[특허]FPGA device for image classification

FPGA device for image classification 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06K-009/62 G06T-007/70 G06T-001/20 H04N-007/18 G06T-007/13 B60W-010/18 B60W-010/20 B60W-010/06 B60W-030/09 G05D-001/00
출원번호	US-0496144 (2017-04-25)
등록번호	US-10255525 (2019-04-09)
발명자 / 주소	Totolos, Jr., George Silberman, Joshua Strother, Daniel Vallespi-Gonzalez, Carlos Parlour, David Bruce
출원인 / 주소	Uber Technologies, Inc.
대리인 / 주소	Dority & Manning, P.A.
인용정보	피인용 횟수 : 0 인용 특허 : 10

초록 ▼

Image processing systems can include one or more cameras configured to obtain image data, one or more memory devices configured to store a classification model that classifies image features within the image data as including or not including detected objects, and a field programmable gate array (FPGA) device coupled to the one or more cameras. The FPGA device is configured to implement one or more image processing pipelines for image transformation and object detection. The one or more image processing pipelines can generate a multi-scale image pyramid of multiple image samples having different scaling factors, identify and aggregate features within one or more of the multiple image samples having different scaling factors, access the classification model, provide the features as input to the classification model, and receive an output indicative of objects detected within the image data.

대표청구항 ▼

1. An image processing system, comprising: one or more cameras configured to obtain image data;one or more memory devices configured to store a classification model that classifies image features within the image data as including or not including detected objects; anda field programmable gate array (FPGA) device coupled to the one or more cameras, the FPGA device configured to implement one or more image processing pipelines for image transformation and object detection;the one or more image processing pipelines including a plurality of logic blocks and interconnectors programmed to: generate a multi-scale image pyramid of multiple image samples having different scaling factors; identify and aggregate features within one or more of the multiple image samples having different scaling factors; access the classification model stored in the one or more memory devices; provide the features within the one or more of the multiple image samples as input to the classification model; and produce an output indicative of objects detected within the image data;wherein the features identified and aggregated within the one or more of the multiple image samples comprise edge portions, and wherein the one or more image processing pipelines further include a plurality of logic blocks and interconnectors that are programmed to determine an angle classification for each of the identified edge portions, and to assign the edge portion to one of a plurality of different bins depending on the angle classification for that edge portion. 2. The image processing system of claim 1, wherein the one or more image processing pipelines further include a plurality of logic blocks and interconnectors that are programmed to determine a histogram descriptive of the plurality of different bins. 3. The image processing system of claim 2, wherein the histogram comprises a histogram of oriented gradients for the identified edge portions. 4. The image processing system of claim 1, wherein the plurality of different bins are defined to have different sizes based on an amount of image data in each image sample such that bin sizes are smaller for image samples having a greater amount of image data. 5. The image processing system of claim 1, wherein the one or more image processing pipelines include a plurality of logic blocks and interconnectors programmed to generate one or more channel images from the image data, each channel image corresponding to a feature map that maps a patch of one or more input pixels from the image data to an output pixel within the channel image. 6. The image processing system of claim 1, wherein the one or more image processing pipelines further include a plurality of logic blocks and interconnectors designed to convert intermediate stages of the image data from the one or more cameras from a floating point representation to fixed point integer-based representation. 7. The image processing system of claim 1, wherein the one or more image processing pipelines further include a plurality of logic blocks and interconnectors programmed to convert the image data from the one or more cameras into a multi-parameter representation including values corresponding to an image hue parameter, an image saturation parameter, and an image greyscale parameter. 8. The image processing system of claim 1, wherein the one or more image processing pipelines further include a plurality of logic blocks and interconnectors programmed to convert the image data from a representation having multiple color components to a greyscale representation. 9. The image processing system of claim 1, wherein the one or more image processing pipelines include a plurality of logic blocks and interconnectors programmed to determine a sliding window of fixed size, to analyze successive image patches within each of the multiple image samples using the sliding window of fixed size, and to identify objects of interest within the successive image patches. 10. A vehicle control system, comprising: one or more cameras configured to obtain image data within an environment proximate to a vehicle;a field programmable gate array (FPGA) device coupled to one or more cameras, the FPGA device configured to implement one or more image processing pipelines for image transformation and object detection, the one or more image processing pipelines including a plurality of logic blocks and interconnectors programmed to: generate from the image data a multi-scale image pyramid of multiple image samples having different scaling factors; identify and aggregate features within one or more of the multiple image samples having different scaling factors; and to detect one or more objects of interest within the multiple image samples based at least in part on the features, wherein the features identified and aggregated within the one or more of the multiple image samples comprise edge portions, and wherein the second image processing pipeline for object detection further includes a plurality of logic blocks and interconnectors that are programmed to determine an angle classification for each of the identified edge portions, and to assign the edge portion to one of a plurality of different bins depending on the angle classification for that edge portion;one or more computing devices configured to receive an output from the FPGA device and to further characterize the objects of interest. 11. The vehicle control system of claim 10, wherein the one or more computing devices are further configured to control motion of the vehicle based at least in part on the one or more objects of interest detected within the image data from the one or more cameras. 12. The vehicle control system of claim 10, wherein the one or more image processing pipelines further include a plurality of logic blocks and interconnectors that are programmed to determine a histogram of oriented gradients descriptive of the plurality of different bins. 13. The vehicle control system of claim 10, wherein the plurality of different bins are defined to have different sizes based on an amount of image data in each image sample such that bin sizes are smaller for image samples having a greater amount of image data. 14. The vehicle control system of claim 10, wherein the one or more image processing pipelines further include a plurality of logic blocks and interconnectors programmed to convert the image data from the one or more cameras into a multi-parameter representation including values corresponding to an image hue parameter, an image saturation parameter, and an image greyscale parameter. 15. The vehicle control system of claim 10, wherein the one or more image processing pipelines further include a plurality of logic blocks and interconnectors programmed to determine a sliding window of fixed size, to analyze successive image patches within each of the multiple image samples using the sliding window of fixed size, and to identify objects of interest within the successive image patches.

이 특허에 인용된 특허 (10)

Stein, Gideon P.; Ferencz, Andras D.; Avni, Ofer, Estimating distance to an object using a sequence of images recorded by a monocular camera.
상세보기
Craig, William C., GPS-based traction control system and method using data transmitted between vehicles.
상세보기
Walker, Scott; Wilson, Terry B.; Hamman, Gary M., Mechanically scanned parabolic reflector antenna.
상세보기
Schamp, Gregory G.; Davies, Owen A.; Demro, James C., Method of processing an image of a visual scene.
상세보기
Ernst, Jr.,Raymond P.; Wilson,Terry B., Multi-sensor integration for a vehicle.
상세보기
Vallespi-Gonzalez, Carlos, Object detection for an autonomous vehicle.
상세보기
Wilson, Terry B., Path prediction for vehicular collision warning system.
상세보기
Sanjay Devappa Rai ; Nicholas Barton ; Troy Taylor ; Xueming Henry Gu, Primary and secondary color manipulations using hue, saturation, luminance and area isolation.
상세보기
Breed, David S., System and method for preventing vehicular accidents.
상세보기
Ernst, Jr.,Raymond P.; Wilson,Terry B., Vehicular collision avoidance system.
상세보기

내보내기 메뉴

내보내기 구분

파일저장
인쇄
메일전송

구성항목

기본정보
상세정보

관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC

저장형식

Text(ASCII format)
Excel format
PIAS분석(.xls)

메일정보

받는사람 (필수): @
보내는사람 (선택): @
제목
내용: KISTI 검색결과 이메일 서비스

안내

총 건의 자료가 검색되었습니다.

다운받으실 자료의 인덱스를 입력하세요. (1-10,000)

검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다.

데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요)

다운로드 파일은 UTF-8 형태로 저장됩니다.
파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오.

Text(ASCII format)
Excel format

AI-Helper ※ AI-Helper는 을 사용합니다.

AI-Helper

안녕하세요, AI-Helper입니다. 좌측 "선택된 텍스트"에서 텍스트를 선택하여 요약, 번역, 용어설명을 실행하세요.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

연합인증

FPGA device for image classification 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (10)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

FPGA device for image classification 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (10)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트