[특허]Apparatus, systems, and methods for providing configurable computational imaging pipeline

Apparatus, systems, and methods for providing configurable computational imaging pipeline 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-015/80 G06F-015/167 G06F-009/38
출원번호	US-0082645 (2013-11-18)
등록번호	US-9146747 (2015-09-29)
발명자 / 주소	Moloney, David Richmond, Richard Donohoe, David Barry, Brendan Brick, Cormac Vesa, Ovidiu Andrei
출원인 / 주소	LINEAR ALGEBRA TECHNOLOGIES LIMITED
대리인 / 주소	Wilmer Cutler Pickering Hale and Dorr LLP
인용정보	피인용 횟수 : 0 인용 특허 : 32

초록 ▼

The present application relates generally to a parallel processing device. The parallel processing device can include a plurality of processing elements, a memory subsystem, and an interconnect system. The memory subsystem can include a plurality of memory slices, at least one of which is associated with one of the plurality of processing elements and comprises a plurality of random access memory (RAM) tiles, each tile having individual read and write ports. The interconnect system is configured to couple the plurality of processing elements and the memory subsystem. The interconnect system includes a local interconnect and a global interconnect.

대표청구항 ▼

1. An electronic device comprising: a parallel processing device comprising: a plurality of processing elements each configured to execute instructions;a memory subsystem comprising a plurality of memory slices including a first memory slice associated with one of the plurality of processing elements, wherein the first memory slice comprises a plurality of random access memory (RAM) tiles each having individual read and write ports; andan interconnect system configured to couple the plurality of processing elements and the memory subsystem, wherein the interconnect system includes: a local interconnect configured to couple the first memory slice and the one of the plurality of processing elements, anda global interconnect configured to couple the first memory slice and the remaining of the plurality of processing elements;a processor, in communication with the parallel processing device, configured to run a module stored in memory that is configured to: receive a flow graph associated with a data processing process, wherein the flow graph comprises a plurality of nodes and a plurality of edges connecting two or more of the plurality of nodes, wherein each node identifies an operation and each edge identifies a relationship between the connected nodes; andassign a first node of the plurality of nodes to a first processing element of the parallel processing device and a second node of the plurality of nodes to a second processing element of the parallel processing device, thereby parallelizing operations associated with the first node and the second node. 2. The electronic device of claim 1, wherein the flow graph is provided in an extensible markup language (XML) format. 3. The electronic device of claim 1, wherein the module is configured to assign the first node of the plurality of nodes to the first processing element based on a past performance of a memory subsystem in the parallel processing device. 4. The electronic device of claim 3, wherein the memory subsystem of the parallel processing device comprises a counter that is configured to count a number of memory clashes over a predetermined period of time, and the past performance of the memory subsystem comprises the number of memory clashes measured by the counter. 5. The electronic device of claim 1, wherein the module is configured to assign the first node of the plurality of nodes to the first processing element while the parallel processing device is operating at least a portion of the flow graph. 6. The electronic device of claim 1, wherein the module is configured to receive a plurality of flow graphs, and assign all operations associated with the plurality of flow graphs to a single processing element in the parallel processing device. 7. The electronic device of claim 1, wherein the module is configured to stagger memory accesses by the processing elements to reduce memory clashes. 8. The electronic device of claim 1, wherein the electronic device includes a mobile device. 9. The electronic device of claim 1, wherein the flow graph is specified using an application programming interface (API) associated with the parallel processing device. 10. The electronic device of claim 1, wherein the module is configured to provide input image data to the plurality of processing elements by: dividing the input image data into a plurality of strips; andproviding one of the plurality of strips of the input image data to one of the plurality of processing elements. 11. The electronic device of claim 10, wherein a number of the plurality of strips of the input image data is the same as a number of the plurality of processing elements. 12. A method comprising: receiving, at a processor in communication with a parallel processing device, a flow graph associated with a data processing process, wherein the flow graph comprises a plurality of nodes and a plurality of edges connecting two or more of the plurality of nodes, wherein each node identifies an operation and each edge identifies a relationship between the connected nodes; andassigning a first node of the plurality of nodes to a first processing element of the parallel processing device and a second node of the plurality of nodes to a second processing element of the parallel processing device, thereby parallelizing operations associated with the first node and the second node,wherein the parallel processing device also comprises: a memory subsystem comprising a plurality of memory slices including a first memory slice associated with the first processing element, wherein the first memory slice comprises a plurality of random access memory (RAM) tiles each having individual read and write ports; andan interconnect system configured to couple the first processing element, the second processing element, and the memory subsystem, wherein the interconnect system includes: a local interconnect configured to couple the first memory slice and the first processing element, anda global interconnect configured to couple the first memory slice and the second processing element. 13. The method of claim 12, wherein the flow graph is provided in an extensible markup language (XML) format. 14. The method of claim 12, wherein assigning the first node of the plurality of nodes to the first processing element of the parallel processing device comprises assigning the first node of the plurality of nodes to the first processing element based on a past performance of a first memory slice in the parallel processing device. 15. The method of claim 14, further comprising counting, at a counter in the memory subsystem, a number of memory clashes in the first memory slice over a predetermined period of time, and the past performance of the first memory slice comprises the number of memory clashes in the first memory slice. 16. The method of claim 12, wherein assigning the first node of the plurality of nodes to the first processing element is performed while the parallel processing device is operating at least a portion of the flow graph. 17. The method of claim 12, further comprising staggering memory accesses by the processing elements to the first memory slice in order to reduce memory clashes. 18. The method of claim 12, wherein the flow graph is specified using an application programming interface (API) associated with the parallel processing device. 19. The method of claim 12, further comprising providing an input image data to the plurality of processing elements by dividing the input image data into a plurality of strips and providing one of the plurality of strips of the input image data to one of the plurality of processing elements. 20. The method of claim 19, wherein a number of the plurality of strips of the input image data is the same as a number of the plurality of processing elements.

이 특허에 인용된 특허 (32)

Comair, Claude; Li, Xin; Abou-Samra, Samir; Champagne, Robert; Fam, Sun Tjen; Ghali, Prasanna; Pan, Jun, 3D transformation matrix compression and decompression.
상세보기
Seong,Nak hee; Lim,Kyoung mook; Jeong,Seh woong; Park,Jae hong; Im,Hyung jun; Bae,Gun young; Kim,Young duck, Apparatus and method for dispatching very long instruction word having variable length.
상세보기
Moloney, David, Circuit for compressing data and a processor employing same.
상세보기
Crayson Paul G. (Cranham GBX), Data compression.
상세보기
Iwata Yasushi,JPX ; Asato Akira,JPX, Data processing device to compress and decompress VLIW instructions by selectively storing non-branch NOP instructions.
상세보기
Pitsianis,Nikos P.; Pechanek,Gerald George; Rodriguez,Ricardo, Efficient complex multiplication and fast fourier transform (FFT) implementation on the ManArray architecture.
상세보기
Pitsianis, Nikos P.; Pechanek, Gerald G.; Rodriguez, Ricardo E., Efficient complex multiplication and fast fourier transform (FFT) implementation on the manarray architecture.
상세보기
Kim, Donglok; Berg, Stefan G.; Sun, Weiyun; Kim, Yongmin, Method and apparatus for compressing VLIW instruction and sharing subinstructions.
상세보기
Kim,Donglok; Berg,Stefan G.; Sun,Weiyun; Kim,Yongmin, Method and apparatus for compressing VLIW instruction and sharing subinstructions.
상세보기
Coleman Charles H. (Redwood City CA) Miller Sidney D. (Mountain View CA) Smidth Peter (Menlo Park CA), Method and apparatus for image data compression using combined luminance/chrominance coding.
상세보기
Richardson Stephen (Stanford CA), Method and apparatus for optimizing complex arithmetic units for trivial operands.
상세보기
Gerald G. Pechanek ; Juan Guillermo Revilla ; Edwin F. Barry, Methods and apparatus for dynamic very long instruction word sub-instruction selection for execution time parallelism in an indirect very long instruction word processor.
상세보기
Pechanek Gerald G. ; Revilla Juan Guillermo ; Barry Edwin F., Methods and apparatus for dynamic very long instruction word sub-instruction selection for execution time parallelism in an indirect very long instruction word processor.
상세보기
Pechanek, Gerald G.; Revilla, Juan Guillermo; Barry, Edwin Franklin, Methods and apparatus for dynamic very long instruction word sub-instruction selection for execution time parallelism in an indirect very long instruction word processor.
상세보기
Drabenstott, Thomas L.; Pechanek, Gerald G.; Barry, Edwin F.; Kurak, Jr., Charles W., Methods and apparatus to support conditional execution in a VLIW-based array processor with subword execution.
상세보기
Drabenstott, Thomas L.; Pechanek, Gerald G.; Barry, Edwin F.; Kurak, Jr., Charles W., Methods and apparatus to support conditional execution in a VLIW-based array processor with subword execution.
상세보기
Drabenstott,Thomas L.; Pechanek,Gerald George; Barry,Edwin Franklin; Kurak, Jr.,Charles W., Methods and apparatus to support conditional execution in a VLIW-based array processor with subword execution.
상세보기
Drabenstott,Thomas L.; Penchanek,Gerald G.; Barry,Edwin F.; Kurak, Jr.,Charles W., Methods and apparatus to support conditional execution in a VLIW-based array processor with subword execution.
상세보기
Thomas L. Drabenstott ; Gerald G. Pechanek ; Edwin F. Barry ; Charles W. Kurak, Jr., Methods and apparatus to support conditional execution in a VLIW-based array processor with subword execution.
상세보기
Kikinis Dan, Modular portable computer with removable pointer device.
상세보기
Whittaker James Robert,GBX ; Rowland Paul,GBX, Multi-threaded data processing management system.
상세보기
Hall William E. (Beaverton OR) Stigers Dale A. (Hillsboro OR) Decker Leslie F. (Portland OR), Parallel processing system.
상세보기
Hall William E. (Beaverton OR) Stigers Dale A. (Hillsboro OR) Decker Leslie F. (Portland OR), Parallel vector processing system for individual and broadcast distribution of operands and control information.
상세보기
Topham,Nigel Peter, Processor and method for generating and storing compressed instructions in a program memory and decompressed instructions in an instruction cache wherein the decompressed instructions are assigned im.
상세보기
Topham,Nigel Peter, Processor and method for generating and storing compressed instructions in a program memory and decompressed instructions in an instruction cache wherein the decompressed instructions are assigned imaginary addresses derived from information stored in the program memory with the compressed instructions.
상세보기
Omoda Koichiro (Sagamihara JPX) Nagashima Shigeo (Hachioji JPX), Storage control apparatus.
상세보기
Booth, Jr.,Lawrence A.; Rosenzweig,Joel; Burr,Jeremy, System and method for high-speed communications between an application processor and coprocessor.
상세보기
Knudson Donald R. (Concord MA), System to effect digital encoding of an image.
상세보기
Bleiweiss, Avi I., System, method, and computer program product for accelerating a game artificial intelligence process.
상세보기
Haikonen Pentti,FIX ; Juhola Janne M.,FIX ; Latva-Rasku Petri,FIX, Video compressing method wherein the direction and location of contours within image blocks are defined using a binary picture of the block.
상세보기
Brethour, Vernon; Kirkland, Dale; Lazenby, William; Shelton, Gary, Wide instruction word graphics processor.
상세보기
Brethour, Vernon; Kirkland, Dale; Lazenby, William; Shelton, Gary, Wide instruction word graphics processor.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Apparatus, systems, and methods for providing configurable computational imaging pipeline 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (32)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Apparatus, systems, and methods for providing configurable computational imaging pipeline 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (32)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트