[특허]Arithmetic node including general digital signal processing functions for an adaptive computing machine

Arithmetic node including general digital signal processing functions for an adaptive computing machine 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-015/173 G06F-015/80
출원번호	US-0367188 (2003-02-13)
등록번호	US-8949576 (2015-02-03)
발명자 / 주소	Hogenauer, Eugene B.
출원인 / 주소	NVIDIA Corporation
대리인 / 주소	Patterson & Sheridan, LLP
인용정보	피인용 횟수 : 0 인용 특허 : 49

초록 ▼

An apparatus for processing operations in an adaptive computing environment is provided. The adaptive computing environment including at least one processing node. A node includes a memory configured to receive and store data. The data is received from a programmable interconnection network and stored. The node also includes an execution unit configured to perform a signal processing operation. The operation is performed using data retrieved from the memory and an output result is generated. The output result may be used for further computations or sent directly to the programmable interconnection network for transfer to another processing node in the adaptive computing environment.

대표청구항 ▼

1. An adaptive computing engine, comprising: a programmable interconnection network including a network root and a set of crosspoint switches, each crosspoint switch coupled to the network root, wherein the network root and the set of crosspoint switches can be programmed to configure the adaptive computing engine for one or more different tasks; anda plurality of nodes that each have a fixed and different architecture that corresponds to a particular algorithmic function, wherein each node is connected to one or more other nodes in the plurality of nodes by at least one crosspoint switch in the set of crosspoint switches, each node including: an execution unit configured to perform the particular algorithmic function associated with the node, the execution unit having an internal structure specific to the particular algorithmic function associated with the node,a memory configured to receive data and to store data, the memory having a size and a memory format, anda node wrapper configured to: receive data and configuration information from the programmable interconnection network,distribute the data and configuration information received from the interconnection network to the execution unit and to the memory,receive data from the execution unit and from the memory, andtransmit the data received from the execution unit and from the memory to other nodes in the plurality of nodes and to one or more processing elements external to the adaptive computing engine via the programmable interconnection network. 2. The adaptive computing engine of claim 1, wherein each node in the plurality of nodes further includes: an instruction cache configured to store one or more instructions and one or more operands; anda controller configured to control the operation of the execution unit and the address generator by performing the steps of: retrieving an instruction from the instruction cache,causing the address generator to generate an address for the instruction cache at which one or more operands associated with the instruction are stored,retrieving the one or more operands from the memory based on the address, andtransmitting the instruction and the one or more operands to the execution unit for execution. 3. The adaptive computing engine of claim 2, wherein the controller retrieves one or more instructions from sequential addresses in the instruction cache until a branch instruction is retrieved from the instruction cache. 4. The adaptive computing engine of claim 3, wherein the branch instruction is an unconditional branch instruction including a branch address specifying a location in the instruction cache of a subsequent instruction to be executed that is based on a value stored in a computed value latch that is set during execution of a previous instruction. 5. The adaptive computing engine of claim 3, wherein the branch instruction is a conditional branch instruction, the controller continues to retrieve instructions from sequential addresses in the instruction cache according to a binary value stored in a conditional status latch that is set during execution of a previous instruction. 6. The adaptive computing engine of claim 1, configured to execute an instruction loop, the adaptive computing engine comprising: a loop stack configured to receive an address of a start instruction in the instruction cache, an address of an end instruction in the instruction cache, and a maximum number of loop iterations; anda program counter configured to record a current number of loop iterations, wherein the loop stack is popped and a subsequent instruction loop is executed when the maximum number of loop iterations is equal to the current number of loop iterations. 7. The adaptive computing engine of claim 1, further comprising: a system bus interface configured to provide communication with one or more computer systems;a network input interface configured to send and receive real-time data;an external memory interface configured to be coupled to one or more external memory devices; anda network output interface configured to provide communication with one or more other adaptive computing engines. 8. The adaptive computing engine of claim 7, wherein the adaptive computing engine is coupled to one or more other adaptive computing engines that are connected in a sequence, and the last other adaptive computing engine in the sequence includes a feedback connection to the adaptive computing engine. 9. The adaptive computing engine of claim 1, wherein the programmable interconnection network is configured to cause the plurality of nodes to implement a linear algorithmic operation, a non-linear algorithmic operation, a finite state machine operation, a memory operation, a bit manipulation, a fast Fourier transform, an arithmetic logic function, a multiply-accumulate function, or a discrete cosine transformation. 10. A computing system, comprising a first adaptive computing engine and a second adaptive computing engine, wherein the first adaptive computing engine and the second adaptive computing engine each comprise: a programmable interconnection network including a network root and a set of crosspoint switches, each crosspoint switch coupled to the network root, wherein the network root and the set of crosspoint switches can be programmed to configure the adaptive computing engine for one or more different tasks; anda plurality of nodes that each have a fixed and different architecture that corresponds to a particular algorithmic function, wherein each node is connected to one or more other nodes in the plurality of nodes by at least one crosspoint switch in the set of crosspoint switches, each node including: an execution unit configured to perform the particular algorithmic function associated with the node, the execution unit having an internal structure specific to the particular algorithmic function associated with the node,a memory configured to receive data and to store data, the memory having a size and a memory format, anda node wrapper configured to: receive data and configuration information from the programmable interconnection network,distribute the data and configuration information received from the interconnection network to the execution unit and to the memory,receive data from the execution unit and from the memory, andtransmit the data received from the execution unit and from the memory to other nodes in the plurality of nodes and to one or more processing elements external to the adaptive computing engine via the programmable interconnection network. 11. The computing system of claim 10, wherein each node in the plurality of nodes further includes: an instruction cache configured to store one or more instructions and one or more operands; anda controller configured to control the operation of the execution unit and the address generator by performing the steps of: retrieving an instruction from the instruction cache,causing the address generator to generate an address for the instruction cache at which one or more operands associated with the instruction are stored,retrieving the one or more operands from the memory based on the address, andtransmitting the instruction and the one or more operands to the execution unit for execution. 12. The computing system of claim 11, wherein the controller retrieves one or more instructions from sequential addresses in the instruction cache until a branch instruction is retrieved from the instruction cache. 13. The computing system of claim 12, wherein the branch instruction is an unconditional branch instruction including a branch address specifying a location in the instruction cache of a subsequent instruction to be executed that is based on a value stored in a computed value latch that is set during execution of a previous instruction. 14. The computing system of claim 12, wherein the branch instruction is a conditional branch instruction, the controller continues to retrieve instructions from sequential addresses in the instruction cache according to a binary value stored in a conditional status latch that is set during execution of a previous instruction. 15. The computing system of claim 10, configured to execute an instruction loop, the adaptive computing engine comprising: a loop stack configured to receive an address of a start instruction in the instruction cache, an address of an end instruction in the instruction cache, and a maximum number of loop iterations; anda program counter configured to record a current number of loop iterations, wherein the loop stack is popped and a subsequent instruction loop is executed when the maximum number of loop iterations is equal to the current number of loop iterations. 16. The computing system of claim 10, wherein the first adaptive computing engine and the second adaptive computing engine each further include: a system bus interface configured to provide communication with one or more computer systems;a network input interface configured to send and receive real-time data;an external memory interface configured to be coupled to one or more external memory devices; anda network output interface configured to provide communication with one or more other adaptive computing engines. 17. The computing system of claim 16, wherein the network output interface included in the first adaptive computing engine is coupled to the network input interface of the second adaptive computing engine, and the network output interface included in the second adaptive computing engine is coupled to the network input interface of the first adaptive computing engine comprising a feedback connection from the second adaptive computing engine to the first adaptive computing engine. 18. The computing system of claim 10, wherein the programmable interconnection network is configured to cause the plurality of nodes to implement a linear algorithmic operation, a non-linear algorithmic operation, a finite state machine operation, a memory operation, a bit manipulation, a fast Fourier transform, an arithmetic logic function, a multiply-accumulate function, or a discrete cosine transformation.

이 특허에 인용된 특허 (49)

Freeman Ross H. (San Jose CA), Configurable electrical circuit having configurable logic elements and configurable interconnects.
상세보기
Popli Sanjay (Sunnyvale CA) Pickett Scott (Los Gatos CA) Hawley David (Belmont CA) Moni Shankar (Santa Clara CA) Camarota Rafael C. (San Jose CA), Configuration features in a configurable logic array.
상세보기
Vorbach Martin Andreas,DEX ; Munch Robert Markus,DEX, Dynamically reconfigurable data processing system.
상세보기
Wittig Ralph D. ; Mohan Sundararajarao ; Carberry Richard A., FPGA configurable logic block with multi-purpose logic/memory circuit.
상세보기
Cloutier Jocelyn, FPGA-based processor.
상세보기
Kundu, Arunangshu; Goldfein, Arnold; Plants, William C.; Hightower, David, Field programmable gate array and microcontroller system-on-a-chip.
상세보기
Trimberger Stephen M., Field programmable gate array having programming instructions in the configuration bitstream.
상세보기
Law Edwin S. ; Buch Kiran B. ; Baxter Glenn A. ; Pang Raymond C., Hardwire logic device emulating an FPGA.
상세보기
Stephen L. Wasson, Heterogeneous programmable gate array.
상세보기
Martin Vorbach DE; Robert Munch DE, I/O and memory bus system for DFPS and units with two or multi-dimensional programmable cell architectures.
상세보기
Vorbach Martin,DEX ; Munch Robert,DEX, I/O and memory bus system for DFPs and units with two- or multi-dimensional programmable cell architectures.
상세보기
Tavana Danesh ; Yee Wilson K. ; Trimberger Stephen M., Integrated circuit with field programmable and application specific logic areas.
상세보기
Cooke Laurence H. ; Phillips Christopher E. ; Wong Dale, Integrated processor and programmable data path chip for reconfigurable computing.
상세보기
Wong Dale ; Phillips Christopher E. ; Cooke Laurence H., Integrated processor and programmable data path chip for reconfigurable computing.
상세보기
DeHon Andre ; Mirsky Ethan ; Knight ; Jr. Thomas F., Intermediate-grain reconfigurable processing device.
상세보기
Martin Vorbach DE; Robert Munch DE, Internal bus system for DFPS and units with two- or multi-dimensional programmable cell architectures, for managing large volumes of data with a high interconnection complexity.
상세보기
Master Paul L. ; Hatley William T. ; Scheuermann II Walter J. ; Goodman Margaret J., Method and apparatus for adaptable digital protocol processing.
상세보기
Cummings Mark R., Method and apparatus for communicating information.
상세보기
Bertolet Allan Robert ; Clinton Kim P.N. ; Gould Scott Whitney ; Keyser III Frank Ray ; Reny Timothy Shawn ; Zittritsch Terrance John, Method and system for layout and schematic generation for heterogeneous arrays.
상세보기
Cooke Laurence H. ; Phillips Christopher E. ; Wong Dale, Method for compiling high level programming languages into an integrated processor with reconfigurable logic.
상세보기
Vorbach, Martin; Munch, Robert, Method for deadlock-free configuration of dataflow processors and modules with a two- or multidimensional programmable cell structure (FPGAs, DPGAs, etc.).
상세보기
Vorbach,Martin; May,Frank; N체ckel,Armin, Method for debugging reconfigurable architectures.
상세보기
Martin Vorbach DE; Robert Munch DE, Method for hierarchical caching of configuration data having dataflow processors and modules having two-or multidimensional programmable cell structure (FPGAs, DPGAs, etc.)--.
상세보기
Harrison David A. ; Silver Joshua M. ; Soe Soren T., Method for programming complex PLD having more than one function block type.
상세보기
May,Frank; N？ckel,Armin; Vorbach,Martin, Method for translating programs for reconfigurable architectures.
상세보기
Vorbach, Martin; Munch, Robert, Method of repairing integrated circuits.
상세보기
Vorbach, Martin; Munch, Robert, Method of self-synchronization of configurable elements of a programmable module.
상세보기
Vorbach Martin,DEX ; Munch Robert,DEX, Method of the self-synchronization of configurable elements of a programmable unit.
상세보기
Vorbach,Martin; Baumgarte,Volker, Methods and devices for treating and processing data.
상세보기
Vorbach,Martin; Baumgarte,Volker; Ehlers,Gerd; May,Frank; N체ckel,Armin, Pipeline configuration unit protocols and communication.
상세보기
Kawamoto Koji (Itami JPX), Pipeline processor with hardware loop function using instruction address stack for holding content of program counter an.
상세보기
Camarota Rafael C. (San Jose CA) Furtek Frederick C. (Menlo Park CA) Ho Walford W. (Saratoga CA) Browder Edward H. (Saratoga CA), Programmable logic cell and array.
상세보기
Camarota Rafael C. (San Jose CA) Furtek Frederick C. (Menlo Park CA) Ho Walford W. (Saratoga CA) Browder Edward H. (Saratoga CA), Programmable logic cell and array with bus repeaters.
상세보기
Trimberger Stephen M. ; Carberry Richard A. ; Johnson Robert Anders ; Wong Jennifer, Programmable logic device including configuration data or user data memory slices.
상세보기
Katsutoshi Ito JP, Radio communication apparatus employing a rake receiver.
상세보기
Ebeling William Henry Carl ; Cronquist Darren Charles ; Franklin Paul David, Reconfigurable computing architecture for providing pipelined data paths.
상세보기
Alan David Marshall GB; Anthony Stansfield GB; Jean Vuillemin FR, Reconfigurable processor devices.
상세보기
Vorbach,Martin, Reconfigurable sequencer structure.
상세보기
Trimberger Stephen M., Reprogrammable instruction set accelerator.
상세보기
Vorbach,Martin; Bretz,Daniel, Router.
상세보기
Vorbach Martin,DEX ; Munch Robert,DEX, Run-time reconfiguration method for programmable units.
상세보기
Kelleher Brian M. ; Dewey Thomas E., Scalable graphics processor architecture.
상세보기
Kopp Randall L. (Irvine CA) Johnson S. Val (Anaheim CA), Single-chip self-configurable parallel processor.
상세보기
Iadanza Joseph Andrew (Hinesburg VT), System and method for dynamically reconfiguring a programmable gate array.
상세보기
Davis Donald J. ; Bennett Toby D. ; Harris Jonathan C. ; Miller Ian D. ; Edwards Stephen G., System and method for programming the hardware of field programmable gate arrays (FPGAs) and related reconfiguration resources as if they were software by creating hardware objects.
상세보기
Martin Vorbach DE; Robert Munch DE, UNIT FOR PROCESSING NUMERIC AND LOGIC OPERATIONS FOR USE IN CENTRAL PROCESSING UNITS (CPUS), MULTIPROCESSOR SYSTEMS, DATA-FLOW PROCESSORS (DSPS), SYSTOLIC PROCESSORS AND FIELD PROGRAMMABLE GATE ARRAY.
상세보기
Furtek Frederick C. (Menlo Park CA) Camarota Rafael C. (San Jose CA), Versatile programmable logic cell for use in configurable logic arrays.
상세보기
Agrawal Prathima ; Cravatts Mark Robert ; Trotter John Andrew ; Srivastava Mani Bhushan, Wireless adapter architecture for mobile computing.
상세보기
Athanas Peter ; Bittner ; Jr. Ray A., Worm-hole run-time reconfigurable processor field programmable gate array (FPGA).
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Arithmetic node including general digital signal processing functions for an adaptive computing machine 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (49)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Arithmetic node including general digital signal processing functions for an adaptive computing machine 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (49)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트