[특허]Decoupled scalar/vector computer architecture system and method

Decoupled scalar/vector computer architecture system and method 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-009/38
출원번호	US-0643586 (2003-08-18)
등록번호	US-7334110 (2008-02-19)
발명자 / 주소	Faanes,Gregory J. Scott,Steven L. Lundberg,Eric P. Moore, Jr.,William T. Johnson,Timothy J.
출원인 / 주소	Cray Inc.
대리인 / 주소	Schwegman, Lundberg & Woessner, P.A.
인용정보	피인용 횟수 : 42 인용 특허 : 73

초록 ▼

In a computer system having a scalar processing unit and a vector processing unit, wherein the vector processing unit includes a vector dispatch unit, a system and method of decoupling operation of the scalar processing unit from that of the vector processing unit, the method comprising sending a vector instruction from the scalar processing unit to the vector dispatch unit, wherein sending includes marking the vector instruction as complete if the vector instruction is not a vector memory instruction and if the vector instruction does not require scalar operands, reading a scalar operand, wherein reading includes transferring the scalar operand from the scalar processing unit to the vector dispatch unit, predispatching the vector instruction within the vector dispatch unit if the vector instruction is scalar committed, dispatching the predispatched vector instruction if all required operands are ready, and executing the dispatched vector instruction as a function of the scalar operand.

대표청구항 ▼

What is claimed is: 1. In a computer system having a scalar processing unit and a vector processing unit, wherein the vector processing unit includes a vector dispatch unit and wherein the vector dispatch unit includes a predispatch queue and a dispatch queue, a method of decoupling operation of the scalar processing unit from that of the vector processing unit, the method comprising: dispatching vector instructions from the scalar processing unit to the vector dispatch unit, wherein dispatching includes sending the vector instructions from the scalar processing unit to the vector dispatch unit even if all scalar operands are not ready and even if all scalar instructions issued prior to the vector instructions are not scalar committed; queueing up the vector instructions received from the scalar processing unit in the vector dispatch unit's predispatch queue; reading scalar operands from the scalar processing unit, wherein reading includes transferring the scalar operands from the scalar processing unit to the vector dispatch unit; predispatching the vector instructions from the predispatch queue to the dispatch queue in the order received, wherein predispatching includes determining if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed and transferring the vector instruction from the predispatch queue to the dispatch queue only if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed; dispatching the predispatched vector instruction from the dispatch queue if all required scalar operands are ready; and executing the vector instruction dispatched from the dispatch queue as a function of the scalar operands. 2. The method according to claim 1, wherein executing the dispatched vector instruction includes translating an address associated with the vector instruction and trapping on a translation fault. 3. The method according to claim 1, wherein the method further includes marking the vector instruction as complete, wherein marking the vector instruction as complete includes: if the vector instruction is not a memory instruction and if the vector instruction does not require scalar operands, indicating that the vector instruction is complete when the vector instruction is dispatched from the scalar processing unit; if the vector instruction is not a memory instruction but requires scalar operands, indicating that the vector instruction is complete when the scalar operands are available; and if the vector instruction is a memory instruction, indicating that the vector instruction is complete when the vector address has been translated; and graduating the vector instruction if the vector instruction is marked complete. 4. In a computer system having a scalar processing unit and a vector processing unit, wherein the vector processing unit includes a vector dispatch unit and wherein the vector dispatch unit includes a predispatch queue and a dispatch queue, a method of decoupling operation of the scalar processing unit from that of the vector processing unit, the method comprising: dispatching vector instructions from the scalar processing unit to the vector dispatch unit, wherein dispatching includes sending the vector instructions from the scalar processing unit to the vector dispatch unit even if all scalar operands are not ready and even if all scalar instructions issued prior to the vector instructions are not scalar committed; queueing up the vector instructions received from the scalar processing unit in the vector dispatch unit's predispatch queue; reading scalar operands from the scalar processing unit, wherein reading includes transferring the scalar operands from the scalar processing unit to the vector dispatch unit; predispatching, the vector instructions from the predispatch queue to the dispatch queue in the order received, wherein predispatching includes determining if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed and transferring the vector instruction from the predispatch queue to the dispatch queue only if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed; dispatching the predispatched vector instruction from the dispatch queue if all required scalar operands are ready; generating an address for a vector load; issuing a vector load request to memory; receiving vector data from memory; storing the vector data in a load buffer; transferring the vector data from the load buffer to a vector register; and executing the vector instruction dispatched from the vector dispatch queue on the vector data stored in the vector register. 5. The method according to claim 4, wherein the vector processing unit includes a vector execute unit and a vector load/store unit, wherein issuing a vector load request to memory includes issuing and executing vector memory references in the vector load/store unit when the vector load store unit has received the instruction and memory operands from the scalar processing unit. 6. The method according to claim 4, wherein storing the vector data in a load buffer and transferring the vector data from the load buffer to a vector register are decoupled from each other. 7. The method according to claim 4, wherein storing the vector data in a load buffer includes writing memory load data to the load buffer until all previous memory operations complete without fault. 8. The method according to claim 5, wherein storing the vector data in a load buffer and transferring the vector data from the load buffer to a vector register are decoupled from each other. 9. The method according to claim 5, wherein storing the vector data in a load buffer includes writing memory load data to the load buffer until all previous memory operations complete without fault. 10. In a computer system having a scalar processing unit and a vector processing unit, wherein the vector processing unit includes a vector dispatch unit and wherein the vector dispatch unit includes a predispatch queue and a dispatch queue, a method of decoupling operation of the scalar processing unit from that of the vector processing unit, the method comprising: dispatching vector instructions from the scalar processing unit to the vector dispatch unit, wherein dispatching includes sending the vector instructions from the scalar processing unit to the vector dispatch unit even if all scalar operands are not ready and even if all scalar instructions issued prior to the vector instructions are not scalar committed; queueing up the vector instructions received from the scalar processing unit in the vector dispatch unit's predispatch queue; reading scalar operands from the scalar processing unit, wherein reading includes transferring the scalar operands from the scalar processing unit to the vector dispatch unit; predispatching the vector instructions from the predispatch queue to the dispatch queue in the order received, wherein predispatching includes determining if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed and transferring the vector instruction from the predispatch queue to the dispatch queue only if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed; dispatching the predispatched vector instruction from the dispatch queue if all required scalar operands are ready; generating a first and a second address for a vector load; issuing first and second vector load requests to memory; receiving vector data associated with the first and second addresses from memory; storing vector data associated with the first address in a first vector register; storing vector data associated with the second address in a second vector register; executing a vector instruction on the vector data stored in the first vector register; renaming the second vector register; and executing the vector instruction dispatched from the dispatch queue on the vector data stored in the renamed vector register. 11. The method according to claim 10, wherein the vector processing unit includes a vector execute unit and a vector load/store unit, wherein issuing a vector load request to memory includes issuing and executing vector memory references in the vector load/store unit when the vector load store unit has received the instruction and memory operands from the scalar processing unit. 12. A computer system, comprising: a scalar processing unit; and a vector processing unit, wherein the vector processing unit includes a vector dispatch unit, a vector execute unit and a vector load/store unit, wherein the vector dispatch unit includes a predispatch queue and a dispatch queue; wherein the scalar processing unit dispatches a vector instructions to the vector dispatch unit even if all scalar operands associated with the instructions are not ready; wherein the scalar processing unit reads scalar operands and transfers the read scalar operands from the scalar processing unit to the vector dispatch unit; wherein the vector dispatch unit stores vector instructions received from the scalar processing unit in the vector dispatch unit's predispatch queue, predispatches the vector instructions from the predispatch queue to the dispatch queue in the order received; wherein the vector dispatch unit determines if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed and transfers the vector instruction from the predispatch queue to the dispatch queue only if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed; wherein the vector dispatch unit dispatches the predispatched vector instruction from the vector dispatch unit to one or more of the vector execute unit and the vector load/store unit if all required scalar operands are ready; wherein the vector load/store unit receives an instruction and memory operands from the scalar processing unit, issues and executes a vector memory load reference as a function of the instruction and the memory operands received from the scalar processing unit, and stores data received as a result of the vector memory reference in a load buffer; and wherein the vector execute unit issues the vector memory load instruction and transfers the data received as a result of the vector memory reference from the load buffer to a vector register. 13. The system according to claim 12, wherein the load buffer stores memory load data until it is determined that no previous memory operation will fail and, if no previous memory operations have failed, the load buffer transfers the data to the vector register. 14. In a computer system having a scalar processing unit and a vector processing unit, wherein the vector processing unit includes a vector dispatch unit and wherein the vector dispatch unit includes a predispatch queue and a dispatch queue, a method of decoupling scalar and vector execution, comprising: dispatching scalar instructions to a scalar instruction queue in the scalar processing unit; dispatching a vector instruction that requires scalar operands from the scalar processing unit to the scalar instruction queue and to the predispatch queue, wherein dispatching includes sending the vector instruction from scalar processing unit to the predispatch queue even if all scalar operands are not ready; queueing up two or more vector instructions received from the scalar processing unit in the predispatch queue; executing the vector instruction in the scalar processing unit, wherein executing the vector instruction in the scalar processing unit includes writing a scalar operand to a scalar operand queue; predispatching, within the vector dispatch unit, the vector instructions from the predispatch queue to the dispatch queue in the order received, wherein predispatching includes determining if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed and transferring the vector instruction from the predispatch queue to the dispatch queue only if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed; notifying the vector processing unit that the scalar operand is available in the scalar operand queue; dispatching the predispatched vector instruction from the dispatch queue of the vector dispatch unit, if all required scalar operands are ready; and executing the vector instruction dispatched from the vector dispatch unit in the vector processing unit, wherein executing the vector instruction in the vector processing unit includes reading the scalar operand from the scalar operand queue. 15. The method according to claim 14, wherein the method further includes marking the vector instruction as complete, wherein marking the vector instruction as complete includes: if the vector instruction is not a memory instruction and if the vector instruction does not require scalar operands, indicating that the vector instruction is complete when the vector instruction is dispatched from the scalar processing unit; if the vector instruction is not a memory instruction but requires scalar operands, indicating that the vector instruction is complete when the scalar operands are available; and if the vector instruction is a memory instruction, indicating that the vector instruction is complete when the vector address has been translated; and graduating the vector instruction if the vector instruction is marked complete. 16. In a computer system having a scalar processing unit and a vector processing unit, wherein the vector processing unit includes a vector dispatch unit and wherein the vector dispatch unit includes a predispatch queue and a dispatch queue, a method of decoupling a vector memory reference and a vector execution, comprising: dispatching scalar instructions to a scalar instruction queue in the scalar processing unit; dispatching a vector instruction from the scalar processing unit to the scalar instruction queue and to the predispatch queue in the vector processing unit, wherein dispatching includes sending the vector instruction from the scalar processing unit to the predispatch queue in the vector processing unit even if all scalar operands are not ready; queueing up two or more vector instructions received from the scalar processing unit in the predispatch queue; executing the vector instruction in the scalar processing unit, wherein executing the vector instruction in the scalar processing unit includes generating an address and writing the address to a scalar operand queue; predispatching, within the vector dispatch unit, the vector instructions from the predispatch queue to the dispatch queue in the order received, wherein predispatching includes determining if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed and transferring the vector instruction from the predispatch queue to the dispatch queue only if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed; notifying the vector processing unit that the address is available in the scalar operand queue; dispatching the predispatched vector instruction from the dispatch queue of the vector processing unit if all required scalar operands are ready; and executing the vector instruction dispatched from the vector processing unit in the vector processing unit, wherein executing the vector instruction in the vector processing unit includes reading the address from the scalar operand queue and generating a memory request as a function of the address read from the scalar operand queue. 17. The method according to claim 16, wherein the method further includes marking the vector instruction as complete, wherein marking the vector instruction as complete includes: if the vector instruction is not a memory instruction and if the vector instruction does not require scalar operands, indicating that the vector instruction is complete when the vector instruction is dispatched from the scalar processing unit; if the vector instruction is not a memory instruction but requires scalar operands, indicating that the vector instruction is complete when the scalar operands are available; and if the vector instruction is a memory instruction, indicating that the vector instruction is complete when the vector address has been translated; and graduating the vector instruction if the vector instruction is marked complete. 18. In a computer system having a scalar processing unit and a vector processing unit, wherein the vector processing unit includes a vector dispatch unit and wherein the vector dispatch unit includes a predispatch queue and a dispatch queue, a method of executing a vector instruction, comprising: dispatching scalar instructions to a scalar instruction queue in the scalar processing unit; dispatching a vector instruction from the scalar processing unit to the scalar instruction queue and to the predispatch queue in the vector processing unit, wherein dispatching includes sending the vector instruction from the scalar processing unit to the predispatch queue in the vector processing unit even if all scalar operands are not ready; queueing up two or more vector instructions received from the scalar processing unit in the predispatch queue; executing the vector instruction in the scalar processing unit, wherein executing the vector instruction in the scalar processing unit includes generating an address and writing the address to a scalar operand queue; predispatching, within the vector dispatch unit, the vector instructions from the predispatch queue to the dispatch queue in the order received, wherein predispatching includes determining if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed and transferring the vector instruction from the predispatch queue to the dispatch queue only if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed; notifying the vector processing unit that the address is available in the scalar operand queue; dispatching the predispatched vector instruction from the dispatch queue of the vector processing unit if all required scalar operands are ready; and executing the vector instruction dispatched from the vector processing unit in the vector processing unit, wherein executing the dispatched vector instruction in the vector processing unit includes: reading the address from the scalar operand queue; generating a memory request as a function of the address read from the scalar operand queue; receiving vector data from memory; storing the vector data in a load buffer; transferring the vector data from the load buffer to a vector register; and executing a vector instruction on the vector data stored in the vector register. 19. In a computer system having a scalar processing unit and a vector processing unit, wherein the vector processing unit includes a vector dispatch unit and wherein the vector dispatch unit includes a predispatch queue and a dispatch queue, a method of unrolling a loop, comprising: preparing a first and a second vector instruction, wherein each vector instruction executes an iteration through the loop and wherein each vector instruction requires calculation of a scalar loop value; dispatching the first and second vector instructions from the scalar processing unit to a scalar instruction queue in the scalar processing unit and to the predispatch queue in the vector processing unit, wherein dispatching includes sending the first and second vector instructions from the scalar processing unit to the predispatch queue in the vector processing unit even if all scalar operands are not ready; queueing up the first and second vector instructions received from the scalar processing unit in the predispatch queue; executing a portion of each vector instruction in the scalar processing unit, wherein executing a portion of each vector instruction in the scalar processing unit includes writing a scalar operand representing the scalar loop value calculated for each vector instruction to a scalar operand queue; predispatching, within the vector processing unit, each vector instruction received from the scalar processing unit if all previously received vector instructions are scalar committed; predispatching, within the vector dispatch unit, each vector instruction from the predispatch queue to the dispatch queue in the order received, wherein predispatching includes determining if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed and transferring the vector instruction from the predispatch queue to the dispatch queue only if all scalar instructions issued prior to the vector instruction at the head of the predispatch queue are scalar committed; notifying the vector processing unit that the scalar operand is available in the scalar operand queue; dispatching the predispatched vector instruction from the dispatch queue of the vector processing unit if all required scalar operands are ready; and executing the first and second vector instructions dispatched from the dispatch queue of the vector processing unit, wherein executing the dispatched vector instruction in the vector processing unit includes reading the scalar operands associated with each instruction from the scalar operand queue.

이 특허에 인용된 특허 (73)

Nugent Steven F. (Portland OR), Adaptive message routing for multi-dimensional networks.
상세보기
Blasbalg Herman (Gaithersburg MD), Adaptive packet length traffic control in a local area network.
상세보기
Bruckert William (Northboro MA) Bissett Thomas D. (Derry NH) Kovalcin David (Grafton MA) Nene Ravi (Chelmsford MA), Apparatus and method for documenting faults in computing modules.
상세보기
Hashimoto Shin,JPX ; Masaki Reiji,JPX, Apparatus for analyzing operations of parallel processing system.
상세보기
Barnes George H. (Wayne PA) Lundstrom Stephen F. (Wayne PA) Shafer Philip E. (Holmes PA), Array processor architecture.
상세보기
Vishin Sanjay ; Aybay Gunes, Auxiliary translation lookaside buffer for assisting in accessing data in remote address spaces.
상세보기
Kessler Richard E. ; Oberlin Steven M. ; Thorson Gregory M., Barrier and eureka synchronization architecture for multiprocessors.
상세보기
Oberlin Steven M. (Chippewa Falls WI) Fromm Eric C. (Eau Claire WI), Barrier synchronization for distributed memory massively parallel processing systems.
상세보기
Ishizaka Kenichi,JPX, Barrier synchronization system in parallel data processing.
상세보기
McMahan Steven C., Branch processing unit with target cache read prioritization protocol for handling multiple hits.
상세보기
Shibata Masabumi,JPX ; Nakajima Atsushi,JPX ; Fujiwara Shisei,JPX, Cache coherency control method and multi-processor system using the same.
상세보기
Koyanagi, Hisao, Cache consistent control of subsequent overlapping memory access during specified vector scatter instruction execution.
상세보기
Chang, Stephen S., Cache states for multiprocessor cache coherency protocols.
상세보기
Hall Barbara A. (Endwell NY) Huang Kevin C. (Endicott NY) Jabusch John D. (Endwell NY) Ngai Agnes Y. (Endwell NY), Central processing unit checkpoint retry for store-in and store-through cache systems.
상세보기
Buchholz Dale R. (Palatine IL), Channel access control in a communication system.
상세보기
Chen Steve S. (Chippewa Falls) Simmons Frederick J. (Neillsville) Spix George A. (Eau Claire) Wilson Jimmie R. (Eau Claire) Miller Edward C. (Eau Claire) Eckert Roger E. (Eau Claire) Beard Douglas R., Cluster architecture for a highly parallel scalar/vector multiprocessor system.
상세보기
Whaley Kenneth M. ; Tarolli Gary, Command data transport to a graphics processing device from a CPU performing write reordering operations.
상세보기
Nagai Yasuhiro (Bunkyo JPX) Sasaki Ryoichi (Fujisawa JPX) Suzuki Michio (Yokohama NY JPX) Yosioka Shunichi (New York NY) Mizuhara Noboru (Kawasaki JPX), Communication circuit switching or parallel operation system.
상세보기
Mendelsohn Noah R. (Arlington MA) Perchik James (Cambridge MA) Hancock Thomas R. (Somerville MA), Component replacement control for fault-tolerant data processing system.
상세보기
Le Boudec Jean-Yves (Adliswil CHX) Truong Linh (Gattikon CHX), Connectionless ATM data services.
상세보기
Nagashima, Shigeo; Torii, Shunichi; Omoda, Koichiro; Inagami, Yasuhiro, Data processing system including scalar data processor and vector data processor.
상세보기
Papadopoulos Gregory M. (Acton MA) Nikhil Rishiyur S. (Arlington MA) Greiner Robert J. (Chandler AZ) Arvind (Arlington MA), Data processing system with synchronization coprocessor for multiple threads.
상세보기
Papadopoulos Gregory M. (Burlington MA) Nikhil Rishiyur S. (Arlington MA) Greiner Robert J. (Chandler AZ) Arvind (Arlington MA), Data processing system with synchronization coprocessor for multiple threads.
상세보기
Easki Hiroshi (Yokohama JPX) Natsubori Shigeyasu (Yokohama JPX) Saito Takeshi (Tokyo JPX) Tsuda Yoshiyuki (Kawasaki JPX) Matsuzawa Shigeo (Tokyo JPX), Data-transfer routing management for packet-oriented digital communication system including ATM networks.
상세보기
Ogura Takao (Kawasaki JPX) Amemiya Shigeo (Kawasaki JPX) Tezuka Koji (Kawasaki JPX) Chujo Takafumi (Kawasaki JPX), Distributed control of telecommunication network for setting up an alternative communication path.
상세보기
Ben-Ayed Mondher (Rochester NY) Merriam Charles W. (Rochester NY), Dynamic routing system for a multinode communications network.
상세보기
Ackerman Dennis F. (Boynton Beach FL) Desai Himanshu H. (Boca Raton FL) Gupta Ram K. (Boca Raton FL) Srinivasan Ravi R. (Boca Raton FL), Exception handling method and apparatus for a microkernel data processing system.
상세보기
Madan Herb. S. (Marina del Rey CA) Chow Edward (San Dimas CA), Fault tolerant hypercube computer system architecture.
상세보기
Tsuchiya Paul F. (Lake Hopatcong NJ), General internet method for routing packets in a communications network.
상세보기
Shu Renben (St. Paul MN) Du David H. C. (New Brighton MN), Improved hypercube topology for multiprocessor computer systems.
상세보기
Flaig Charles M. (Pasadena CA) Seitz Charles L. (San Luis Rey CA), Inter-computer message routing system with each computer having separate routinng automata for each dimension of the net.
상세보기
Mario D. Nemirovsky ; Adolfo M. Nemirovsky ; Narendra Sankar, Interstream control and communications for multi-streaming digital processors.
상세보기
Carter Nicholas P. ; Keckler Stephen W. ; Dally William J., Memory system with global address translation.
상세보기
Nugent Steven F. (Portland OR), Message routing in a multiprocessor computer system.
상세보기
Beard Douglas R. (Eleva WI) Phelps Andrew E. (Eau Claire WI) Woodmansee Michael A. (Eau Claire WI) Blewett Richard G. (Altoona WI) Lohman Jeffrey A. (Eau Claire WI) Silbey Alexander A. (Eau Claire WI, Method and apparatus for chaining vector instructions.
상세보기
Drysdale, Tracy Garrett; Bobholz, Scott P, Method and apparatus for communicating between processing entities in a multi-processor.
상세보기
Peterson John C. (Alta Loma CA) Chow Edward (San Dimas CA) Madan Herb S. (Marina del Rey CA), Method and apparatus for eliminating unsuccessful tries in a search tree.
상세보기
Klausler Peter Michael, Method and apparatus for processing a set of data values with plural processing units mask bits generated by other processing units.
상세보기
Dion Rodgers ; Darrell Boggs ; Amit Merchant ; Rajesh Kota ; Rachel Hsu ; Keshavan Tiruvallur, Method and apparatus for processing an event occurrence within a multithreaded processor.
상세보기
Fossum Tryggve (Northboro MA) Hetherington Ricky C. (Northboro MA) Fite ; Jr. David B. (Northboro MA) Manley Dwight P. (Holliston MA) McKeen Francis X. (Westboro MA) Murray John E. (Acton MA), Method and apparatus using a cache and main memory for both vector processing and scalar processing by prefetching cache.
상세보기
Rolfe David B. (West Hurley NY), Method for interconnecting and system of interconnected processing elements by controlling network density.
상세보기
Chujo Takafumi (Hachiouji JPX) Komine Hiroaki (Yamato JPX) Miyazaki Keiji (Kawasaki JPX) Ogura Takao (Kawasaki JPX) Soejima Tetsuo (Tama JPX), Method for searching for alternate path in communication network.
상세보기
Shiojiri Hirohisa (Tokyo JPX) Koga Toshio (Tokyo JPX), Method of adaptively multiplexing a plurality of video channel data using channel data assignment information obtained f.
상세보기
Neches Philip M. (Pasadena CA), Multi processor sorting network for sorting while transmitting concurrently presented messages by message content to del.
상세보기
Mori Kinji (Yokohama JPX) Miyamoto Shoji (Kawasaki JPX) Ihara Hirokazu (Machida JPX), Multi-dimensional structured computer system.
상세보기
Barrett Linda (Raleigh NC) Long Lynn D. (Chapel Hill NC) Menditto Louis F. (Raleigh NC) Stagg Arthur J. (Raleigh NC) Ward Raymond E. (Durham NC), Multi-path channel (MPC) interface with user transparent, unbalanced, dynamically alterable computer input/output channe.
상세보기
den Haan, Petrus A. M.; Hopmans, Franciscus P. M., Multi-processor computer system with distributed memory and an interprocessor communication mechanism, and method for operating such mechanism.
상세보기
Summer ; Jr. Charles F. (Orlando FL) Pettus Robert O. (Lexington SC) Bonnell Ronald D. (Lexington SC) Huhns Michael N. (Irmo SC) Stephens Larry M. (Columbia SC), Multiple-microcomputer processing.
상세보기
Baum Richard I. (Poughkeepsie NY) Brotman Charles H. (Poughkeepsie NY) Rymarczyk James W. (Poughkeepsie NY), Multiprocessing packet switching connection system having provision for error correction and recovery.
상세보기
Yamazaki Takeshi (Tokyo JPX), Multiprocessor system for locally managing address translation table.
상세보기
Frink Craig R. (Chelmsford MA) Bryg William R. (Saratoga CA) Chan Kenneth K. (San Jose CA) Hotchkiss Thomas R. (Groton MA) Odineal Robert D. (Roseville CA) Williams James B. (Lowell MA) Ziegler Micha, Multiprocessor system for maintaining cache coherency by checking the coherency in the order of the transactions being i.
상세보기
Nesheim William A. ; Guzovskiy Aleksandr, Multiprocessor system having mapping table in each node to map global physical addresses to local physical addresses of.
상세보기
Deneau, Thomas M., Multiprocessor system implementing virtual memory using a shared memory, and a page replacement method for maintaining paged memory coherence.
상세보기
Teraslinna Kari T. (Boulder CO), N+K sparing in a telecommunications switching environment.
상세보기
Baror Gigy, Organization of an integrated cache unit for flexible usage in supporting multiprocessor operations.
상세보기
Ogura Takao (Kawasaki JPX) Amemiya Shigeo (Kawasaki JPX) Tezuka Koji (Kawasaki JPX) Chujo Takafumi (Kawasaki JPX), Packet directional path identifier transfer system.
상세보기
Pierce Paul R. (Portland OR), Parallel processing system virtual connection method and apparatus with protection and flow control.
상세보기
Bowles James E., Reducing cache snooping overhead in a multilevel cache system with inclusion field in shared cache indicating state of.
상세보기
Scott, Steven L.; Dickson, Chris; Fromm, Eric C.; Anderson, Michael L., Remote address translation in a multiprocessor system.
상세보기
Scott, Steven L., Remote translation mechanism for a multi-node system.
상세보기
Childs Philip L. (Endicott NY) Olnowich Howard T. (Endicott NY) Skovira Joseph F. (Binghamton NY), SYNC-NET- a barrier synchronization apparatus for multi-stage networks.
상세보기
Nickolls John R. (Los Altos CA) Zapisek John (Cupertino CA) Kim Won S. (Fremont CA) Kalb Jeffery C. (Saratoga CA) Blank W. Thomas (Palo Alto CA) Wegbreit Eliot (Palo Alto CA) Van Horn Kevin (Mountain, Scalable processor to processor and processor-to-I/O interconnection network and method for parallel processing arrays.
상세보기
Beard Douglas R. (Eleva WI) Phelps Andrew E. (Eau Claire WI) Woodmansee Michael A. (Eau Claire WI) Blewett Richard G. (Altoona WI) Lohman Jeffrey A. (Eau Claire WI) Silbey Alexander A. (Eau Claire WI, Scalar/vector processor.
상세보기
Dunning Dave (Portland OR), Self-timed mesh routing chip with data broadcasting.
상세보기
Nakazato, Satoshi, Shared memory type vector processing system, including a bus for transferring a vector processing instruction, and control method thereof.
상세보기
Meyers Steven D. (Hurley NY) Ngo Hung C. (Kingston NY) Schwartz Paul R. (Kingston NY), Single register arbiter circuit.
상세보기
DeLano Eric R. ; Buckley Michael A. ; Weir Duncan C., Software assisted hardware TLB miss handler.
상세보기
Schimmel Curt F., System and method for maintaining translation look-aside buffer (TLB) consistency.
상세보기
Horie Takeshi (Kawasaki JPX) Ikesaka Morio (Yokohama JPX) Ishihata Hiroaki (Tokyo JPX), System for controlling communication between parallel computers.
상세보기
Richard L. Frank ; Gopalan Arun ; Michael J. Cusson ; Daniel E. O'Shaughnessy, System for efficiently maintaining translation lockaside buffer consistency in a multi-threaded, multi-processor virtual memory system.
상세보기
Stone Harold S. (Chappaqua NY), Technique for parallel synchronization.
상세보기
Dally William J. (Arlington MA) Seitz Charles L. (San Luis Rey CA), Torus routing chip.
상세보기
Hansen Craig C., Virtual memory system with local and global virtual address translation.
상세보기

이 특허를 인용한 특허 (42)

Mahan, Justin Michael; Hutchins, Edward A.; Toksvig, Michael J. M., Address independent shader program loading.
상세보기
Stewart, Malcolm; Ors, Ali Osman; Laroche, Daniel, Apparatus and method of vector unit sharing.
상세보기
Flachs, Brian; Johns, Charles Ray; Weigand, Ulrich, Ceasing parallel processing of first set of loops upon selectable number of monitored terminations and processing second set.
상세보기
Karandikar, Ashish; Agarwal, Pooja, Configurable SIMD engine with high, low and mixed precision modes.
상세보기
Karandikar, Ashish; Gadre, Shirish; Gruner, Frederick R.; Sijstermans, Franciscus W., Context switching on a video processor having a scalar execution unit and a vector execution unit.
상세보기
Eichenberger, Alexandre E.; Flachs, Brian K.; Johns, Charles R.; Nutter, Mark R., Data parallel function call for determining if called routine is data parallel.
상세보기
Eichenberger, Alexandre E.; Flachs, Brian K.; Johns, Charles R.; Nutter, Mark R., Data parallel function call for determining if called routine is data parallel.
상세보기
Scott, Steven L.; Faanes, Gregory J., Decoupling of write address from its associated write data in a store to a shared memory in a multiprocessor system.
상세보기
Nilsson, Anders; Tell, Eric; Alfredsson, Erik, Digital signal processor and method for addressing a memory in a digital signal processor.
상세보기
Crow, Franklin C.; Sewall, Jeffrey R., Interrupt handling techniques in the rasterizer of a GPU.
상세보기
Crow, Franklin C.; Sewall, Jeffrey R., Interrupt handling techniques in the rasterizer of a GPU.
상세보기
Scott, Steven L., Latency tolerant distributed shared memory multiprocessor computer.
상세보기
Karandikar, Ashish; Gadre, Shirish; Lew, Stephen D., Latency tolerant system for executing video processing operations.
상세보기
Fukagawa, Masao, Loading/discarding acquired data for vector load instruction upon determination of prediction success of multiple preceding branch instructions.
상세보기
Vasudevan, Nalini; Bharadwaj, Jayashankar; Hughes, Christopher J.; Girkar, Milind B.; Charney, Mark J.; Valentine, Robert; Lee, Victor W.; Kim, Daehyun; Hartono, Albert; Baghsorkhi, Sara S., Loop vectorization methods and apparatus.
상세보기
Pautsch, Gregory W.; Pautsch, Adam, Method and apparatus for cooling electronic components.
상세보기
Kohn, James R., Method and apparatus for indirectly addressed vector load-add-store across multi-processors.
상세보기
Kohn,James R., Method and apparatus for indirectly addressed vector load-add-store across multi-processors.
상세보기
Danskin, John M.; Tamasi, Anthony Michael, Method and system for implementing fragment operation processing across a graphics bus interconnect.
상세보기
Bowen, Andrew D., Method and system for non stalling pipeline instruction fetching from memory.
상세보기
Karandikar, Ashish; Gadre, Shirish; Salek, Amir H., Methods and systems for command acceleration in a video processor via translation of scalar instructions into vector instructions.
상세보기
Bharadwaj, Jayashankar; Vasudevan, Nalini; Hartono, Albert; Baghsorkhi, Sara S., Methods and systems to vectorize scalar computer program loops having loop-carried dependences.
상세보기
Lew, Stephen D.; Karandikar, Ashish; Gadre, Shirish; Sijstermans, Franciscus W., Multi context execution on a video processor.
상세보기
Karandikar, Ashish; Gadre, Shirish; Lew, Stephen D.; Cheng, Christopher T., Multidimensional datapath processing in a video processor.
상세보기
Garg, Atul; Sharma, Anil, Multistandard hardware video encoder.
상세보기
Scott,Steven L.; Faanes,Gregory J.; Stephenson,Brick; Moore, Jr.,William T.; Kohn,James R., Multistream processing memory-and barrier-synchronization method and apparatus.
상세보기
Kanuri, Mrudula, Optimal use of buffer space by a storage controller which writes retrieved data directly to a memory.
상세보기
Karandikar, Ashish; Gadre, Shirish; Sijstermans, Franciscus W.; Su, Zhiqiang Jonathan, Pipelined L2 cache for memory transfers for a video processor.
상세보기
Vamanan, Balajee; Methar, Tukaram; Kanuri, Mrudula; Krishnan, Sreenivas, Processing of read requests in a memory controller using pre-fetch mechanism.
상세보기
Mahan, Justin Michael; Hutchins, Edward A.; Kubalska, Ewa M.; Battle, James T., Program sequencer for generating indeterminant length shader programs for a graphics processor.
상세보기
Lew, Stephen D.; Gadre, Shirish; Karandikar, Ashish; Sijstermans, Franciscus W., Programmable DMA engine for implementing memory transfers and video processing for a video processor.
상세보기
Scott, Steven L.; Faanes, Gregory J.; Stephenson, Brick; Moore, Jr., William T.; Kohn, James R., Relaxed memory consistency model.
상세보기
Sheets, Kitrick; Hastings, Andrew B., Remote translation mechanism for a multinode system.
상세보기
Garg, Atul; Venkatapuram, Prahlad, Rewind-enabled hardware encoder.
상세보기
Sheets,Kitrick; Williams,Josh; Gettler,Jonathan; Piatz,Steve; Hastings,Andrew B.; Hill,Peter; Bravatto,James G.; Kohn,James R.; Titus,Greg, Scheduling synchronization of programs running as streams on multiple processors.
상세보기
Mahan, Justin Michael; Hutchins, Edward A., Shader program instruction fetch.
상세보기
Mahan, Justin Michael; Hutchins, Edward A., Software assisted shader merging.
상세보기
Su, Zhiqiang Jonathan; Karandikar, Ashish, State machine control for a pipelined L2 cache to implement memory transfers for a video processor.
상세보기
Gadre, Shirish; Karandikar, Ashish; Lew, Stephen D., Stream processing in a video processor.
상세보기
Faanes, Gregory J.; Lundberg, Eric P.; Scott, Steven L.; Baird, Robert J., System and method for processing memory instructions using a forced order queue.
상세보기
Luu, Viet-Tam; Pflughaupt, Russell, Validating a graphics pipeline using pre-determined schedules.
상세보기
Gadre, Shirish; Karandikar, Ashish; Lew, Stephen D.; Cheng, Christopher T., Video processor having scalar and vector components.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Decoupled scalar/vector computer architecture system and method 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (73)

이 특허를 인용한 특허 (42)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Decoupled scalar/vector computer architecture system and method 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (73)

이 특허를 인용한 특허 (42)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트