$\require{mediawiki-texvc}$

연합인증

연합인증 가입 기관의 연구자들은 소속기관의 인증정보(ID와 암호)를 이용해 다른 대학, 연구기관, 서비스 공급자의 다양한 온라인 자원과 연구 데이터를 이용할 수 있습니다.

이는 여행자가 자국에서 발행 받은 여권으로 세계 각국을 자유롭게 여행할 수 있는 것과 같습니다.

연합인증으로 이용이 가능한 서비스는 NTIS, DataON, Edison, Kafe, Webinar 등이 있습니다.

한번의 인증절차만으로 연합인증 가입 서비스에 추가 로그인 없이 이용이 가능합니다.

다만, 연합인증을 위해서는 최초 1회만 인증 절차가 필요합니다. (회원이 아닐 경우 회원 가입이 필요합니다.)

연합인증 절차는 다음과 같습니다.

최초이용시에는
ScienceON에 로그인 → 연합인증 서비스 접속 → 로그인 (본인 확인 또는 회원가입) → 서비스 이용

그 이후에는
ScienceON 로그인 → 연합인증 서비스 접속 → 서비스 이용

연합인증을 활용하시면 KISTI가 제공하는 다양한 서비스를 편리하게 이용하실 수 있습니다.

Multistream processing memory-and barrier-synchronization method and apparatus 원문보기

IPC분류정보
국가/구분 United States(US) Patent 등록
국제특허분류(IPC7판)
  • G06F-012/00
  • G06F-009/52
  • G06F-009/46
출원번호 US-0643741 (2003-08-18)
등록번호 US-7437521 (2008-10-14)
발명자 / 주소
  • Scott,Steven L.
  • Faanes,Gregory J.
  • Stephenson,Brick
  • Moore, Jr.,William T.
  • Kohn,James R.
출원인 / 주소
  • Cray Inc.
대리인 / 주소
    Schwegman, Lundberg & Woessner, P.A.
인용정보 피인용 횟수 : 38  인용 특허 : 84

초록

A method and apparatus to provide specifiable ordering between and among vector and scalar operations within a single streaming processor (SSP) via a local synchronization (Lsync) instruction that operates within a relaxed memory consistency model. Various aspects of that relaxed memory consistency

대표청구항

What is claimed is: 1. An apparatus comprising: a memory interface; a plurality of queues connected to the memory interface, including a first queue and a second queue, wherein each of the plurality of queues holds pending memory requests and enforces an ordering in the commitment of the pending me

이 특허에 인용된 특허 (84)

  1. Nugent Steven F. (Portland OR), Adaptive message routing for multi-dimensional networks.
  2. Blasbalg Herman (Gaithersburg MD), Adaptive packet length traffic control in a local area network.
  3. Bruckert William (Northboro MA) Bissett Thomas D. (Derry NH) Kovalcin David (Grafton MA) Nene Ravi (Chelmsford MA), Apparatus and method for documenting faults in computing modules.
  4. Hashimoto Shin,JPX ; Masaki Reiji,JPX, Apparatus for analyzing operations of parallel processing system.
  5. Barnes George H. (Wayne PA) Lundstrom Stephen F. (Wayne PA) Shafer Philip E. (Holmes PA), Array processor architecture.
  6. Leedom George W. ; Moore William T., Associative scalar data cache with write-through capabilities for a vector processor.
  7. Vishin Sanjay ; Aybay Gunes, Auxiliary translation lookaside buffer for assisting in accessing data in remote address spaces.
  8. Kessler Richard E. ; Oberlin Steven M. ; Thorson Gregory M., Barrier and eureka synchronization architecture for multiprocessors.
  9. Oberlin Steven M. (Chippewa Falls WI) Fromm Eric C. (Eau Claire WI), Barrier synchronization for distributed memory massively parallel processing systems.
  10. Ishizaka Kenichi,JPX, Barrier synchronization system in parallel data processing.
  11. McMahan Steven C., Branch processing unit with target cache read prioritization protocol for handling multiple hits.
  12. Shibata Masabumi,JPX ; Nakajima Atsushi,JPX ; Fujiwara Shisei,JPX, Cache coherency control method and multi-processor system using the same.
  13. Koyanagi, Hisao, Cache consistent control of subsequent overlapping memory access during specified vector scatter instruction execution.
  14. Chang, Stephen S., Cache states for multiprocessor cache coherency protocols.
  15. Hall Barbara A. (Endwell NY) Huang Kevin C. (Endicott NY) Jabusch John D. (Endwell NY) Ngai Agnes Y. (Endwell NY), Central processing unit checkpoint retry for store-in and store-through cache systems.
  16. Buchholz Dale R. (Palatine IL), Channel access control in a communication system.
  17. Chen Steve S. (Chippewa Falls) Simmons Frederick J. (Neillsville) Spix George A. (Eau Claire) Wilson Jimmie R. (Eau Claire) Miller Edward C. (Eau Claire) Eckert Roger E. (Eau Claire) Beard Douglas R., Cluster architecture for a highly parallel scalar/vector multiprocessor system.
  18. Whaley Kenneth M. ; Tarolli Gary, Command data transport to a graphics processing device from a CPU performing write reordering operations.
  19. Nagai Yasuhiro (Bunkyo JPX) Sasaki Ryoichi (Fujisawa JPX) Suzuki Michio (Yokohama NY JPX) Yosioka Shunichi (New York NY) Mizuhara Noboru (Kawasaki JPX), Communication circuit switching or parallel operation system.
  20. Mendelsohn Noah R. (Arlington MA) Perchik James (Cambridge MA) Hancock Thomas R. (Somerville MA), Component replacement control for fault-tolerant data processing system.
  21. Le Boudec Jean-Yves (Adliswil CHX) Truong Linh (Gattikon CHX), Connectionless ATM data services.
  22. Nagashima, Shigeo; Torii, Shunichi; Omoda, Koichiro; Inagami, Yasuhiro, Data processing system including scalar data processor and vector data processor.
  23. Papadopoulos Gregory M. (Acton MA) Nikhil Rishiyur S. (Arlington MA) Greiner Robert J. (Chandler AZ) Arvind (Arlington MA), Data processing system with synchronization coprocessor for multiple threads.
  24. Papadopoulos Gregory M. (Burlington MA) Nikhil Rishiyur S. (Arlington MA) Greiner Robert J. (Chandler AZ) Arvind (Arlington MA), Data processing system with synchronization coprocessor for multiple threads.
  25. Easki Hiroshi (Yokohama JPX) Natsubori Shigeyasu (Yokohama JPX) Saito Takeshi (Tokyo JPX) Tsuda Yoshiyuki (Kawasaki JPX) Matsuzawa Shigeo (Tokyo JPX), Data-transfer routing management for packet-oriented digital communication system including ATM networks.
  26. Faanes,Gregory J.; Scott,Steven L.; Lundberg,Eric P.; Moore, Jr.,William T.; Johnson,Timothy J., Decoupled scalar/vector computer architecture system and method.
  27. Morton Steven G., Digital signal processor containing scalar processor and a plurality of vector processors operating from a single instruction.
  28. Ogura Takao (Kawasaki JPX) Amemiya Shigeo (Kawasaki JPX) Tezuka Koji (Kawasaki JPX) Chujo Takafumi (Kawasaki JPX), Distributed control of telecommunication network for setting up an alternative communication path.
  29. Ben-Ayed Mondher (Rochester NY) Merriam Charles W. (Rochester NY), Dynamic routing system for a multinode communications network.
  30. Madan Herb. S. (Marina del Rey CA) Chow Edward (San Dimas CA), Fault tolerant hypercube computer system architecture.
  31. Tsuchiya Paul F. (Lake Hopatcong NJ), General internet method for routing packets in a communications network.
  32. Shu Renben (St. Paul MN) Du David H. C. (New Brighton MN), Improved hypercube topology for multiprocessor computer systems.
  33. Kohn,James R., Indirectly addressed vector load-operate-store method and apparatus.
  34. Flaig Charles M. (Pasadena CA) Seitz Charles L. (San Luis Rey CA), Inter-computer message routing system with each computer having separate routinng automata for each dimension of the net.
  35. Thomas Basil Smith, III ; Robert Brett Tremaine, Memory system for permitting simultaneous processor access to a cache line and sub-cache line sectors fill and writeback to a system memory.
  36. Carter Nicholas P. ; Keckler Stephen W. ; Dally William J., Memory system with global address translation.
  37. Nugent Steven F. (Portland OR), Message routing in a multiprocessor computer system.
  38. Beard Douglas R. (Eleva WI) Phelps Andrew E. (Eau Claire WI) Woodmansee Michael A. (Eau Claire WI) Blewett Richard G. (Altoona WI) Lohman Jeffrey A. (Eau Claire WI) Silbey Alexander A. (Eau Claire WI, Method and apparatus for chaining vector instructions.
  39. Drysdale, Tracy Garrett; Bobholz, Scott P, Method and apparatus for communicating between processing entities in a multi-processor.
  40. Peterson John C. (Alta Loma CA) Chow Edward (San Dimas CA) Madan Herb S. (Marina del Rey CA), Method and apparatus for eliminating unsuccessful tries in a search tree.
  41. Shailender Chaudhry ; Marc Tremblay ; James M. O'Connor, Method and apparatus for enforcing memory reference dependencies through a load store unit.
  42. Dion Rodgers ; Darrell Boggs ; Amit Merchant ; Rajesh Kota ; Rachel Hsu ; Keshavan Tiruvallur, Method and apparatus for processing an event occurrence within a multithreaded processor.
  43. Fossum Tryggve (Northboro MA) Hetherington Ricky C. (Northboro MA) Fite ; Jr. David B. (Northboro MA) Manley Dwight P. (Holliston MA) McKeen Francis X. (Westboro MA) Murray John E. (Acton MA), Method and apparatus using a cache and main memory for both vector processing and scalar processing by prefetching cache.
  44. Seznec, Andre C., Method for ensuring maximum bandwidth on accesses to strided vectors in a bank-interleaved cache.
  45. Rolfe David B. (West Hurley NY), Method for interconnecting and system of interconnected processing elements by controlling network density.
  46. Chujo Takafumi (Hachiouji JPX) Komine Hiroaki (Yamato JPX) Miyazaki Keiji (Kawasaki JPX) Ogura Takao (Kawasaki JPX) Soejima Tetsuo (Tama JPX), Method for searching for alternate path in communication network.
  47. Shiojiri Hirohisa (Tokyo JPX) Koga Toshio (Tokyo JPX), Method of adaptively multiplexing a plurality of video channel data using channel data assignment information obtained f.
  48. Neches Philip M. (Pasadena CA), Multi processor sorting network for sorting while transmitting concurrently presented messages by message content to del.
  49. Mori Kinji (Yokohama JPX) Miyamoto Shoji (Kawasaki JPX) Ihara Hirokazu (Machida JPX), Multi-dimensional structured computer system.
  50. Barrett Linda (Raleigh NC) Long Lynn D. (Chapel Hill NC) Menditto Louis F. (Raleigh NC) Stagg Arthur J. (Raleigh NC) Ward Raymond E. (Durham NC), Multi-path channel (MPC) interface with user transparent, unbalanced, dynamically alterable computer input/output channe.
  51. den Haan, Petrus A. M.; Hopmans, Franciscus P. M., Multi-processor computer system with distributed memory and an interprocessor communication mechanism, and method for operating such mechanism.
  52. Baum Richard I. (Poughkeepsie NY) Brotman Charles H. (Poughkeepsie NY) Rymarczyk James W. (Poughkeepsie NY), Multiprocessing packet switching connection system having provision for error correction and recovery.
  53. Yamazaki Takeshi (Tokyo JPX), Multiprocessor system for locally managing address translation table.
  54. Frink Craig R. (Chelmsford MA) Bryg William R. (Saratoga CA) Chan Kenneth K. (San Jose CA) Hotchkiss Thomas R. (Groton MA) Odineal Robert D. (Roseville CA) Williams James B. (Lowell MA) Ziegler Micha, Multiprocessor system for maintaining cache coherency by checking the coherency in the order of the transactions being i.
  55. Nesheim William A. ; Guzovskiy Aleksandr, Multiprocessor system having mapping table in each node to map global physical addresses to local physical addresses of.
  56. Deneau, Thomas M., Multiprocessor system implementing virtual memory using a shared memory, and a page replacement method for maintaining paged memory coherence.
  57. Teraslinna Kari T. (Boulder CO), N+K sparing in a telecommunications switching environment.
  58. Barlow,Stephen; Bailey,Neil; Ramsdale,Timothy; Plowman,David; Swann,Robert, Narrow/wide cache.
  59. Baror Gigy, Organization of an integrated cache unit for flexible usage in supporting multiprocessor operations.
  60. Ogura Takao (Kawasaki JPX) Amemiya Shigeo (Kawasaki JPX) Tezuka Koji (Kawasaki JPX) Chujo Takafumi (Kawasaki JPX), Packet directional path identifier transfer system.
  61. Pierce Paul R. (Portland OR), Parallel processing system virtual connection method and apparatus with protection and flow control.
  62. Bowles James E., Reducing cache snooping overhead in a multilevel cache system with inclusion field in shared cache indicating state of.
  63. Scott, Steven L.; Dickson, Chris; Fromm, Eric C.; Anderson, Michael L., Remote address translation in a multiprocessor system.
  64. Scott, Steven L., Remote translation mechanism for a multi-node system.
  65. Childs Philip L. (Endicott NY) Olnowich Howard T. (Endicott NY) Skovira Joseph F. (Binghamton NY), SYNC-NET- a barrier synchronization apparatus for multi-stage networks.
  66. Nickolls John R. (Los Altos CA) Zapisek John (Cupertino CA) Kim Won S. (Fremont CA) Kalb Jeffery C. (Saratoga CA) Blank W. Thomas (Palo Alto CA) Wegbreit Eliot (Palo Alto CA) Van Horn Kevin (Mountain, Scalable processor to processor and processor-to-I/O interconnection network and method for parallel processing arrays.
  67. Beard Douglas R. (Eleva WI) Phelps Andrew E. (Eau Claire WI) Woodmansee Michael A. (Eau Claire WI) Blewett Richard G. (Altoona WI) Lohman Jeffrey A. (Eau Claire WI) Silbey Alexander A. (Eau Claire WI, Scalar/vector processor.
  68. Dunning Dave (Portland OR), Self-timed mesh routing chip with data broadcasting.
  69. Nakazato, Satoshi, Shared memory type vector processing system, including a bus for transferring a vector processing instruction, and control method thereof.
  70. Meyers Steven D. (Hurley NY) Ngo Hung C. (Kingston NY) Schwartz Paul R. (Kingston NY), Single register arbiter circuit.
  71. DeLano Eric R. ; Buckley Michael A. ; Weir Duncan C., Software assisted hardware TLB miss handler.
  72. Dutton Patrick Francis ; Gregor Steven Lee ; Li Hehching Harry, Storage subsystem including an error correcting cache and means for performing memory to memory transfers.
  73. Schimmel Curt F., System and method for maintaining translation look-aside buffer (TLB) consistency.
  74. David Parks, System and method providing cache coherency and atomic memory operations in a multiprocessor computer architecture.
  75. Horie Takeshi (Kawasaki JPX) Ikesaka Morio (Yokohama JPX) Ishihata Hiroaki (Tokyo JPX), System for controlling communication between parallel computers.
  76. Richard L. Frank ; Gopalan Arun ; Michael J. Cusson ; Daniel E. O'Shaughnessy, System for efficiently maintaining translation lockaside buffer consistency in a multi-threaded, multi-processor virtual memory system.
  77. Sakai Kenichi (Yohohama JPX), System for releasing suspended execution of scalar instructions following a wait instruction immediately upon change of.
  78. Stone Harold S. (Chappaqua NY), Technique for parallel synchronization.
  79. Dally William J. (Arlington MA) Seitz Charles L. (San Luis Rey CA), Torus routing chip.
  80. Faanes, Gregory J.; Lundberg, Eric P., Vector and scalar data cache for a vector multiprocessor.
  81. Gregory J. Faanes ; Eric P. Lundberg, Vector and scalar data cache for a vector multiprocessor.
  82. Kamiya Yasuaki (Tokyo JPX), Vector processing system for invalidating scalar cache memory block indicated by address in tentative vector store instr.
  83. Hansen Craig C., Virtual memory system with local and global virtual address translation.
  84. Van Loo William C. (Palo Alto CA) Ebrahim Zahir (Mountain View CA) Nishtala Satyanarayana (Cupertino CA) Normoyle Kevin (San Jose CA) Loewenstein Paul (Palo Alto CA) Coffin ; III Louis F. (San Jose C, Writeback cancellation processing system for use in a packet switched cache coherent multiprocessor system.

이 특허를 인용한 특허 (38)

  1. Ohlgren, Harry Carl Håkan; Lindquist, Carl Tobias, Allocating audio processing among a plurality of processing units with a global synchronization pulse.
  2. Godard, Roger Rawson; Kahlich, Arthur David; Schukat, Jan, CPU security mechanisms employing thread-specific protection domains.
  3. Morris, Terrel, Computer system and method for sharing computer memory.
  4. Morris, Terrel, Computer system and method for sharing computer memory.
  5. Steinmacher-Burow, Burkhard, Conditional access with timeout.
  6. Voigt, Douglas L., Coordinating replication of data stored in a non-volatile memory-based system.
  7. Scott, Steven L.; Faanes, Gregory J., Decoupling of write address from its associated write data in a store to a shared memory in a multiprocessor system.
  8. Reynolds, Nathan, Dynamic atomic bitsets.
  9. Guthrie, Guy L.; Helterhoff, Harmony L.; Jeremiah, Thomas L.; Ng, Alvan W.; Starke, William J.; Stuecheli, Jeffrey A.; Williams, Philip G., Empirically based dynamic control of acceptance of victim cache lateral castouts.
  10. Cargnoni, Robert A.; Guthrie, Guy L.; Helterhoff, Harmony L.; Starke, William J.; Stuecheli, Jeffrey A.; Williams, Phillip G., Empirically based dynamic control of transmission of victim cache lateral castouts.
  11. Hughes, Christopher J.; Chen, Yen-Kuang (Y. K.); Bomb, Mayank; Brandt, Jason W.; Buxton, Mark J.; Charney, Mark J.; Chennupaty, Srinivas; Corbal, Jesus; Dixon, Martin G.; Girkar, Milind B.; Hall, Jonathan C.; Ido, Hideki (Saito); Lachner, Peter; Neiger, Gilbert; Newburn, Chris J.; Parthasarathy, Rajesh S.; Toll, Bret L.; Valentine, Robert; Wiedemeier, Jeffrey G., Gathering and scattering multiple data elements.
  12. Hughes, Christopher J.; Chen, Yen-Kuang (Y. K.); Bomb, Mayank; Brandt, Jason W.; Buxton, Mark J.; Charney, Mark J.; Chennupaty, Srinivas; Corbal, Jesus; Dixon, Martin G.; Girkar, Milind B.; Hall, Jonathan C.; Ido, Hideki (Saito); Lachner, Peter; Neiger, Gilbert; Newburn, Chris J.; Parthasarathy, Rajesh S.; Toll, Bret L.; Valentine, Robert; Wiedemeier, Jeffrey G., Gathering and scattering multiple data elements.
  13. Guthrie, Guy L.; Ng, Alvan W.; Siegel, Michael S.; Starke, William J.; Williams, Derek E.; Williams, Phillip G., Handling castout cache lines in a victim cache.
  14. Ould-Ahmed-Vall, Elmoustapha; Doshi, Kshitij A.; Sair, Suleyman; Yount, Charles R., Instruction and logic to provide stride-based vector load-op functionality with mask duplication.
  15. Ould-Ahmed-Vall, Elmoustapha; Doshi, Kshitij A.; Sair, Suleyman; Yount, Charles R., Instruction and logic to provide vector loads with strides and masking functionality.
  16. Scott, Steven L., Latency tolerant distributed shared memory multiprocessor computer.
  17. Guthrie, Guy L.; Ng, Alvan W.; Siegel, Michael S.; Starke, William J.; Williams, Derek E.; Williams, Phillip G., Lateral cache-to-cache cast-in.
  18. Guthrie, Guy L.; Le, Hien M.; Ng, Alvan W.; Siegel, Michael S.; Williams, Derek E.; Williams, Phillip G., Lateral castout (LCO) of victim cache line in data-invalid state.
  19. Kohn, James R., Method and apparatus for indirectly addressed vector load-add-store across multi-processors.
  20. Guthrie, Guy L.; Helterhoff, Harmony L.; Starke, William J.; Williams, Phillip G.; Stuecheli, Jeffrey A., Mode-based castout destination selection.
  21. Bent, John M.; Faibish, Sorin; Grider, Gary, Partitioned key-value store with atomic memory operations.
  22. Sprangle, Eric; Rohillah, Anwar; Cavin, Robert; Forsyth, Tom; Abrash, Michael, Processor and system using a mask register to track progress of gathering and prefetching elements from memory.
  23. Lang, Christian Alexander; Mihaila, George Andrei; Stanoi, Ioana Roxana, Providing consistency in processing data streams.
  24. Scott, Steven L.; Faanes, Gregory J.; Stephenson, Brick; Moore, Jr., William T.; Kohn, James R., Relaxed memory consistency model.
  25. Sheets, Kitrick; Hastings, Andrew B., Remote translation mechanism for a multinode system.
  26. Guthrie, Guy L.; Starke, William J.; Stuecheli, Jeffrey; Williams, Derek E.; Puzak, Thomas R., Selective cache-to-cache lateral castouts.
  27. Blocksome, Michael; Dozsa, Gabor; Gooding, Thomas M.; Heidelberger, Philip; Kumar, Sameer; Mamidala, Amith R.; Miller, Douglas, Shared address collectives using counter mechanisms.
  28. Faanes, Gregory J.; Lundberg, Eric P.; Scott, Steven L.; Baird, Robert J., System and method for processing memory instructions using a forced order queue.
  29. Sprangle, Eric; Rohillah, Anwar; Cavin, Robert; Forsyth, Andrew T.; Abrash, Michael, System and method for using a mask register to track progress of gathering and scattering elements between data registers and memory.
  30. Biswas, Sukalpa; Shiu, Shinye; Wang, James, System cache with cache hint control.
  31. Daly, Jr., George William; Guthrie, Guy Lynn; Leavens, Ross Boyd; McDonald, Joseph Gerald; Siegel, Michael Steven; Starke, William John; Williams, Derek Edward, Techniques for write-after-write ordering in a coherency managed processor system that employs a command pipeline.
  32. Eichenberger, Alexandre E.; Gschwind, Michael K.; Salapura, Valentina, Vector loads with multiple vector elements from a same cache line in a scattered load operation.
  33. Ge, Yi; Takebe, Yoshimasa; Takahashi, Hiromasa, Vector processing circuit, command issuance control method, and processor system.
  34. Guthrie, Guy L.; Siegel, Michael S.; Starke, William J.; Williams, Derek E., Victim cache lateral castout targeting.
  35. Guthrie, Guy L.; Starke, William J.; Stuecheli, Jeffrey A.; Williams, Phillip G., Victim cache prefetching.
  36. Guthrie, Guy L.; Jeremiah, Thomas L.; McNeil, William L.; Patel, Piyush C.; Starke, William J.; Stuecheli, Jeffrey A., Victim cache replacement.
  37. Arimilli, Ravi K.; Guthrie, Guy L.; Cargnoni, Robert A.; Starke, William J.; Williams, Derek E., Virtual barrier synchronization cache.
  38. Arimilli, Ravi K.; Guthrie, Guy L.; Siegel, Michael; Starke, William J.; Williams, Derek E., Virtual barrier synchronization cache castout election.
섹션별 컨텐츠 바로가기

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

AI-Helper 아이콘
AI-Helper
안녕하세요, AI-Helper입니다. 좌측 "선택된 텍스트"에서 텍스트를 선택하여 요약, 번역, 용어설명을 실행하세요.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.

선택된 텍스트

맨위로