[특허]VLIW computer processing architecture having a scalable number of register files

VLIW computer processing architecture having a scalable number of register files 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-015/80 G06F-015/76
출원번호	US-0802289 (2001-03-08)
발명자 / 주소	Saulsbury,Ashley Parkin,Michael Rice,Daniel S.
출원인 / 주소	Sun Microsystems, Inc.
대리인 / 주소	Townsend and Townsend and Crew LLP
인용정보	피인용 횟수 : 7 인용 특허 : 35

초록 ▼

According to the invention, a processing core is disclosed. The processing core includes one or more processing pipelines and a number of register flies. The processing pipelines having a total of N-number of processing paths, where each of the processing paths processes instructions on M-bit data words. Each of the number of register files has Q-number of registers that are each M-bits wide. The Q-number of registers within each of the plurality of register files are either private or global registers. When a value is written to one of said Q-number of said registers, which is a global register within one of said number of register files, the value is propagated to a corresponding global register in the other of the number of register files. When a value is written to one of said Q-number of the registers, which is a private register within one of said number of register files, the value is not propagated to a corresponding register in the other of said number of register files.

대표청구항 ▼

What is claimed is: 1. A processing core comprising: one or more processing pipelines having a total of N-number of processing paths, each of said processing paths for processing instructions on M-bit data words; and a plurality of register files, each having Q-number of registers, said Q-number of registers being M-bits wide; wherein said Q-number of registers within each of said plurality of register files are both private and global registers, and wherein when a value is written to one of said Q-number of said registers which is a global register within one of said plurality of register files, said value is propagated to a corresponding global register in the other of said plurality of register files, and wherein when a value is written to one of said Q-number of said registers which is a private register within one of said plurality of register files, said value is not propagated to a corresponding register in the other of said plurality of register files, wherein each of said Q-number of registers is bi-modal to programmably operate in both private and global modes. 2. The processing core as recited in claim 1, wherein for even values of N that are greater than one, every two of said N-number of processing paths share one of said plurality of register files. 3. The processing core as recited in claim 1, wherein a processing instruction comprises N-number of P-bit instructions appended together to form a very long instruction word (VLIW), and said N-number of processing paths process N-number of P-bit instructions in parallel. 4. The processor chip as recited in claim 3, wherein M=64, Q=64, and P=32. 5. The processing core as recited in claim 1, wherein said processing pipeline comprises an execute stage which includes an execute unit for each of said N-number of M-bit processing paths, each of said execute units comprising an integer processing unit, a load/store processing unit, a floating point processing unit, or any combination of one or more of said integer processing units, said load/store processing units, and said floating point processing units. 6. The processing core as recited in claim 5, wherein an integer processing unit and a floating point processing unit share one of said plurality of register files. 7. The processing core as recited in claim 1, wherein Q=64, and a 64-bit special register stores bits indicating whether a register in a register file is a private register or a global register, each bit in the 64-bit special register corresponding to one of said registers in said register file. 8. The processing core as recited in claim 1, wherein each of said plurality of register files is connected to a bus, and a value written to a global register in one of said plurality of register files is propagated to a corresponding global register in the other of said plurality of register files across said bus. 9. The processing core as recited in claim 1, wherein said plurality of register files are connected together in serial, and a value written to a first global register in a first of said plurality of register files is propagated to a corresponding first global register in a second of said plurality of register files connected directly to said first of said plurality of register files. 10. A VLIW processing core comprising: one or more processing pipelines each including a fetch stage, a decode stage, an execute stage, and a write-back stage, said execute stage having an execute unit comprising an integer processing unit, a load/store processing unit, a floating point processing unit, or any combination of one or more of said integer processing units, said load/store processing units, or said floating point processing units; and a register file for each of said one or more processing pipelines; wherein: an integer processing unit and a floating point processing unit within said one or more processing pipelines both access said register file, the register file is comprised of Q-number of registers, said Q-number of registers comprise both private and global registers, whereby each of said Q-number of registers is dynamically configurable to operate in both private and global modes, when a value is written to a one of said Q-number of said registers that is a configured to global register mode within one of said plurality of register files, said value is propagated to a corresponding global register in another register file within the VLIW processing core, and when a value is written to the one of said Q-number of said registers that is configured to private register mode within one of said plurality of register files, said value is not propagated to a corresponding register in another register file within the VLIW processing core. 11. In a computer system, a scalable computer processing architecture, comprising: one or more processor chips, each comprising: a processing core, including: a processing pipeline having N-number of processing paths, each of said processing paths for processing instructions on M-bit data words; and a plurality of register files, each having Q-number of registers, said Q-number of registers being M-bits wide; an I/O link configured to communicate with other of said one or more processor chips, if more than one, or with I/O devices; a communication controller in electrical communication with said processing core and said I/O link; said communication controller for controlling the exchange of data between a first one of said one or more processor chips and said other of said one or more processor chips; wherein: said computer processing architecture can be scaled larger by connecting together two or more of said processor chips in parallel via said I/O links of said processor chips, so as to create multiple processing core pipelines which share data therebetween, said Q-number of registers within each of said plurality of register files comprise both private and global registers, whereby each of said Q-number of registers is bi-modal to switch between private and global modes, when a value is written to a one of said Q-number of said registers which is switched to global register mode within one of said plurality of register files, said value is propagated to a corresponding global register in the other of said plurality of register files, and when a value is written to the one of said Q-number of said registers which is switched to private register mode within one of said plurality of register flies, said value is not propagated to a corresponding register in the other of said plurality of register files. 12. The computer processing architecture as recited in claim 11, wherein in said processing core of each of said processor chips, for even values of N that are greater than one, every two of said N-number of processing paths share one of said plurality of register files. 13. The computer processing architecture as recited in claim 11, wherein a processing instruction comprises N-number of P-bit instructions appended together to form a very long instruction word (VLIW) , and said N-number of processing paths process N-number of P-bit instructions in parallel. 14. The computer processing architecture as recited in claim 13, wherein M=64, Q=64, and P=32. 15. The computer processing architecture as recited in claim 11, wherein said processing pipeline comprises an execute stage which includes an execute unit for each of said N-number of M-bit processing paths, each of said execute units comprising an integer processing unit, a load/store processing unit, a floating point processing unit, or any combination of one or more of said integer processing units, said load/store processing units, and said floating point processing units. 16. The computer processing architecture as recited in claim 15, wherein an integer processing unit and a floating point processing unit share one of said plurality of register files. 17. The computer processing architecture as recited in claim 11, wherein Q=64, and a 64-bit special register stores bits indicating whether a register in a register file is a private register or a global register, each bit in the 64-bit special register corresponding to one of said registers in said register file. 18. The computer processing architecture as recited in claim 11, wherein each of said plurality of register files is connected to a bus, and a value written to a global register in one of said plurality of register files is propagated to a corresponding global register in the other of said plurality of register files across said bus. 19. The computer processing architecture as recited in claim 18, wherein said plurality of register files are connected together in serial, and a value written to a first global register in a first of said plurality of register files is propagated to a corresponding first global register in a second of said plurality of register files connected directly to said first of said plurality of register files. 20. The processing core as recited in claim 1, wherein said Q-number of registers within each of said plurality of register files can switch between being either private or global registers.

이 특허에 인용된 특허 (35)

Kumar Rajendra (Sunnyvale CA) Emerson Paul G. (San Jose CA), Cache memory system having secondary cache integrated with primary cache for use with VLSI circuits.
상세보기
Dye Thomas A. (Cedar Park TX), Cached random access memory device and system.
상세보기
Leung Wingyu ; Tam Kit Sang, Caching in a multi-processor computer system.
상세보기
Witt David B. (Austin TX), Computer memory architecture including a replacement cache.
상세보기
Mukesh K. Patel ; Chitrabhanu Dasgupta, Constant pool reference resolution method.
상세보기
Rao G. R. Mohan, DRAM with integral SRAM and arithmetic-logic units.
상세보기
Michael C. Greim ; James R. Bartlett, DSP intercommunication network.
상세보기
Jouppi Norman P. (Palo Alto CA), Data processing system and method with prefetch buffers.
상세보기
Jouppi Norman P. (Palo Alto CA) Eustace Alan (Palo Alto CA), Data processing system and method with small fully-associative cache and prefetch buffers.
상세보기
Kronstadt Eric P. (Westchester County NY) Gandhi Sharad P. (Santa Clara CA), Distributed cache in dynamic rams.
상세보기
Rao G. R. Mohan, Dual port random access memories and systems using the same.
상세보기
Lai Konrad K. (Aloha OR), Exclusive and/or partially inclusive extension cache system and method to minimize swapping therein.
상세보기
Puar Deepraj S. (Sunnyvale CA) Ranganathan Ravi (Cupertino CA), Graphics controller integrated circuit without memory interface.
상세보기
Puar Deepraj S. (Sunnyvale CA) Ranganathan Ravi (Cupertino CA), Graphics controller integrated circuit without memory interface.
상세보기
Hagersten Erik ; Zak ; Jr. Robert C., Hybrid NUMA COMA caching system and methods for selecting between the caching modes.
상세보기
Liberty Dean A., Hybrid NUMA/S-COMA system and method.
상세보기
Cook Peter W. (Mount Kisco NY), IC chips including ALUs and identical register files whereby a number of ALUs directly and concurrently write results to.
상세보기
Saulsbury Ashley ; Nowatzyk Andreas ; Pong Fong, Integrated processor/memory device with victim data cache.
상세보기
Saulsbury Ashley ; Nowatzyk Andreas ; Pong Fong, Integrated processor/memory device with victim data cache.
상세보기
Pechanek Gerald G. ; Kurak ; Jr. Charles W., Manifold array processor.
상세보기
Cushing David E. (Chelmsford MA) Kelly Richard P. (Nashua NH) Ledoux Robert V. (Litchfield NH) Shen Jian-Kuo (Belmont MA), Mechanism for automatically updating multiple unit register file memories in successive cycles for a pipelined processin.
상세보기
Pechanek Gerald G. ; Revilla Juan G., Merged array controller and processing element.
상세보기
Engdahl Jonathan R. (Chardon OH) Gee David J. (Ann Arbor MI) Lucak Mark A. (Hudson OH) Adams Shawn L. (Rocky River OH), Method and apparatus for exchanging different classes of data during different time intervals.
상세보기
Boggs Darrell D. (Aloha OR) Colwell Robert P. (Portland OR) Fetterman Michael A. (Hillsboro OR) Glew Andrew F. (Hillsboro OR) Gupta Ashwani K. (Beaverton OR) Hinton Glenn J. (Portland OR) Papworth Da, Method and apparatus for maintaining a macro instruction for refetching in a pipelined processor.
상세보기
Gerald G. Pechanek ; Edwin F. Barry, Methods and apparatus for dynamic instruction controlled reconfiguration register file with extended precision.
상세보기
Thomas L. Drabenstott ; Gerald G. Pechanek ; Edwin F. Barry ; Charles W. Kurak, Jr., Methods and apparatus to support conditional execution in a VLIW-based array processor with subword execution.
상세보기
Fujishima Kazuyasu (Hyogo-ken JPX) Matsuda Yoshio (Hyogo-ken JPX) Asakura Mikio (Hyogo-ken JPX), Semiconductor memory device for simple cache system.
상세보기
Ward Stephen A. (Chestnut Hill MA) Zak Robert C. (Somerville MA), Set associative memory.
상세보기
Levy Henry M. ; Eggers Susan J. ; Lo Jack ; Tullsen Dean M., Shared register storage mechanisms for multithreaded computer systems with out-of-order execution.
상세보기
Jouppi Norman P. (Palo Alto CA), System and method for exclusive two-level caching.
상세보기
Rim Min-Joong,KRX, System for fetching unit instructions and multi instructions from memories of different bit widths and converting unit instructions to multi instructions by adding NOP instructions.
상세보기
Hsu Fu-Chieh ; Leung Wingyu, System utilizing a DRAM array as a next level cache memory and method for operating same.
상세보기
Baltz Philip K. ; Simar ; Jr. Ray L., User-configurable on-chip program memory system.
상세보기
Ito, Hironobu; Sato, Hisakazu, VLIW processor accepting branching to any instruction in an instruction word set to be executed consecutively.
상세보기
Masubuchi Yoshio (Kawasaki JPX), Very large instruction word type computer for performing a data transfer between register files through a signal line pa.
상세보기

이 특허를 인용한 특허 (7)

Laudon, James, Doubling thread resources in a processor.
상세보기
Laudon, James, Doubling thread resources in a processor.
상세보기
Tran, Thang, Managing power of thread pipelines according to clock frequency and voltage specified in thread registers.
상세보기
Tran, Thang, Multi-threading processors, integrated circuit devices, systems, and processes of operation and manufacture.
상세보기
Tran, Thang, Multithreaded processor with plurality of scoreboards each issuing to plurality of pipelines.
상세보기
Yeh,Tse Yu, Power consumption reduction in a pipeline by stalling instruction issue on a load miss.
상세보기
Gschwind,Michael Karl; Hofstee,Harm Peter; Hopkins,Martin E.; Kahle,James Allan, SIMD-RISC microprocessor architecture.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

VLIW computer processing architecture having a scalable number of register files 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (35)

이 특허를 인용한 특허 (7)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

VLIW computer processing architecture having a scalable number of register files 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (35)

이 특허를 인용한 특허 (7)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트