[특허]Methods for improved simulation of integrated circuit designs

Methods for improved simulation of integrated circuit designs 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-017/50
출원번호	US-0082971 (2008-04-14)
등록번호	US-8438003 (2013-05-07)
발명자 / 주소	Agarwal, Rakesh Baltaretu, Oana
출원인 / 주소	Cadence Design Systems, Inc.
대리인 / 주소	Sawyer Law Group, P.C.
인용정보	피인용 횟수 : 0 인용 특허 : 58

초록 ▼

A method of improved simulator processing is provided. The method according to the current invention includes grouping frequently accessed data into one set id to improve memory hierarchy performance. The method further includes simulating predication in a non-predicated architecture to improve CPU performance. The simulated predication includes pseudo-predicated implementation of read-operation vector element access pseudo-predicated implementation of write-operation vector element access, and predicated implementation of multi-way branches with assignment statements having a same left-hand-side (lhs). The method further includes determining a selection path in a multi-sensitive “always” block to reduce taken branches. The multi-sensitive “always” block selection path determination includes generating instance-specific code to save port allocation storage, and generating inlined instance-specific code to combine sensitive actions. The method further includes regenerating code affected by the assignment statement to implement value-change callback.

대표청구항 ▼

1. A computer-implemented method of improving simulator processing, the method comprising:allocating data used by a simulation scheduler;simulating predication in a non-predicated architecture, wherein the simulated predication comprises: determination of a maximum pseudo-predicated instruction sequence length by considering target machine microarchitecture characteristics;implementation of multi-valued read-operation and multi-valued write-operation vector element access, wherein any of the multi-value read-operation and the multi-valued write-operation can be expressed as 0/1/X/Z bits; andimplementation of multi-way branches with assignment statements having a same left-hand-side (lhs);determining a selection path in a multi-sensitive “always” block to reduce taken multi-way branches, andgenerating code;wherein allocating data used by a simulation scheduler further comprises: probing a line size of a processor cache;providing a software override of a value of the probed line size; andselecting one or more of a core routine algorithm and data structure for the simulation scheduler, wherein a sum of line sizes is not greater than a d1_linesize, wherein the d1_linesize is a line size of a level 1 data cache. 2. The computer-implemented method of claim 1, wherein a start address of the data structure is aligned at an address that is a multiple of the d1_linesize. 3. The computer-implemented method of claim 1, wherein a user specifies a set id of a class of central routines (S) as either a fixed value between a range of 0 and S−1 inclusive, or as a randomly chosen value in the range of 0 and S−1. 4. The computer-implemented method of claim 1, further comprising: applying programming constructs, wherein the programming constructs are unique to hardware description language (HDL). 5. The computer-implemented method of claim 1, wherein target machine microarchitecture characteristics are measured and the maximum pseudo-predicated instruction sequence length is determined, wherein a compiler-user-specified parameter can override the measured characteristics. 6. The computer-implemented method of claim 1, wherein a first phantom element at index −1 of each vector is introduced to conduct a pseudo-predicated evaluation of each vector. 7. The computer-implemented method of claim 1, wherein a second phantom element at index −2 of each vector is introduced and when the vector has X/Z bits the −2 index is a temporary storage location. 8. The computer-implemented method of claim 1, wherein assignment statements of the multi-way branch are converted to allow for the predication in a non-predicated architecture. 9. The computer-implemented method of claim 8, wherein assignment statements of the multi-way branch only having an “else” clause are converted to allow for the predication in a non-predicated architecture. 10. The computer-implemented method of claim 1, wherein code is inlined for each instance of a small module that directly encodes an actual parameter address. 11. The computer-implemented method of claim 10, wherein the module is viewed at compile time. 12. The computer-implemented method of claim 1, wherein if X/Z bits are present, a separate code area is branched for handling. 13. The computer-implemented method of claim 1, wherein condition checks are done only by mainline code, whereas code for statement bodies for each condition is stored in a separate code area. 14. The computer-implemented method of claim 13, wherein nesting of the separate code area is provided. 15. The computer-implemented method of claim 1, wherein an acc_vcl_add( ) command is executed when the generated code for an assignment is affected by a temporal call. 16. The computer-implemented method of claim 1, further comprising: assigning a unique id to each one of a format specifier. 17. The computer-implemented method of claim 16, wherein an I/O command only sends the format specifier id and data values to an I/O subsystem. 18. The computer-implemented method of claim 17, wherein the I/O subsystem runs on a separate processor/thread to offload a main simulation processor. 19. A system comprising: a memory; anda processor configured to:simulate predication in a non-predicated architecture, wherein the simulated predication comprises:determination of a maximum pseudo-predicated instruction sequence length by considering target machine microarchitecture characteristics;implementation of multi-valued read-operation and multi-valued write-operation vector element access, wherein any of the multi-valued read-operation and multi-valued write-operation can be expressed as 0/1/X/Z bits;implementation of multi-way branches with assignment statements having a same left-hand-side (lhs);and determining a selection path in a multi-sensitive “always” block to reduce taken multi-way branches,and generating code;wherein to allocate data used by a simulation scheduler further comprises to: probe a line size of a processor cache;provide a software override of a value of the probed line size; andselect one or more of a core routine algorithm and data structure for the simulation scheduler, wherein a sum of line sizes is not greater than a d1_linesize, wherein the d1_linesize is a line size of a level 1 data cache. 20. The system of claim 19, wherein the processor is further configured to: apply programming constructs, wherein the programming constructs are unique to hardware description language (HDL). 21. The system of claim 19, wherein a start address of the data structure is aligned at an address that is a multiple of the d1_linesize. 22. The system of claim 19, wherein a user specifies a set id of a class of central routines (S) as either a fixed value between a range of 0 and S−1 inclusive, or as a randomly chosen value in the range of 0 and S−1. 23. The system of claim 19, wherein target machine microarchitecture characteristics are measured and the maximum pseudo-predicated instruction sequence length is determined, wherein a compiler-user-specified parameter can override the measured characteristics. 24. A non-transitory computer readable storage medium containing program instructions for improving simulator processing, wherein execution of program instructions by one or more processors of a computer causes the one or more processors to carry out the steps of: simulating predication in a non-predicated architecture, wherein the simulated predication comprises: determination of a maximum pseudo-predicated instruction sequence length by considering target machine microarchitecture characteristics;implementation of multi-valued read-operation and multi-valued write-operation vector element access, wherein any of the multi-valued read-operation and multi-valued write-operation can be expressed as 0/1/X/Z bits;implementation of multi-way branches with assignment statements having a same left-hand-side (lhs);determining a selection path in a multi-sensitive “always” block to reduce taken multi-way branches, andgenerating code;wherein allocating data used by a simulation scheduler further comprises: probing a line size of a processor cache;providing a software override of a value of the probed line size; andselecting one or more of a core routine algorithm and data structure for the simulation scheduler, wherein a sum of line sizes is not greater than a d1_linesize, wherein the d1_linesize is a line size of a level 1 data cache. 25. The non-transitory computer readable storage medium of claim 24, further comprising: applying programming constructs, wherein the programming constructs are unique to hardware description language (HDL). 26. The non-transitory computer readable storage medium of claim 24, wherein a start address of the data structure is aligned at an address that is a multiple of the d1_linesize. 27. The non-transitory computer readable storage medium of claim 24, wherein a user specifies a set id of a class of central routines (S) as either a fixed value between a range of 0 and S−1 inclusive, or as a randomly chosen value in the range of 0 and S−1. 28. The non-transitory computer readable storage medium of claim 24, wherein target machine microarchitecture characteristics are measured and the maximum pseudo-predicated instruction sequence length is determined, wherein a compiler-user-specified parameter can override the measured characteristics.

이 특허에 인용된 특허 (58)

Gaither, Blaine D.; Smith, Robert B., Analyzing effectiveness of a computer cache by estimating a hit rate based on applying a subset of real-time addresses to a model of the cache.
상세보기
Blandy,Geoffrey Owen, Apparatus and method for implementing switch instructions in an IA64 architecture.
상세보기
Steely ; Jr. Simon C. ; Macri Joseph Dominic, Apparatus and method for serialized set prediction.
상세보기
Steely ; Jr. Simon C. ; Macri Joseph Dominic, Apparatus and method for serialized set prediction.
상세보기
Babaian Boris A.,RUX ; Gruzdov Feodor A.,RUX ; Sakhin Yuli Kh.,RUX ; Volin Vladimir S.,RUX ; Volkonski Vladimir Yu.,RUX, Architectural support for software pipelining of nested loops.
상세보기
Killian,Earl A.; Gonzalez,Ricardo E.; Dixit,Ashish B.; Lam,Monica; Lichtenstein,Walter D.; Rowen,Christopher; Ruttenberg,John C.; Wilson,Robert P.; Wang,Albert Ren Rui; Maydan,Dror Eliezer, Automated processor generation system for designing a configurable processor and method for the same.
상세보기
Bae Jong Hong,KRX ; Hong Se Kyoung,KRX, Branch prediction apparatus having branch target buffer for effectively processing branch instruction.
상세보기
Emma, Philip G.; Hartstein, Allan M.; Langston, Keith N.; Prasky, Brian R.; Puzak, Thomas R.; Webb, Charles F., Branch prediction instructions having mask values involving unloading and loading branch history data.
상세보기
Nonomura Yo,JPX ; Kikuchi Sumio,JPX, Branch predictor.
상세보기
Franke Hubertus ; Pattnaik Pratap Chandra ; Krieger Orran Yaakov ; Baransky Yurij Andrij, Cache architecture to enable accurate cache sensitivity.
상세보기
Sato, Mitsuru; Kumon, Kouichi, Cache device and control method for controlling cache memories in a multiprocessor system.
상세보기
Liao, Shih-wei; Rakvic, Ryan N.; Hankins, Richard A.; Wang, Hong; Wu, Gansha; Lueh, Guei-Yuan; Tian, Xinmin; Petersen, Paul M.; Shah, Sanjiv; Diep, Trung; Shen, John; Chinya, Gautham, Compiler-based scheduling optimization hints for user-level threads.
상세보기
Dmitry M. Maslennikov RU; Valentine G. Tikhonov RU; Alexander I. Kasinsky RU; Vladimir Y. Volkonsky RU, Computer method and apparatus for compilation of multi-way decisions.
상세보기
Jacobs Eino, Computer system, cache memory and process for cache entry replacement with selective locking of elements in different ways and groups.
상세보기
Mills Jack D. ; Wilkerson Christopher B., Decomposition of instructions into branch and sequential code sections.
상세보기
Lin,Chang Fu, Embedded system with instruction prefetching device, and method for fetching instructions in embedded systems.
상세보기
Cheong Hoichi (Austin TX) Hicks Dwain A. (Pflugerville TX) So Kimming (Austin TX), Hierarchical cache arrangement wherein the replacement of an LRU entry in a second level cache is prevented when the cac.
상세보기
Wilson Peter J., High performance processor employing background memory move mechanism.
상세보기
Finlay,Ian Richard; Lohman,Guy Maring, Information retrieval system and method using index ANDing for improving performance.
상세보기
Ushiro Sotaro (Tokyo JPX), Input/output paging mechanism in a data processor.
상세보기
Jaggar David Vivian,GBX, Invalid write recovery apparatus and method within cache memory.
상세보기
Ueno, Toshiaki, Memory management system.
상세보기
Thomas Basil Smith, III ; Robert Brett Tremaine, Memory system for permitting simultaneous processor access to a cache line and sub-cache line sectors fill and writeback to a system memory.
상세보기
Snyder ; II Wilson Parkhurst, Method and apparatus for accessing a cache memory utilization distingushing bit RAMs.
상세보기
Edwards,Stephen A., Method and apparatus for converting a concurrent control flow graph into a sequential control flow graph.
상세보기
Ebcioglu Mahmut Kemal ; Groves Randall Dean, Method and apparatus for dynamic conversion of computer instructions.
상세보기
Mark J. Charney ; Philip G. Emma ; Daniel A. Prener ; Thomas R. Puzak, Method and apparatus for reducing latency in set-associative caches using set prediction.
상세보기
John A. Wickeraad ; Stephen B. Lyle ; Brendan A. Voge, Method and apparatus for replacing cache lines in a cache memory.
상세보기
Broughton,Jeffrey M.; Chen,Liang T.; Lam,William kwei cheung; Pappas,Derek E.; Chen,Ihao; McWilliams,Thomas M.; Narang,Ankur; Rubin,Jeffrey B.; Cohen,Earl T.; Parkin,Michael W.; Saulsbury,Ashley N.; , Method and apparatus for simulation system compiler.
상세보기
Chaudhry, Shailender; Caprioli, Paul, Method and structure for concurrent branch prediction in a processor.
상세보기
Levy Hanoch (Rockville MD) Morris Robert J. T. (Los Gatos CA), Method for the assignment of request streams to cache memories.
상세보기
Devins, Robert J.; Ferro, Paul G.; Herzl, Robert D.; Kautzman, Mark E.; Mahler, Kenneth A.; Milton, David W., Method of developing re-usable software for efficient verification of system-on-chip integrated circuit designs.
상세보기
Burgess Bradley (Austin TX), Method of loading instructions into an instruction cache by repetitively using a routine containing a mispredicted branc.
상세보기
Moss,Robert W., Methods and structure for dynamic modifications to arbitration for a shared resource.
상세보기
Masashi Sasahara JP; Rakesh Agarwal ; Kamran Malik ; Michael Raam, Microprocessor with virtual-to-physical address translation using flags.
상세보기
Steely ; Jr. Simon C. (Hudson NH), Multi-index multi-way set-associative cache.
상세보기
Okamoto, Russell; Passmore, Greg, Multi-query optimization.
상세보기
Moreno Jaime Humberto (Hartsdale NY), Object code compatible representation of very long instruction word programs.
상세보기
Moreno Jaime Humberto, Object-code compatible representation of very long instruction word programs.
상세보기
Hoxey Steven M. (Claremont CAX), Partitioning case statements for optimal execution performance.
상세보기
Bell, Jr.,Robert H.; Guthrie,Guy Lynn; Starke,William John; Stuecheli,Jeffrey Adam, Pipelining D states for MRU steerage during MRU/LRU member allocation.
상세보기
Topham, Nigel Peter, Predicated execution of instructions in processors.
상세보기
Jourdan,Stephan J.; Boggs,Darrell D.; Miller,John Alan; Singhal,Ronak, Prediction of load-store dependencies in a processing agent.
상세보기
Palmer Mark L. (Hollis NH), Predictive cache system.
상세보기
Kedem Gershon ; Alexander Thomas, Predictive caching system and method based on memory access which previously followed a cache miss.
상세보기
Fouts Douglas Jai, Predictive read cache memories for reducing primary cache miss latency in embedded microprocessor systems.
상세보기
Robinson, John T.; Tremaine, Robert B.; Wazlowski, Michael E., Prioritizing and locking removed and subsequently reloaded cache lines.
상세보기
Agarwal Rakesh ; Malik Kamran ; Teruyama Tatsuo,JPX, Processor method and apparatus for performing single operand operation and multiple parallel operand operation.
상세보기
Berg,Stefan G.; Kim,Donglok; Kim,Yongmin, Program-directed cache prefetching for media processors.
상세보기
Moll, Laurent R.; Glaskowsky, Peter N.; Rowlands, Joseph B., Re-fetching cache memory having coherent re-fetching.
상세보기
Riordan Thomas J. (Los Altos CA), Structure and method for virtual-to-physical address translation in a translation lookaside buffer.
상세보기
Mayer, Albrecht; Siebert, Harry, System and method for integrated circuit emulation.
상세보기
Vincent Britto, System and method for selective transfer of application data between storage devices of a computer system through utilization of dynamic memory allocation.
상세보기
Chen, William Y., System and method of using partially resolved predicates for elimination of comparison instruction.
상세보기
Craft, David J.; Dixon, Brian P.; Volobuev, Yuri L.; Wyllie, James C., System for balancing multiple memory buffer sizes and method therefor.
상세보기
Tachibana, Masayoshi, Task execution time estimating method.
상세보기
Nair, Sreekumar Ramakrishnan, Using value-expression graphs for data-flow optimizations.
상세보기
Circello Joseph C. (Phoenix AZ) Schimke David J. (Phoenix AZ), Zero-cycle multi-state branch cache prediction data processing system and method thereof.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Methods for improved simulation of integrated circuit designs 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (58)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Methods for improved simulation of integrated circuit designs 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (58)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트