IPC분류정보
국가/구분 |
United States(US) Patent
등록
|
국제특허분류(IPC7판) |
|
출원번호 |
US-0670824
(2003-09-25)
|
등록번호 |
US-7392511
(2008-06-24)
|
발명자
/ 주소 |
- Brokenshire,Daniel Alan
- Hofstee,Harm Peter
- Minor,Barry L
- Nutter,Mark Richard
|
출원인 / 주소 |
- International Business Machines Corporation
|
대리인 / 주소 |
|
인용정보 |
피인용 횟수 :
9 인용 특허 :
59 |
초록
▼
A program is into at least two object files: one object file for each of the supported processor environments. During compilation, code characteristics, such as data locality, computational intensity, and data parallelism, are analyzed and recorded in the object file. During run time, the code chara
A program is into at least two object files: one object file for each of the supported processor environments. During compilation, code characteristics, such as data locality, computational intensity, and data parallelism, are analyzed and recorded in the object file. During run time, the code characteristics are combined with runtime considerations, such as the current load on the processors and the size of the data being processed, to arrive at an overall value. The overall value is then used to determine which of the processors will be assigned the task. The values are assigned based on the characteristics of the various processors. For example, if one processor is better at handling intensive computations against large streams of data, programs that are highly computationally intensive and process large quantities of data are weighted in favor of that processor. The corresponding object is then loaded and executed on the assigned processor.
대표청구항
▼
What is claimed is: 1. A computer-implemented method to load objects in a heterogeneous multiprocessor computer system, said method comprising: analyzing a source program for one or more program characteristics, the program characteristics selected from the group consisting of data locality, comput
What is claimed is: 1. A computer-implemented method to load objects in a heterogeneous multiprocessor computer system, said method comprising: analyzing a source program for one or more program characteristics, the program characteristics selected from the group consisting of data locality, computational intensity, and data parallelism; in response to the analyzing, compiling the source program into two object files, a first object file corresponding to a first instruction set and a second object file corresponding to a second instruction set; in response to the compiling, storing the program characteristics in each of the object files; receiving a request to execute a software task corresponding to the source program; selecting a processor from a plurality of dislike processors, wherein a first processor corresponds to the first instruction set and a second processor corresponds to the second instruction set, to execute the software task, the selecting comprising comparing one or more characteristics of the software task with the program characteristics stored in the first object file and the second object file; in response to selecting the first processor: loading the first object file into a shared memory, wherein the shared memory is shared by the plurality of dislike processors; and executing the loaded first object file by the first processor; and in response to selecting the second processor: loading the second object file into the shared memory; and executing the loaded second object file by the second processor. 2. The method as described in claim 1 wherein selecting the processor further comprises: retrieving the program characteristics; retrieving current system characteristics, wherein the current system characteristics includes processor load characteristics for the plurality of dislike processors; and combining the program characteristics and the current system characteristics to determine which of the dislike processors to assign the software task. 3. The method as described in claim 2 wherein at least one of the current system characteristics is selected from the group consisting of processor availability for each of the dislike processors, and a data size of data being processed by the software task. 4. The method as described in claim 1 wherein executing the first object file by the first processor further comprises: determining that the first processor has a scheduler that schedules tasks for the first processor; and scheduling the first object file to execute on the first processor, the scheduling including: writing a software code identifier corresponding to the first object file to a run queue corresponding to the first processor. 5. The method as described in claim 1 wherein executing the second object file by the second processor further comprises: signaling the second processor; and reading, by the second processor, the second object file from the shared memory into a local memory corresponding to the second processor. 6. The method as described in claim 5 further comprising: writing an instruction block in the shared memory, the instruction block including an address of the loaded second object file and an address of an input buffer; and reading the loaded second object file and the input buffer from the locations identified in the instruction block to the second processor's local memory. 7. The method as described in claim 6 further comprising: signaling the second processor from one of the other processors, the signaling including: writing the address of the instruction block to a mailbox that corresponds to the second processor; and reading, by the second processor, the instruction block in response to the signal.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.