Method and system for exploiting service level objectives to enable resource sharing in a communication network having a plurality of application environments
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-015/173
G06F-015/16
G06F-017/50
G06F-009/45
G06G-007/62
G06G-007/00
출원번호
US-0991339
(2001-11-13)
등록번호
US-7310672
(2007-12-18)
발명자
/ 주소
Rolia,Jerome
출원인 / 주소
Hewlett Packard Development Company, L.P.
인용정보
피인용 횟수 :
11인용 특허 :
29
초록▼
A method and system for resource sharing in a communication network supporting a plurality of application environments. Specifically, one embodiment of the present invention discloses a method ensuring only sufficient computational resources are used by a multi-component system as needed to meet sys
A method and system for resource sharing in a communication network supporting a plurality of application environments. Specifically, one embodiment of the present invention discloses a method ensuring only sufficient computational resources are used by a multi-component system as needed to meet system, subsystem, and/or component-level service level objectives. Demand values are calculated for a plurality of components in an application environment. The demand values are calculated from throughput and utilization metrics collected at each of the plurality of components. Response time metrics are predicted from the demand values. The application environment is modeled in response to the response time metrics to determine the optimum number of computational resources needed for each of the components in satisfying a functional objective. A dynamic resource manager communicates with a plurality of component managers, one for each of the plurality of components, to allocate computational resources throughout the application environment.
대표청구항▼
What is claimed is: 1. A method of resource allocation comprising: a) calculating a plurality of demand values for a plurality of components, wherein said plurality of demand values is calculated from a combination of throughput and utilization metrics, wherein said components are communicatively c
What is claimed is: 1. A method of resource allocation comprising: a) calculating a plurality of demand values for a plurality of components, wherein said plurality of demand values is calculated from a combination of throughput and utilization metrics, wherein said components are communicatively coupled in series, wherein processing of a request from a user received at a first component of said plurality of components proceeds forward through said components to a last component in said series and then backward through said components to said first component and then to said user, wherein a performance of service is suspended at each of said components after said processing of a request by said each of said components, and wherein said metrics are measurable at points between said components; b) predicting a plurality of response time metrics for said plurality of components based on said plurality of demand values; c) modeling said plurality of components based on an objective function that responds to conditions as represented by said plurality of response time metrics when at least one of said plurality of response time metrics does not satisfy at least one of a plurality of service level objectives to determine a new effective distribution of computational resources throughout said plurality of components such that said plurality of components that are modeled satisfies said plurality of service level objectives; and d) allocating computational resources throughout said plurality of components to reflect said new effective distribution. 2. The method as described in claim 1, wherein said plurality of components comprise an application environment. 3. The method as described in claim 1, wherein said at least one of a plurality of service level objectives applies to said plurality of components on a system-wide basis. 4. The method as described in claim 1, wherein said at least one of a plurality of service level objectives applies to said plurality of components on a subsystem basis. 5. The method as described in claim 1, wherein said at least one of a plurality of service level objectives applies to one of said plurality of components. 6. The method as described in claim 1, wherein a) further comprises: receiving a plurality of metric values from said plurality of components, said plurality of metric values used to calculate said demand values. 7. The method as described in claim 1, wherein c) comprises: inputting said plurality of demand values into a predictive model to determine said new effective distribution of computational resources. 8. The method as described in claim 1, wherein d) comprises: removing computational resources from said plurality of components. 9. The method as described in claim 1, wherein d) comprises: adding computational resources to said plurality of components. 10. A method of resource allocation in an application environment comprising: a) receiving a plurality of metric values from a plurality of components of said application environment, wherein said components are communicatively coupled in series, wherein processing of a request from a user received at a first component of said plurality of components proceeds forward through said components to a last component in said series and then backward through said components to said first component and then to said user, wherein a performance of service is suspended at each of said components after said processing of a request by said each of said components, and wherein said metric values are measurable at points between said components; b) calculating a plurality of demand values from said plurality of metric values; c) predicting a plurality of response time metrics for each of said plurality of components based on said plurality of demand values; d) modeling said plurality of components based on an objective function that responds to conditions as represented by said plurality of response time metrics when at least one of said plurality of response time metrics does not satisfy at least one of a plurality of service level objectives applying to said plurality of components on a system level to determine a new effective distribution of computational resources for said plurality of components such that response time metrics associated with said plurality of components that are modeled satisfies said plurality of service level objective, wherein said new effective distributions results in an optimum number of said plurality of components; and e) allocating computational resources throughout said plurality of components to reflect said optimum number. 11. The method of resource allocation as described in claim 10, wherein d) further comprises: determining a plurality of optimum numbers of computational resources, one for each of said plurality of components, that represents said new effective distribution of computational resources. 12. The method of resource allocation as described in claim 10, wherein e) comprises: removing computational resources from said plurality of components. 13. The method as described in claim 10, wherein e) comprises: adding computational resources to said plurality of components. 14. The method as described in claim 10, wherein c) comprises: predicting said plurality of response time metrics using a prediction modeling technique. 15. The method as described in claim 14, wherein said plurality of metric values includes throughput metrics and utilization metrics. 16. The method as described in claim 10, wherein c) comprises: inputting said plurality of demand values into a predictive model to determine said optimum number. 17. A computer system comprising: a processor; a computer readable memory coupled to said processor and containing program instructions that, when executed, implement a method of resource allocation comprising: a) calculating a plurality of demand values for a plurality of components, wherein said plurality of demand values is calculated from a combination of throughput and utilization metrics, wherein said components are communicatively coupled in series, wherein processing of a request from a user received at a first component of said plurality of components proceeds forward through said components to a last component in said series and then backward through said components to said first component and then to said user, wherein a performance of service is suspended at each of said components after said processing of a request by said each of said components, and wherein said metrics are measurable at points between said components; b) predicting a plurality of response time metrics for said plurality of components based on said plurality of demand values; c) modeling said plurality of components based on an objective function that responds to conditions as represented by said plurality of response time metrics when at least one of said plurality of response time metrics does not satisfy at least one of a plurality of service level objectives to determine a new effective distribution of computational resources throughout said plurality of components such that said plurality of components that are modeled satisfies said plurality of service level objectives; and d) allocating computational resources throughout said plurality of components to reflect said new effective distribution. 18. The computer system as described in claim 17, wherein said plurality of components comprise an application environment. 19. The computer system as described in claim 17, wherein said at least one of a plurality of service level objectives applies to said plurality of components on a system-wide basis. 20. The computer system as described in claim 17, wherein said at least one of a plurality of service level objectives applies to said plurality of components on a subsystem basis. 21. The computer system as described in claim 17, wherein said at least one of a plurality of service level objectives applies to one of said plurality of components. 22. The computer system as described in claim 17, wherein a) in said method of resource allocation further comprises: receiving a plurality of metric values from said plurality of components, said plurality of metric values used to calculate said demand values. 23. The computer system as described in claim 17, wherein c) in said method of resource allocation comprises: inputting said plurality of demand values into a predictive model to determine said new effective distribution of computational resources. 24. The computer system as described in claim 17, wherein d) in said method of resource allocation comprises: removing computational resources from said plurality of components. 25. The computer system as described in claim 17, wherein d) in said method of resource allocation comprises: adding computational resources to said plurality of components. 26. A communication network comprising: a plurality of computational resources; an application environment having a plurality of network nodes coupled together; a plurality of components in said application environment servicing an application, each of said plurality of components including at least one computational resource from said plurality of computational resources, each of said plurality of components residing on one of said plurality of network nodes, wherein said components are communicatively coupled in series, wherein processing of a request from a user received at a first component of said plurality of components proceeds forward through said components to a last component in said series and then backward through said components to said first component and then to said user, wherein a performance of service is suspended at each of said components after said processing of a request by said each of said components; a plurality of metrics measured at each of said plurality of components for calculating a plurality of demand values, wherein said metrics are measurable at points between said components, wherein said plurality of demand values is calculated from a combination of throughput and utilization metrics; a functional objective for defining an optimum number of computational resources in said application environment; and a dynamic resource manager coupled to said application environment for modeling said plurality of components based on said functional objective that responds to conditions as represented by said plurality of demand values when at least one of said plurality of demand values does not satisfy at least one of a plurality of service level objectives to determine a new effective distribution of computational resources throughout each of said plurality of components such that said plurality of components that are modeled satisfies said plurality of service level objectives. 27. The communication network as described in claim 26, wherein said plurality of metrics comprises throughput metrics and utilization metrics. 28. The communication network as described in claim 26, further comprising: a prediction model for predicting a plurality of response time metrics for said plurality of components based on said plurality of demand values; and a mathematical model for modeling said plurality of components in response to said plurality of response time metrics for determining said new effective distribution of computational resources. 29. The communication network as described in claim 26, further comprising: a plurality of component managers, one for each of said plurality of components, for managing the addition and removal of computational resources in said plurality of components in response to notices from said dynamic resource manager. 30. The communication network as described in claim 26, wherein said plurality of components comprise a local area network (LAN).
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (29)
Bigus Joseph Phillip, Adaptive resource allocation using neural networks.
Edward James Ellesson ; Roch Andre Guerin ; Sanjay Damodar Kamat ; Arvind Krishna ; Rajendran Rajan ; Dinesh Chandra Verma, Architecture for supporting service level agreements in an IP network.
Chu Lynn H. (Piscataway NJ) Dragoni ; Jr. Anthony H. (Warren NJ) Eichelman ; II Fredric R. (Princeton NJ) Huleatt Thomas O. (Neshanic Station NJ) Irgon Adam E. (East Brunswick NJ) Langion Martha S. (, Expert system for computer system resource management.
Zinky, John A.; Schantz, Richard R.; Bakken, David E.; Loyall, Joseph P., Framework for providing quality of service requirements in a distributed object-oriented computer system.
Goldszmidt,German; Lorrain,Jean A.; Maruyama,Kiyoshi; Verma,Dinesh Chandra, METHOD AND APPARATUS FOR DYNAMICALLY ADJUSTING RESOURCES ASSIGNED TO PLURALITY OF CUSTOMERS, FOR MEETING SERVICE LEVEL AGREEMENTS (SLAS) WITH MINIMAL RESOURCES, AND ALLOWING COMMON POOLS OF RESOURCES.
Leon L. Lumelsky ; Nelson R. Manohar, Management of service-oriented resources across heterogeneous media servers using homogenous service units and service signatures to configure the media servers.
Mangipudi, Krishna; Basani, Vijay, Method and apparatus for policy based class of service and adaptive service level management within the context of an internet and intranet.
Mangipudi,Krishna; Basani,Vijay, Method and apparatus for policy based class service and adaptive service level management within the context of an internet and intranet.
Al-Hilali Hilal ; Clarke Perry ; Guimbellot David Edward ; Howell David Andrew, Method and computer program product for estimating total resource usage requirements of a server application in a hypothetical user configuration.
Combs, Charles; Gold, Jeffrey; Mair, Brian; Pedersen, David; Schear, David, Method and system for contention controlled data exchange in a distributed network-based resource allocation.
Sheets, Kitrick B.; Smith, Philip S.; Engel, Stephen J.; Deng, Yuefan; Guistozzi, Joseph; Korobka, Alexander, Method and system for providing dynamic hosted service management across disparate accounts/sites.
Bouillet,Eric; Mitra,Debasis; Ramakrishnan,Kajamalai Gopalaswamy, Method for controlling the disposition of an incoming call based on the loading status of a route and on a test of each link along the route.
Bigus, Joseph Phillip; Hellerstein, Joseph L.; Squillante, Mark S.; Thathachar, Jayram S., System and method for generic automated tuning for performance management.
Mei,Mark Gee Gwo; Wu,Kun Lung; Yu,Philip Shi Lung, System for providing gracefully degraded services on the internet under overloaded conditions responsive to HTTP cookies of user requests.
Jorgensen, Jacob W., Transmission control protocol/internet protocol (TCP/IP) packet-centric wireless point to multi-point (PTMP) transmission system architecture.
Das, Rajarshi; Kephart, Jeffrey Owen; Tesauro, Gerald James; Walsh, William Edward, Method and apparatus for utility-based dynamic resource allocation in a distributed computing system.
Peterson, Jonathan M.; Stark, Magnus H.; Kozee, Jacqueline E.; Atkinson, M. Cort; Hodges, Donna Kay; Bonner, Bridget N.; Muth, Mario, Methods and systems for electronic data exchange utilizing centralized management technology.
Hodges, Donna Kay; Kreiner, Barrett M.; Menon, Jai P.; Walton, Tommy W.; Rehkopf, Thomas W.; Tischer, Steven Neil, Service-oriented architecture systems and methods.
Murthy, Seshashayee; Singh, Aameek; Uttamchandani, Sandeep M., System, method, and apparatus for server-storage-network optimization for application service level agreements.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.