System and method for monitoring cluster partner boot status over a cluster interconnect
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-015/167
G06F-015/16
출원번호
US-0284795
(2002-10-31)
발명자
/ 주소
Gole,Abhijeet
출원인 / 주소
Network Appliance, Inc.
대리인 / 주소
Cesari and McKenna LLP
인용정보
피인용 횟수 :
7인용 특허 :
33
초록▼
A system and method for monitoring cluster partner boot status over a RDMA-capable transport protocol is provided. Upon initialization, a storage system in a cluster generates a virtual interface connection using a predetermined and pre-assigned fibre channel identification and virtual interface num
A system and method for monitoring cluster partner boot status over a RDMA-capable transport protocol is provided. Upon initialization, a storage system in a cluster generates a virtual interface connection using a predetermined and pre-assigned fibre channel identification and virtual interface number. The surviving storage system may then open a connection to this predetermined and predefined virtual interface connection and issue RDMA commands thereto. This surviving storage system may utilize the RDMA capabilities of the virtual interface protocol to obtain a set of boot statistics to determine whether the booting file or storage system is progressing normally.
대표청구항▼
What is claimed is: 1. A method for monitoring a cluster partner boot status over a remote direct memory access-capable transport protocol, the method comprising the steps of: creating, on a booting cluster partner, a protocol interface, the protocol interface utilizing a predetermined and pre-assi
What is claimed is: 1. A method for monitoring a cluster partner boot status over a remote direct memory access-capable transport protocol, the method comprising the steps of: creating, on a booting cluster partner, a protocol interface, the protocol interface utilizing a predetermined and pre-assigned identification; opening, by a surviving cluster partner, a connection to the created protocol interface; and performing, by the surviving cluster partner via the connection, a remote direct memory access to obtain a set of boot variables of the booting cluster partner. 2. The method of claim 1 further comprising the steps of: determining, from the set of boot variables, whether the booting cluster partner is progressing normally; performing, in response to the booting cluster partner not progressing normally, a failover routine; and completing, in response to the booting cluster partner progressing normally, an initialization routine on the booting cluster partner. 3. The method of claim 1 wherein the transport protocol further comprises a virtual interface protocol. 4. The method of claim 1 wherein the predetermined and pre-assigned identification further comprises a virtual interface number and fibre channel identification. 5. A method for monitoring cluster partner boot status over a cluster interconnect, the method comprising the steps of: generating, upon boot initialization, one or more protocol connections, the protocol connections utilizing a predetermined and pre-assigned identification; issuing, by a non-booting cluster partner, a remote direct memory access operation to the booting cluster partner; and whereby the remote direct memory access operation retrieves a set of boot variables from the booting cluster partner. 6. The method of claim 5 further comprising the steps of: determining, from the set of boot variables, whether the booting cluster partner is progressing normally. 7. A storage system cluster comprising: a booting storage system, the booting storage system configured to create, using predetermined and preassigned identification, one or more connection interfaces; a surviving storage system, the surviving storage system configured to perform remote direct memory access operations using the connection interfaces to obtain a set of boot variables; and whereby the surviving storage system is further configured to determine from the set of boot variables whether to perform a failover operation. 8. The storage system cluster of claim 7 wherein the remote direct memory access operations are performed using a virtual interface connection. 9. A storage system, comprising: a first cluster partner configured to: i) begin an initialization process in response a failure of the first cluster partner, and ii) store a boot status of the initialization process as one or more boot variables; a second cluster partner configured to monitor the boot status of the first cluster partner; and a cluster interconnect to connect the first and second cluster partners, wherein the second cluster partner is further configured to request the boot variables from the first cluster partner over the cluster interconnect prior to completion of the initialization process. 10. The storage system of claim 9, wherein the second cluster partner is further configured to determine that the initialization process on the first cluster partner is not progressing normally based on the boot variables, and in response, to perform a failover routine. 11. The storage system of claim 9, wherein the second cluster partner is further configured to determine that the initialization process on the first cluster partner is progressing normally based on the boot variables, and in response, to allow the initialization process to complete. 12. The storage system of claim 9, further comprising: a memory of the first cluster partner configured to store the boot variables in a particular location, wherein the second cluster partner is further adapted to request the boot variables by sending a read request to the memory of the first cluster partner for the particular location. 13. The storage system of claim 9, wherein the second cluster partner is further configured to determine the boot variables of the first cluster partner by a remote direct memory access by the second cluster partner to the first cluster partner. 14. The storage system of claim 9, wherein the cluster interconnect comprises a Virtual Interface. 15. The storage system of claim 9, wherein the cluster interconnect is configured to utilize a Fibre Channel protocol. 16. The storage system of claim 9, wherein the first and second cluster partners are storage systems. 17. The storage system of claim 9, wherein at least one of the boot variables indicates a boot stage of the initialization process. 18. The storage system of claim 9, wherein at least one of the boot variables indicates a boot progress of the initialization process. 19. The storage system of claim 9, wherein the boot progress is indicated by a value adapted to be incremented in response to progression of the initialization process.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (33)
Uzrad-Nali, Oran; Har-Chen, Dror, Apparatus and method for receive transport protocol termination.
Gordon, Barry; Baranoff, Keith Andrew; Davies, Keith; Hollows, Robin L., Autonomously established secure and persistent internet connection and autonomously reestablished without user intervention that connection if it lost.
Suzuki, Haruyoshi, Information processing method and apparatus using a storage medium storing all necessary software and content to configure and operate the apparatus.
Byers Russell Francis,CAX ; Duchaine Joseph Marcel Gilles,CAX ; Schuett Michael Leonard,CAX ; Grootenboer Cornelius Jacob,GBX, Method and controller for controlling shutdown of a processing unit.
Ohran Richard S. ; Rollins Richard N. ; Ohran Michael R. ; Marsden Wally, Method for improving recovery performance from hardware and software errors in a fault-tolerant computer system.
Hitz David ; Malcolm Michael ; Lau James ; Rakitzis Byron, Method for maintaining consistent states of a file system and for creating user-accessible read-only copies of a file s.
Wallach Walter A. ; Findlay Bruce ; Pellicer Thomas J. ; Chrabaszcz Michael, Method for providing a fault tolerant network using distributed server processes to remap clustered network resources to other servers during server failure.
McCown Patricia M. (Cresskill NJ) Conway Timothy J. (Highland Park NJ) Jessen Karl M. (Bayonne NJ), Methods and apparatus for monitoring system performance.
Ekrot Alexander C. ; Singer James H. ; Hemphill John M. ; Autor Jeffrey S. ; Galloway William C. ; Alexander Dennis J., Multi-server fault tolerance using in-band signalling.
Hitz David (Sunnyvale CA) Schwartz Allan (Saratoga CA) Lau James (Cupertino CA) Harris Guy (Mountain View CA), Multiple facility operating system architecture.
Hitz David ; Schwartz Allan ; Lau James ; Harris Guy, Multiple software-facility component operating system for co-operative processor control within a multiprocessor computer system.
Row Edward J. (Mountain View CA) Boucher Laurence B. (Saratoga CA) Pitts William M. (Los Altos CA) Blightman Stephen E. (San Jose CA), Parallel I/O network file server architecture.
Row Edward J. (Mountain View CA) Boucher Laurence B. (Saratoga CA) Pitts William M. (Los Altos CA) Blightman Stephen E. (San Jose CA), Parallel I/O network file server architecture.
Beardsley Brent Cameron (Tucson AZ) Hathorn Roger Gregory (Tucson AZ) Holley Bret Wayne (Tucson AZ) Iskiyan James Lincoln (Tucson AZ), Remote copy system for setting request interconnect bit in each adapter within storage controller and initiating request.
Chakkalamattam Jos Paul ; Steven Michael French ; Aidon Paul Jennery ; Glenn Edwards Brew ; James Richard Schoech, Server based configuration of network computers via machine classes.
Clowes Richard F. (New York NY) Tims Fred W. (Springfield Center NY), Workstation-implemented data storage re-routing for server fault-tolerance on computer networks.
Andreas,Savva, Apparatus and the method for integrating NICs with RDMA capability but no hardware memory protection in a system without dedicated monitoring processes.
Tyrrell, John Charles; Roussos, Konstantinos; Smoot, Peter Logan, System and method for implementing a flexible storage manager with threshold control.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.