Client-side repository in a networked deduplicated storage system
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-017/30
G06F-011/14
출원번호
US-0324848
(2011-12-13)
등록번호
US-9104623
(2015-08-11)
발명자
/ 주소
Retnamma, Manoj Kumar Vijayan
Attarde, Deepak Raghunath
Joshi, Hetalkumar N.
출원인 / 주소
Commvault Systems, Inc.
대리인 / 주소
Knobbe, Martens, Olson & Bear, LLP
인용정보
피인용 횟수 :
16인용 특허 :
137
초록▼
A storage system according to certain embodiments includes a client-side repository (CSR). The CSR may communicate with a client at a higher data transfer rate than the rate used for communication between the client and secondary storage. During copy operations, for instance, some or all of the data
A storage system according to certain embodiments includes a client-side repository (CSR). The CSR may communicate with a client at a higher data transfer rate than the rate used for communication between the client and secondary storage. During copy operations, for instance, some or all of the data being backed up or otherwise copied to secondary storage is stored in the CSR. During restore operations, copies of the data stored in the CSR is accessed from the CSR instead of from secondary storage, improving performance. Remaining data blocks not stored in the CSR can be restored from secondary storage.
대표청구항▼
1. A method for restoring data to a client system from secondary storage, the method comprising: providing one or more computer processors;performing with a media agent executing in the one or more computer processors, a secondary copy operation that copies a plurality of data blocks associated with
1. A method for restoring data to a client system from secondary storage, the method comprising: providing one or more computer processors;performing with a media agent executing in the one or more computer processors, a secondary copy operation that copies a plurality of data blocks associated with primary storage in a client system to secondary storage located remotely from the primary storage, wherein the client system communicates with the secondary storage via a wide area network and wherein the secondary copy operation creates a secondary copy of the plurality of data blocks in the secondary storage;during performance of the secondary copy operation, creating with the media agent, for each data block of the plurality of data blocks a hash signature for each data block according to a deduplication scheme;during performance of the secondary copy operation, further copying at least a portion of the data blocks and a first copy of hash signatures associated with the data blocks to a client-side repository comprising at least computer memory, wherein client-side repository is different than the secondary storage, and wherein the client system communicates with the client-side repository over a local area network;during performance of the secondary copy operation, populating an index in communication with the media agent with a second copy of the hash signatures associated with the plurality of the data blocks stored in the secondary storage;further creating with the media agent, age information associated with the time of the creation of the secondary copy of the plurality of data blocks in the secondary storage;storing in at least computer memory, the age information about the time of creation of the secondary copy of the plurality of data blocks in secondary storage;receiving at the media agent, a request to restore data to the client system;consulting with the media agent, the age information to determine the time of the creation of the secondary copy in secondary storage of at least one data block associated with the restore data;based on the age information of the time of the creation of the secondary copy in secondary storage of the at least one data block associated with the restore data, deciding with the media agent whether to query the client-side repository remote from the secondary storage as to whether the client-side repository is populated with a copy of the at least one data block associated with the restore data;in response to determining that the age of creation of the secondary copy in secondary storage of the at least one data block associated with the restore data satisfies a threshold age, querying the client-side repository with the second copy of the hash signature from the index to determine whether the first copy of the hash signature is stored in the client-side repository;receiving an answer from the client-side repository indicating a result of the query; andin response to the answer, accessing the at least one data block associated with the restore data from secondary storage for transmission to the client system when the result indicates that the client-side repository is not populated with the first hash signature, wherein the at least one data block associated with the restore data is restored from the client-side repository to an information store of the client system via the local area network when the client-side repository is populated with the first hash signature; andin response to determining that the age of creation of the secondary copy in secondary storage of the at least one data block associated with the restore data does not satisfy the threshold age, restoring the at least one data block from secondary storage for transmission to the client system via the wide area network. 2. The method of claim 1, wherein the client system communicates with the client-side repository via a local area network and the client system communicates with the secondary storage via a wide area network. 3. The method of claim 1, wherein the age information comprises a copy session identifier identifying a particular copy session associated with the at least one data block associated with the restore data. 4. The method of claim 1, wherein the age information is indicative of when the at least one data block associated with the restore data was stored at the secondary storage. 5. The method of claim 1, wherein the age information is indicative of when the at least one data block associated with the restore data was stored at the secondary storage relative to when one or more other data blocks were stored at the secondary storage. 6. The method of claim 1, wherein determining that the age of the at least one data block associated with the restore data satisfies the threshold age comprises determining that the age is less than the threshold age. 7. The method of claim 1, wherein the age information provides an indication as to the likelihood that the at least one data block associated with the restore data was pruned from the client-side repository. 8. The method of claim 1, wherein the secondary copy of the plurality of data blocks were previously copied to the secondary storage during a backup operation. 9. The method of claim 1, wherein the secondary copy of the plurality of data blocks were previously copied to the secondary storage during a replication operation. 10. A storage system, comprising: one or more computer processors;primary storage in a client system wherein the primary storage stores a plurality of data blocks created by the client system;secondary storage located remotely from the primary storage and is in communication with the client system via a wide area network, the secondary storage storing in response to a secondary copy operation, a secondary copy of the plurality of data blocks and according to a deduplication scheme that creates a hash signature for each of the data blocks;a client-side repository comprising at least computer memory that is different than secondary storage and is in communication with the client system via a local area network, the client-side repository stores copies of at least a portion the data blocks copied to secondary storage and stores a first copy of the hash signatures associated with the portion of the data blocks;a media agent executing in the one or more computer processors, the media agent having an index comprising at least computer memory, the index storing at least a second copy of the hash signatures associated with the plurality of the data blocks stored in the secondary storage, the index further storing age information associated with the time of the creation of the secondary copy of the plurality of data blocks in the secondary storage;in response to receiving a request to restore data to the client system, the media agent further configured to: consult the age information to determine the time of the creation of the secondary copy in secondary storage of at least one data block associated with the restore data;based on the age information of the time of the creation of the secondary copy in secondary storage of the at least one data block associated with the restore data, determine whether to query a client-side repository remote from the secondary storage as to whether the client-side repository is populated with a copy of the at least one data block associated with the restore data;in response to a determination that the age of creation of the secondary copy in secondary storage of the at least one data block satisfies a threshold age, query the client-side repository with the second copy of the hash signature from the index to determine whether the first copy of the hash signature is stored in the client-side repository;receive an answer from the client-side repository indicating a result of the query;in response to the answer, access the at least one data block associated with the restore data from secondary storage for transmission to the client system when the result indicates that the client-side repository is not populated with the first hash signature, wherein the data block is restored from the client-side repository to an information store of the client system via the local area network when the client-side repository is populated with the first hash signature; andin response to a determination that the age of creation of the secondary copy in secondary storage of the at least one data block associated with the restore data does not satisfy the threshold age, restoring the at least one data block from secondary storage for transmission to the client system via the wide area network. 11. The storage system of claim 10, wherein the age information comprises a copy session identifier identifying a particular copy session associated with the at least one data block associated with the restore data. 12. The storage system of claim 10, wherein the client system communicates with the client-side repository via a local area network and the client system communicates with the secondary storage via a wide area network. 13. The storage system of claim 10, wherein the age information is indicative of when the at least one data block associated with the restore data was stored at the secondary storage. 14. The storage system of claim 10, wherein the age information is indicative of when the at least one data block associated with the restore data was stored at the secondary storage relative to when one or more other data blocks were stored in the secondary storage. 15. The storage system of claim 10, wherein to determine that the age of creation of the secondary copy in secondary storage of the at least one data block associated with the restore data satisfies the threshold age, the media agent is configured to determine that the age is less than the threshold age. 16. The storage system of claim 10, wherein the age information provides an indication as to the likelihood that the at least one data block associated with the restore data was pruned from the client-side repository.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (137)
Ranade, Dilip Madhusudan; Shelat, Radha; Kabra, Navin, Adaptive caching for a distributed file sharing system.
Yuval Ofek ; Zoran Cakeljic ; Samuel Krikler IL; Sharon Galtzur IL; Michael Hirsch IL; Dan Arnon ; Peter Kamvysselis, Apparatus and methods for copying, backing up, and restoring data using a backup segment size larger than the storage block size.
Griffin David (Maynard MA) Campbell Jonathan (Acton MA) Reilly Michael (Sterling MA) Rosenbaum Richard (Pepperell MA), Arrangement with cooperating management server node and network service node.
Nakano Toshio (Odawara JPX) Nozawa Masafumi (Odawara JPX) Kurano Akira (Odawara JPX) Hisano Kiyoshi (Odawara JPX) Hoshino Masayuki (Odawara JPX), Backup control method and system in data processing system using identifiers for controlling block data transfer.
Kitajima Hiroyuki (Yokohama) Yamamoto Akira (Yokohama) Doi Takashi (Hadano) Nozawa Masafumi (Odawara JPX), Buffered peripheral system and method for backing up and retrieving data to and from backup memory device.
Ludmila Cherkasova ; Martin F. Arlitt ; Richard J. Friedrich ; Tai Jin, Caching protocol method and system based on request frequency and relative storage duration.
Cole Leo J. (Raleigh NC) Frantz Curtis J. (Durham NC) Lee Jeannette (Raleigh NC) Ordanic Zvonimir (Raleigh NC) Plank Larry K. (Rochester MN), Centralized management in a computer network.
Carpenter Kelly S. (Fremont CA) Dearing Gerard M. (San Jose CA) Nick Jeffrey M. (Fishkill NY) Strickland Jimmy P. (Saratoga CA) Swanson Michael D. (Poughkeepsie NY) Wilkinson Wendell W. (Hyde Park NY, Coherence controls for store-multiple shared data coordinated by cache directory entries in a shared electronic storage.
Senator Steven T. ; Fuller Billy J., Computer system method and apparatus providing for various versions of a file without requiring data copy or log operati.
Fecteau Jean G. (Toronto NY CAX) Gdaniec Joseph M. (Vestal NY) Hennessy James P. (Endicott NY) MacDonald John F. (Vestal NY) Osisek Damian L. (Vestal NY), Computer system which supports asynchronous commitment of data.
Dunphy William E. (Westminster CO) Halladay Steven M. (Louisville CO) Moy Michael E. (Lafayette CO) Munro Frederick G. (Broomfield CO), Data storage and protection system.
Yanai Moshe (Framingham MA) Vishlitzky Natan (Brookline MA) Alterescu Bruno (Newton MA) Castel Daniel (Framingham MA) Shklarsky Gadi (Brookline MA), Data storage system controlled remote data mirroring with respectively maintained data indices.
Fortier Richard W. (Acton MA) Mastors Robert M. (Ayer MA) Taylor Tracy M. (Upton MA) Wallace John J. (Franklin MA), Digital data processor with improved backup storage.
Kenley Gregory (Northboro MA) Ericson George (Schrewsbury MA) Fortier Richard (Acton MA) Holland Chuck (Northboro MA) Mastors Robert (Ayer MA) Pownell James (Natick MA) Taylor Tracy (Upton MA) Wallac, Digital data storage system with improved data migration.
Christenson,Nikolai Paul; Fritchie,Scott Ernest Lystig; Larson,James Stephen, Electronic mail system with methodology providing distributed message store.
Xu Yikang ; Vahalia Uresh K. ; Jiang Xiaoye ; Gupta Uday ; Tzelnic Percy, File server system using file system storage, data movers, and an exchange of meta data among data movers for file locking and direct access to shared file systems.
Lagueux, Jr., Richard A.; Stave, Joel H.; Yeaman, John B.; Stevens, Brian E.; Higgins, Robert M.; Collins, James M., Graphical user interface for configuration of a storage system.
Urevig Paul D. ; Malnati James R. ; Ethen Donald J. ; Weber Herbert L., Grouping shared resources into one or more pools and automatically re-assigning shared resources from where they are not currently needed to where they are needed.
Cane David ; Hirschman David, High performance backup via selective file saving which can perform incremental backups and exclude files and uses a cha.
Barney Rock D. ; Schwols Keith ; Nelson Ellen M., Integration of a database into file management software for protecting, tracking and retrieving data.
Martin Charles W. (Richardson TX) Reid Fredrick S. (Plano TX) Forbus Gary L. (Dallas TX) Adams Steve M. (Plano TX) Shannon C. Patrick (Garland TX) Pirpich Eric A. (Garland TX), Mass data storage and retrieval system.
Kedem Nadav,ILX, Mass storage subsystem and backup arrangement for digital data processing system which permits information to be backed up while host computer(s) continue(s) operating in connection with information .
Long Robert M., Media element library with non-overlapping subset of media elements and non-overlapping subset of media element drives accessible to first host and unaccessible to second host.
Kullick Steven E. ; Spirakis Charles S. ; Titus Diane J., Method and apparatus for transferring archival data among an arbitrarily large number of computer devices in a networked.
Eastridge Lawrence E. (Tucson AZ) Kern Robert F. (Tucson AZ) Kern Ronald M. (Tucson AZ) Mikkelsen Claus W. (Morgan Hill CA) Ratliff James M. (Tucson AZ), Method and system for automated backup copy ordering in a time zero backup copy session.
Eastridge Lawrence E. (Tucson AZ) Kern Robert F. (Tucson AZ) Micka William F. (Tucson AZ) Mikkelsen Claus W. (Morgan Hill CA) Ratliff James M. (Tucson AZ), Method and system for automated termination and resumption in a time zero backup copy process.
Walter A. Hubis ; William G. Deitz, Method and system for controlling access share storage devices in a network environment by configuring host-to-volume mapping data structures in the controller memory for granting and denying access .
Chron, Edward Gustav; Menon, Jaishankar Moothedath, Method and system for providing consistent data modification information to clients in a storage system.
Aoyama Yuki,JPX ; Takahashi Toru,JPX ; Wakayama Satoshi,JPX, Method of and an apparatus for displaying version information and configuration information and a computer-readable recording medium on which a version and configuration information display program i.
Haustein, Nils; Klein, Craig A.; Troppens, Ulf; Winarski, Daniel J., Method of and system for deduplicating backed up data in a client-server environment.
Wahlert, Brian M; Berkowitz, Brian T; van Ingen, Catharine; Rangegowda, Dharshan; Jazayeri, Mike, Method, system, and apparatus for creating saved searches and auto discovery groups for a data protection system.
Palliyil, Sudarshan; Venkateshamurthy, Shivakumara; Vijayaraghavan, Srinivas Belur; Aswathanarayana, Tejasvi, Methods, apparatus and computer programs for enhanced access to resources within a network.
Pisello Thomas (De Bary FL) Crossmier David (Casselberry FL) Ashton Paul (Oviedo FL), Network management system having virtual catalog overview of files distributively stored across network domain.
Prahlad, Anand; May, Andreas; Lunde, Norman R.; Zhou, Lixin; Kumar, Avinash; Ngo, David, Snapshot storage and management system with indexing and user interface.
Crockett Robert N. (Tucson AZ) Kern Ronald M. (Tucson AZ) Micka William F. (Tucson AZ), Software directed microcode state save for distributed storage controller.
Mutalik Madhav ; Senie Faith M., System and method for performing file-handling operations in a digital data processing system using an operating system-independent file map.
Moulton, Gregory Hagan, System and method for unorchestrated determination of data sequences using sticky byte factoring to determine breakpoints in digital sequences.
Huai ReiJane (Old Brookville NY) Daly Robert (Ronkonkoma NY) Curti Walter (Dix Hills NY) Mohan Deepak (Huntington NY) Chueh James Kuang-Ru (Bayside NY) Louie Larry (Forest Hills NY), System and parallel streaming and data stripping to back-up a network.
Stoppani ; Jr. Peter (Woodinville WA), System for allocating storage spaces based upon required and optional service attributes having assigned piorities.
Bamford Roger J. (Woodside CA) Howard Forrest W. (Berkeley CA) Kabcenell Dirk A. (Portola Valley CA) Miner Robert N. (San Francisco CA), System for database integrity with multiple logs assigned to client subsets.
Flynn Rex A. (Belmont MA) Anick Peter G. (Marlboro MA), System for reconstructing prior versions of indexes using records indicating changes between successive versions of the.
Saether Christian D. (Seattle WA) Stoppani ; Jr. Peter (Woodinville WA), System of device independent file directories using a tag between the directories and file descriptors that migrate with.
Prahlad, Anand; Schwartz, Jeremy A.; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for classifying and transferring information in a storage network.
Borghetti, Stefano; Sgro', Antonio Mario; Corte, Gianluca Della; Gianfagna, Leonida, Thread based view and archive for simple mail transfer protocol (SMTP) clients devices and methods.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.