Backup using a client-side signature repository in a networked storage system
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-017/30
G06F-011/14
출원번호
US-0916434
(2013-06-12)
등록번호
US-9251186
(2016-02-02)
발명자
/ 주소
Muller, Marcus S.
Ngo, David
출원인 / 주소
CommVault Systems, Inc.
대리인 / 주소
Knobbe, Martens, Olson & Bear, LLP
인용정보
피인용 횟수 :
16인용 특허 :
140
초록▼
A storage system according to certain embodiments includes a client-side signature repository that includes information representative of a set of data blocks stored in primary storage. During copy or backup operations, the system can use the client-side signature repository to identify data blocks
A storage system according to certain embodiments includes a client-side signature repository that includes information representative of a set of data blocks stored in primary storage. During copy or backup operations, the system can use the client-side signature repository to identify data blocks located in primary storage that are new or that have changed. The system can also use the client-side signature repository to identify multiple locations within primary storage where different instances of the data blocks are located.
대표청구항▼
1. A method of generating a backup data set for a client computing device by using a signature repository residing in a primary storage subsystem, the method comprising: for each respective client computing device of one or more client computing devices in a primary storage subsystem: monitoring sto
1. A method of generating a backup data set for a client computing device by using a signature repository residing in a primary storage subsystem, the method comprising: for each respective client computing device of one or more client computing devices in a primary storage subsystem: monitoring storage of a plurality of files formed of data blocks generated by one or more software applications running on the respective client computing device,wherein the plurality of files are stored as primary data in a primary data store associated with the respective client computing device;maintaining, by a repository agent executing on one or more processors in the primary storage subsystem, a repository indicating at least which data blocks of the monitored files are stored in the primary storage subsystem, wherein at least a first data block of the data blocks stored in a first primary data primary subsystem forms at least a portion of a first file stored in the first primary data store and a second data block that matches the first data block is stored in a second primary data store of the primary storage subsystem and forms at least a portion of a second file stored in the second primary data store,wherein the first file is generated b at least one software application running on a first client computing device associated with the first primary data store and the second file is generated by at least one software application running on a second client computing device associated with the second primary data store, andwherein the first data block and the second data block are stored in a native format of the at least one software application that generated the first data block and the second data block, respectively; andin response to instructions to create a secondary copy of the first file in a secondary storage subsystem, identifying a set of data blocks that do not have corresponding secondary copy data blocks stored in secondary storage subsystem, the set of data blocks forming at least a portion of the first file and including the first data block;querying the repository to identify at least a first group of data blocks of the set of data blocks for which matching data blocks are stored in the primary storage subsystem, wherein the first group of data blocks includes the first data block;identifying a location of the matching data blocks within the primary storage subsystem, wherein the matching data blocks include at least the second data block; andretrieving the matching data blocks from one or more of the data stores associated with the one or more client computing devices, including the second data block stored in the second primary data store. 2. The method of claim 1, further comprising: generating a copy operation index that identifies one or more data blocks that form first file including the retrieved first group of data blocks; andcommunicating the retrieved first group of data blocks and the copy operation index to the secondary storage subsystem over a wide-area network (WAN), wherein the secondary storage subsystem creates the secondary copy of the first file using at least the retrieved first group of data blocks. 3. The method of claim 1, wherein the repository agent resides on a computing device that is separate from each of the one or more client computing devices. 4. The method of claim 1, wherein the repository is maintained in one or more storage devices that are separate from the data stores of the one or more client computing devices. 5. The method of claim 1, wherein the repository comprises a central repository including signatures which correspond to the data blocks that are stored in the data store associated with all of the one or more client computing devices, wherein the signatures are generated by a signature generator. 6. The method of claim 5, wherein the first group of data blocks is identified using the signatures in the repository. 7. The method of claim 5, wherein the repository comprises signatures corresponding to all of the data blocks that are stored in the primary storage subsystem. 8. The method of claim 5, wherein each client computing device includes a signature generator. 9. The method of claim 5, wherein the repository agent comprises the signature generator. 10. The method of claim 5, further comprising maintaining in the repository, for each data block referenced in the repository: a storage location of the referenced data block within the data store associated with the client computing device that stores the referenced data block. 11. The method of claim 1, wherein the repository includes only metadata and does not include copies of the data blocks referenced in the repository. 12. The method of claim 1, wherein the secondary storage subsystem comprises deduplicated data. 13. The method of claim 1, wherein the primary data store of one or more client computing devices comprises deduplicated data. 14. A storage system for generating a backup data set for a client computing device using a signature repository, the system comprising: a repository indicating which data blocks are stored in a primary storage subsystem, the primary storage subsystem including one or more client computing devices each having an associated data store, wherein the data blocks are generated by one or more software applications running on the one or more client computing devices,wherein at least a first data block of the data blocks stored in a first primary data store of the primary storage subs stem forms at least a portion of a first file stored in the first primary data store and a second data block that matches the first data block is stored in a second primary data store of the primary storage subsystem and forms at least a portion of a second file stored in the second primary data store,wherein the first file is generated b at least one software application running on a first client computing device associated with the first primary data store and the second file is generated by at least one software application running on a second client computing device associated with the second primary data store, andwherein the first data block and the second data block are stored in a native format of the at least one software application that generated the first data block and the second data block, respectively; anda repository agent executing on one or more processors and configured to: maintain the repository; andin response to instructions to create a secondary copy in a secondary storage subsystem of the first file, identify a set of data blocks that do not have corresponding secondary copy data blocks stored in the secondary storage subsystem, the set of data blocks forming at least a portion of the first file and including the first data block;query the repository to identify at least a first group of data blocks of the set of data blocks for which matching data blocks are stored in the primary storage subsystem; andprovide retrieval information usable to locate the matching data blocks within the primary storage subsystem, wherein the matching data blocks include at least the second data block. 15. The system of claim 14, wherein the repository agent is further configured to generate a copy operation index that identifies one or more data blocks that form the first file including the retrieved first group of data blocks; andcommunicate the retrieved first group of data blocks and the copy operation index to the secondary storage subsystem which creates the secondary copy of the first file using at least the retrieved first group of data blocks. 16. The system of claim 14, wherein the repository agent resides on a computing device that is separate from each of the one or more client computing devices. 17. The system of claim 14, wherein the repository is maintained in one or more storage devices that are separate from the data stores of the one or more client computing devices. 18. The system of claim 14, wherein the repository comprises a central repository including signatures which correspond to the data blocks that are stored in the data store associated with all of the one or more client computing devices, wherein the signatures are generated by a signature generator. 19. A computer-readable, non-transitory storage medium having one or more computer-executable modules for generating a backup data set for a client computing device, the one or more computer-executable modules comprising: a first module in communication with one or more client computing devices and configured to: maintain a repository indicating data blocks that are stored in a primary storage subsystem, the primary storage subsystem comprising one or more client computing devices each having an associated data store, wherein the data blocks are generated by one or more software applications running on the one or more client computing devices;wherein at least a first data block of the data blocks stored in a first primary data store of the primary storage subs stem forms at least a portion of a first file stored in the first primary data store and a second data block that matches the first data block is stored in a second primary data store of the primary storage subsystem and forms at least a portion of a second file stored in the second primary data store,wherein the first file is generated b at least one software application running on a first client computing device associated with the first primary data store and the second file is generated by at least one software application running on a second client computing device associated with the second primary data store; andin response to instructions to create a secondary copy in a secondary storage subsystem of the first file, identify a set of data blocks that do not have corresponding secondary copy data blocks stored in the secondary storage subs stem, the set of data blocks forming at least a portion of the first file and including the first data block;query the repository to identify at least a first group of data blocks of the set of data blocks for which matching data blocks are not stored in the primary storage subsystem; andprovide retrieval information usable to locate the matching data blocks within the primary storage subsystem, wherein the matching data blocks include at least the second data block. 20. The computer-readable, non-transitory storage medium of claim 19, wherein the first module is further configured to generate a copy operation index that identifies one or more data blocks that form the first file including the retrieved first group of data blocks.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (140)
Ranade, Dilip Madhusudan; Shelat, Radha; Kabra, Navin, Adaptive caching for a distributed file sharing system.
Yuval Ofek ; Zoran Cakeljic ; Samuel Krikler IL; Sharon Galtzur IL; Michael Hirsch IL; Dan Arnon ; Peter Kamvysselis, Apparatus and methods for copying, backing up, and restoring data using a backup segment size larger than the storage block size.
Griffin David (Maynard MA) Campbell Jonathan (Acton MA) Reilly Michael (Sterling MA) Rosenbaum Richard (Pepperell MA), Arrangement with cooperating management server node and network service node.
Nakano Toshio (Odawara JPX) Nozawa Masafumi (Odawara JPX) Kurano Akira (Odawara JPX) Hisano Kiyoshi (Odawara JPX) Hoshino Masayuki (Odawara JPX), Backup control method and system in data processing system using identifiers for controlling block data transfer.
Kitajima Hiroyuki (Yokohama) Yamamoto Akira (Yokohama) Doi Takashi (Hadano) Nozawa Masafumi (Odawara JPX), Buffered peripheral system and method for backing up and retrieving data to and from backup memory device.
Ludmila Cherkasova ; Martin F. Arlitt ; Richard J. Friedrich ; Tai Jin, Caching protocol method and system based on request frequency and relative storage duration.
Cole Leo J. (Raleigh NC) Frantz Curtis J. (Durham NC) Lee Jeannette (Raleigh NC) Ordanic Zvonimir (Raleigh NC) Plank Larry K. (Rochester MN), Centralized management in a computer network.
Carpenter Kelly S. (Fremont CA) Dearing Gerard M. (San Jose CA) Nick Jeffrey M. (Fishkill NY) Strickland Jimmy P. (Saratoga CA) Swanson Michael D. (Poughkeepsie NY) Wilkinson Wendell W. (Hyde Park NY, Coherence controls for store-multiple shared data coordinated by cache directory entries in a shared electronic storage.
Senator Steven T. ; Fuller Billy J., Computer system method and apparatus providing for various versions of a file without requiring data copy or log operati.
Fecteau Jean G. (Toronto NY CAX) Gdaniec Joseph M. (Vestal NY) Hennessy James P. (Endicott NY) MacDonald John F. (Vestal NY) Osisek Damian L. (Vestal NY), Computer system which supports asynchronous commitment of data.
Dunphy William E. (Westminster CO) Halladay Steven M. (Louisville CO) Moy Michael E. (Lafayette CO) Munro Frederick G. (Broomfield CO), Data storage and protection system.
Yanai Moshe (Framingham MA) Vishlitzky Natan (Brookline MA) Alterescu Bruno (Newton MA) Castel Daniel (Framingham MA) Shklarsky Gadi (Brookline MA), Data storage system controlled remote data mirroring with respectively maintained data indices.
Fortier Richard W. (Acton MA) Mastors Robert M. (Ayer MA) Taylor Tracy M. (Upton MA) Wallace John J. (Franklin MA), Digital data processor with improved backup storage.
Kenley Gregory (Northboro MA) Ericson George (Schrewsbury MA) Fortier Richard (Acton MA) Holland Chuck (Northboro MA) Mastors Robert (Ayer MA) Pownell James (Natick MA) Taylor Tracy (Upton MA) Wallac, Digital data storage system with improved data migration.
Christenson,Nikolai Paul; Fritchie,Scott Ernest Lystig; Larson,James Stephen, Electronic mail system with methodology providing distributed message store.
Xu Yikang ; Vahalia Uresh K. ; Jiang Xiaoye ; Gupta Uday ; Tzelnic Percy, File server system using file system storage, data movers, and an exchange of meta data among data movers for file locking and direct access to shared file systems.
Lagueux, Jr., Richard A.; Stave, Joel H.; Yeaman, John B.; Stevens, Brian E.; Higgins, Robert M.; Collins, James M., Graphical user interface for configuration of a storage system.
Urevig Paul D. ; Malnati James R. ; Ethen Donald J. ; Weber Herbert L., Grouping shared resources into one or more pools and automatically re-assigning shared resources from where they are not currently needed to where they are needed.
Cane David ; Hirschman David, High performance backup via selective file saving which can perform incremental backups and exclude files and uses a cha.
Barney Rock D. ; Schwols Keith ; Nelson Ellen M., Integration of a database into file management software for protecting, tracking and retrieving data.
Martin Charles W. (Richardson TX) Reid Fredrick S. (Plano TX) Forbus Gary L. (Dallas TX) Adams Steve M. (Plano TX) Shannon C. Patrick (Garland TX) Pirpich Eric A. (Garland TX), Mass data storage and retrieval system.
Kedem Nadav,ILX, Mass storage subsystem and backup arrangement for digital data processing system which permits information to be backed up while host computer(s) continue(s) operating in connection with information .
Long Robert M., Media element library with non-overlapping subset of media elements and non-overlapping subset of media element drives accessible to first host and unaccessible to second host.
Kullick Steven E. ; Spirakis Charles S. ; Titus Diane J., Method and apparatus for transferring archival data among an arbitrarily large number of computer devices in a networked.
Eastridge Lawrence E. (Tucson AZ) Kern Robert F. (Tucson AZ) Kern Ronald M. (Tucson AZ) Mikkelsen Claus W. (Morgan Hill CA) Ratliff James M. (Tucson AZ), Method and system for automated backup copy ordering in a time zero backup copy session.
Eastridge Lawrence E. (Tucson AZ) Kern Robert F. (Tucson AZ) Micka William F. (Tucson AZ) Mikkelsen Claus W. (Morgan Hill CA) Ratliff James M. (Tucson AZ), Method and system for automated termination and resumption in a time zero backup copy process.
Walter A. Hubis ; William G. Deitz, Method and system for controlling access share storage devices in a network environment by configuring host-to-volume mapping data structures in the controller memory for granting and denying access .
Chron, Edward Gustav; Menon, Jaishankar Moothedath, Method and system for providing consistent data modification information to clients in a storage system.
Aoyama Yuki,JPX ; Takahashi Toru,JPX ; Wakayama Satoshi,JPX, Method of and an apparatus for displaying version information and configuration information and a computer-readable recording medium on which a version and configuration information display program i.
Haustein, Nils; Klein, Craig A.; Troppens, Ulf; Winarski, Daniel J., Method of and system for deduplicating backed up data in a client-server environment.
Wahlert, Brian M; Berkowitz, Brian T; van Ingen, Catharine; Rangegowda, Dharshan; Jazayeri, Mike, Method, system, and apparatus for creating saved searches and auto discovery groups for a data protection system.
Palliyil, Sudarshan; Venkateshamurthy, Shivakumara; Vijayaraghavan, Srinivas Belur; Aswathanarayana, Tejasvi, Methods, apparatus and computer programs for enhanced access to resources within a network.
Pisello Thomas (De Bary FL) Crossmier David (Casselberry FL) Ashton Paul (Oviedo FL), Network management system having virtual catalog overview of files distributively stored across network domain.
Prahlad, Anand; May, Andreas; Lunde, Norman R.; Zhou, Lixin; Kumar, Avinash; Ngo, David, Snapshot storage and management system with indexing and user interface.
Crockett Robert N. (Tucson AZ) Kern Ronald M. (Tucson AZ) Micka William F. (Tucson AZ), Software directed microcode state save for distributed storage controller.
Friend,John; Belshe,Michael; Collins,Roger; Bennett,Mike, System and method for full wireless synchronization of a data processing apparatus with a messaging system.
Mutalik Madhav ; Senie Faith M., System and method for performing file-handling operations in a digital data processing system using an operating system-independent file map.
Moulton, Gregory Hagan, System and method for unorchestrated determination of data sequences using sticky byte factoring to determine breakpoints in digital sequences.
Patel, Sujal M.; Mikesell, Paul A., System and methods for providing a distributed file system utilizing metadata to track information about data stored throughout the system.
Huai ReiJane (Old Brookville NY) Daly Robert (Ronkonkoma NY) Curti Walter (Dix Hills NY) Mohan Deepak (Huntington NY) Chueh James Kuang-Ru (Bayside NY) Louie Larry (Forest Hills NY), System and parallel streaming and data stripping to back-up a network.
Stoppani ; Jr. Peter (Woodinville WA), System for allocating storage spaces based upon required and optional service attributes having assigned piorities.
Bamford Roger J. (Woodside CA) Howard Forrest W. (Berkeley CA) Kabcenell Dirk A. (Portola Valley CA) Miner Robert N. (San Francisco CA), System for database integrity with multiple logs assigned to client subsets.
Flynn Rex A. (Belmont MA) Anick Peter G. (Marlboro MA), System for reconstructing prior versions of indexes using records indicating changes between successive versions of the.
Saether Christian D. (Seattle WA) Stoppani ; Jr. Peter (Woodinville WA), System of device independent file directories using a tag between the directories and file descriptors that migrate with.
Prahlad, Anand; Schwartz, Jeremy A.; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for classifying and transferring information in a storage network.
Borghetti, Stefano; Sgro', Antonio Mario; Corte, Gianluca Della; Gianfagna, Leonida, Thread based view and archive for simple mail transfer protocol (SMTP) clients devices and methods.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.