System and method for storing redundant information
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-007/00
G06F-017/00
출원번호
US-0251022
(2011-09-30)
등록번호
US-8285683
(2012-10-09)
발명자
/ 주소
Prahlad, Anand
Gokhale, Parag
Kottomtharayil, Rajiv
Retnamma, Manoj K. Vijayan
Attarde, Deepak R.
출원인 / 주소
CommVault Systems, Inc.
대리인 / 주소
Perkins Coie LLP
인용정보
피인용 횟수 :
19인용 특허 :
147
초록▼
A method and system for reducing storage requirements and speeding up storage operations by reducing the storage of redundant data includes receiving a request that identifies one or more data objects to which to apply a storage operation. For each data object, the storage system determines if the d
A method and system for reducing storage requirements and speeding up storage operations by reducing the storage of redundant data includes receiving a request that identifies one or more data objects to which to apply a storage operation. For each data object, the storage system determines if the data object contains data that matches another data object to which the storage operation was previously applied. If the data objects do not match, then the storage system performs the storage operation in a usual manner. However, if the data objects do match, then the storage system may avoid performing the storage operation.
대표청구항▼
1. A method in a computer system for restoring data from a deduplicated copy, the method comprising: receiving a request to restore to a destination location one or more data objects originally located within a hierarchy, wherein the copy of data contains information describing the hierarchy, but on
1. A method in a computer system for restoring data from a deduplicated copy, the method comprising: receiving a request to restore to a destination location one or more data objects originally located within a hierarchy, wherein the copy of data contains information describing the hierarchy, but only contains a reduced instance of each data object of the hierarchy for storage on sequential media, andwherein the request specifies an original location within the hierarchy of the data objects to be restored;for each data object in the request: determining whether the data at the original location within the hierarchy specified by the request contains the data object itself or a reference to a data object and the data object is stored elsewhere; wherein the request contains a digest value for identifying each data object,wherein the determining comprises comparing the digest value to an index of digest values of data objects stored within the computer system;if the original location within the hierarchy specified by the request contains a reference to a data object, resolving the reference to find the data object and copying the data object to the destination location; andif the original location within the hierarchy specified by the request contains the data object itself, copying the data object to the destination location. 2. The method of claim 1 wherein the hierarchy is a file system hierarchy, and wherein the copy of data contains a reduced instance of the data of each file in the file system hierarchy, and wherein additional instances of each file in the original location are stored as stubs within the copy of data that reference the single instance of the data of each file. 3. The method of claim 1 wherein the reference further comprises a pointer to the location of the actual data for the data object, and wherein resolving the reference comprises locating the actual data based on the pointer. 4. The method of claim 1 wherein the one or more data objects are email messages and wherein the hierarchy comprises folders within an email message store. 5. The method of claim 1 wherein the copy of data is stored on one or more sequential media, wherein the reference further comprises a media identifier and offset of the actual data object, and wherein resolving the reference comprises locating the media identified by the media identifier and seeking to the offset. 6. The method of claim 1 wherein the copying the data object to the destination location further comprises decrypting the data object. 7. A system for improving the retrieval of a data object in a network environment that contains multiple instances of the data object, the system comprising: at least one processor;at least one memory, coupled to the processor;means, coupled among the at least one processor and memory, for receiving a request to access the data object at a first location;means, coupled among the at least one processor and memory, for determining at least one alternative location of an instance of the requested data object, wherein the at least one alternative location is not the first location;means, coupled among the at least one processor and memory, for selecting an instance of the data object from among the determined at least one alternative location to use to satisfy the request; andmeans, coupled among the at least one processor and memory, for sending the selected instance of the data object in response to the request,wherein, coupled among the at least one processor and memory, the request contains a digest value for identifying the data object, andwherein, coupled among the at least one processor and memory, determining the at least one alternative location of an instance of the requested data object comprises comparing the digest value to an index of digest values of data objects stored within the network environment. 8. The system of claim 7 wherein a computer system from which the request is received is closer to the selected alternative location than to the first location. 9. The system of claim 7 wherein the request is an HTTP request for a data object located on the world wide web, and wherein the alternative location is a location within a private local area network. 10. The system of claim 7 wherein the means for selecting an instance of the data object comprises means for determining the fastest location from which the data object is available. 11. The system of claim 7 wherein the means for selecting an instance of the data object comprises means for identifying multiple copies containing instances of the data object and determining attributes of each copy that affect an accessing speed for the data object. 12. The system of claim 7 wherein the means for sending the selected instance of the data object comprises means for copying the data object from the selected alternative location to an intermediate location, and then from the intermediate location to a location specified in the request. 13. At least one tangible computer-readable medium storing instructions, which when executed by at least one data processing device performs a method for restoring data from a deduplicated copy, comprising: receiving a request to restore to a destination location one or more data objects originally located within a hierarchy, wherein the copy of data contains information describing the hierarchy, but only contains a reduced instance of each data object of the hierarchy for storage on sequential media, andwherein the request specifies an original location within the hierarchy of the data objects to be restored;for each data object in the request:determining whether the data at the original location within the hierarchy specified by the request contains the data object itself or a reference to a data object and the data object is stored elsewhere; wherein the request contains a digest value for identifying each data object,wherein the determining comprises comparing the digest value to an index of digest values of data objects stored within a computer system;if the original location within the hierarchy specified by the request contains a reference to a data object, resolving the reference to find the data object and copying the data object to the destination location; andif the original location within the hierarchy specified by the request contains the data object itself, copying the data object to the destination location. 14. The tangible computer-readable medium of claim 13 wherein the hierarchy is a file system hierarchy, and wherein the copy of data contains a reduced instance of the data of each file in the file system hierarchy, and wherein additional instances of each file in the original location are stored as stubs within the copy of data that reference the single instance of the data of each file. 15. The tangible computer-readable medium of claim 13 wherein the reference further comprises a pointer to the location of the actual data for the data object, and wherein resolving the reference comprises locating the actual data based on the pointer. 16. The tangible computer-readable medium of claim 13 wherein the one or more data objects are email messages and wherein the hierarchy comprises folders within an email message store. 17. The tangible computer-readable medium of claim 13 wherein the copy of data is stored on one or more sequential media, wherein the reference further comprises a media identifier and offset of the actual data object, and wherein resolving the reference comprises locating the media identified by the media identifier and seeking to the offset. 18. The tangible computer-readable medium of claim 13 wherein the copying the data object to the destination location further comprises decrypting the data object.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (147)
Ackaouy,Emmanuel; Amdur,Matthew B.; Prakash,Ashish; Ayyar,Kartik, Apparatus and method for tandem operation in a storage network.
Yuval Ofek ; Zoran Cakeljic ; Samuel Krikler IL; Sharon Galtzur IL; Michael Hirsch IL; Dan Arnon ; Peter Kamvysselis, Apparatus and methods for copying, backing up, and restoring data using a backup segment size larger than the storage block size.
Griffin David (Maynard MA) Campbell Jonathan (Acton MA) Reilly Michael (Sterling MA) Rosenbaum Richard (Pepperell MA), Arrangement with cooperating management server node and network service node.
Nakano Toshio (Odawara JPX) Nozawa Masafumi (Odawara JPX) Kurano Akira (Odawara JPX) Hisano Kiyoshi (Odawara JPX) Hoshino Masayuki (Odawara JPX), Backup control method and system in data processing system using identifiers for controlling block data transfer.
Kitajima Hiroyuki (Yokohama) Yamamoto Akira (Yokohama) Doi Takashi (Hadano) Nozawa Masafumi (Odawara JPX), Buffered peripheral system and method for backing up and retrieving data to and from backup memory device.
Cole Leo J. (Raleigh NC) Frantz Curtis J. (Durham NC) Lee Jeannette (Raleigh NC) Ordanic Zvonimir (Raleigh NC) Plank Larry K. (Rochester MN), Centralized management in a computer network.
Carpenter Kelly S. (Fremont CA) Dearing Gerard M. (San Jose CA) Nick Jeffrey M. (Fishkill NY) Strickland Jimmy P. (Saratoga CA) Swanson Michael D. (Poughkeepsie NY) Wilkinson Wendell W. (Hyde Park NY, Coherence controls for store-multiple shared data coordinated by cache directory entries in a shared electronic storage.
Senator Steven T. ; Fuller Billy J., Computer system method and apparatus providing for various versions of a file without requiring data copy or log operati.
Fecteau Jean G. (Toronto NY CAX) Gdaniec Joseph M. (Vestal NY) Hennessy James P. (Endicott NY) MacDonald John F. (Vestal NY) Osisek Damian L. (Vestal NY), Computer system which supports asynchronous commitment of data.
Dunphy William E. (Westminster CO) Halladay Steven M. (Louisville CO) Moy Michael E. (Lafayette CO) Munro Frederick G. (Broomfield CO), Data storage and protection system.
Yanai Moshe (Framingham MA) Vishlitzky Natan (Brookline MA) Alterescu Bruno (Newton MA) Castel Daniel (Framingham MA) Shklarsky Gadi (Brookline MA), Data storage system controlled remote data mirroring with respectively maintained data indices.
Hagerstrom, Carl F.; Hutchinson, Thomas Dixon; Bharthulwar, Shridhar; Tinius, Paul E., Detecting and managing orphan files between primary and secondary data stores.
Fortier Richard W. (Acton MA) Mastors Robert M. (Ayer MA) Taylor Tracy M. (Upton MA) Wallace John J. (Franklin MA), Digital data processor with improved backup storage.
Kenley Gregory (Northboro MA) Ericson George (Schrewsbury MA) Fortier Richard (Acton MA) Holland Chuck (Northboro MA) Mastors Robert (Ayer MA) Pownell James (Natick MA) Taylor Tracy (Upton MA) Wallac, Digital data storage system with improved data migration.
Christenson,Nikolai Paul; Fritchie,Scott Ernest Lystig; Larson,James Stephen, Electronic mail system with methodology providing distributed message store.
Alam Salim ; Bhalerao Vinayak A. ; Wu Charles ; Hu George ; Ferrell John I., File object synchronization between a desktop computer and a mobile device.
Xu Yikang ; Vahalia Uresh K. ; Jiang Xiaoye ; Gupta Uday ; Tzelnic Percy, File server system using file system storage, data movers, and an exchange of meta data among data movers for file locking and direct access to shared file systems.
Lagueux, Jr., Richard A.; Stave, Joel H.; Yeaman, John B.; Stevens, Brian E.; Higgins, Robert M.; Collins, James M., Graphical user interface for configuration of a storage system.
Urevig Paul D. ; Malnati James R. ; Ethen Donald J. ; Weber Herbert L., Grouping shared resources into one or more pools and automatically re-assigning shared resources from where they are not currently needed to where they are needed.
Barney Rock D. ; Schwols Keith ; Nelson Ellen M., Integration of a database into file management software for protecting, tracking and retrieving data.
Douceur,John R.; Theimer,Marvin M.; Adya,Atul; Bolosky,William J., Locating potentially identical objects across multiple computers based on stochastic partitioning of workload.
Douceur,John R.; Theimer,Marvin M.; Adya,Atul; Bolosky,William J., Locating potentially identical objects across multiple computers based on stochastic partitioning of workload.
Martin Charles W. (Richardson TX) Reid Fredrick S. (Plano TX) Forbus Gary L. (Dallas TX) Adams Steve M. (Plano TX) Shannon C. Patrick (Garland TX) Pirpich Eric A. (Garland TX), Mass data storage and retrieval system.
Kedem Nadav,ILX, Mass storage subsystem and backup arrangement for digital data processing system which permits information to be backed up while host computer(s) continue(s) operating in connection with information .
Long Robert M., Media element library with non-overlapping subset of media elements and non-overlapping subset of media element drives accessible to first host and unaccessible to second host.
Kullick Steven E. ; Spirakis Charles S. ; Titus Diane J., Method and apparatus for transferring archival data among an arbitrarily large number of computer devices in a networked.
Eastridge Lawrence E. (Tucson AZ) Kern Robert F. (Tucson AZ) Kern Ronald M. (Tucson AZ) Mikkelsen Claus W. (Morgan Hill CA) Ratliff James M. (Tucson AZ), Method and system for automated backup copy ordering in a time zero backup copy session.
Eastridge Lawrence E. (Tucson AZ) Kern Robert F. (Tucson AZ) Micka William F. (Tucson AZ) Mikkelsen Claus W. (Morgan Hill CA) Ratliff James M. (Tucson AZ), Method and system for automated termination and resumption in a time zero backup copy process.
Walter A. Hubis ; William G. Deitz, Method and system for controlling access share storage devices in a network environment by configuring host-to-volume mapping data structures in the controller memory for granting and denying access .
Chefalas, Thomas E.; Mastrianni, Steven J., Method and system for processing backup data associated with application, querying metadata files describing files accessed by the application.
Chron, Edward Gustav; Menon, Jaishankar Moothedath, Method and system for providing consistent data modification information to clients in a storage system.
Aoyama Yuki,JPX ; Takahashi Toru,JPX ; Wakayama Satoshi,JPX, Method of and an apparatus for displaying version information and configuration information and a computer-readable recording medium on which a version and configuration information display program i.
Wahlert, Brian M; Berkowitz, Brian T; van Ingen, Catharine; Rangegowda, Dharshan; Jazayeri, Mike, Method, system, and apparatus for creating saved searches and auto discovery groups for a data protection system.
Wolfgang, John Jay; Boyd, Kenneth Wayne; Day, III, Kenneth Fairclough; Doatmas, Philip Matthew; Dahman, Kirby Grant, Method, system, and program for data synchronization between a primary storage device and a secondary storage device by determining whether a first identifier and a second identifier match, where a unique identifier is associated with each portion of data.
Palliyil, Sudarshan; Venkateshamurthy, Shivakumara; Vijayaraghavan, Srinivas Belur; Aswathanarayana, Tejasvi, Methods, apparatus and computer programs for enhanced access to resources within a network.
Pisello Thomas (De Bary FL) Crossmier David (Casselberry FL) Ashton Paul (Oviedo FL), Network management system having virtual catalog overview of files distributively stored across network domain.
Sawdon, Wayne A.; Haskin, Roger L.; Schmuck, Frank B.; Wyllie, James C., Plurality of file systems using weighted allocation to allocate space on one or more storage devices.
Prahlad, Anand; May, Andreas; Lunde, Norman R.; Zhou, Lixin; Kumar, Avinash; Ngo, David, Snapshot storage and management system with indexing and user interface.
Crockett Robert N. (Tucson AZ) Kern Ronald M. (Tucson AZ) Micka William F. (Tucson AZ), Software directed microcode state save for distributed storage controller.
Mutalik Madhav ; Senie Faith M., System and method for performing file-handling operations in a digital data processing system using an operating system-independent file map.
Prahlad, Anand; Gokhale, Parag; Kottomtharayil, Rajiv; Vijayan Retnamma, Manoj K.; Attarde, Deepak R., System and method for storing redundant information.
Moulton, Gregory Hagan, System and method for unorchestrated determination of data sequences using sticky byte factoring to determine breakpoints in digital sequences.
Patel, Sujal M.; Mikesell, Paul A., System and methods for providing a distributed file system utilizing metadata to track information about data stored throughout the system.
Huai ReiJane (Old Brookville NY) Daly Robert (Ronkonkoma NY) Curti Walter (Dix Hills NY) Mohan Deepak (Huntington NY) Chueh James Kuang-Ru (Bayside NY) Louie Larry (Forest Hills NY), System and parallel streaming and data stripping to back-up a network.
Stoppani ; Jr. Peter (Woodinville WA), System for allocating storage spaces based upon required and optional service attributes having assigned piorities.
Flynn Rex A. (Belmont MA) Anick Peter G. (Marlboro MA), System for reconstructing prior versions of indexes using records indicating changes between successive versions of the.
Saether Christian D. (Seattle WA) Stoppani ; Jr. Peter (Woodinville WA), System of device independent file directories using a tag between the directories and file descriptors that migrate with.
Prahlad, Anand; Schwartz, Jeremy A.; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for classifying and transferring information in a storage network.
Prahlad, Anand; Agrawal, Vijay H., Continuous data protection over intermittent connections, such as continuous data backup for laptops or wireless devices.
Prahlad, Anand; Agrawal, Vijay H., Continuous data protection over intermittent connections, such as continuous data backup for laptops or wireless devices.
Senthilnathan, Muthusamy; Thati, Ravi; Kumarasamy, Paramasivam; Mishra, Hemant, Predicting scale of data migration between production and archive storage systems, such as for enterprise customers having large and/or numerous files.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.