System and method for storing redundant information
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-007/00
G06F-017/30
G11B-005/86
G06F-011/14
출원번호
US-0262878
(2014-04-28)
등록번호
US-9236079
(2016-01-12)
발명자
/ 주소
Prahlad, Anand
Gokhale, Parag
Kottomtharayil, Rajiv
Vijayan, Manoj Kumar
Attarde, Deepak Raghunath
출원인 / 주소
Commvault Systems, Inc.
대리인 / 주소
Perkins Coie LLP
인용정보
피인용 횟수 :
3인용 특허 :
227
초록▼
A method and system for reducing storage requirements and speeding up storage operations by reducing the storage of redundant data includes receiving a request that identifies one or more data objects to which to apply a storage operation. For each data object, the storage system determines if the d
A method and system for reducing storage requirements and speeding up storage operations by reducing the storage of redundant data includes receiving a request that identifies one or more data objects to which to apply a storage operation. For each data object, the storage system determines if the data object contains data that matches another data object to which the storage operation was previously applied. If the data objects do not match, then the storage system performs the storage operation in a usual manner. However, if the data objects do match, then the storage system may avoid performing the storage operation.
대표청구항▼
1. A computer system for restoring data from a sequential storage medium, wherein the data has been deduplicated, the system comprising: at least one processor;at least one data storage device coupled to the at least one processor;a receiving unit configured to receive a request to restore at least
1. A computer system for restoring data from a sequential storage medium, wherein the data has been deduplicated, the system comprising: at least one processor;at least one data storage device coupled to the at least one processor;a receiving unit configured to receive a request to restore at least first and second different data objects from a deduplicated copy of the first and second data objects, wherein the deduplicated copy of the first and second data objects is stored on a sequential storage medium,wherein the first and second data objects, prior to deduplication and storage on the sequential storage medium, had multiple, identical instances, andwherein the deduplicated copy contains one instance, stored on the sequential storage medium, of the first and second data objects, and information describing one or more references to the one instance of the first and second data objects;an identifying unit, in response to the received request, configured to— when a requested first or second data object is stored as an instance, identify a location on the sequential storage medium of the instance of the respective first or second data objects, andwhen a requested first or second data object is stored as a reference to an instance, identify a location on the sequential storage medium of the instance of the respective first or second data objects, andconfigured to sort the identified locations; anda restoring unit configured to restore the requested data objects to a random-access storage medium from the sorted locations on the sequential storage medium. 2. The computer system of claim 1, further comprising a retrieving unit configured to retrieve an index from the sequential storage medium,wherein the index comprises, for each of the first and second data objects: (1) an identifier of the first or second data object,(2) information indicating whether the first or second data object is stored as an instance or a reference in the deduplicated copy,(3) when the first or second data object is stored as a reference, an identifier of a source first or second data object for the reference, respectively, and(4) a storage location on the sequential storage medium when the first or second data object is stored as an instance. 3. The computer system of claim 2, wherein the identifying unit identifies the locations for the requested first or second data objects from the index, without accessing the references. 4. The computer system of claim 1, further comprising a recovering unit configured to: copy a chunk of data on the sequential storage medium corresponding to the requested first or second data object to a random access storage medium,determine whether the chunk of data contains an instance of the first or second data object or a reference to an instance, andwhen the chunk of data contains a reference to an instance, retrieve the instance from the sequential storage medium using the reference. 5. The computer system of claim 1, wherein the information describing a reference to an instance comprises a media identifier identifying a storage medium on which the instance is stored and an offset within the identified storage medium to the instance. 6. The computer system of claim 1, further comprising a retrieving unit configured to retrieve the information describing an original location of each first or second data object,wherein the restoring unit is further configured to restore the first or second data objects in accordance with the retrieved information. 7. A computer system for restoring data from a single-instance copy on a sequential storage medium, comprising: a receiving unit configured to receive a request to restore one or more data objects from a single-instance copy of the data objects on a sequential storage medium, wherein some of the data objects are or were identical, andwherein the single-instance copy contains information describing a first instance of each of the one or more data objects, and one or more references to the first instances as stored on the sequential storage medium;an identifying unit configured to, for each of the one or more data objects— identify the storage location of the instance when the data object is stored as an instance in the single-instance copy, andidentify the storage location of the first instance when the data object is stored as a reference to a first instance in the single-instance copy; anda restoring unit configured to restore the one or more data objects on a random-access storage medium in an order of the identified storage locations on the sequential storage medium. 8. A non-transitory computer-readable medium containing instructions for controlling a computer system to execute a method of copying a deduplicated copy of data from a sequential storage medium to a random-access storage medium for data restoration, the method comprising: receiving a request to restore data objects from a deduplicated copy of data objects stored on a sequential storage medium, wherein the deduplicated copy contains information describing a first instance of each of the data objects, and one or more references to the first instances as stored on the sequential storage medium;in response to the request, recreating at least a portion of the deduplicated copy on the random-access storage medium, wherein the references in the recreated copy refer to first instances as stored on the random-access storage medium;receiving a request to restore one of the data objects to a destination location;determining whether the data object to be restored is stored as an instance or a reference in the deduplicated copy on the random-access storage medium; andwhen the data object is stored as an instance, storing the instance in the destination location, but when the data object is stored as a reference to a first instance, storing, if necessary, the first instance in the destination location. 9. The non-transitory computer-readable medium of claim 8, wherein a reference to a first instance comprises a media identifier identifying a storage medium on which the first instance is stored and an offset within the storage medium to the first instance, and wherein at least some of the data objects are of different data types or formats. 10. The non-transitory computer-readable medium of claim 8, wherein the method further comprises: copying an index from the sequential storage medium to the random-access storage medium,wherein the index comprises, for each of the data objects, (1) an identifier of the data object, and(2) information indicating whether the data object is stored as an instance or a reference in the deduplicated copy. 11. The non-transitory computer-readable medium of claim 10, wherein the determining step is performed based on the copied index. 12. A method for copying a deduplicated copy of data from a sequential storage medium to a random-access storage medium for data restoration, the method comprising: receiving a request to restore data objects from a deduplicated copy of data objects stored on a sequential storage medium, wherein the deduplicated copy contains information describing a first instance of each of the data objects, and one or more references to the first instances as stored on the sequential storage medium;in response to the request, recreating at least a portion of the deduplicated copy on the random-access storage medium, wherein the references in the recreated copy refer to first instances as stored on the random-access storage medium;receiving a request to restore one of the data objects to a destination location;determining whether the data object to be restored is stored as an instance or a reference in the deduplicated copy on the random-access storage medium; andwhen the data object is stored as an instance, storing the instance in the destination location, but when the data object is stored as a reference to a first instance, storing, if necessary, the first instance in the destination location. 13. The method of claim 12, wherein a reference to a first instance comprises a media identifier identifying a storage medium on which the first instance is stored and an offset within the storage medium to the first instance, and wherein at least some of the data objects are of different data types or formats. 14. The method of claim 12, further comprising: copying an index from the sequential storage medium to the random-access storage medium,wherein the index comprises, for each of the data objects, (1) an identifier of the data object, and(2) information indicating whether the data object is stored as an instance or a reference in the deduplicated copy. 15. The method of claim 14, wherein the determining step is performed based on the copied index.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (227)
Tarui Toshiaki (Kokubunji JPX) Sukegawa Naonobu (Kokubunji JPX) Fujii Hiroaki (Hadano JPX) Kitai Katsuyoshi (Kokubunji JPX), Access control method for a shared main memory in a multiprocessor based upon a directory held at a storage location of.
Bates, Allen Keith; Haustein, Nils; Klein, Craig Anthony; Troppens, Ulf; Winarski, Daniel James, Apparatus and method to select a deduplication protocol for a data storage library.
Yuval Ofek ; Zoran Cakeljic ; Samuel Krikler IL; Sharon Galtzur IL; Michael Hirsch IL; Dan Arnon ; Peter Kamvysselis, Apparatus and methods for copying, backing up, and restoring data using a backup segment size larger than the storage block size.
Griffin David (Maynard MA) Campbell Jonathan (Acton MA) Reilly Michael (Sterling MA) Rosenbaum Richard (Pepperell MA), Arrangement with cooperating management server node and network service node.
Dile, James Michael; Nguyen, Joanne T.; Piletski, Vadzim Ivanovich; Smith, James Patrick, Backing-up and restoring files including files referenced with multiple file names.
Nakano Toshio (Odawara JPX) Nozawa Masafumi (Odawara JPX) Kurano Akira (Odawara JPX) Hisano Kiyoshi (Odawara JPX) Hoshino Masayuki (Odawara JPX), Backup control method and system in data processing system using identifiers for controlling block data transfer.
Kitajima Hiroyuki (Yokohama) Yamamoto Akira (Yokohama) Doi Takashi (Hadano) Nozawa Masafumi (Odawara JPX), Buffered peripheral system and method for backing up and retrieving data to and from backup memory device.
Worley ; Jr. William S. (Saratoga CA) Bryg William R. (Saratoga CA) Baum Allen (Palo Alto CA), Cache memory consistency control with explicit software instructions.
Cole Leo J. (Raleigh NC) Frantz Curtis J. (Durham NC) Lee Jeannette (Raleigh NC) Ordanic Zvonimir (Raleigh NC) Plank Larry K. (Rochester MN), Centralized management in a computer network.
Carpenter Kelly S. (Fremont CA) Dearing Gerard M. (San Jose CA) Nick Jeffrey M. (Fishkill NY) Strickland Jimmy P. (Saratoga CA) Swanson Michael D. (Poughkeepsie NY) Wilkinson Wendell W. (Hyde Park NY, Coherence controls for store-multiple shared data coordinated by cache directory entries in a shared electronic storage.
Senator Steven T. ; Fuller Billy J., Computer system method and apparatus providing for various versions of a file without requiring data copy or log operati.
Fecteau Jean G. (Toronto NY CAX) Gdaniec Joseph M. (Vestal NY) Hennessy James P. (Endicott NY) MacDonald John F. (Vestal NY) Osisek Damian L. (Vestal NY), Computer system which supports asynchronous commitment of data.
Reed Drummond Shattuck ; Heymann Peter Earnshaw ; Mushero Steven Mark ; Jones Kevin Benard ; Oberlander Jeffrey Todd ; Banay Dan, Computer-based communication system and method using metadata defining a control structure.
Prahlad, Anand; Agrawal, Vijay H., Continuous data protection over intermittent connections, such as continuous data backup for laptops or wireless devices.
Prahlad, Anand; Agrawal, Vijay H., Continuous data protection over intermittent connections, such as continuous data backup for laptops or wireless devices.
Midgely Christopher W. (Framingham MA) Holland Charles J. (Northboro MA) Webb John W. (Sutton MA) Gonsalves Manuel (Brookline MA), Continuously-snapshotted protection of computer files.
Dunphy William E. (Westminster CO) Halladay Steven M. (Louisville CO) Moy Michael E. (Lafayette CO) Munro Frederick G. (Broomfield CO), Data storage and protection system.
Yanai Moshe (Framingham MA) Vishlitzky Natan (Brookline MA) Alterescu Bruno (Newton MA) Castel Daniel (Framingham MA) Shklarsky Gadi (Brookline MA), Data storage system controlled remote data mirroring with respectively maintained data indices.
Hagerstrom, Carl F.; Hutchinson, Thomas Dixon; Bharthulwar, Shridhar; Tinius, Paul E., Detecting and managing orphan files between primary and secondary data stores.
Fortier Richard W. (Acton MA) Mastors Robert M. (Ayer MA) Taylor Tracy M. (Upton MA) Wallace John J. (Franklin MA), Digital data processor with improved backup storage.
Kenley Gregory (Northboro MA) Ericson George (Schrewsbury MA) Fortier Richard (Acton MA) Holland Chuck (Northboro MA) Mastors Robert (Ayer MA) Pownell James (Natick MA) Taylor Tracy (Upton MA) Wallac, Digital data storage system with improved data migration.
Christenson,Nikolai Paul; Fritchie,Scott Ernest Lystig; Larson,James Stephen, Electronic mail system with methodology providing distributed message store.
Alam Salim ; Bhalerao Vinayak A. ; Wu Charles ; Hu George ; Ferrell John I., File object synchronization between a desktop computer and a mobile device.
Xu Yikang ; Vahalia Uresh K. ; Jiang Xiaoye ; Gupta Uday ; Tzelnic Percy, File server system using file system storage, data movers, and an exchange of meta data among data movers for file locking and direct access to shared file systems.
Bates, Allen K.; Haustein, Nils; Klein, Craig A.; Krick, Frank; Troppens, Ulf; Winarski, Daniel, File system with internal deduplication and management of data blocks.
Lagueux, Jr., Richard A.; Stave, Joel H.; Yeaman, John B.; Stevens, Brian E.; Higgins, Robert M.; Collins, James M., Graphical user interface for configuration of a storage system.
Urevig Paul D. ; Malnati James R. ; Ethen Donald J. ; Weber Herbert L., Grouping shared resources into one or more pools and automatically re-assigning shared resources from where they are not currently needed to where they are needed.
Barney Rock D. ; Schwols Keith ; Nelson Ellen M., Integration of a database into file management software for protecting, tracking and retrieving data.
Douceur,John R.; Theimer,Marvin M.; Adya,Atul; Bolosky,William J., Locating potentially identical objects across multiple computers based on stochastic partitioning of workload.
Douceur,John R.; Theimer,Marvin M.; Adya,Atul; Bolosky,William J., Locating potentially identical objects across multiple computers based on stochastic partitioning of workload.
Martin Charles W. (Richardson TX) Reid Fredrick S. (Plano TX) Forbus Gary L. (Dallas TX) Adams Steve M. (Plano TX) Shannon C. Patrick (Garland TX) Pirpich Eric A. (Garland TX), Mass data storage and retrieval system.
Kedem Nadav,ILX, Mass storage subsystem and backup arrangement for digital data processing system which permits information to be backed up while host computer(s) continue(s) operating in connection with information .
Long Robert M., Media element library with non-overlapping subset of media elements and non-overlapping subset of media element drives accessible to first host and unaccessible to second host.
Kullick Steven E. ; Spirakis Charles S. ; Titus Diane J., Method and apparatus for transferring archival data among an arbitrarily large number of computer devices in a networked.
Archibald, Jr., John Edward; McKean, Brian Dennis, Method and apparatus for using extended disk sector formatting to assist in backup and hierarchical storage management.
Eastridge Lawrence E. (Tucson AZ) Kern Robert F. (Tucson AZ) Kern Ronald M. (Tucson AZ) Mikkelsen Claus W. (Morgan Hill CA) Ratliff James M. (Tucson AZ), Method and system for automated backup copy ordering in a time zero backup copy session.
Eastridge Lawrence E. (Tucson AZ) Kern Robert F. (Tucson AZ) Micka William F. (Tucson AZ) Mikkelsen Claus W. (Morgan Hill CA) Ratliff James M. (Tucson AZ), Method and system for automated termination and resumption in a time zero backup copy process.
Walter A. Hubis ; William G. Deitz, Method and system for controlling access share storage devices in a network environment by configuring host-to-volume mapping data structures in the controller memory for granting and denying access .
Chefalas, Thomas E.; Mastrianni, Steven J., Method and system for processing backup data associated with application, querying metadata files describing files accessed by the application.
Chron, Edward Gustav; Menon, Jaishankar Moothedath, Method and system for providing consistent data modification information to clients in a storage system.
Aoyama Yuki,JPX ; Takahashi Toru,JPX ; Wakayama Satoshi,JPX, Method of and an apparatus for displaying version information and configuration information and a computer-readable recording medium on which a version and configuration information display program i.
Wahlert, Brian M; Berkowitz, Brian T; van Ingen, Catharine; Rangegowda, Dharshan; Jazayeri, Mike, Method, system, and apparatus for creating saved searches and auto discovery groups for a data protection system.
Wolfgang, John Jay; Boyd, Kenneth Wayne; Day, III, Kenneth Fairclough; Doatmas, Philip Matthew; Dahman, Kirby Grant, Method, system, and program for data synchronization between a primary storage device and a secondary storage device by determining whether a first identifier and a second identifier match, where a unique identifier is associated with each portion of data.
Palliyil, Sudarshan; Venkateshamurthy, Shivakumara; Vijayaraghavan, Srinivas Belur; Aswathanarayana, Tejasvi, Methods, apparatus and computer programs for enhanced access to resources within a network.
MacHardy, Earle; Harvey, David; Duprey, Dennis, Methods, systems, and computer program products for mapped logical unit (MLU) replications, storage, and retrieval in a redundant array of inexpensive disks (RAID) environment.
Crescenti,John; Kavuri,Srinivas; Oshinsky,David Alan; Prahlad,Anand, Modular backup and retrieval system used in conjunction with a storage area network.
Pisello Thomas (De Bary FL) Crossmier David (Casselberry FL) Ashton Paul (Oviedo FL), Network management system having virtual catalog overview of files distributively stored across network domain.
Sawdon, Wayne A.; Haskin, Roger L.; Schmuck, Frank B.; Wyllie, James C., Plurality of file systems using weighted allocation to allocate space on one or more storage devices.
Bruce, Buford L.; Kim, Peter C.; Levi, Michael; Silliman, Albert; Wissmann, Joseph T.; Zaremba, Christopher, Providing archiving of individual mail content while maintaining a single copy mail store.
Prahlad, Anand; May, Andreas; Lunde, Norman R.; Zhou, Lixin; Kumar, Avinash; Ngo, David, Snapshot storage and management system with indexing and user interface.
Prahlad, Anand; May, Andreas; Lunde, Norman R.; Zhou, Lixin; Kumar, Avinash; Ngo, David, Snapshot storage and management system with indexing and user interface.
Crockett Robert N. (Tucson AZ) Kern Ronald M. (Tucson AZ) Micka William F. (Tucson AZ), Software directed microcode state save for distributed storage controller.
Ting, Daniel; Zheng, Ling; Manley, Stephen L.; DeStefano, John Frederick, System and method for managing data deduplication of storage systems utilizing persistent consistency point images.
Ting, Daniel; Zheng, Ling; Manley, Stephen L.; DeStefano, John Frederick, System and method for managing data deduplication of storage systems utilizing persistent consistency point images.
Mutalik Madhav ; Senie Faith M., System and method for performing file-handling operations in a digital data processing system using an operating system-independent file map.
Huang,Jau Hsiung; Tseng,Wei Hsin; Chou,Hung Te; Weng,Yung Chiuan, System and method for providing access to computer files across computer operating systems.
Prahlad, Anand; Gokhale, Parag; Kottomtharayil, Rajiv; Retnamma, Manoj K. Vijayan; Attarde, Deepak R., System and method for storing redundant information.
Prahlad, Anand; Gokhale, Parag; Kottomtharayil, Rajiv; Vijayan Retnamma, Manoj K.; Attarde, Deepak R., System and method for storing redundant information.
Moulton, Gregory Hagan, System and method for unorchestrated determination of data sequences using sticky byte factoring to determine breakpoints in digital sequences.
Patel, Sujal M.; Mikesell, Paul A., System and methods for providing a distributed file system utilizing metadata to track information about data stored throughout the system.
Huai ReiJane (Old Brookville NY) Daly Robert (Ronkonkoma NY) Curti Walter (Dix Hills NY) Mohan Deepak (Huntington NY) Chueh James Kuang-Ru (Bayside NY) Louie Larry (Forest Hills NY), System and parallel streaming and data stripping to back-up a network.
Frasier, Lawrence Martin; Resino, Robert George, System for adjusting resource allocation to a logical partition based on rate of page swaps and utilization by changing a boot configuration file.
Stoppani ; Jr. Peter (Woodinville WA), System for allocating storage spaces based upon required and optional service attributes having assigned piorities.
Sim-Tang, Siew Yong; Fraisl, Daniel J., System for moving real-time data events across a plurality of devices in a network for simultaneous data protection, replication, and access services.
Flynn Rex A. (Belmont MA) Anick Peter G. (Marlboro MA), System for reconstructing prior versions of indexes using records indicating changes between successive versions of the.
Morris Robert J. T. (Los Gatos CA), System for reducing storage requirements and transmission loads in a backup subsystem in client-server environment by tr.
Saether Christian D. (Seattle WA) Stoppani ; Jr. Peter (Woodinville WA), System of device independent file directories using a tag between the directories and file descriptors that migrate with.
Prahlad, Anand; Schwartz, Jeremy A.; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for classifying and transferring information in a storage network.
Prahlad, Anand; Schwartz, Jeremy A.; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for classifying and transferring information in a storage network.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.