Application-aware and remote single instance data management
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-007/00
G06F-017/30
출원번호
US-0816687
(2015-08-03)
등록번호
US-9971784
(2018-05-15)
발명자
/ 주소
Gokhale, Parag
출원인 / 주소
COMMVAULT SYSTEMS, INC.
대리인 / 주소
Commvault Systems, Inc.
인용정보
피인용 횟수 :
0인용 특허 :
209
초록▼
A method and system for reducing storage requirements and speeding up storage operations by reducing the storage of redundant data includes receiving a request that identifies one or more files or data objects to which to apply a storage operation. For each file or data object, the storage system de
A method and system for reducing storage requirements and speeding up storage operations by reducing the storage of redundant data includes receiving a request that identifies one or more files or data objects to which to apply a storage operation. For each file or data object, the storage system determines if the file or data object contains data that matches another file or data object to which the storage operation was previously applied, based on awareness of the application that created the data object. If the data objects do not match, then the storage system performs the storage operation in a usual manner. However, if the data objects do match, then the storage system may avoid performing the storage operation with respect to the particular file or data object.
대표청구항▼
1. A system for copying files or data objects from a computer system at a first location to a second location, including single-instancing the files or data objects with a plurality of differing associated metadata, the system comprising: a processor; andmultiple hardware components, including:a sto
1. A system for copying files or data objects from a computer system at a first location to a second location, including single-instancing the files or data objects with a plurality of differing associated metadata, the system comprising: a processor; andmultiple hardware components, including:a storage operation manager component coupled to the processor and configured to receive a request to copy a file or data object from a computer system at a first location to a second location, wherein the first location and the second location are geographically remote from each other;a file cache component at the first location configured to: receive the file or data object to be copied from the computer system, andstore the file or data object before it is copied to the second location;a single instance database component at the first location configured to: extract metadata associated with the file or data object,query the second location to determine whether the file or data object is already stored at the second location,wherein the query includes the extracted metadata, andreceive a response from the second location that indicates whether the file or data object is already stored at the second location, wherein the response is based on determining at the second location whether the extracted metadata matches metadata from any files or data objects stored at the second location; andwherein the single instance database component at the first location is further configured to: when the file or data object is not already stored at the second location, copy the file or data object from the file cache component at the first location to the second location, andwhen the file or data object is already stored at the second location and the extracted metadata does not match metadata stored at the second location, (a) single-instance the file or data object at the second location by declining to copy the file or data object thereto from the file cache component at the first location, and(b) copy the extracted metadata to the second location and associate the extracted metadata with the already-stored file or data object at the second location, thereby storing for a single stored instance of the file or data object at the second location at least a first metadata version and a second metadata version that is different from the first metadata version. 2. The system of claim 1 wherein the storage operation manager component is further configured to receive a second request to copy a second file or data object from a second computer system at a third location to the second location, wherein the second location and the third location are geographically remote, and further comprising: a second file cache component at the third location configured to: receive the second file or data object to be copied from the second computer system, andstore the second file or data object before it is copied to the second location; anda second single instance database component at the third location configured to: query the second location to determine whether the second file or data object is already stored at the second location,receive a response from the second location that indicates whether the second file or data object is already stored at the second location, andwhen the second file or data object is not already stored at the second location, copy the second file or data object from the second file cache component to the second location. 3. The system of claim 1 wherein the single instance database component is further configured to: ascertain incremental changes in files or data objects stored at the file cache component; andcopy any incremental changes in files or data objects from the file cache component to the second location. 4. The system of claim 1 wherein the first location is associated with a satellite office and the second location is associated with a main office, wherein the file cache component is further configured to only store one copy of each file or data object, and wherein the single instance database component is further configured to copy the files or data objects that are unique in the first location to the second location. 5. The system of claim 1 wherein the single instance database component is further configured to: create a substantially unique identifier that represents the file or data object; andsend the substantially unique identifier to the second location as part of the query to determine whether the file or data object is already stored at the second location. 6. The system of claim 1, wherein the extracted metadata is associated with at least one of: permissions for the file or data object, a property of the file or data object, an access control list for the file or data object, an identifier for the file or data object, a size of the file or data object, a creation date of the file or data object, a creation date for the file or data object, and an access date for the file or data object. 7. A non-transitory computer-readable storage medium encoded with instructions for controlling a computer system to transfer files from a computer system at a source location to a target location, by a method comprising: receiving a request to transfer a file from a computer system at a source location to a target location, wherein the target location includes a single instance database, and wherein the source location and the target location are geographically remote from each other;sending a request to the single instance database to determine whether the file matches any file already stored by the single instance database and wherein metadata extracted from the file to be transferred matches any metadata associated with any file already stored by the single instance database;receiving a determination from the single instance database as to whether the file matches any file already stored by the single instance database;when the file does not match any file already stored at the target location, storing the file from the computer system at the source location to the single instance database at the target location;receiving a determination from the single instance database as to whether the extracted metadata matches any metadata associated with any file stored by the single instance database; andwhen the file is already stored by the single instance database and the extracted metadata does not match metadata associated with the already-stored file: (a) declining to store the file from the computer system to the single instance database at the target location, and (b) storing the extracted metadata from the computer system to the single instance database at the target location and associating the extracted metadata with the already-stored file, thereby storing for a single stored instance of the file at the single instance database at least a first metadata version and a second metadata version that is different from the first metadata version. 8. The computer-readable storage medium of claim 7 wherein the single instance database stores files received from computer systems at locations other than the source location and tracks such locations, and wherein the single instance database contains a reference count for each file indicating how many different tracked locations refer to the file. 9. The computer-readable storage medium of claim 7 wherein the request to store the file is received as part of a continuous data replication operation. 10. The computer-readable storage medium of claim 7 wherein sending a request to the single instance database includes: generating a substantially unique identifier that represents the file; andsending the substantially unique identifier to the single instance database. 11. The computer-readable storage medium of claim 7 wherein sending a request to the single instance database includes: identifying data objects within the file;generating substantially unique identifiers that represent each identified data object; andsending the substantially unique identifiers to the single instance database. 12. The computer-readable storage medium of claim 7, wherein the method further comprises: when the file matches a file already stored at the target location: incrementing a reference count in an index that refers to the file; andindicating to the requestor that the request to copy the file to the target location succeeded, without actually copying the file to the target location. 13. The computer-readable storage medium of claim 7 wherein sending a request to a single instance database at the target location includes: determining the size of the file; andsending the file size to the single instance database. 14. A method for transferring files from a computer system at a source location to a target location, the method comprising: receiving a request to transfer a file from a computer system at a source location to a target location, wherein the target location includes a single instance database, and wherein the source location and the target location are geographically remote from each other;sending a request to the single instance database to determine whether the file matches any file already stored by the single instance database and wherein metadata extracted from the file to be transferred matches any metadata associated with any file already stored by the single instance database;receiving a determination from the single instance database as to whether the file matches any file already stored by the single instance database;when the file does not match any file already stored at the target location, storing the file from the computer system at the source destination to the single instance database at the target location;receiving a determination from the single instance database as to whether the extracted metadata matches any metadata associated with any file stored by the single instance database; andwhen the file is already stored by the single instance database and the extracted metadata does not match metadata associated with the already-stored file: (a) declining to store the file from the computer system at the source location to the single instance database at the target location, and (b) storing the extracted metadata from the computer system to the target location and associating the extracted metadata with the already-stored file, thereby storing for a single stored instance of the file at the single instance database at least a first metadata version and a second metadata version that is different from the first metadata version. 15. The method of claim 14, wherein the single instance database stores files received from computer systems at locations other than the source location and tracks such locations, and wherein the single instance database contains a reference count for each file indicating how many different tracked locations refer to the file. 16. The method of claim 14, wherein the request to store the file is received as part of a continuous data replication operation. 17. The method of claim 14, wherein sending a request to the single instance database includes: generating a substantially unique identifier that represents the file; andsending the substantially unique identifier to the single instance database. 18. The method of claim 14, wherein sending a request to the single instance database includes: identifying data objects within the file;generating substantially unique identifiers that represent each identified data object; andsending the substantially unique identifiers to the single instance database. 19. The method of claim 14, wherein the method further comprises: when the file matches a file already stored at the target location: incrementing a reference count in an index that refers to the file; andindicating to the requestor that the request to copy the file to the target location succeeded, without actually copying the file to the target location.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (209)
Ackaouy,Emmanuel; Amdur,Matthew B.; Prakash,Ashish; Ayyar,Kartik, Apparatus and method for tandem operation in a storage network.
Bates, Allen Keith; Haustein, Nils; Klein, Craig Anthony; Troppens, Ulf; Winarski, Daniel James, Apparatus and method to select a deduplication protocol for a data storage library.
Yuval Ofek ; Zoran Cakeljic ; Samuel Krikler IL; Sharon Galtzur IL; Michael Hirsch IL; Dan Arnon ; Peter Kamvysselis, Apparatus and methods for copying, backing up, and restoring data using a backup segment size larger than the storage block size.
Griffin David (Maynard MA) Campbell Jonathan (Acton MA) Reilly Michael (Sterling MA) Rosenbaum Richard (Pepperell MA), Arrangement with cooperating management server node and network service node.
Dile, James Michael; Nguyen, Joanne T.; Piletski, Vadzim Ivanovich; Smith, James Patrick, Backing-up and restoring files including files referenced with multiple file names.
Nakano Toshio (Odawara JPX) Nozawa Masafumi (Odawara JPX) Kurano Akira (Odawara JPX) Hisano Kiyoshi (Odawara JPX) Hoshino Masayuki (Odawara JPX), Backup control method and system in data processing system using identifiers for controlling block data transfer.
Kitajima Hiroyuki (Yokohama) Yamamoto Akira (Yokohama) Doi Takashi (Hadano) Nozawa Masafumi (Odawara JPX), Buffered peripheral system and method for backing up and retrieving data to and from backup memory device.
Cole Leo J. (Raleigh NC) Frantz Curtis J. (Durham NC) Lee Jeannette (Raleigh NC) Ordanic Zvonimir (Raleigh NC) Plank Larry K. (Rochester MN), Centralized management in a computer network.
Carpenter Kelly S. (Fremont CA) Dearing Gerard M. (San Jose CA) Nick Jeffrey M. (Fishkill NY) Strickland Jimmy P. (Saratoga CA) Swanson Michael D. (Poughkeepsie NY) Wilkinson Wendell W. (Hyde Park NY, Coherence controls for store-multiple shared data coordinated by cache directory entries in a shared electronic storage.
Chung, Keicy, Computer storage device having separate read-only space and read-write space, removable media component, system management interface, and network interface.
Senator Steven T. ; Fuller Billy J., Computer system method and apparatus providing for various versions of a file without requiring data copy or log operati.
Fecteau Jean G. (Toronto NY CAX) Gdaniec Joseph M. (Vestal NY) Hennessy James P. (Endicott NY) MacDonald John F. (Vestal NY) Osisek Damian L. (Vestal NY), Computer system which supports asynchronous commitment of data.
Prahlad, Anand; Agrawal, Vijay H., Continuous data protection over intermittent connections, such as continuous data backup for laptops or wireless devices.
Prahlad, Anand; Agrawal, Vijay H., Continuous data protection over intermittent connections, such as continuous data backup for laptops or wireless devices.
Prahlad, Anand; Agrawal, Vijay H., Continuous data protection over intermittent connections, such as continuous data backup for laptops or wireless devices.
Prahlad, Anand; Agrawal, Vijay H., Continuous data protection over intermittent connections, such as continuous data backup for laptops or wireless devices.
Dunphy William E. (Westminster CO) Halladay Steven M. (Louisville CO) Moy Michael E. (Lafayette CO) Munro Frederick G. (Broomfield CO), Data storage and protection system.
Yanai Moshe (Framingham MA) Vishlitzky Natan (Brookline MA) Alterescu Bruno (Newton MA) Castel Daniel (Framingham MA) Shklarsky Gadi (Brookline MA), Data storage system controlled remote data mirroring with respectively maintained data indices.
Hagerstrom, Carl F.; Hutchinson, Thomas Dixon; Bharthulwar, Shridhar; Tinius, Paul E., Detecting and managing orphan files between primary and secondary data stores.
Fortier Richard W. (Acton MA) Mastors Robert M. (Ayer MA) Taylor Tracy M. (Upton MA) Wallace John J. (Franklin MA), Digital data processor with improved backup storage.
Kenley Gregory (Northboro MA) Ericson George (Schrewsbury MA) Fortier Richard (Acton MA) Holland Chuck (Northboro MA) Mastors Robert (Ayer MA) Pownell James (Natick MA) Taylor Tracy (Upton MA) Wallac, Digital data storage system with improved data migration.
Christenson,Nikolai Paul; Fritchie,Scott Ernest Lystig; Larson,James Stephen, Electronic mail system with methodology providing distributed message store.
Alam Salim ; Bhalerao Vinayak A. ; Wu Charles ; Hu George ; Ferrell John I., File object synchronization between a desktop computer and a mobile device.
Xu Yikang ; Vahalia Uresh K. ; Jiang Xiaoye ; Gupta Uday ; Tzelnic Percy, File server system using file system storage, data movers, and an exchange of meta data among data movers for file locking and direct access to shared file systems.
Bates, Allen K.; Haustein, Nils; Klein, Craig A.; Krick, Frank; Troppens, Ulf; Winarski, Daniel, File system with internal deduplication and management of data blocks.
Lagueux, Jr., Richard A.; Stave, Joel H.; Yeaman, John B.; Stevens, Brian E.; Higgins, Robert M.; Collins, James M., Graphical user interface for configuration of a storage system.
Urevig Paul D. ; Malnati James R. ; Ethen Donald J. ; Weber Herbert L., Grouping shared resources into one or more pools and automatically re-assigning shared resources from where they are not currently needed to where they are needed.
Barney Rock D. ; Schwols Keith ; Nelson Ellen M., Integration of a database into file management software for protecting, tracking and retrieving data.
Douceur,John R.; Theimer,Marvin M.; Adya,Atul; Bolosky,William J., Locating potentially identical objects across multiple computers based on stochastic partitioning of workload.
Douceur,John R.; Theimer,Marvin M.; Adya,Atul; Bolosky,William J., Locating potentially identical objects across multiple computers based on stochastic partitioning of workload.
Martin Charles W. (Richardson TX) Reid Fredrick S. (Plano TX) Forbus Gary L. (Dallas TX) Adams Steve M. (Plano TX) Shannon C. Patrick (Garland TX) Pirpich Eric A. (Garland TX), Mass data storage and retrieval system.
Kedem Nadav,ILX, Mass storage subsystem and backup arrangement for digital data processing system which permits information to be backed up while host computer(s) continue(s) operating in connection with information .
Long Robert M., Media element library with non-overlapping subset of media elements and non-overlapping subset of media element drives accessible to first host and unaccessible to second host.
Kullick Steven E. ; Spirakis Charles S. ; Titus Diane J., Method and apparatus for transferring archival data among an arbitrarily large number of computer devices in a networked.
Eastridge Lawrence E. (Tucson AZ) Kern Robert F. (Tucson AZ) Kern Ronald M. (Tucson AZ) Mikkelsen Claus W. (Morgan Hill CA) Ratliff James M. (Tucson AZ), Method and system for automated backup copy ordering in a time zero backup copy session.
Eastridge Lawrence E. (Tucson AZ) Kern Robert F. (Tucson AZ) Micka William F. (Tucson AZ) Mikkelsen Claus W. (Morgan Hill CA) Ratliff James M. (Tucson AZ), Method and system for automated termination and resumption in a time zero backup copy process.
Walter A. Hubis ; William G. Deitz, Method and system for controlling access share storage devices in a network environment by configuring host-to-volume mapping data structures in the controller memory for granting and denying access .
Chefalas, Thomas E.; Mastrianni, Steven J., Method and system for processing backup data associated with application, querying metadata files describing files accessed by the application.
Chron, Edward Gustav; Menon, Jaishankar Moothedath, Method and system for providing consistent data modification information to clients in a storage system.
Aoyama Yuki,JPX ; Takahashi Toru,JPX ; Wakayama Satoshi,JPX, Method of and an apparatus for displaying version information and configuration information and a computer-readable recording medium on which a version and configuration information display program i.
Wahlert, Brian M; Berkowitz, Brian T; van Ingen, Catharine; Rangegowda, Dharshan; Jazayeri, Mike, Method, system, and apparatus for creating saved searches and auto discovery groups for a data protection system.
Wolfgang, John Jay; Boyd, Kenneth Wayne; Day, III, Kenneth Fairclough; Doatmas, Philip Matthew; Dahman, Kirby Grant, Method, system, and program for data synchronization between a primary storage device and a secondary storage device by determining whether a first identifier and a second identifier match, where a unique identifier is associated with each portion of data.
Palliyil, Sudarshan; Venkateshamurthy, Shivakumara; Vijayaraghavan, Srinivas Belur; Aswathanarayana, Tejasvi, Methods, apparatus and computer programs for enhanced access to resources within a network.
Crescenti,John; Kavuri,Srinivas; Oshinsky,David Alan; Prahlad,Anand, Modular backup and retrieval system used in conjunction with a storage area network.
Pisello Thomas (De Bary FL) Crossmier David (Casselberry FL) Ashton Paul (Oviedo FL), Network management system having virtual catalog overview of files distributively stored across network domain.
Sawdon, Wayne A.; Haskin, Roger L.; Schmuck, Frank B.; Wyllie, James C., Plurality of file systems using weighted allocation to allocate space on one or more storage devices.
Bruce, Buford L.; Kim, Peter C.; Levi, Michael; Silliman, Albert; Wissmann, Joseph T.; Zaremba, Christopher, Providing archiving of individual mail content while maintaining a single copy mail store.
Prahlad, Anand; May, Andreas; Lunde, Norman R.; Zhou, Lixin; Kumar, Avinash; Ngo, David, Snapshot storage and management system with indexing and user interface.
Prahlad, Anand; May, Andreas; Lunde, Norman R.; Zhou, Lixin; Kumar, Avinash; Ngo, David, Snapshot storage and management system with indexing and user interface.
Crockett Robert N. (Tucson AZ) Kern Ronald M. (Tucson AZ) Micka William F. (Tucson AZ), Software directed microcode state save for distributed storage controller.
Ting, Daniel; Zheng, Ling; Manley, Stephen L.; DeStefano, John Frederick, System and method for managing data deduplication of storage systems utilizing persistent consistency point images.
Ting, Daniel; Zheng, Ling; Manley, Stephen L.; DeStefano, John Frederick, System and method for managing data deduplication of storage systems utilizing persistent consistency point images.
Mutalik Madhav ; Senie Faith M., System and method for performing file-handling operations in a digital data processing system using an operating system-independent file map.
Huang,Jau Hsiung; Tseng,Wei Hsin; Chou,Hung Te; Weng,Yung Chiuan, System and method for providing access to computer files across computer operating systems.
Prahlad, Anand; Gokhale, Parag; Kottomtharayil, Rajiv; Retnamma, Manoj K. Vijayan; Attarde, Deepak R., System and method for storing redundant information.
Prahlad, Anand; Gokhale, Parag; Kottomtharayil, Rajiv; Vijayan Retnamma, Manoj K.; Attarde, Deepak R., System and method for storing redundant information.
Moulton, Gregory Hagan, System and method for unorchestrated determination of data sequences using sticky byte factoring to determine breakpoints in digital sequences.
Patel, Sujal M.; Mikesell, Paul A., System and methods for providing a distributed file system utilizing metadata to track information about data stored throughout the system.
Huai ReiJane (Old Brookville NY) Daly Robert (Ronkonkoma NY) Curti Walter (Dix Hills NY) Mohan Deepak (Huntington NY) Chueh James Kuang-Ru (Bayside NY) Louie Larry (Forest Hills NY), System and parallel streaming and data stripping to back-up a network.
Frasier, Lawrence Martin; Resino, Robert George, System for adjusting resource allocation to a logical partition based on rate of page swaps and utilization by changing a boot configuration file.
Stoppani ; Jr. Peter (Woodinville WA), System for allocating storage spaces based upon required and optional service attributes having assigned piorities.
Sim-Tang, Siew Yong; Fraisl, Daniel J., System for moving real-time data events across a plurality of devices in a network for simultaneous data protection, replication, and access services.
Flynn Rex A. (Belmont MA) Anick Peter G. (Marlboro MA), System for reconstructing prior versions of indexes using records indicating changes between successive versions of the.
Saether Christian D. (Seattle WA) Stoppani ; Jr. Peter (Woodinville WA), System of device independent file directories using a tag between the directories and file descriptors that migrate with.
Prahlad, Anand; Schwartz, Jeremy A.; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for classifying and transferring information in a storage network.
Prahlad, Anand; Schwartz, Jeremy A.; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for classifying and transferring information in a storage network.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.