Creation of synthetic backups within deduplication storage system
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-007/00
G06F-017/00
G06F-017/30
G06F-011/14
출원번호
US-0801756
(2013-03-13)
등록번호
US-9697222
(2017-07-04)
발명자
/ 주소
Aronovich, Lior
Hirsch, Michael
Toaff, Yair
출원인 / 주소
INTERNATIONAL BUSINESS MACHINES CORPORATION
대리인 / 주소
Griffiths & Seaton PLLC
인용정보
피인용 횟수 :
0인용 특허 :
31
초록▼
A deduplication storage system and a backup application create a synthetic backup. Metadata instructions are provided to the deduplication storage system. Each of the metadata instructions specifies the data segment of an originating backup and a designated location of the data segment in the synthe
A deduplication storage system and a backup application create a synthetic backup. Metadata instructions are provided to the deduplication storage system. Each of the metadata instructions specifies the data segment of an originating backup and a designated location of the data segment in the synthetic backup. Each of the metadata instructions are processed by locating those data sub-segments in the deduplication storage system specified by the data segment in each of the metadata instructions, and creating metadata references to each of the data sub-segments and adding the metadata references to metadata of the synthetic backup being created.
대표청구항▼
1. A method for creating a synthetic backup by a deduplication storage system and a backup application using a processor device, comprising: providing the deduplication storage system a plurality of metadata instructions, wherein each of the plurality of metadata instructions specifies a data segmen
1. A method for creating a synthetic backup by a deduplication storage system and a backup application using a processor device, comprising: providing the deduplication storage system a plurality of metadata instructions, wherein each of the plurality of metadata instructions specifies a data segment of an originating backup and a designated location of the data segment in the synthetic backup;processing each of the plurality of metadata instructions by each of:partitioning each data segment input into each of a plurality of fixed-sized data sub-segments, each sub-segment referencing a plurality of stored sub-segments,for each of the plurality of data sub-segments, during the construction of the synthetic backup, calculating each of a plurality of input deduplication digests based on a retrieved plurality of stored deduplication digests,locating those of the plurality of data sub-segments in the deduplication storage system specified by the data segment in each of the plurality of metadata instructions, andcreating metadata references to each of the plurality of data sub-segments and adding the metadata references to metadata of the synthetic backup being created wherein the references includes physical and logical data patterns;constructing the synthetic backup by consolidating the plurality of metadata instructions that reference adjacent backup data segments into a single metadata instruction;wherein the synthetic backup includes data from already existing full backup and its subsequent incremental backups dating until a specific point in time. 2. The method of claim 1, further including using a plurality of metadata calculations and metadata operations to process each of the plurality of metadata instructions. 3. The method of claim 1, further including incorporating a plurality of storage data sub-segments in the deduplication storage system in all data segments of a stored backup. 4. The method of claim 3, further including performing one of: associating a backup data segment with one of the plurality of storage data sub-segments during a deduplication process, andstoring association information of the backup data segment with one of the plurality of storage data sub-segments in metadata associated with the backup data segment. 5. The method of claim 4, further including performing reference counts in a granularity of the plurality of storage data sub-segments. 6. The method of claim 5, further including incrementing values of the reference counts of the plurality of storage data sub-segments that are referenced by the data segment specified in each of the plurality of metadata instructions in accordance with a number of new references to each of the plurality of storage data sub-segments. 7. A system for creating a synthetic backup by a deduplication storage system and a backup application, comprising: the deduplication storage system;at least one processor device, operable in the deduplication computing storage environment, wherein the at least one processor device:provides the deduplication storage system a plurality of metadata instructions, wherein each of the plurality of metadata instructions specifies a data segment of an originating backup and a designated location of the data segment in the synthetic backup, processes each of the plurality of metadata instructions by each of:partitioning each data segment input into each of a plurality of fixed-sized data sub-segments, each sub-segment referencing a plurality of stored sub-segments,for each of the plurality of data sub-segments, during the construction of the synthetic backup, calculating each of a plurality of input deduplication digests based on a retrieved plurality of stored deduplication digests,locating those of the plurality of data sub-segments in the deduplication storage system specified by the data segment in each of the plurality of metadata instructions, andcreating metadata references to each of the plurality of data sub-segments and adding the metadata references to metadata of the synthetic backup being created wherein the references includes physical and logical data patterns,constructing the synthetic backup by consolidates the plurality of metadata instructions that reference adjacent backup data segments into a single metadata instruction;wherein the synthetic backup includes data from already existing full backup and its subsequent incremental backups dating until a specific point in time. 8. The system of claim 7, wherein the at least one processor device uses a plurality of metadata calculations and metadata operations to process each of the plurality of metadata instructions. 9. The system of claim 7, wherein the at least one processor device incorporates a plurality of storage data sub-segments in the deduplication storage system all data segments of a stored backup. 10. The system of claim 9, wherein the at least one processor device performs one of: associating a backup data segment with one of the plurality of storage data sub-segments during a deduplication process, andstoring association information of the backup data segment with one of the plurality of storage data sub-segments in metadata associated with the backup data segment. 11. The system of claim 10, wherein the at least one processor device performs reference counts in a granularity of the plurality of storage data sub-segments. 12. The system of claim 11, wherein the at least one processor device increments values of the reference counts of the plurality of storage data sub-segments that are referenced by the data segment specified in each of the plurality of metadata instructions in accordance with a number of new references to each of the plurality of storage data sub-segments. 13. A computer program product for creating a synthetic backup by a deduplication storage system and a backup application using at least one processor device, the computer program product comprising a non-transitory computer-readable storage medium having computer-readable program code portions stored therein, the computer-readable program code portions comprising: an executable portion that provides the deduplication storage system a plurality of metadata instructions, wherein each of the plurality of metadata instructions specifies a data segment of an originating backup and a designated location of the data segment in the synthetic backup;an executable portion that processes each of the plurality of metadata instructions by each of:partitioning each data segment input into each of a plurality of fixed-sized data sub-segments, each sub-segment referencing a plurality of stored sub-segments,for each of the plurality of data sub-segments, during the construction of the synthetic backup, calculating each of a plurality of input deduplication digests based on a retrieved plurality of stored deduplication digests,locating those of the plurality of data sub-segments in the deduplication storage system specified by the data segment in each of the plurality of metadata instructions, andcreating metadata references to each of the plurality of data sub-segments and adding the metadata references to metadata of the synthetic backup being created wherein the references includes physical and logical data patterns;an executable portion that constructing the synthetic backup by consolidates the plurality of metadata instructions that reference adjacent backup data segments into a single metadata instruction;wherein the synthetic backup includes data from already existing full backup and its subsequent incremental backups dating until a specific point in time. 14. The computer program product of claim 13, further including an executable portion that uses a plurality of metadata calculations and metadata operations to process each of the plurality of metadata instructions. 15. The computer program product of claim 13, further including an executable portion that incorporates a plurality of storage data sub-segments in the deduplication storage system in all data segments of a stored backup. 16. The computer program product of claim 15, further including an executable portion that performs one of: associating a backup data segment with one of the plurality of storage data sub-segments during a deduplication process, andstoring association information of the backup data segment with one of the plurality of storage data sub-segments in metadata associated with the backup data segment. 17. The computer program product of claim 16, further including an executable portion that performs reference counts in a granularity of the plurality of storage data sub-segments. 18. The computer program product of claim 17, further including an executable portion that increments values of the reference counts of the plurality of storage data sub-segments, that are referenced by the data segment specified in each of the plurality of metadata instructions, in accordance with a number of new references to each of the plurality of storage data sub-segments.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (31)
Nakao, Yoshio, Apparatus and method for generating digest according to hierarchical structure of topic.
Yuval Ofek ; Zoran Cakeljic ; Samuel Krikler IL; Sharon Galtzur IL; Michael Hirsch IL; Dan Arnon ; Peter Kamvysselis, Apparatus and methods for copying, backing up, and restoring data using a backup segment size larger than the storage block size.
McCanne, Steven; Demmer, Michael J., Content-based segmentation scheme for data compression in storage and transmission including hierarchical segment representation.
McGrattan, Emma K.; Ball, Stephen; Moucaddem, Sami R.; Rivet, Jean-Francois; Kuo, Chin L.; Yang, Frank H., Method and apparatus for data backup using data blocks.
Woodhill James R. (Houston TX) Woodhill Louis R. (Richmond TX) More ; Jr. William Russell (Houston TX) Berlin Jay Harris (Houston TX), System and method for distributed storage management on networked computer systems using binary object identifiers.
Wittenberg David K. (Hudson MA) Leichter Jerrold S. (Stamford CT), System for controlling access to a secure system by verifying acceptability of proposed password by using hashing and gr.
Hirsch, Michael; Bitner, Haim; Aronovich, Lior; Asher, Ron; Bachmat, Eitan; Klein, Shmuel T., Systems and methods for efficient data searching, storage and reduction.
Raizen, Helen S.; Bappe, Michael E.; Nikolaevich, Agarkov Vadim; Biester, William Carl; Ruef, Richard; Owen, Karl M., Systems and methods for using thin provisioning to reclaim space identified by data reduction processes.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.