[특허]Efficient construction of synthetic backups within deduplication storage system

Efficient construction of synthetic backups within deduplication storage system 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-007/00
출원번호	US-0488180 (2012-06-04)
등록번호	US-8682854 (2014-03-25)
발명자 / 주소	Aronovich, Lior Hirsch, Michael Toaff, Yair
출원인 / 주소	International Business Machines Corporation
대리인 / 주소	Griffiths & Seaton PLLC
인용정보	피인용 횟수 : 1 인용 특허 : 15

초록 ▼

A deduplication storage system enables new input data to be deduplicated with data of synthetic backups already constructed, and for this purpose efficiently calculates deduplication digests for synthetic backups being constructed, based on already existing digests of data referenced by the synthetic backups. For each input data segment of the plurality of input data segments of a synthetic backup being constructed, a plurality of deduplication digests of stored data segments, referenced by the input data segment, is retrieved from an index. Each input data segment is partitioned into each of a plurality of fixed-sized data sub-segments. A calculation is performed producing a deduplication digest for a data sub-segment, where the calculation is based on the retrieved deduplication digests of the plurality of stored data sub-segments referenced by the input data sub-segment.

대표청구항 ▼

1. For a plurality of new input data segments in a deduplication storage system, a method of facilitating construction of a synthetic backup by a processor device, the synthetic backup being independent of and constructed from an originating backup being a full, existing backup, the method comprising: for each new input data segment of the plurality of new input data segments, retrieving a plurality of stored deduplication digests of stored data segments, referenced by the new input data segments, the stored data segments being data taken from the originating backup, and the plurality of stored deduplication digests being deduplication digests calculated from the stored data segments,partitioning each new input data segment into each of a plurality of fixed-sized data sub-segments,for each of the plurality of data sub-segments, during the construction of the synthetic backup, calculating each of a plurality of input deduplication digests based on the retrieved plurality of stored deduplication digests,aggregating each of the plurality of sub-segment deduplication digests to generate a deduplication digest of each new input data segment,searching the plurality of stored deduplication digests of the stored data segments for matches with the deduplication digest of each new input data segment to thereby deduplicate each new input data segment, andforming a deduplication digest of the synthetic backup from the deduplication digests of each new input data segment. 2. The method of claim 1, further including subsequent to the searching, storing the plurality of input deduplication digests in the index, wherein a stored deduplication digest of the plurality of stored deduplication digests matched with an input deduplication digest of the plurality of input deduplication digests may be displaced by the input deduplication digest. 3. The method of claim 1, further comprising: creating a metadata file in the deduplicated storage system, andoptimizing successive storage instructions. 4. The method of claim 3, further including, for each optimized storage instruction: retrieving a metadata segment associated with the new input data segment indicated by the optimized storage instruction,adjusting the metadata segment to reference solely the new input data segment,copying the adjusted metadata segment to the metadata file of the synthetic backup, andfor each storage block referenced by the metadata segment, incrementing a reference count value. 5. The method of claim 1, wherein calculating each of a plurality of sub-segment deduplication digests includes: calculating a hash value for each block in the plurality of new input data segments in byte offsets,arranging a selected plurality of maximal hash values in descending order according to an order of significance,identifying blocks in determined positions relative to the blocks associated with the maximal hash values as shifted blocks,selecting a subset of the hash values of the shifted blocks for a first distinguishing characteristic of the plurality of input data sub-segments, andselecting an additional subset of the hash values of the shifted blocks, for a second distinguishing characteristic of the plurality of new input data segments. 6. The method of claim 5, further including configuring a distinguishing characteristics (DC) index for the plurality of new input data segments for storing the second distinguishing characteristic, and configuring a storage identifiers (SI) index for the plurality of input data sub-segments for storing the first distinguishing characteristic. 7. The method of claim 6, further including calculating the first and second distinguishing characteristics.

이 특허에 인용된 특허 (15)

Laffin, Aaron Wallace, Creating synthetic backup images on a remote computer system.
상세보기
Farber, David A.; Lachman, Ronald D., De-duplication of data in a data processing system.
상세보기
Zhu,Ming Benjamin; Li,Kai; Patterson,R. Hugo, Efficient data storage system.
상세보기
David A. Farber ; Ronald D. Lachman, Identifying and requesting data in network using identifiers which are based on contents of data.
상세보기
Vaikar, Amol Manohar, Method and apparatus for continuous data protection.
상세보기
McGrattan, Emma K.; Ball, Stephen; Moucaddem, Sami R.; Rivet, Jean-Francois; Kuo, Chin L.; Yang, Frank H., Method and apparatus for data backup using data blocks.
상세보기
Ralph Shnelvar, Method and apparatus for storing information in a data processing system.
상세보기
Van Ingen, Catharine; Berkowitz, Brian T., Method and system for synthetic backup and restore.
상세보기
Stringham, Russell, Methods and systems for creating full backups.
상세보기
Efstathopoulos, Petros; Guo, Fanglu; Shah, Dharmesh, Progressive sampling for deduplication indexing.
상세보기
Zeis, Mike; Wu, Weibao, Source classification for performing deduplication in a backup operation.
상세보기
Narayanan, Priyesh, Synthetic differential backups creation for a database using binary log conversion.
상세보기
Niles,Ronald S.; Lam,Wai, System and method for backing up data.
상세보기
Woodhill James R. (Houston TX) Woodhill Louis R. (Richmond TX) More ; Jr. William Russell (Houston TX) Berlin Jay Harris (Houston TX), System and method for distributed storage management on networked computer systems using binary object identifiers.
상세보기
Wittenberg David K. (Hudson MA) Leichter Jerrold S. (Stamford CT), System for controlling access to a secure system by verifying acceptability of proposed password by using hashing and gr.
상세보기

이 특허를 인용한 특허 (1)

Pang, Hung Hing Anthony; Botelho, Fabiano; Ekambaram, Dhanabal; Garg, Nitin, Opportunistic fragmentation repair.
상세보기

내보내기 메뉴

내보내기 구분

파일저장
인쇄
메일전송

구성항목

기본정보
상세정보

관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC

저장형식

Text(ASCII format)
Excel format
PIAS분석(.xls)

메일정보

받는사람 (필수): @
보내는사람 (선택): @
제목
내용: KISTI 검색결과 이메일 서비스

안내

총 건의 자료가 검색되었습니다.

다운받으실 자료의 인덱스를 입력하세요. (1-10,000)

검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다.

데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요)

다운로드 파일은 UTF-8 형태로 저장됩니다.
파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오.

Text(ASCII format)
Excel format

AI-Helper ※ AI-Helper는 을 사용합니다.

AI-Helper

안녕하세요, AI-Helper입니다. 좌측 "선택된 텍스트"에서 텍스트를 선택하여 요약, 번역, 용어설명을 실행하세요.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

연합인증

Efficient construction of synthetic backups within deduplication storage system 원문보기