최소 단어 이상 선택하여야 합니다.
최대 10 단어까지만 선택 가능합니다.
다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
NTIS 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
DataON 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Edison 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Kafe 바로가기국가/구분 | United States(US) Patent 등록 |
---|---|
국제특허분류(IPC7판) |
|
출원번호 | US-0992408 (2016-01-11) |
등록번호 | US-10061535 (2018-08-28) |
발명자 / 주소 |
|
출원인 / 주소 |
|
대리인 / 주소 |
|
인용정보 | 피인용 횟수 : 0 인용 특허 : 270 |
A method and system for reducing storage requirements and speeding up storage operations by reducing the storage of redundant data includes receiving a request that identifies one or more data objects to which to apply a storage operation. For each data object, the storage system determines if the d
A method and system for reducing storage requirements and speeding up storage operations by reducing the storage of redundant data includes receiving a request that identifies one or more data objects to which to apply a storage operation. For each data object, the storage system determines if the data object contains data that matches another data object to which the storage operation was previously applied. If the data objects do not match, then the storage system performs the storage operation in a usual manner. However, if the data objects do match, then the storage system may avoid performing the storage operation.
1. A method performed by a computer system of storing a single-instance copy on a sequential storage medium, wherein the single instance copy is created from copies of original data objects, the method comprising: receiving or accessing multiple data objects from a computer network; wherein some of
1. A method performed by a computer system of storing a single-instance copy on a sequential storage medium, wherein the single instance copy is created from copies of original data objects, the method comprising: receiving or accessing multiple data objects from a computer network; wherein some of the multiple data objects are substantially identical according to a hashing algorithm;storing, on a random-access storage medium, a single-instance copy of the multiple data objects; wherein the single-instance copy contains a copy of only one of the substantially identical data objects; andwherein the random-access storage medium includes at least one reference to the copy of the only one of the substantially identical data objects;storing the single-instance copy of the multiple data objects on a sequential storage medium by: transferring the copy of the only one of the substantially identical data objects from the random-access storage medium to the sequential storage medium; andtransferring the at least one reference to the copy of the substantially identical data objects from the random-access storage medium to the sequential storage medium after the copy of the only one of the substantially identical data objects is stored on the sequential storage medium. 2. The method of claim 1, wherein receiving or accessing multiple data objects from a computer network includes receiving or accessing multiple data objects from multiple, different logical locations within a computer network,wherein the single-instance copy contains information identifying the logical locations, andwherein transferring the at least one reference to the copy of the substantially identical data objects from the random-access storage medium to the sequential storage medium after the copy of the only one of the substantially identical data objects is stored on the sequential storage medium includes storing the information identifying the logical locations. 3. The method of claim 1, wherein storing the at least one reference to the copy of the substantially identical data objects from the random-access storage medium to the sequential storage medium after the copy of the only one of the substantially identical data objects is stored on the sequential storage medium includes storing a reference count to track a number of references that refer to the copy of one of the substantially identical data objects. 4. The method of claim 1, wherein a reference to the copy of one of the substantially identical data objects comprises a media identifier identifying a storage medium on which the copy is stored and an offset within the identified storage medium to the copy. 5. The method of claim 1, further comprising maintaining an index on the random-access storage medium, wherein the index comprises, for each of the multiple data objects: an identifier of the data object, information indicating whether the data object is stored as a copy or a reference to a copy, and an identifier of a source copy when the data object is stored as a reference to the source copy. 6. The method of claim 1, wherein storing, on the random-access storage medium, the single-instance copy of the multiple data objects includes storing the single-instance copy of the multiple data objects using an index that comprises, for each of the multiple data objects: an identifier of the data object, information indicating whether the data object is stored as a copy or a reference to a copy, and an identifier of a source copy when the data object is stored as a reference to the source copy. 7. The method of claim 1, wherein receiving or accessing multiple data objects from a computer network includes receiving multiple data objects from multiple, different logical locations within a computer network, and wherein the single-instance copy contains information identifying the logical locations, further comprising: transferring, before transferring the information identifying the logical locations on the sequential storage medium, an index that comprises, for each of the multiple data objects: an identifier of the data object, information indicating whether the data object is stored as a copy or a reference to a copy, and an identifier of a source copy when the data object is stored as a reference to the source copy. 8. The method of claim 1, wherein at least some of the multiple data objects are of different types or formats, and wherein the different types or formats correspond to documents, email messages, and configuration settings. 9. A method performed by a computer system of storing a de-duplicated copy of data objects on a sequential storage medium, comprising: receiving one or more data objects in a hierarchy, wherein some of the data objects are identified as identical based on hashing;storing, on a random-access storage medium, a de-duplicated copy of the one or more data objects, wherein the de-duplicated copy contains information describing— a first instance of each of the one or more data objects, andone or more references to the one or more first instances as stored on the random-access storage medium; andtransferring the de-duplicated copy of the one or more data objects from the random-access storage medium to a sequential storage medium for storage on the sequential storage medium. 10. The method of claim 9, wherein the data objects are of at least two different object types, and wherein the different types or formats correspond to documents, email messages, or configuration settings. 11. The method of claim 9, wherein at least one of the data objects has an archive file format. 12. A non-transitory computer-readable medium containing instructions for controlling a computer system to execute a method of storing a copy of data objects on a sequential storage medium, the method comprising: receiving or accessing multiple data objects from a computer network; wherein some of the multiple data objects are substantially identical according to a hashing algorithm;storing, on a random-access storage medium, a single-instance copy of the multiple data objects; wherein the single-instance copy contains a copy of only one of the substantially identical data objects; andwherein the random-access storage medium includes at least one reference to the copy of the only one of the substantially identical data objects;storing the single-instance copy of the multiple data objects on a sequential storage medium by: transferring the copy of the only one of the substantially identical data objects from the random-access storage medium to the sequential storage medium; andtransferring the at least one reference to the copy of the substantially identical data objects from the random-access storage medium to the sequential storage medium after the copy of the only one of the substantially identical data objects is stored on the sequential storage medium. 13. The non-transitory computer-readable medium of claim 12, wherein receiving or accessing multiple data objects from a computer network includes receiving or accessing multiple data objects from multiple, different logical locations within a computer network,wherein the single-instance copy contains information identifying the logical locations, andwherein transferring the at least one reference to the copy of the substantially identical data objects from the random-access storage medium to the sequential storage medium after the copy of the only one of the substantially identical data objects is stored on the sequential storage medium includes storing the information identifying the logical locations. 14. The non-transitory computer-readable medium of claim 12, wherein storing the at least one reference to the copy of the substantially identical data objects from the random-access storage medium to the sequential storage medium after the copy of the only one of the substantially identical data objects is stored on the sequential storage medium includes storing a reference count to track a number of references that refer to the copy of one of the substantially identical data objects. 15. The non-transitory computer-readable medium of claim 12, wherein a reference to the copy of one of the identified as identical data objects comprises a media identifier identifying a storage medium on which the copy is stored and an offset within the identified storage medium to the copy. 16. The non-transitory computer-readable medium of claim 12, further comprising maintaining an index on the random-access storage medium, wherein the index comprises, for each of the multiple data objects: an identifier of the data object, information indicating whether the data object is stored as a copy or a reference to a copy, and an identifier of a source copy when the data object is stored as a reference to the source copy. 17. The non-transitory computer-readable medium of claim 12, wherein storing, on the random-access storage medium, the single-instance copy of the multiple data objects includes storing the single-instance copy of the multiple data objects using an index that comprises, for each of the multiple data objects: an identifier of the data object, information indicating whether the data object is stored as a copy or a reference to a copy, and an identifier of a source copy when the data object is stored as a reference to the source copy. 18. The non-transitory computer-readable medium of claim 12, wherein receiving or accessing multiple data objects from a computer network includes receiving multiple data objects from multiple, different logical locations within a computer network, and wherein the single-instance copy contains information identifying the logical locations, further comprising: transferring, before transferring the information identifying the logical locations on the sequential storage medium, an index that comprises, for each of the multiple data objects: an identifier of the data object, information indicating whether the data object is stored as a copy or a reference to a copy, and an identifier of a source copy when the data object is stored as a reference to the source copy. 19. The non-transitory computer-readable medium of claim 12, wherein at least some of the multiple data objects are of different types or formats, and wherein the different types or formats correspond to documents, email messages, and configuration settings. 20. The non-transitory computer-readable medium of claim 12, wherein the sequential storage medium includes a magnetic tape.
Copyright KISTI. All Rights Reserved.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.