IPC분류정보
국가/구분 |
United States(US) Patent
등록
|
국제특허분류(IPC7판) |
|
출원번호 |
UP-0330197
(2008-12-08)
|
등록번호 |
US-7852237
(2011-02-10)
|
발명자
/ 주소 |
- McCanne, Steven
- Demmer, Michael J.
|
출원인 / 주소 |
- Riverbed Technology, Inc.
|
대리인 / 주소 |
Townsend & Townsend and Crew LLP
|
인용정보 |
피인용 횟수 :
4 인용 특허 :
34 |
초록
▼
In a coding system, input data within a system is encoded. The input data might include sequences of symbols that repeat in the input data or occur in other input data encoded in the system. The encoding includes determining a target segment size, determining a window size, identifying a fingerprint
In a coding system, input data within a system is encoded. The input data might include sequences of symbols that repeat in the input data or occur in other input data encoded in the system. The encoding includes determining a target segment size, determining a window size, identifying a fingerprint within a window of symbols at an offset in the input data, determining whether the offset is to be designated as a cut point and segmenting the input data as indicated by the set of cut points. For each segment so identified, the encoder determines whether the segment is to be a referenced segment or an unreferenced segment, replacing the segment data of each referenced segment with a reference label and storing a reference binding in a persistent segment store for each referenced segment, if needed. Hierarchically, the process can be repeated by grouping references into groups, replacing the grouped references with a group label, storing a binding between the grouped references and group label, if one is not already present, and repeating the process. The number of levels of hierarchy can be fixed in advanced or it can be determined from the content encoded.
대표청구항
▼
What is claimed is: 1. A method comprising: compressing a data stream by: segmenting at least portions of the data stream into segments based on content of the data stream; responsive to a respective segment being a recurring segment, replacing the segment with a corresponding reference; and respon
What is claimed is: 1. A method comprising: compressing a data stream by: segmenting at least portions of the data stream into segments based on content of the data stream; responsive to a respective segment being a recurring segment, replacing the segment with a corresponding reference; and responsive to a respective segment not being a recurring segment, creating a reference for the segment and replacing the segment with the created reference; and storing the compressed data in a storage system. 2. The method of claim 1, wherein the size of the segments is not fixed. 3. The method of claim 1, wherein segmenting the data stream comprises generating a fingerprint of a group of data symbols. 4. The method of claim 3, wherein segmenting the data stream comprises determining a segmentation boundary based on a respective fingerprint. 5. The method of claim 3, wherein generating the fingerprint comprises generating a hash based on the group of data symbols. 6. The method of claim 1, wherein compressing the data stream further comprises compressing the references hierarchically. 7. The method of claim 6, wherein compressing the references hierarchically comprises assigning a hash to a sequence of hashes. 8. The method of claim 6, wherein compressing the references hierarchically comprises: segmenting a stream of references into groups; responsive to a respective group being a recurring group, replacing the group with a corresponding reference; and responsive to a respective group not being a recurring group, creating a reference for the group and replacing the group with the created reference. 9. The method of claim 6, wherein the compressed data stream includes references associated with different hierarchy levels. 10. The method of claim 1, further comprising storing segment-to-reference binding information in a persistent storage. 11. The method of claim 1, further comprising allowing an average segment size to be set. 12. A system, comprising: a segmentation mechanism configured to segment at least portions of a data stream into segments based on content of the data stream; a replacement mechanism configured to replace a respective segment with a corresponding reference in response to the segment being a recurring segment; and a binding mechanism configured to create a reference for a respective segment and replacing the segment with the created reference in response to the segment not being a recurring segment, thereby compressing the data stream. 13. The system of claim 12, wherein the size of the segments is not fixed. 14. The system of claim 12, wherein the segmentation mechanism comprises a fingerprinting mechanism configured to generate a fingerprint of a group of data symbols. 15. The system of claim 14, wherein the segmentation mechanism is configured to determine a segmentation boundary based on a respective fingerprint. 16. The system of claim 14, wherein while generating the fingerprint, the fingerprinting mechanism is configured to generate a hash based on the group of data symbols. 17. The system of claim 12, further comprising a mechanism to hierarchically assign references to sequences of references. 18. The system of claim 17, wherein the references comprise hashes. 19. The system of claim 12, wherein the segmentation mechanism is further configured to segment a stream of references into groups; wherein the replacement mechanism is further configured to replacing a respective group with a corresponding reference responsive to the group being a recurring group; and wherein the binding mechanism is further configured to create a reference for a respective group and replacing the group with the created reference responsive to the group not being a recurring group. 20. The system of claim 17, wherein the compressed data stream includes references associated with different hierarchy levels. 21. The system of claim 12, wherein the binding mechanism is further configured to store segment-to-reference binding information in a persistent storage. 22. The system of claim 12, wherein the segmentation mechanism is further configured to allow an average segment size to be set. 23. A method comprising: receiving a file or a portion of a file; compressing the file or portion thereof by: at least partially segmenting the file or portion thereof into segments based on content of the file; responsive to a respective segment being a recurring segment, replacing the segment with a corresponding reference; responsive to a respective segment not being a recurring segment, creating a reference for the segment and replacing the segment with the created reference; and saving the file or portion thereof to a storage system. 24. A method comprising: compressing a file by: segmenting at least portions of the file into segments based on content of the file; responsive to a respective segment being a recurring segment, replacing the segment with a corresponding reference; and responsive to a respective segment not being a recurring segment, creating a reference for the segment and replacing the segment with the created reference; and storing the compressed file in a storage system. 25. The method of claim 24, wherein the size of the segments is not fixed. 26. The method of claim 24, wherein segmenting the file comprises generating a fingerprint of a group of data symbols. 27. The method of claim 26, wherein segmenting the file comprises determining a segmentation boundary based on a respective fingerprint. 28. The method of claim 26, wherein generating the fingerprint comprises generating a hash based on the group of data symbols. 29. The method of claim 24, wherein compressing the file further comprises compressing the references hierarchically. 30. The method of claim 29, wherein compressing the references hierarchically comprises assigning a hash to a sequence of hashes. 31. The method of claim 29, wherein compressing the references hierarchically comprises: segmenting a stream of references into groups; responsive to a respective group being a recurring group, replacing the group with a corresponding reference; and responsive to a respective group not being a recurring group, creating a reference for the group and replacing the group with the created reference. 32. The method of claim 29, wherein the compressed file includes references associated with different hierarchy levels. 33. The method of claim 24, further comprising storing segment-to-reference binding information in a persistent storage. 34. The method of claim 24, further comprising allowing an average segment size to be set.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.