Data classification systems and methods for organizing a metabase
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-007/00
G06F-017/00
G06F-003/00
G06F-009/44
G06F-009/46
G06F-013/00
출원번호
UP-0605931
(2006-11-28)
등록번호
US-7849059
(2011-01-31)
발명자
/ 주소
Prahlad, Anand
Schwartz, Jeremy Alan
Ngo, David
Brockway, Brian
Muller, Marcus S.
출원인 / 주소
CommVault Systems, Inc.
대리인 / 주소
Knobbe, Martens, Olson & Bear, LLP
인용정보
피인용 횟수 :
37인용 특허 :
129
초록▼
Systems and methods for managing electronic data are disclosed. Various data management operations can be performed based on a metabase formed from metadata. Such metadata can be identified from an index of data interactions generated by a journaling module, and obtained from their associated data o
Systems and methods for managing electronic data are disclosed. Various data management operations can be performed based on a metabase formed from metadata. Such metadata can be identified from an index of data interactions generated by a journaling module, and obtained from their associated data objects stored in one or more storage devices. In various embodiments, such processing of the index and storing of the metadata can facilitate, for example, enhanced data management operations, enhanced data identification operations, enhanced storage operations, data classification for organizing and storing the metadata, cataloging of metadata for the stored metadata, and/or user interfaces for managing data. In various embodiments, the metabase can be configured in different ways. For example, the metabase can be stored separately from the data objects so as to allow obtaining of information about the data objects without accessing the data objects or a data structure used by a file system.
대표청구항▼
What is claimed is: 1. A method of identifying data to store in a metabase, the method comprising: monitoring with a journaling module, data interactions between at least one application and one or more of a plurality of data objects stored in a file system, wherein the journaling module is separat
What is claimed is: 1. A method of identifying data to store in a metabase, the method comprising: monitoring with a journaling module, data interactions between at least one application and one or more of a plurality of data objects stored in a file system, wherein the journaling module is separate from the application and wherein the journaling module populates an index with entries about the data interactions; displaying with one or more computer processors a user interface that allows a user to input a user-defined tag expression, wherein the user-defined tag expression comprises information associated with data interactions the user desires to track; tagging entries in the index that meet the user-defined tag expression, wherein the tagging associates a tag identifier with the entries; scanning entries in the index to identify at least a first entry from the index associated with the tag identifier, wherein the first entry corresponds to a first data interaction with a first data object meeting the tag expression; obtaining from the index first metadata about the data interaction associated with the first entry; accessing the first data object associated with the first entry, to obtain second metadata, wherein the second metadata comprises information about the first data object that the user desires to track; obtaining from the index third metadata, wherein the third metadata comprises the tag identifier associated with the first entry; updating a metabase with the first, second and third metadata such that the metabase associates the tag identifier with first metadata obtained from the index and the second metadata obtained from the first data object, wherein the metabase is stored separately from the first data object and separately from the file system containing the first data object, wherein said updating further comprises determining which of a plurality of metabases comprises records storing first, second or third metadata associated with the first data object; and in response to a user request for information about data interactions associated with the user-defined tag expression, accessing the first, the second or the third metadata in the metabase to determine data interactions that meet the user-defined tag expression without accessing either the plurality of data objects or the file system. 2. The method of claim 1, wherein information about the selected entry comprises information indicative of modifications to the first data object. 3. The method of claim 1, wherein the first, second and third metadata in the metabase is stored separately from the entire contents of the data objects. 4. The method of claim 1, further comprising accessing one or more of the first, second or third metadata associated with the data objects one or more times to update the metabase. 5. The method of claim 1, wherein said updating comprises: determining whether the selected entry in the index of data interactions has an existing record in the metabase; if no record exists corresponding to the selected entry, creating a new record in the metabase; and updating the existing record or the new record with at least a part of the information obtained from the selected entry. 6. The method of claim 1, wherein said selecting comprises determining whether the entry is a new entry in the index of data interactions. 7. The method of claim 6, wherein the entry is considered to be new if a time stamp of the entry is later than a time at which a previous entry was analyzed. 8. The method of claim 6, wherein the entry is considered to be new based on an identifier of the entry. 9. The method of claim 8, wherein the identifier comprises an update sequence number that identifies the entry in the index of data interactions. 10. The method of claim 1, further comprising initially populating the metabase by accessing the data objects so as to access available first, second or third metadata associated with the data objects. 11. The method of claim 10, additionally comprising: quiescing the data interactions associated with the at least one storage device; and performing said populating during said quiescing. 12. The method of claim 11, wherein said populating is performed during operation of the at least one storage device. 13. The method of claim 12, additionally comprising queuing the data interactions generated during said populating to allow capture of the data interactions during the accessing process. 14. The method of claim 1, additionally comprising receiving input regarding the user-defined tag expression, wherein said obtaining information is based at least in part on said user-defined tag expression. 15. A system for managing electronic data in a storage network, the system comprising: a journaling module executing in one or more processors that is configured to monitor data interactions between at least one application and one or more the plurality of data objects associated with a file system, wherein the journaling module is separate from the application and wherein the journaling module is further configured to populate an index with entries about the data interactions; a user interface executing in one or more computer processors, wherein the user interface allows a user to input a user-defined tag expression that comprises information associated with data interactions the user desires to track; a data classification module executing in one or more processors configured to: entries in the index that meet the user-defined tag expression, wherein the data classification module associates a tag identifier with the entries; scan entries in the index to identify at least a first entry from the index associated with the tag identifier, wherein the first entry corresponds to a first data interaction with a first data object meeting the tag expression; obtain from the index first metadata about the data interaction associated with the first entry; access the first data object associated with the first entry, to obtain second metadata, wherein the second metadata comprises information about the first data object that the user desires to track; obtain from the index third metadata, wherein the third metadata comprises the tag identifier associated with the first entry; update in a metabase the first, second and third metadata such that the metabase associates the tag identifier with the first metadata obtained from the index and the second metadata obtained from the first data object, wherein the metabase is stored separately from the first data object and separately from the file system containing the first data object, wherein said updating further comprises determining which of a plurality of metabases comprises records storing first, second or third metadata associated with the first data object; in response to a user request for information about data interactions associated with the user-defined tag expression, the data classification module is configured to access the first, the second or the third metadata in the metabase to determine data interactions that meet the user-defined tag expression without accessing either the plurality of data objects or the file system. 16. The system of claim 15, wherein the journal file is populated by a monitoring module. 17. The system of claim 15, wherein the data classification module is further configured to access the one or more data objects one or more times to update the metabase. 18. The system of claim 15, wherein the properties of the data objects are stored in the metabase separately from entire content of the data objects. 19. The system of claim 15, wherein the information obtained from the selected entry is indicative of modifications to metadata of the first data object resulting from the first data interaction. 20. The system of claim 19, wherein the first, second or third metadata comprises at least one of: a data owner, a last modified time, a last accessed time, a data object size and an application type. 21. The system of claim 15, wherein the data classification module is further configured to classify the one or more properties of the data object based on the user-defined tag expression. 22. The system of claim 15, wherein the data classification module is further configured to periodically scan the entries in the index. 23. The system of claim 22, wherein the data classification module is further configured to allow analysis of the one or more properties of the data objects based on a selected criteria without accessing the data objects.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (129)
Yuval Ofek ; Zoran Cakeljic ; Samuel Krikler IL; Sharon Galtzur IL; Michael Hirsch IL; Dan Arnon ; Peter Kamvysselis, Apparatus and methods for copying, backing up, and restoring data using a backup segment size larger than the storage block size.
Griffin David (Maynard MA) Campbell Jonathan (Acton MA) Reilly Michael (Sterling MA) Rosenbaum Richard (Pepperell MA), Arrangement with cooperating management server node and network service node.
Nakano Toshio (Odawara JPX) Nozawa Masafumi (Odawara JPX) Kurano Akira (Odawara JPX) Hisano Kiyoshi (Odawara JPX) Hoshino Masayuki (Odawara JPX), Backup control method and system in data processing system using identifiers for controlling block data transfer.
Kitajima Hiroyuki (Yokohama) Yamamoto Akira (Yokohama) Doi Takashi (Hadano) Nozawa Masafumi (Odawara JPX), Buffered peripheral system and method for backing up and retrieving data to and from backup memory device.
Cole Leo J. (Raleigh NC) Frantz Curtis J. (Durham NC) Lee Jeannette (Raleigh NC) Ordanic Zvonimir (Raleigh NC) Plank Larry K. (Rochester MN), Centralized management in a computer network.
Carpenter Kelly S. (Fremont CA) Dearing Gerard M. (San Jose CA) Nick Jeffrey M. (Fishkill NY) Strickland Jimmy P. (Saratoga CA) Swanson Michael D. (Poughkeepsie NY) Wilkinson Wendell W. (Hyde Park NY, Coherence controls for store-multiple shared data coordinated by cache directory entries in a shared electronic storage.
J. Paul Dourish ; John O. Lamping ; Thomas Rodden GB, Collaborative document management system with customizable filing structures that are mutually intelligible.
Eric C. Peters ; Stanley Rabinowitz ; Herbert R. Jacobs ; Richard Baker Gillett, Jr. ; Peter J. Fasciano, Computer system and process for transferring multiple high bandwidth streams of data between multiple storage units and multiple applications in a scalable and reliable manner.
Senator Steven T. ; Fuller Billy J., Computer system method and apparatus providing for various versions of a file without requiring data copy or log operati.
Fecteau Jean G. (Toronto NY CAX) Gdaniec Joseph M. (Vestal NY) Hennessy James P. (Endicott NY) MacDonald John F. (Vestal NY) Osisek Damian L. (Vestal NY), Computer system which supports asynchronous commitment of data.
Koseki, Michihiko; Yokoyama, Mamoru; Sumi, Masashi; Yamaguchi, Satoru; Taniwaki, Sadayoshi; Hamanaka, Seishiro, Data processing system with mechanism for restoring file systems based on transaction logs.
Dunphy William E. (Westminster CO) Halladay Steven M. (Louisville CO) Moy Michael E. (Lafayette CO) Munro Frederick G. (Broomfield CO), Data storage and protection system.
Yanai Moshe (Framingham MA) Vishlitzky Natan (Brookline MA) Alterescu Bruno (Newton MA) Castel Daniel (Framingham MA) Shklarsky Gadi (Brookline MA), Data storage system controlled remote data mirroring with respectively maintained data indices.
Fortier Richard W. (Acton MA) Mastors Robert M. (Ayer MA) Taylor Tracy M. (Upton MA) Wallace John J. (Franklin MA), Digital data processor with improved backup storage.
Kenley Gregory (Northboro MA) Ericson George (Schrewsbury MA) Fortier Richard (Acton MA) Holland Chuck (Northboro MA) Mastors Robert (Ayer MA) Pownell James (Natick MA) Taylor Tracy (Upton MA) Wallac, Digital data storage system with improved data migration.
Xu Yikang ; Vahalia Uresh K. ; Jiang Xiaoye ; Gupta Uday ; Tzelnic Percy, File server system using file system storage, data movers, and an exchange of meta data among data movers for file locking and direct access to shared file systems.
Lagueux, Jr., Richard A.; Stave, Joel H.; Yeaman, John B.; Stevens, Brian E.; Higgins, Robert M.; Collins, James M., Graphical user interface for configuration of a storage system.
Urevig Paul D. ; Malnati James R. ; Ethen Donald J. ; Weber Herbert L., Grouping shared resources into one or more pools and automatically re-assigning shared resources from where they are not currently needed to where they are needed.
Leighton,F. Thomson; Lewin, legal representative,Anne E.; Lewin, deceased,Daniel M., HTML delivery from edge-of-network servers in a content delivery network (CDN).
Ito Hiromichi,JPX ; Arai Masato,JPX ; Nakata Yukio,JPX ; Ito Toshiya,JPX ; Mori Mitsuru,JPX, Information processing system enabling access to different types of files, control method for the same and storage mediu.
Barney Rock D. ; Schwols Keith ; Nelson Ellen M., Integration of a database into file management software for protecting, tracking and retrieving data.
Oshinsky, David Alan; Ignatius, Paul; Prahlad, Anand; May, Andreas, Logical view and access to data managed by a modular data and storage management system.
Ignatius, Paul; Theisen, Marjorie H.; Oshinsky, David Alan; Kavuri, Srinivas, Logical view and access to physical storage in modular data and storage management system.
Martin Charles W. (Richardson TX) Reid Fredrick S. (Plano TX) Forbus Gary L. (Dallas TX) Adams Steve M. (Plano TX) Shannon C. Patrick (Garland TX) Pirpich Eric A. (Garland TX), Mass data storage and retrieval system.
Kedem Nadav,ILX, Mass storage subsystem and backup arrangement for digital data processing system which permits information to be backed up while host computer(s) continue(s) operating in connection with information .
Long Robert M., Media element library with non-overlapping subset of media elements and non-overlapping subset of media element drives accessible to first host and unaccessible to second host.
Amundson Daniel L. ; Halley Donald Ray ; Koeller Paul Douglas ; Koser Leonard William ; Smith Lynda Marie, Method and apparatus for data backup and recovery.
Kullick Steven E. ; Spirakis Charles S. ; Titus Diane J., Method and apparatus for transferring archival data among an arbitrarily large number of computer devices in a networked.
Eastridge Lawrence E. (Tucson AZ) Kern Robert F. (Tucson AZ) Kern Ronald M. (Tucson AZ) Mikkelsen Claus W. (Morgan Hill CA) Ratliff James M. (Tucson AZ), Method and system for automated backup copy ordering in a time zero backup copy session.
Eastridge Lawrence E. (Tucson AZ) Kern Robert F. (Tucson AZ) Micka William F. (Tucson AZ) Mikkelsen Claus W. (Morgan Hill CA) Ratliff James M. (Tucson AZ), Method and system for automated termination and resumption in a time zero backup copy process.
Walter A. Hubis ; William G. Deitz, Method and system for controlling access share storage devices in a network environment by configuring host-to-volume mapping data structures in the controller memory for granting and denying access .
Aoyama Yuki,JPX ; Takahashi Toru,JPX ; Wakayama Satoshi,JPX, Method of and an apparatus for displaying version information and configuration information and a computer-readable recording medium on which a version and configuration information display program i.
Biettron,Laurent; Pallu,Fr챕d챕ric; Tricot,Sylvie, Method of thematic classification of documents, themetic classification module, and search engine incorporating such a module.
Crescenti,John; Kavuri,Srinivas; Oshinsky,David Alan; Prahlad,Anand, Modular backup and retrieval system used in conjunction with a storage area network.
Pisello Thomas (De Bary FL) Crossmier David (Casselberry FL) Ashton Paul (Oviedo FL), Network management system having virtual catalog overview of files distributively stored across network domain.
Crockett Robert N. (Tucson AZ) Kern Ronald M. (Tucson AZ) Micka William F. (Tucson AZ), Software directed microcode state save for distributed storage controller.
Thomas Michael W. ; Allard James E. ; Howard Michael ; Chung Sophia ; Ferroni Cameron ; Henbenthal Douglas C. ; Ludeman John ; Stebbens Kim ; Sanders ; II Henry L. ; Treadwell ; III David R., System and method for administering a meta database as an integral component of an information server.
Kottomtharayil,Rajiv; Gokhale,Parag; Prahlad,Anand; Vijayan Retnamma,Manoj Kumar; Ngo,David; Devassy,Varghese, System and method for dynamically performing storage operations in a computer network.
Diaz Perez, Milton, System and method for managing, converting and displaying video content on a video-on-demand platform, including ads used for drill-down navigation and consumer-generated classified ads.
Richard J. Huebsch ; Robert J. Prieve ; Leonard Kampa, System and method for multiplexed data back-up to a storage tape and restore operations using client identification tags.
Mutalik Madhav ; Senie Faith M., System and method for performing file-handling operations in a digital data processing system using an operating system-independent file map.
Huai ReiJane (Old Brookville NY) Daly Robert (Ronkonkoma NY) Curti Walter (Dix Hills NY) Mohan Deepak (Huntington NY) Chueh James Kuang-Ru (Bayside NY) Louie Larry (Forest Hills NY), System and parallel streaming and data stripping to back-up a network.
Stoppani ; Jr. Peter (Woodinville WA), System for allocating storage spaces based upon required and optional service attributes having assigned piorities.
Flynn Rex A. (Belmont MA) Anick Peter G. (Marlboro MA), System for reconstructing prior versions of indexes using records indicating changes between successive versions of the.
Saether Christian D. (Seattle WA) Stoppani ; Jr. Peter (Woodinville WA), System of device independent file directories using a tag between the directories and file descriptors that migrate with.
Prahlad, Anand; Schwartz, Jeremy Alan; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for using metadata to enhance data management operations.
Horvitz, Eric J.; Kadie, Carl M.; Ozer, Stuart; Wong, Curtis G., Training, inference and user interface for guiding the caching of media content on local stores.
Prahlad, Anand; Schwartz, Jeremy A.; Ngo, David; Brockway, Brian; Muller, Marcus S.; Gokhale, Parag; Kottomtharayil, Rajiv, Method and system for offline indexing of content and classifying stored data.
Prahlad, Anand; Kavuri, Srinivas; Kottomtharayil, Rajiv; Amarendran, Arun Prasad; Brockway, Brian; Muller, Marcus S.; May, Andreas, Method and system for searching stored data.
Prahlad, Anand; Kavuri, Srinivas; Kottomtharayil, Rajiv; Amarendran, Arun Prasad; Brockway, Brian; Muller, Marcus S.; May, Andreas, Method and system for searching stored data.
Prahlad, Anand; Kavuri, Srinivas; Kottomtharayil, Rajiv; Amarendran, Arun Prasad; Brockway, Brian; Muller, Marcus S.; May, Andreas, Method and system for searching stored data.
Prahlad, Anand; Schwartz, Jeremy A.; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for classifying and transferring information in a storage network.
Prahlad, Anand; Schwartz, Jeremy A.; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for classifying and transferring information in a storage network.
Prahlad, Anand; Schwartz, Jeremy A.; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for classifying and transferring information in a storage network.
Prahlad, Anand; Schwartz, Jeremy Alan; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for using metadata to enhance data identification operations.
Prahlad, Anand; Schwartz, Jeremy Alan; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for using metadata to enhance data identification operations.
Prahlad, Anand; Schwartz, Jeremy Alan; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for using metadata to enhance data identification operations.
Prahlad, Anand; Schwartz, Jeremy Alan; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for using metadata to enhance data identification operations.
Prahlad, Anand; Schwartz, Jeremy Alan; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for using metadata to enhance data identification operations.
Prahlad, Anand; Schwartz, Jeremy Alan; Ngo, David; Brockway, Brian; Muller, Marcus S., Systems and methods for using metadata to enhance data identification operations.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.