A region based information retrieval system improves on conventional information retrieval systems by breaking down documents into one or more region(s) and processing the additional information available at a region level of analysis. When looking at regions, it becomes possible to quickly distingu
A region based information retrieval system improves on conventional information retrieval systems by breaking down documents into one or more region(s) and processing the additional information available at a region level of analysis. When looking at regions, it becomes possible to quickly distinguish between groups of related documents, quickly ignore or focus on certain information, track recent evolutions of documents, as well as understand the historical relationships, heritage, and versions of these documents. This is all possible whether or not the document publishers specify where the content originally came from.
대표청구항▼
1. A system for retrieving information from document collections comprising: a document collection subsystem for managing documents in a document database;a region finding, splitting and graphing subsystem for analyzing documents in the document database, establishing regions of these documents, whe
1. A system for retrieving information from document collections comprising: a document collection subsystem for managing documents in a document database;a region finding, splitting and graphing subsystem for analyzing documents in the document database, establishing regions of these documents, where the region is less than the containing document and the bounds of each region is defined by the existence of at least one other identical or nearly identical region elsewhere in the document database, identifying region sets of such identical or nearly identical regions across documents, and storing these regions sets in a region set database;an indexing subsystem for making the region sets searchable and storing the index information in a searchable index of region set database; anda searching and ranking subsystem for finding region sets in the region set database using the searchable index of region sets based on an information request, creating a list of region set clusters of closely related region sets from the regions sets found, where the relations are based on the relationships in the region set graphs obtained from the region set graphs database, and communicating the search results. 2. The system of claim 1, wherein the region finding, splitting and graphing subsystem further establishes region set graphs with relationships between the region sets, and stores these region set graphs in a region set graph database. 3. The system of claim 1, wherein the searching and ranking subsystem further creates a list of documents from the document database which contains regions belonging to the region sets found and communicates these results. 4. The system of claim 1, wherein the results contain the regions sets within each regions set cluster. 5. The system of claim 4, wherein the regions sets within each regions set cluster are ranked based on the chronology of the region sets. 6. The system of claim 1, wherein the results contain the documents from the document database which contains regions belonging to the region sets found. 7. The system of claim 1, wherein the information request comprises a search text. 8. The system of claim 1, wherein the information request comprises similarity to a document, image, video, or other content. 9. The system of claim 1, wherein the relationship between region sets comprises content of the regions in one region set being a subset of content of regions in another region set. 10. The system of claim 1, wherein the relationship between region sets comprises content of the regions in one region set partially overlapping with content of regions in another region set. 11. A method of organizing and retrieving information in documents collections comprising: a document collection element for managing documents in a document database;a region finding, splitting and graphing element for analyzing documents in the document database, establishing regions of these documents, where the region is less than the containing document and the bounds of each region is defined by the existence of at least one other identical or nearly identical region elsewhere in the document database, identifying region sets of such identical or nearly identical regions across documents, and storing these regions sets in a region set database;an indexing element for making the region sets searchable and storing the index information in a searchable index of region set database; anda searching and ranking element for finding region sets in the region set database using the searchable index of region sets based on an information request, creating a list of region set clusters of closely related region sets from the regions sets found, where the relations are based on the relationships in the region set graphs obtained from the region set graphs database, and communicating the search results. 12. The method of claim 11, wherein the region finding, splitting and graphing element further establishes region set graphs with relationships between the region sets, and stores these region set graphs in a region set graph database. 13. The method of claim 11, wherein the relationship between these region sets are either one region set being a partial or full subset of the other region set or one region set containing a reference to or keywords of a document containing all or part of the other region set. 14. The method of claim 11, wherein an attempt is made to establish the strength of the relationship between the region sets. 15. The method of claim 11, wherein an attempt is made to establish the chronology of the region sets within region set clusters. 16. The method of claim 11, wherein the search results are ranked based on the relationships between the region sets. 17. The method of claim 11, wherein the search results are the documents partially or fully containing the identified sets of regions. 18. The method of claim 11, wherein the relationship between region sets comprises content of the regions in one region set partially overlapping with content of the regions in another region set. 19. The method of claim 11, wherein an attempt is made to establish the chronology of the documents within region set clusters.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (80)
Tardo, Joseph John; Frailong, Jean-Marc; Mendoza, Harold Lee; Haris, Shiv, Apparatus and method for cryptographic-based license management.
Downs Edgar ; Gruse George Gregory ; Hurtado Marco M. ; Lehman Christopher T. ; Milsted Kenneth Louis ; Lotspiech Jeffrey B., Electronic content delivery system.
Afifi Ashraf ; Chan Dominic ; Comuzzi Joseph J. ; Hart Johnson M. ; Pizzarello Antonio, Method and apparatus for analyzing computer code using weakest precondition.
Lee,Chris Guo; Matada,Anmol Neelammna; Wang,Ningning, Method and apparatus for determining relative relevance between portions of large electronic documents.
Anand, Ashok Kumar; Berry, Brian Jacob; Boey, Johnny Yit; Jandrisevits, Michael; May, Patrick King Wah; Wotzak, Gregory Paul, Method and system for automated integration of design analysis subprocesses.
Zhang,Benyu; Zeng,Hua Jun; Ma,Wei Ying; Xi,Wensi; Chen,Zheng; Fox,Edward A., Method and system for ranking objects based on intra-type and inter-type relationships.
Cline David C. (San Jose CA) Silverman Andrew P. (Los Gatos CA) Wymore Farrell W. (Mountain View CA), Method for analyzing calls of application program by inserting monitoring routines into the executable version and redir.
Broder Andrei Z. ; Glassman Steven C. ; Nelson Charles G. ; Manasse Mark S. ; Zweig Geoffrey G., Method for clustering closely resembling data objects.
Califano Andrea (New York NY), Method for finding a reference token sequence in an original token string within a database of token strings using appen.
Bharat Krishna Asur ; Henzinger Monika R., Method for ranking documents in a hyperlinked environment using connectivity and selective content analysis.
Ruffin Michael ; Jayaram Kristin R. ; Merenda Ann C. ; Morrison Timothy I. ; Ordonez Carlos A. ; Preston Allen H. ; Temple ; III Joseph L. ; Yan Eva L., Method, system and program product for evaluating the business requirements of an enterprise for generating business solution deliverables.
Baker Brenda Sue ; Church Kenneth Ward ; Helfman Jonathan Isaac ; Kernighan Brian W., Methods and apparatus for detecting and displaying similarities in large data sets.
Jain Ramesh ; Horowitz Bradley ; Fuller Charles E. ; Gupta Amarnath ; Bach Jeffrey R. ; Shu Chiao-fe, Similarity engine for content-based retrieval of images.
Driskell Dwight D. ; Greenspan Michael ; Henley Vivian C. ; Lane Nancy C. ; MacFarlane Lloyd ; Nielsen Betty J., System and method for associating services information with selected elements of an organization.
Devanbu Premkumar Thomas ; Stubblebine Stuart Gerald, System and method for providing assurance to a host that a piece of software possesses a particular property.
Premkumar Thomas Devanbu ; Stuart Gerald Stubblebine, System and method for providing assurance to a host that a piece of software possesses a particular property.
Ginter Karl L. ; Shear Victor H. ; Spahn Francis J. ; Van Wie David M., System and methods for secure transaction management and electronic rights protection.
Bergler,Peter M.; Parsons, Jr.,John E.; Hagan,Breen E.; Brockway,Tad Dennis; Leitman,Robert K., System and related methods for managing and enforcing software licenses.
Ginter Karl L. ; Shear Victor H. ; Sibert W. Olin ; Spahn Francis J. ; Van Wie David M., Systems and methods for secure transaction management and electronic rights protection.
Lang, Ulrich; Schreiner, Rudolf, Method and system for rapid accreditation/re-accreditation of agile IT environments, for example service oriented architecture (SOA).
Lang, Ulrich; Schreiner, Rudolf, Method and system for rapid accreditation/re-accreditation of agile IT environments, for example service oriented architecture (SOA).
※ AI-Helper는 부적절한 답변을 할 수 있습니다.