IPC분류정보
국가/구분 |
United States(US) Patent
등록
|
국제특허분류(IPC7판) |
|
출원번호 |
US-0303782
(2002-11-26)
|
우선권정보 |
JP-0041855 (1997-02-26) |
발명자
/ 주소 |
- Takahashi, Toru
- Kawaguchi, Hisamitsu
- Yamasaki, Noriyuki
|
출원인 / 주소 |
|
대리인 / 주소 |
Mattingly, Stanger & Malur, P.C.
|
인용정보 |
피인용 횟수 :
4 인용 특허 :
16 |
초록
▼
A text cataloging method includes a step of cataloging already-analyzed-text data obtained from an analysis of a logical structure of a text to be cataloged in a text database, a step of creating a structure index by sequentially superposing logical structures of texts to be cataloged, wherein a sin
A text cataloging method includes a step of cataloging already-analyzed-text data obtained from an analysis of a logical structure of a text to be cataloged in a text database, a step of creating a structure index by sequentially superposing logical structures of texts to be cataloged, wherein a single metaelement is used for representing a group of elements in the texts having the same position of appearance in one of the texts and the same element type, a single piece of meta-character-string data is used for representing a group of pieces of character-string data in the texts having the same position of appearance in one of the texts, and a context identifier is assigned to each metanode composing a tree-like structure of the structure index for uniquely identifying the metanode; a step of generating structured-full-text data composed of definitions of associative relations between all pieces of character-string data included in already-analyzed-text data of each text to be cataloged, and context identifiers of pieces of meta-character-string data in the structure index used for representing the pieces of character-string data; and a character-string-index updating step, including the sub-steps of extracting partial character strings, generating structured-character-position information, and updating a character-string index.
대표청구항
▼
1. A structured-text cataloging/searching method for a text searching system, in which a set of texts is searched for specific text contents, comprising the following steps: an already-analyzed-text data generating/cataloging step of cataloging, in a text database, already-analyzed-text data obtai
1. A structured-text cataloging/searching method for a text searching system, in which a set of texts is searched for specific text contents, comprising the following steps: an already-analyzed-text data generating/cataloging step of cataloging, in a text database, already-analyzed-text data obtained from an analysis of a logical structure of a text to be cataloged, said already-analyzed-text data generating/cataloging step being performed for a plurality of texts to be cataloged; and a structure-index creating step of creating a structure index, by sequentially superposing logical structures of said plurality of texts cataloged in said already-analyzed-text data generating/cataloging step; wherein said structure index has a tree-like structure composed of a plurality of metanodes; wherein a context identifier that uniquely identifies one of said metanodes is assigned to each metanode of said structure index; and wherein a group of structure elements having the same position of appearance for a plurality of texts are represented by a single metanode. 2. A structured-text cataloging/searching method according to claim 1, further comprising the steps of: a character-string-index updating step comprising the sub-steps of: extracting partial character strings each having a predetermined character count from each of a plurality of texts to be cataloged; andupdating a character string index by cataloging an associative relation between each of said partial character strings and structured character position information of that partial character string in said character string index; a structure-condition judging step of searching the structure index for a set of context identifiers satisfying a specific structure condition; a structured-character-position-information extracting step of extracting partial character strings from a search term, each extracted partial character string having a predetermined character count, and searching the character string index for a set of pieces of structured-character-position information matching said extracted partial character strings; and an index searching step of searching said set of pieces of structured-character-position information for specific pieces of structured-character-position information that have context identifiers found at said structure-condition judging step, and that have a positional relation among said specific pieces of structured-character-position information matching an order of arrangements of said partial character strings in said search term. 3. A structured-text cataloging/searching method according to claim 2, in which said character-string-index updating step comprises the sub-steps of: extracting character-position information of said partial character strings in a text to be cataloged, a text identifier for uniquely identifying said text in a text database, and a context identifier of a metanode representing character-string data including said partial character strings in a structure index; and generating structured-character-position information that includes said character-position information, said text identifier, and said context identifier. 4. A structured-text cataloging/searching method according to claim 2, in which said structure-index creating step creates a structure index by sequentially superposing logical structures of a plurality of texts to be cataloged; wherein the tree-like structures of two of said texts to be cataloged are superposed on each other in said structure index by: comparing nodes of one of said tree-like structures of said two texts with nodes of the other of said tree-like structures of said two texts;regarding a root node in said one of said tree-like structures as a counterpart that mutually corresponds to a root node in said other of said tree-like structures;regarding a non-root node in said one of said tree-like structures of said two texts as a counterpart that mutually corresponds to a no n-root node in said other of said tree-like structures if the non-root node in said one of said tree-like structures has a directly superordinate node mutually corresponding to a directly superordinate node of the non-root node in said other of said tree-like structures, if the non-root node in said one of said tree-like structures is the same type of node as the non-root node in said other of said tree-like structures, and if the non-root node in said one of said tree-like structures has the same appearance order number as the non-root node in said other of said tree-like structures, wherein the appearance order number indicates a position in a normal order of an array of brother nodes of the same type of node found by counting said brother nodes, starting from the head of said array; andrepresenting two mutually corresponding nodes by a single metanode in said structure index.5. A structured-text cataloging/searching method according to claim 2, in which said structure-index creating step creates a structure index by sequentially superposing logical structures of a plurality of texts to be cataloged; wherein the tree-like structures of two of said texts to be cataloged are superposed on each other in said structure index by: comparing nodes of one of said tree-like structures of said two texts with nodes of the other of said tree-like structures of said two texts;regarding a root node in said one of said tree-like structures as a counterpart that mutually corresponds to a root node in said other of said tree-like structures;regarding a non-root node in said one of said tree-like structures of said two texts as a counterpart that mutually corresponds to a non-root node in said other of said tree-like structures if the non-root node in said one of said tree-like structures has a directly superordinate node mutually corresponding to a directly superordinate node of the non-root node in said other of said tree-like structures, if the non-root node in said one of said tree-like structures is the same type of node as the non-root node in said other of said tree-like structures, and if the non-root node in said one of said tree-like structures has the same appearance order number as the non-root node in said other of said tree-like structures, wherein said appearance order number indicates a position in a reversed order of an array of brother nodes of the same type of node found by counting said brother nodes, starting from the end of said array; andrepresenting two mutually corresponding nodes by a single metanode in said structure index.6. A structured-text cataloging/searching method according to claim 2, in which said structure-index creating step creates a structure index by sequentially superposing logical structures of a plurality of texts to be cataloged; wherein the tree-like structures of two of said texts to be cataloged are superposed on each other in said structure index by: comparing nodes of one of said tree-like structures of said two texts with nodes of the other of said tree-like structures of said two texts;regarding a root node in said one of said tree-like structures as a counterpart that mutually corresponds to a root node in said other of said tree-like structures;regarding a non-root node in said one of said tree-like structures of said two texts as a counterpart that mutually corresponds to a non-root node in said other of said tree-like structures if the non-root node in said one of said tree-like structures has a directly superordinate node mutually corresponding to a directly superordinate node of the non-root node in said other of said tree-like structures, if the non-root node in said one of said tree-like structures is the same type of node as the non-root node in said other of said tree-like structures, and if either both the non-root node of said one of said tree-like structures and the non-root node of said other of said tree-like structures are head nodes of an array of brother nodes of the sa me type of node, or both the non-root node in said one of said tree-like structures and the non-root node of said other of said tree-like structures are not head nodes of an array of brother nodes of the same type of node; andrepresenting two mutually corresponding nodes by a single metanode in said structure index.7. A structured-text cataloging/searching method according to claim 2, in which said structure-index creating step creates a structure index by sequentially superposing logical structures of a plurality of texts to be cataloged; wherein the tree-like structures of two of said texts to be cataloged are superposed on each other in said structure index by: comparing nodes of one of said tree-like structures of said two texts with nodes of the other of said tree-like structures of said two texts;regarding a non-root node in said one of said tree-like structures of said two texts as a counterpart that mutually corresponds to a non-root node in said other of said tree-like structures if the non-root node in said one of said tree-like structures has a directly superordinate node mutually corresponding to a directly superordinate node of the non-root node in said other of said tree-like structures, if the non-root node in said one of said tree-like structures is the same type of node as the non-root node in said other of said tree-like structures, and if either both the non-root node of said one of said tree-like structures and the non-root node of said other of said tree-like structures are end nodes of an array of brother nodes of the same type of node, or both the non-root node in said one of said tree-like structures and the non-root node of said other of said tree-like structures are not end nodes of an array of brother nodes of the same type of node; andrepresenting two mutually corresponding nodes by a single metanode in said structure index.8. A structured-text cataloging/searching method according to claim 2, further comprising the following steps: searching said already-analyzed-text data for a structure and a descriptive complete string of characters which are regarded as improper objects to be searched; and deleting the structure and the descriptive complete string of characters from the already-analyzed-text data for an improper object found in said searching step; wherein the already-analyzed-text data from which the structures and the descriptive complete string of characters have been deleted in said deleting step, is cataloged in said cataloging step. 9. A computer-readable storage medium encoded with executable instructions constituting a program which, when executed by a computer, causes the computer to implement said already-analyzed-text data generating/cataloging step, said structure-index updating step, said character-string-index updating step, said structure-condition judging step, said structured-character-position-information extracting step, and said index searching step of said structured-text cataloging/searching method according to claim 2.10. A text searching system, comprising: a text cataloging sub-system in which a plurality of texts are cataloged for use as text search objects in a text search operation; a text searching server that causes a text database to be searched in response to a text search request; a text searching client from which a text search request is issued to the text searching server; and a network connecting the text cataloging sub-system, the text searching server, and the text search client; wherein the text cataloging sub-system creates a structure index based on the cataloging of the plurality of texts, the structure index containing context identifiers of character-string data derived from the cataloged texts; wherein said structure index has a tree-like structure composed of a plurality of metanodes; wherein a context identifier that uniquely identifies one of said metanodes is assigned to each metanode of said structure index; whe rein a group of structure elements having the same position of appearance for a plurality of texts are represented by a single metanode; wherein the text search request issued by the text searching client includes a search condition that is translated into a condition specification by the text searching server, from which condition specification the text searching server causes the structure index to be searched for agreement between the context identifiers and the search condition; and wherein the text searching server transmits a text search result to the text searching client upon completion of the search of the structure index. 11. A structured-text cataloging/searching method for a text searching system, in which a set of texts is searched for specific text contents, wherein each text has a hierarchical structure of elements, comprising the following steps: a text cataloging step, which catalogs the order information of every element within the sequence of elements residing on the same level in said hierarchical structure of elements; and a searching step, which finds every text which satisfies a search condition specifying the order of an element within the sequence of elements residing on the same level in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step, wherein said searching step finds every text that satisfies a search condition specifying a number which represents the occurrence order of an element within the sequence of elements residing on the same level in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step. 12. A structured-text cataloging/searching method for a text searching system, in which a set of texts is searched for specific text contents, wherein each text has a hierarchical structure of elements, comprising the following steps: a text cataloging step, which catalogs the order information of every element within the sequence of elements residing on the same level in said hierarchical structure of elements; and a searching step, which finds every text which satisfies a search condition specifying the order of an element within the sequence of elements residing on the same level in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step, wherein said searching step finds every text that satisfies a search condition specifying a number which represents the reverse occurrence order of an element within the sequence of elements residing on the same level in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step. 13. A structured-text cataloging/searching method for a text searching system, in which a set of texts is searched for specific text contents, wherein each text has a hierarchical structure of elements, comprising the following steps: a text cataloging step, which catalogs the order information of every element within the sequence of elements residing on the same level in said hierarchical structure of elements; and a searching step, which finds every text which satisfies a search condition specifying the order of an element within the sequence of elements residing on the same level in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step, wherein said searching step finds every text that satisfies a search condition specifying whether an element is positioned at the beginning or not within the sequence of elements residing on the same level in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step. 14. A structured-text cataloging/searching method for a text searching system, in which a set of texts is searched for specific text contents, wherein each text has a hierarchical structure of elements, comprising the fo llowing steps: a text cataloging step, which catalogs the order information of every element within the sequence of elements residing on the same level in said hierarchical structure of elements; and a searching step, which finds every text which satisfies a search condition specifying the order of an element within the sequence of elements residing on the same level in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step, wherein said searching step finds every text that satisfies a search condition specifying whether an element is positioned at the ending or not within the sequence of elements residing on the same level in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step. 15. A structured-text cataloging/searching method for a text searching system, in which a set of texts is searched for specific text contents, wherein each text has a hierarchical structure of elements, comprising the following steps: a text cataloging step, which catalogs the order information of every element residing on the same level in said hierarchical structure of elements; and a searching step, which finds every text which satisfies a search condition specifying the order of an element in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step. 16. A computer-program which is constructed by executable instructions which, when executed by a computer, causes the computer to implement said cataloging step, and said searching step of said structured-text cataloging/searching method according to claim 15.17. A structured-text cataloging/searching method according to claim 15, wherein said searching step finds every text that satisfies a search condition specifying a number which represents the occurrence order of an element in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step.18. A structured-text cataloging/searching method according to claim 15, wherein said searching step finds every text that satisfies a search condition specifying a number which represents the reverse occurrence order of an element in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step.19. A structured-text cataloging/searching method according to claim 15, wherein said searching step finds every text that satisfies a search condition specifying whether an element is positioned at the beginning or not in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step.20. A structured-text cataloging/searching method according to claim 15, wherein said searching step finds every text that satisfies a search condition specifying whether an element is positioned at the ending or not in said hierarchical structure of elements, by referring to the order information cataloged in said cataloging step.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.