Document length as a static relevance feature for ranking search results
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-017/30
G06F-007/00
출원번호
US-0207910
(2008-09-10)
등록번호
US-9348912
(2016-05-24)
발명자
/ 주소
Tankovich, Vladimir
Meyerzon, Dmitriy
Taylor, Michael James
출원인 / 주소
MICROSOFT TECHNOLOGY LICENSING, LLC
대리인 / 주소
Akhter, Julie Kane
인용정보
피인용 횟수 :
0인용 특허 :
203
초록▼
Embodiments are configured to provide information based on a user query. In an embodiment, a system includes a search component having a ranking component that can be used to rank search results as part of a query response. In one embodiment, the ranking component includes a ranking algorithm that c
Embodiments are configured to provide information based on a user query. In an embodiment, a system includes a search component having a ranking component that can be used to rank search results as part of a query response. In one embodiment, the ranking component includes a ranking algorithm that can use the length of documents returned in response to a search query to rank search results.
대표청구항▼
1. A system for providing information comprising: one or more processors;one or more computer storage media storing computer executable instructions that when executed by the one or more processors provide:a search component configured to locate a search result based on a query input;a database comp
1. A system for providing information comprising: one or more processors;one or more computer storage media storing computer executable instructions that when executed by the one or more processors provide:a search component configured to locate a search result based on a query input;a database component configured to store information associated with the query input including one or more ranking features, wherein the one or more ranking features are associated with a user action or user inaction associated with the search result which are collected with respect to the search result for a same query or a similar query previously received, and wherein one ranking feature of the one or more ranking features is associated with a normalized document length wherein the normalized document length is determined by dividing a length of a document to be ranked by an average length of a set of documents included in the search result, wherein the document to be ranked is included in the set of documents, wherein the length of the document corresponds to a number of words in the document; anda ranking component configured to rank the search result based, at least in part, on a ranking function and the one or more ranking features, including an action-based feature, an inaction-based feature and a normalized document length feature, wherein the search component uses the rank of the search result when providing search results according to a ranking order. 2. The system of claim 1, wherein a transform function converts the normalized document length into a ranking value between zero and one. 3. The system of claim 2, wherein the transform function is defined as: F(D)=D,D3,wherein, D represents the normalized document length and F(D) represents the ranking value. 4. The system of claim 1, wherein the ranking component uses one or more click-through parameters when ranking the search result, wherein the one or more click-through parameters further comprise one or more of the following: a click parameter associated with a number of times that the search result has been clicked;a skip parameter associated with a number of times that the search result has been skipped;a first stream parameter corresponding to a union of query strings associated with a clicked search result; anda second stream parameter corresponding to a union of query strings associated with a skipped search result. 5. The system of claim 4, wherein the search component is further configured to update the one or more click-through parameters including using information associated with how the search result was interacted with when updating the one or more of the click-through parameters. 6. The system of claim 5, wherein the search component is further configured to update the one or more click-through parameters, wherein the update of the one or more click-through parameters corresponds with a selected search result or a skipped search result. 7. The system of claim 1, wherein the wherein the one or more ranking features comprise one or more dynamic ranking features selected from a group consisting of body, title, author, generated title, an anchor text, and a URL, and one or more static ranking features selected from a group consisting of click distance, URL depth, file type, and language. 8. A non-transitory computer-readable storage medium storing computer executable instructions that when executed by one or more processors provide a search engine configured to: receive information associated with a query;locate a search result associated with the query, wherein the search result includes one or more documents;calculate a first input associated with a click parameter and the search result;calculate a second input associated with a skip parameter and the search result;calculate a third input associated with a normalized document length of the one or more documents included in the search result, wherein the normalized document length of the one or more documents is obtained by dividing a length of each document of the one or more documents by an average length of each document of the one or more documents included in the search result, wherein the length of the document corresponds to a number of words in the document; wherein the length of the document corresponds to a number of words in the document;store information associated with the query including one or more ranking features, wherein the one or more ranking features are associated with a user action or user inaction associated with the search result which are collected with respect to the search result for a same query or a similar query previously received, and wherein one ranking feature of the one or more ranking features is associated with a normalized document length, wherein the document to be ranked is included in the one or more documents;ranking the search result based on a ranking determination using the ranking features, the first input, the second input, and the third input;and provide the search result according to the ranking determination. 9. The non-transitory computer-readable storage medium of claim 8, further configured to calculate a fourth input associated with a first stream parameter and the search result;calculate a fifth input associated with a second stream parameter and the search result; andrank the one or more documents included in the search result using at least four of the first input, the second input, the third input, the fourth input, and the fifth input. 10. The non-transitory computer-readable storage medium of claim 8, further configured to update a store with click parameter and skip parameter updates associated with received interactions with the one or more documents included in the search result. 11. The non-transitory computer-readable storage medium of claim 8, further configured to update a store with stream parameter updates associated with received interactions with the one or more documents included in the search result. 12. A method of providing information comprising: searching to locate a search result based on a query input;storing information associated with the query input including one or more ranking features, wherein the one or more ranking features are associated with a user action or user inaction associated with the search result which are collected with respect to the search result for a same query or a similar query previously received, and wherein one ranking feature of the one or more ranking features is associated with a normalized document length wherein the normalized document length is determined by dividing a length of a document to be ranked by an average length of a set of documents included in the search result, wherein the document to be ranked is included in the set of documents, wherein the length of the document corresponds to a number of words in the document; andranking the search result based, at least in part, on a ranking function and the one or more ranking features, including an action-based feature, an inaction-based feature and a normalized document length feature, wherein the search component uses the rank of the search result when providing search results according to a ranking order. 13. The method of claim 12, further comprising: determining a fourth input value associated with a text stream and a received selection of at least one of the one or more query candidates; andranking the one or more query candidates based in part on a scoring determination using a scoring function and one or more of the first input value, the second input value, the third input value and the fourth input value. 14. The method of claim 12, further comprising ranking the one or more query candidates according to a numerical order. 15. The system of claim 1, wherein the normalized document length is determined independently from a file type of one or more documents included in the search result. 16. The non-transitory computer-readable storage medium of claim 8, wherein the normalized document length of the one or more documents included in the search result is determined independently from a file type of the one or more documents included in the search result. 17. The non-transitory computer-readable storage medium of claim 8, wherein the normalized document length of the one or more documents is obtained by dividing a length of each document of the one or more documents by an average length of each document of the one or more documents included in the search result. 18. The method of claim 12, wherein the normalized document length of the at least one of the query candidates is determined independently from a file type of the at least one of the query candidates. 19. The method of claim 12, wherein the normalized document length of the at least one of the query candidates is obtained by dividing a length of the at least one of the query candidates by an average length of the one or more query candidates included in a result of the query.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (203)
Simmonds Christopher D.,GBX ; Jack Ian,GBX ; Marincic Dusan,GBX ; Wilkes Anthony M.,GBX, Accessing network resources using network resource replicator and captured login script for use when the computer is di.
Braden-Harder Lisa ; Corston Simon H. ; Dolan William B. ; Vanderwende Lucy H., Apparatus and methods for an information retrieval system that employs natural language processing of search results to.
Peterson, Leonard J.; Freedman, Steven J.; Partovi, Hadi; Endres, Raymond E.; D'Souza, David J.; Ellerman, Erik Castedo; Jiggins, Julian P., Client-side system for scheduling delivery of web content and locally managing the web content.
Eichstaedt Matthias ; Ford Daniel Alexander ; Lehman Tobin Jon ; Lu Qi ; Teng Shang-Hua, Collaborative team crawling:Large scale information gathering over the internet.
Pant Sangam ; Andre David L. ; Watson Gray ; Green Richard M. ; Schiegg Michael J., Computer system with user-controlled relevance ranking of search results.
Leonardo C. Massarani, Content-indexing search system and method providing search results consistent with content filtering and blocking policies implemented in a blocking engine.
Khoyi Dana (Dracut MA) San Soucie Marc (Tyngsboro MA) Surprenant Carolyn E. (Dracut MA) Stern Laura O. (Woburn MA) Pham Ly-Huong T. (Chelmsford MA), Data integration by object management.
San Soucie Marc (Tyngsboro MA) Surprenant Carolyn E. (Dracut MA) Fitzgerald Thomas (Lowell MA) Walker Susan (Arlington MA), Data processor that customizes program behavior by using a resource retrieval capability.
Davis ; III James R. ; Sanders Daniel S. ; Pathakis Scott W. ; Bradshaw W. Brent ; Jensen Brian L. ; Hodgkinson Andrew A., Hybrid query apparatus and method.
Bowman Dwayne ; Ortega Ruben E. ; Linden Greg ; Spiegel Joel R., Identifying the items most relevant to a current query based on items selected in connection with similar queries.
Kyu-Young Whang KR; Byung-Kwon Park KR; Wook-Shin Han KR; Young-Koo Lee KR, Inverted index storage structure using subindexes and large objects for tight coupling of information retrieval with database management systems.
Ram Subbaroyan ; Yongdong Wang ; Paul Andre Gauthier ; Douglas Michael Cook ; Douglass Russell Judd, Method and apparatus for identifying spoof documents.
Birrell Andrew D. ; Wobber Edward P. ; Schroeder Michael, Method and apparatus for organizing and accessing electronic mail messages using labels and full text and label indexing.
Pratt, John P.; Johnson, Russell Clark; Millett, Ronald P.; Tietjen, Bruce R., Method and apparatus for organizing and using indexes utilizing a search decision table.
Douglass R. Judd ; Paul Gauthier ; J. Eric Baldeschwieler, Method and apparatus for retrieving documents based on information other than document content.
Mitchell, Frederick H.; Bainbridge, David K., Method and apparatus providing a graphical user interface for representing and navigating hierarchical networks.
Gilmour David L. ; Wang Hua-Wen, Method and system for constructing a knowledge profile of a user having unrestricted and restricted access portions according to respective levels of confidence of content of the portions.
Kobayashi, Mei; Takeda, Kohichi, Method and system for document collection final search result by arithmetical operations between search results sorted by multiple ranking metrics.
Barney, Jonathan A., Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects.
Raghavan, Prabhakar; Rajagopalan, Sridhar; Ravikumar, Shanmugasundaram; Tomkins, Andrew S., Method and system for trawling the World-wide Web to identify implicitly-defined communities of web pages.
Lewak Jerzy (Del Mar CA) Grzechnik Slawek (La Mesa CA) Matousek Jon (San Diego CA), Method for accessing computer files and data, using linked categories assigned to each data file record on entry of the.
Schultz John Michael, Method for identifying themes associated with a search query using metadata and for organizing documents responsive to the search query in accordance with the themes.
Day, Don Rutledge; Dutta, Rabindranath; Schell, David Allen, Method, system, and program for gathering indexable metadata on content at a data repository.
Fox, Kevin L.; Frieder, Ophir; Knepper, Margaret M.; Killam, Robert A.; Nemethy, Joseph M.; Cusick, Gregory J.; Snowberg, Eric J., Multiple engine information retrieval and visualization system.
Kirsch Steven T. ; Chang William I., Performing automated document collection and selection by providing a meta-index with meta-index values indentifying co.
Sung Chih-Ta (Princeton CA) Chan Tzoyao (Saratoga CA) Chang Richard (San Jose CA) Rosenau Mark A. (San Jose CA) Ort Jeffrey G. (Bellevue WA) Daum Daniel T. (San Jose CA) Sun Yuanyuan (San Jose CA), Programmable audio-video synchronization method and apparatus for multimedia systems.
Bowman Dwayne E. ; Ortega Ruben E. ; Hamrick Michael L. ; Spiegel Joel R. ; Kohn Timothy R., Refining search queries by the suggestion of correlated terms from prior searches.
Lamping John O. ; Dourish James P. ; Edwards Warren K. ; LaMarca Anthony G. ; Petersen Karin ; Salisbury Michael P. ; Terry Douglas B. ; Thornton James D., Self-contained document management based on document properties.
Belfiore Joseph D. ; Ellison-Taylor Ian M. ; Ramasubramanian Sankaranarayanan ; Chew Chee H. ; Berkun Scott E., Storage of sitemaps at server sites for holding information regarding content.
Candan, Kasim Selcuk; Li, Wen-Syan, System and method employing random walks for mining web page associations and usage to optimize user-oriented web page refresh and pre-fetch scheduling.
Chidlovskii Boris,FRX ; Glance Natalie S.,FRX ; Grasso Antonietta,FRX, System and method for collaborative ranking of search results employing user and group profiles derived from document collection content analysis.
Min, Shermann Loyall; Tanno, Constantin Lorenzo; Mainen, Zachary Frank; Softky, William Russell, System and method for context-based document retrieval.
Kraft, Reiner; Emens, Michael Lawrence; Yim, Peter Chi-Shing, System and method for providing a session query within the context of a dynamic search result set.
Huang, Anita Wai-Ling; Sundaresan, Neelakantan, System and method of ranking and retrieving documents based on authority scores of schemas and documents.
Horvitz, Eric J., System and methods for inferring informational goals and preferred level of detail of results in response to questions posed to an automated information-retrieval or question-answering service.
Monier Louis M., System for adding a new entry to a web page table upon receiving a web page including a link to another web page not having a corresponding entry in the web page table.
Pirolli Peter L. ; Pitkow James E. ; Huberman Bernardo A., System for ranking search results from a collection of documents using spreading activation techniques.
Rose Daniel E. ; Bornstein Jeremy J. ; Tiene Kevin ; Ponceleon Dulce B., System for ranking the relevance of information objects accessed by computer users.
Fagin,Ronald; McCurley,Kevin Snow; Novak,Jasmine; Ravikumar,Shanmugasundram; Sivakumar,Dandapani; Tomlin,John Anthony; Williamson,David Paul, System, method and service for ranking search results using a modular scoring system.
Sisk, Jacob; Bramlet, Heidi Eldenburg; Fain, Daniel C.; Mao, Jianchang; Rieck, Charity A., Term expansion using associative matching of labeled term pairs.
Najork Marc Alexander ; Heydon Clark Allan ; Wiener Janet Lynn, Web crawler system using plurality of parallel priority level queues having distinct associated download priority levels for prioritizing document downloading and maintaining document freshness.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.