IPC분류정보
국가/구분 |
United States(US) Patent
등록
|
국제특허분류(IPC7판) |
|
출원번호 |
US-0269871
(2011-10-10)
|
등록번호 |
US-9183299
(2015-11-10)
|
우선권정보 |
EP-10101012 (2010-11-19) |
발명자
/ 주소 |
- Friedman, Julian L.
- Leigh, Richard T.
|
출원인 / 주소 |
- International Business Machines Corporation
|
대리인 / 주소 |
|
인용정보 |
피인용 횟수 :
0 인용 특허 :
5 |
초록
▼
A method, apparatus and computer program for a search engine to rank a set of pages returned as search results from a search query. The search query and a first origin page are received. The search query includes one or more keywords and the first origin page is chosen based on information associate
A method, apparatus and computer program for a search engine to rank a set of pages returned as search results from a search query. The search query and a first origin page are received. The search query includes one or more keywords and the first origin page is chosen based on information associated with the search query. A set of results is generated based on the one or more keywords. The click distance from the first origin page to each page in the returned set is then calculated. The click distance represents the number of intermediate pages that a user would need to traverse in order to progress from the first origin page to a page in the set of search results. The set of search results are then ranked based on the calculated click distance from the first origin page to a page in the set of results.
대표청구항
▼
1. A method for a search engine to rank a set of pages returned as search results from a search query, the method comprising: receiving a search query, a first origin page, and a second origin page, the search query including one or more keywords, wherein the first origin page and the second origin
1. A method for a search engine to rank a set of pages returned as search results from a search query, the method comprising: receiving a search query, a first origin page, and a second origin page, the search query including one or more keywords, wherein the first origin page and the second origin page are chosen based on information associated with the search query;generating set of search results based on the one or more keywords;creating an index database by storing at least information on the location of a plurality of web pages where at least one keyword was found and the number of times the keyword relating to the search query appears in each web page;establishing information on any relationship between the web pages;calculating a first click distance from the first origin page to each page returned in the set, wherein the first click distance represents the number of intermediate pages that a user would need to traverse in order to progress from the first origin page to a page in the set of search results, wherein the first click distances are normalized, wherein normalizing the calculated first click distance is performed by scaling the first click distance by a maximum click distance from the first origin page to a farthest result from the first origin page;calculating a second click distance from the second origin page to each page returned in the set, wherein the second click distance represents the number of intermediate pages that a user would need to traverse in order to progress from the second origin page to a page in the set of search results, wherein the second click distances are normalized, wherein normalizing the calculated second click distance is performed by scaling the second click distance by a maximum click distance from the second origin page to a farthest result from the second origin page;summing the normalized first and second click distance for each page in the set of results;ranking the set of search results based on the summed click distances;determining existence of any additional factors, wherein the additional factors include user preferences or other information that affect speed and accessibility of the web pages for completing the search query; andre-ranking the search results by applying a weighted value when additional factors were determined, wherein the weight of the value is determined by type of the additional factor determined. 2. The method of claim 1,wherein the determining of when to apply a weighting to a click distance is performed based on the distance between the first origin page and a particular page in the set of search results. 3. The method of claim 2, further comprising: receiving a plurality of origin pages with the search query, wherein each origin page is chosen based on information associated with the search query;calculating normalized click distances for each page in the set of search results for each of the plurality of origin pages; andsumming the calculated normalized click distances for each page in the set of search results. 4. The method of claim 3, further comprising: calculating the click distance from each origin page in the plurality of origin pages to each page in the set of search results; andsetting a threshold that when exceeded the click distance is considered too distant to be calculated in the search. 5. The method of claim 4, wherein normalizing the calculated click distances further comprises performing performed by =Di(r)max(Di) where Di is a set of all click distances for the origin page Oi and D(r) is the click distance from Oi to the result r. 6. The method of claim 4, further comprising: searching the index when receiving a new query to determine if relationship between pages have been established;using the calculated click distances from the index; andranking the pages in the search results with respect to the plurality of origin pages. 7. The method of claim 4, further comprising: determining when to apply a weighting to a click distance representing the distance between a the second origin page and a particular page in the set of search results. 8. The method of claim 1, further comprising: identifying at least one origin page;composing the search query including the one or more keywords specified by the user; andsending the search query and the at least one identified origin page to the search engine. 9. The method of claim 8, wherein the step of identifying the first origin page comprises one of: receiving the first origin page from the user;identifying the first origin page based on the page from which the search query is submitted;identifying the first origin page based on the geographic location of the user; andidentifying the first origin page based information stored about the user. 10. An apparatus to rank a set of pages returned as search results from a search query, the apparatus comprising: a computer comprising a non-transitory computer readable medium, the medium storing search engine configured to receive a search query, a first origin page, and second origin page, the search query including one or more keywords, wherein the first origin page and the second origin page are chosen based on information associated with the search query;the search engine comprising instructions executing a method, the method comprising:generating set of search results based on the one or more keywords;calculating a click distance from the first origin page to each page returned in the set, wherein the click distance represents the number of intermediate pages that a user would need to traverse in order to progress from the first origin page to a page in the set of search results; andcalculating the set of search results, the calculating comprising:calculating a first click distance from the first origin page to each page returned in the set, wherein the first click distance represents the number of intermediate pages that a user would need to traverse in order to progress from the first origin page to a page in the set of search results, wherein the first click distances are normalized, wherein normalizing the calculated first click distance is performed by scaling the first click distance by a maximum click distance from the first origin page to a farthest result from the first origin page;calculating a second click distance from the second origin page to each page returned in the set, wherein the second click distance represents the number of intermediate pages that a user would need to traverse in order to progress from the second origin page to a page in the set of search results, wherein the second click distances are normalized, wherein normalizing the calculated second click distance is performed by scaling the second click distance by a maximum click distance from the second origin page to a farthest result from the second origin page;summing the normalized first and second click distance for each page in the set of results;ranking the set of search results based on the summed click distances;determining existence of any additional factors, wherein the additional factors include user preferences or other information that affect speed and accessibility of the web pages for completing the search query; andre-ranking the search results by applying a weighted value when additional factors were determined, wherein the weight of the value is determined by type of the additional factor determined. 11. The apparatus of claim 10, the executed method further comprising: determines when to apply the weighting to a click distance by analyzing the distance between the first origin page and a particular page in the set of search results. 12. The apparatus of claim 10, the executed method further comprising: receiving a plurality of origin pages with the search query, wherein each origin page is chosen based on information associated with the search querycalculating normalized click distances for each page in the set of search results for each of the plurality of origin pages; andsumming the calculated normalized click distances for each page in the set of search results. 13. The apparatus of claim 12, the executed method further comprising: calculating the click distance from each origin page in the plurality of origin pages to each page in the set of search results. 14. The apparatus of claim 13, the executed method further comprising: setting a threshold that when exceeded the click distance is considered too distant to be calculated in the search. 15. The apparatus of claim 13, the executed method further comprising: using the calculated click distances to rank the pages in the search results with respect to the plurality of origin pages. 16. The apparatus of claim 13, the executed method further comprising: determining when to apply a weighting to a click distance representing the distance between a second origin page and a particular page in the set of search results. 17. The apparatus of claim 10, the executed method further comprising: identifying at least one origin page;composing the search query including the one or more keywords specified by the user; andsending the search query and the at least one identified origin page to the search engine. 18. The apparatus of claim 17, wherein identifying the first origin page comprises one of: receiving the first origin page from the user;identifying the first origin page based on the page from which the search query is submitted;identifying the first origin page based on the geographic location of the user; andidentifying the first origin page based information stored about the user. 19. A computer program product for ranking a set of pages returned as search results from a search query, the computer program product comprising: a non-transitory medium readable by a processing circuit and storing instructions for execution by the processing circuit for performing a method comprising;receiving a search query, a first origin page, and a second origin page, the search query including one or more keywords, wherein the first origin page and the second origin page are chosen based on information associated with the search query;the computer search engine generating set of search results based on the one or more keywords;calculating a first click distance from the first origin page to each page returned in the set, wherein the first click distance represents the number of intermediate pages that a user would need to traverse in order to progress from the first origin page to a page in the set of search results, wherein the first click distances are normalized, wherein normalizing the calculated first click distance is performed by scaling the first click distance by a maximum click distance from the first origin page to a farthest result from the first origin page;calculating a second click distance from the second origin page to each page returned in the set, wherein the second click distance represents the number of intermediate pages that a user would need to traverse in order to progress from the second origin page to a page in the set of search results, wherein the second click distances are normalized, wherein normalizing the calculated second click distance is performed by scaling the second click distance by a maximum click distance from the second origin page to a farthest result from the second origin page;summing the normalized first and second click distance for each page in the set of results;ranking the set of search results based on the summed click distances;determining existence of any additional factors, wherein the additional factors include user preferences or other information that affect speed and accessibility of the web pages for completing the search query; andre-ranking the search results by applying a weighted value when additional factors were determined, wherein the weight of the value is determined by type of the additional factor determined. 20. The computer program product of claim 19, wherein the method further comprises: determining when to apply a weighting to a click distance representing the distance between the first origin page and a particular page in the set of search results.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.