$\require{mediawiki-texvc}$

연합인증

연합인증 가입 기관의 연구자들은 소속기관의 인증정보(ID와 암호)를 이용해 다른 대학, 연구기관, 서비스 공급자의 다양한 온라인 자원과 연구 데이터를 이용할 수 있습니다.

이는 여행자가 자국에서 발행 받은 여권으로 세계 각국을 자유롭게 여행할 수 있는 것과 같습니다.

연합인증으로 이용이 가능한 서비스는 NTIS, DataON, Edison, Kafe, Webinar 등이 있습니다.

한번의 인증절차만으로 연합인증 가입 서비스에 추가 로그인 없이 이용이 가능합니다.

다만, 연합인증을 위해서는 최초 1회만 인증 절차가 필요합니다. (회원이 아닐 경우 회원 가입이 필요합니다.)

연합인증 절차는 다음과 같습니다.

최초이용시에는
ScienceON에 로그인 → 연합인증 서비스 접속 → 로그인 (본인 확인 또는 회원가입) → 서비스 이용

그 이후에는
ScienceON 로그인 → 연합인증 서비스 접속 → 서비스 이용

연합인증을 활용하시면 KISTI가 제공하는 다양한 서비스를 편리하게 이용하실 수 있습니다.

System and method for associating an extensible set of data with documents downloaded by a web crawler 원문보기

IPC분류정보
국가/구분 United States(US) Patent 등록
국제특허분류(IPC7판)
  • G06F-017/21
출원번호 US-0433006 (1999-11-02)
발명자 / 주소
  • Marc Alexander Najork
  • Clark Allan Heydon
출원인 / 주소
  • Alta Vista Company
대리인 / 주소
    Pennie & Edmonds LLP
인용정보 피인용 횟수 : 125  인용 특허 : 8

초록

A web crawler downloads documents from among a plurality of host computers. The web crawler enqueues document addresses in a data structure called the Frontier. The Frontier generally includes a set of queues, with all document addresses sharing a respective common host component being stored in a r

대표청구항

1. A method of performing a continuous crawl for locating and downloading documents from among a plurality of host computers, comprising:(a) obtaining at least one referring document set that includes addresses of one or more referred documents; each referred document address including a host compon

이 특허에 인용된 특허 (8)

  1. Nieten Joseph Lee, Apparatus and method for data transfers through software agents using client-to-server and peer-to-peer transfers.
  2. Bowen Stephen J ; Brown Don R, Keyword searches of structured databases.
  3. Egger Daniel ; Cannon Shawn ; Sauers Ronald D., Method and apparatus for indexing, searching and displaying data.
  4. Mauldin Michael L., Method for searching a queued and ranked constructed catalog of files stored on a network.
  5. Sanu Sankrant ; Meyerzon Dmitriy, Method of web crawling utilizing address mapping.
  6. Belfiore Joseph D. ; Ellison-Taylor Ian M. ; Ramasubramanian Sankaranarayanan ; Chew Chee H. ; Berkun Scott E., Storage of sitemaps at server sites for holding information regarding content.
  7. Brown Eric William ; Chang Rong Nickle ; Ellozy Hamed Abdelfattah ; Prager John Martin ; So Edward Cholchin, System and method for hierarchically grouping and ranking a set of objects in a query context based on one or more rela.
  8. Lumsden Mark W., Technique for providing enhanced relevance information for documents retrieved in a multi database search.

이 특허를 인용한 특허 (125)

  1. Obata,Kenji C; Meyerzon,Dmitriy, Adaptive web crawling using a statistical model.
  2. Masson, James Squires; Desai, Shikha Devesh; Estrada, Theresa Ann; Keslin, Michelle Elena; Lee, Yu Been; Whilden, Allison Anne; Dominguez, Enrique J., Automatic template generation based on previous documents.
  3. Davis,Russell T., Chart view for reusable data markup language.
  4. Petriuc, Mihai, Click distance determination.
  5. Davis, Russell T., Combining reusable data markup language documents.
  6. Dingsor, Andrew D.; Lanzen, Craig A.; Stenzel, Harley A., Congestion avoidance for threads in servers.
  7. Foulger, Michael G.; Gaul, Matthew J., Database interface and database analysis system.
  8. Tankovich, Vladimir; Meyerzon, Dmitriy; Poznanski, Victor, Detection of junk in search result ranking.
  9. Dean, Jeffrey A.; Silverstein, Craig; Gomes, Benedict; Ghemawat, Sanjay, Distributed crawling of hyperlinked documents.
  10. Dean, Jeffrey A.; Silverstein, Craig; Gomes, Benedict; Ghemawat, Sanjay, Distributed crawling of hyperlinked documents.
  11. Dean,Jeffrey A.; Silverstein,Craig; Gomes,Benedict; Ghemawat,Sanjay, Distributed crawling of hyperlinked documents.
  12. Wagers, Doug R., Document crawling systems and methods.
  13. Vaitheeswaran, Ganesh; Bhattacharjee, Arindam; Mahadevan, Ravichandran; Pasumarthi, Suresh, Document indexing based on categorization and prioritization.
  14. Tankovich, Vladimir; Meyerzon, Dmitriy; Taylor, Michael James, Document length as a static relevance feature for ranking search results.
  15. Shen, Shioupyn, Document near-duplicate detection.
  16. Shen, Shioupyn, Document near-duplicate detection.
  17. Zhu, Huican; Acharya, Anurag; Ibel, Max; Gobioff, Howard B., Document reuse in a search engine crawler.
  18. Zhu, Huican; Acharya, Anurag; Ibel, Max; Gobioff, Howard Bradley, Document reuse in a search engine crawler.
  19. Zhu, Huican; Ibel, Maximilian; Acharya, Anurag; Gobioff, Howard Bradley, Document reuse in a search engine crawler.
  20. Meyerzon, Dmitriy; Shnitko, Yauhen; Burges, Chris J. C.; Taylor, Michael James, Enterprise relevancy ranking using a neural network.
  21. Ellard, Daniel J., Extensible fingerprinting functions and content addressed storage system using the same.
  22. Robertson, Stephen; Zaragoza, Hugo; Taylor, Michael; Larimore, Stefan Isbein; Petriuc, Mihai, Field weighting in text searching.
  23. Diamond, Theodore George; Hendrick, Daniel Allen; Rehm, Eric Carl; Riesland, Melissa Anne, Full-text relevancy ranking.
  24. Prince, John, Fuzzy database retrieval.
  25. Abajian, Aram Christian, Grouping multimedia and streaming media search results.
  26. Abajian, Aram Christian, Grouping multimedia and streaming media search results.
  27. Waters, Christopher; de Haaff, Brian, Hosted searching of private local area network information.
  28. Waters, Christopher; de Haaff, Brian, Hosted searching of private local area network information.
  29. Waters, Christopher; de Haaff, Brian; Lockhart, Andrew, Hosted searching of private local area network information with support for add-on application.
  30. Waters, Christopher; de Haaff, Brian; Lockhart, Andrew, Hosted searching of private local area network information with support for add-on applications.
  31. Close, Tyler; Recker, John; Sayers, Craig; Robinson, Ian N, Identifying and displaying messages containing an identifier.
  32. Heydon, Clark Allan; Branson, Kenneth William, Insurance policy revisioning method and apparatus.
  33. Foulger, Michael G.; Gaul, Matthew J., Interactive intelligent searching with executable suggestions.
  34. Foulger, Michael G.; Gaul, Matthew J., Interactive intelligent searching with executable suggestions.
  35. Abajian, Aram Christian; Alexander, Robin Andrew; Lee, Scott Chao-Chueh; Dahl, Austin David; Derosa, John Anthony; Porter, Charles A.; Rehm, Eric Carl; Kolar, Jennifer Lynn; Sudanagunta, Srinivasan, Internet streaming media workflow architecture.
  36. Alpert, Jesse L., Managing items in crawl schedule.
  37. Davis, Russell T, Markup language system, method, and computer program product.
  38. Dengler, Patrick M.; Krishnan, Arvind K.; Singh, Jagdish; Sanchez, Lawrence M.; Shankar, Sai; Chittamuru, Satish Kumar; Pekic, Zoltan; Mondal, Nabarun; Kumar, Namendra; i Dalfó, Ricard Roma, Metadata driven user interface.
  39. Villadsen, Peter; Chen, Zhaoqi; Gottumukkala, Ramakanthachary S.; Calderon, Marcos, Metadata-based eventing supporting operations on data.
  40. Fedorynski, Pawel Aleksander; Samaddar, Sumitro, Method and apparatus for managing a backlog of pending URL crawls.
  41. Keefer, Alan Harrison, Method and apparatus for pricing insurance policies.
  42. Maykov, Alexey; Hurst, Matthew F., Method and apparatus for web crawling.
  43. Zhang, Ling; Yu, Shen, Method and device for indexing resource content in computer networks.
  44. Meyerzon, Dmitriy; Shoroff, Srikanth; Terek, F. Soner; Norin, Scott, Method and system for detecting duplicate documents in web crawls.
  45. Von Weihe, Daniel, Method and system for document retrieval with selective document comparison.
  46. Cooper,Jeremy S; Foulger,Michael G, Method and system for downloading network data at a controlled data transfer rate.
  47. Ren,Wenge, Method and system for implementing OSPF redundancy.
  48. Safa, John, Method and system for shared document approval.
  49. Zervas, Konstantin; Aronsson, Tomas, Method for dynamic caching.
  50. Zervas, Konstantin; Aronsson, Tomas, Method for optimizing utilization of client capacity.
  51. Hayward, Monte Duane, Method of disseminating advertisements using an embedded media player page.
  52. Hayward, Monte Duane, Method of disseminating advertisements using an embedded media player page.
  53. Hayward, Monte Duane, Method of disseminating advertisements using an embedded media player page.
  54. Hayward, Monte Duane, Method of sizing an embedded media player page.
  55. Foulger, Michael G.; Cooper, Jeremy S.; Luu, Michael Sea; van Gorder, Peter B., Method, system, and computer program product for employment market statistics generation and analysis.
  56. Foulger, Michael G.; van Gorder, Peter B., Method, system, and computer program product for propagating remotely configurable posters of host site content.
  57. Glover, Robin Wallace, Methods and systems for comparing presentation slide decks.
  58. Abajian, Aram Christian; Alexander, Robin Andrew; Lee, Scott Chao-Chueh; Dahl, Austin David; Derosa, John Anthony; Porter, Charles A.; Rehm, Eric Carl; Kolar, Jennifer Lynn; Sudanagunta, Srinivasan, Methods and systems for enhancing metadata.
  59. Abajian, Aram Christian; Alexander, Robin Andrew; Lee, Scott Chao-Chueh; Dahl, Austin David; Derosa, John Anthony; Porter, Charles A.; Rehm, Eric Carl; Kolar, Jennifer Lynn; Sudanagunta, Srinivasan, Methods and systems for enhancing metadata.
  60. More, Scott; Beyer, Ilya, Methods and systems for exact data match filtering.
  61. Abajian, Aram Christian, Methods and systems for grouping uniform resource locators based on masks.
  62. More, Scott, Methods and systems for image fingerprinting.
  63. Mulder, Samuel Peter Matthew, Methods and systems for monitoring documents exchanged over email applications.
  64. More, Scott, Methods and systems for preventing unauthorized disclosure of secure information using image fingerprinting.
  65. More, Scott; Beyer, Ilya; Sweeting, Daniel Christopher John, Methods and systems for protect agents using distributed lightweight fingerprints.
  66. More, Scott; Beyer, Ilya, Methods and systems to fingerprint textual information using word runs.
  67. More, Scott; Beyer, Ilya; Sweeting, Daniel Christopher John, Methods and systems to implement fingerprint lookups across remote agents.
  68. More, Scott; Beyer, Ilya; Sweeting, Daniel Christopher John, Methods and systems to implement fingerprint lookups across remote agents.
  69. Foulger, Michael G.; Chipperfield, Thomas R.; Cooper, Jeremy S., Methods, systems and articles of manufacture for scheduling execution of programs on computers having different operating systems.
  70. Carver, Anton P. T., Minimizing visibility of stale content in web searching including revising web crawl intervals of documents.
  71. Carver, Anton P. T., Minimizing visibility of stale content in web searching including revising web crawl intervals of documents.
  72. Carver, Anton P. T., Minimizing visibility of stale content in web searching including revising web crawl intervals of documents.
  73. Hendriks, Erik; Guajardo-Cespedes, Mario; Knych, Thomas William; Wang, Chen, Monitoring application loading.
  74. Shao, Weili; Zhou, Zehua; Husain, Aliasgar Mumtaz, Navigation system with point of interest harvesting mechanism and method of operation thereof.
  75. Jain, Arvind; Manku, Gurmeet Singh, Near-duplicate document detection for web crawling.
  76. Jain, Arvind; Manku, Gurmeet Singh, Near-duplicate document detection for web crawling.
  77. Best, Steven Francis; Brown, Michael Wayne; Cooper, Michael Richard, Personalized indexing and searching for information in a distributed data processing system.
  78. Best,Steven Francis; Brown,Michael Wayne; Cooper,Michael Richard, Personalized indexing and searching for information in a distributed data processing system.
  79. Redpath, Richard J., Plug-in parsers for configuring search engine crawler.
  80. Obata, Kenji; Meyerzon, Dmitriy, Proxy server using a statistical model.
  81. Obata, Kenji; Meyerzon, Dmitriy, Proxy server using a statistical model.
  82. Davis,Russell T., RDL search engine.
  83. Davis, Russell T.; Hampton, III, Luther Pearson, RDX enhancement of system and method for implementing reusable data markup language (RDL).
  84. Meyerzon, Dmitriy; Zaragoza, Hugo, Ranking search results using biased click distance.
  85. Meyerzon, Dmitriy; Li, Hang, Ranking search results using feature extraction.
  86. Meyerzon, Dmitriy; Zaragoza, Hugo, Ranking search results using language types.
  87. Poznanski, Victor; Wang, Oivind; Holm, Fredrik; Bodd, Nicolai; Tankovich, Vladimir; Meyerzon, Dmitriy, Re-ranking search results.
  88. Cooper, Jeremy S.; Foulger, Michael G., Regulating rates of requests by a spider engine to web sites by creating instances of a timing module.
  89. Sweet, Richard Eric; Rowe, Edward Royce Warren, Retrieving documents transitively linked to an initial document.
  90. Sweet,Richard Eric; Rowe,Edward Royce Warren, Retrieving documents transitively linked to an initial document.
  91. Davis, Russell T., Reusable data markup language.
  92. Davis,Russell T., Reusable data markup language.
  93. Davis, Russell T., Reusable macro markup language.
  94. Randall, Keith H., Scheduler for search engine crawler.
  95. Randall, Keith H., Scheduler for search engine crawler.
  96. Randall, Keith H., Scheduler for search engine crawler.
  97. Zhu, Huican; Ibel, Maximilian; Acharya, Anurag; Gobioff, Howard Bradley, Scheduler for search engine crawler.
  98. Zhu, Huican; Ibel, Maximilian; Acharya, Anurag; Gobioff, Howard Bradley, Scheduler for search engine crawler.
  99. Lin, Zhen; Stevens, Keith, Scheduling resource crawls.
  100. Tankovich, Vladimir; Li, Hang; Meyerzon, Dmitriy; Xu, Jun, Search results ranking using editing distance and document information.
  101. Dmitriy Meyerzon ; Sankrant Sanu, Synchronizing crawler with notification source.
  102. Pasumarthi, Suresh; Bhattacharjee, Arindam; Nayak, Shiva Prasad; Vaitheeswaran, Ganesh, Synchronizing primary and secondary repositories.
  103. Sundaresan, Neelakantan, System and method for automatic generation of dynamic search abstracts contain metadata by crawler.
  104. Glover, Robin, System and method for determining document version geneology.
  105. Najork,Marc Alexander, System and method for distributed web crawling.
  106. Pate, Kenneth Allen; Chatwani, Robert; Dickenson, Nancy, System and method for managing shared collections.
  107. Pate, Kenneth Allen; Chatwani, Robert; Dickenson, Nancy, System and method for managing shared collections.
  108. Pate, Kenneth Allen; Chatwani, Robert; Dickenson, Nancy, System and method for managing shared collections.
  109. Blackman,David L.; Ching,Michael; Dill,Stephen; Gonzalez,Ivan Eduardo; Marcus,Adam; Meredith,Daniel Norin; Nguyen,Linda Anh Linh, System and method for prioritizing websites during a webcrawling process.
  110. Cooper, Jeremy S., System and method for proximity searching position information using a proximity parameter.
  111. Meyerzon, Dmitriy; Zaragoza, Hugo, System and method for ranking search results using click distance.
  112. Foulger,Michael G.; Chipperfield,Thomas R.; Cooper,Jeremy S., System and method for scheduling execution of cross-platform computer processes.
  113. Merrigan, Chadd Creighton; Peltonen, Kyle G.; Meyerzon, Dmitriy; Lee, David J., System and method for scoping searches using index keys.
  114. Mulder, Matthew, System and method for securing documents prior to transmission.
  115. Foulger,Michael G.; Chipperfield,Thomas R.; Cooper,Jeremy S.; Storms,Andrew C., System and method related to generating an email campaign.
  116. Foulger, Michael G.; Chipperfield, Thomas R.; Cooper, Jeremy S.; Storms, Andrew C., System and method related to generating and tracking an email campaign.
  117. Sundaresan, Neelakantan, System for weighted indexing of hierarchical documents.
  118. Hughes, Lucian P., System, method and article of manufacture for a user programmable diary interface link.
  119. Davis, Russell T, System, method, and computer program product for outputting markup language documents.
  120. Davis, Russell T, System, method, and computer program product for processing a markup document.
  121. Hayward, Monte Duane, Systems and methods for rendering content.
  122. Howe, Karen N.; Kolar, Jennifer L.; Sudanagunta, Srinivasan, Targeted advertising for playlists based upon search queries.
  123. Levy, Philip; Hada, Naoki, Using document templates to assemble a collection of documents.
  124. Levy, Philip; Hada, Naoki, Using document templates to assemble a collection of documents.
  125. Levy,Philip; Hada,Naoki, Using document templates to assemble a collection of documents.
섹션별 컨텐츠 바로가기

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

AI-Helper 아이콘
AI-Helper
안녕하세요, AI-Helper입니다. 좌측 "선택된 텍스트"에서 텍스트를 선택하여 요약, 번역, 용어설명을 실행하세요.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.

선택된 텍스트

맨위로