$\require{mediawiki-texvc}$

연합인증

연합인증 가입 기관의 연구자들은 소속기관의 인증정보(ID와 암호)를 이용해 다른 대학, 연구기관, 서비스 공급자의 다양한 온라인 자원과 연구 데이터를 이용할 수 있습니다.

이는 여행자가 자국에서 발행 받은 여권으로 세계 각국을 자유롭게 여행할 수 있는 것과 같습니다.

연합인증으로 이용이 가능한 서비스는 NTIS, DataON, Edison, Kafe, Webinar 등이 있습니다.

한번의 인증절차만으로 연합인증 가입 서비스에 추가 로그인 없이 이용이 가능합니다.

다만, 연합인증을 위해서는 최초 1회만 인증 절차가 필요합니다. (회원이 아닐 경우 회원 가입이 필요합니다.)

연합인증 절차는 다음과 같습니다.

최초이용시에는
ScienceON에 로그인 → 연합인증 서비스 접속 → 로그인 (본인 확인 또는 회원가입) → 서비스 이용

그 이후에는
ScienceON 로그인 → 연합인증 서비스 접속 → 서비스 이용

연합인증을 활용하시면 KISTI가 제공하는 다양한 서비스를 편리하게 이용하실 수 있습니다.

Web crawler system using plurality of parallel priority level queues having distinct associated download priority levels for prioritizing document downloading and maintaining document freshness 원문보기

IPC분류정보
국가/구분 United States(US) Patent 등록
국제특허분류(IPC7판)
  • G06F-015/16
  • G06F-015/173
출원번호 US-0433007 (1999-11-02)
발명자 / 주소
  • Najork Marc Alexander
  • Heydon Clark Allan
  • Wiener Janet Lynn
출원인 / 주소
  • Alta Vista Company
대리인 / 주소
    Pennie & Edmonds LLP
인용정보 피인용 횟수 : 121  인용 특허 : 6

초록

A web crawler downloads documents from among a plurality of host computers. The web crawler enqueues document addresses in a data structure called the Frontier. The Frontier generally includes a set of queues, with all document addresses sharing a respective common host component being stored in a r

대표청구항

[ What is claimed is:] [1.]1. A method of performing a continuous crawl for locating and downloading documents from among a plurality of host computers, comprising:(a) obtaining at least one referring document set that includes addresses of one or more referred documents; each referred document addr

이 특허에 인용된 특허 (6)

  1. Eichstaedt Matthias ; Ford Daniel Alexander ; Lehman Tobin Jon ; Lu Qi ; Teng Shang-Hua, Collaborative team crawling:Large scale information gathering over the internet.
  2. Baclawski Kenneth P., Distributed computer database system and method employing intelligent agents.
  3. Ferrel Patrick J. ; Kerr Randy ; Nareddy Krishna ; Uppala Krishna, Information retrieval system in an on-line network including separate content and layout of published titles.
  4. Sanu Sankrant ; Meyerzon Dmitriy, Method of web crawling utilizing address mapping.
  5. Courtright ; II William V. ; Delaney William P. ; Fredin Gerald J., System controller with plurality of memory queues for prioritized scheduling of I/O requests from priority assigned clients.
  6. Monier Louis M., System for adding new entry to web page table upon receiving web page including link to another web page not having cor.

이 특허를 인용한 특허 (121)

  1. Obata,Kenji C; Meyerzon,Dmitriy, Adaptive web crawling using a statistical model.
  2. Sobel, William E.; Szor, Peter; McCorkendale, Bruce, Alternated update system and method.
  3. Takahashi, Kinya, Apparatus, method, program, and information processing system for prioritized data transfer to a network terminal.
  4. Prasad, Mukul Ranjan, Architecture for distributed, parallel crawling of interactive client-server applications.
  5. Zhu, Huican; Acharya, Anurag, Assigning document identification tags.
  6. Zhu, Huican; Acharya, Anurag, Assigning document identification tags.
  7. Petriuc, Mihai, Click distance determination.
  8. Suggs, Darrell; Scott, John; Fair, Robert L., Control of service workload management.
  9. Bright, Walter G., Controlling the order in which content is displayed in a browser.
  10. Okamura, Hideaki, Data processing system, data processing method, and program-providing medium therewith.
  11. Prasad, Mukul R., Detection of dead widgets in software applications.
  12. Tankovich, Vladimir; Meyerzon, Dmitriy; Poznanski, Victor, Detection of junk in search result ranking.
  13. Prasad, Mukul R.; Mesbah, Ali, Determining differences in an event-driven application accessed in different client-tier environments.
  14. Dean, Jeffrey A.; Silverstein, Craig; Gomes, Benedict; Ghemawat, Sanjay, Distributed crawling of hyperlinked documents.
  15. Dean, Jeffrey A.; Silverstein, Craig; Gomes, Benedict; Ghemawat, Sanjay, Distributed crawling of hyperlinked documents.
  16. Dean,Jeffrey A.; Silverstein,Craig; Gomes,Benedict; Ghemawat,Sanjay, Distributed crawling of hyperlinked documents.
  17. Wagers, Doug R., Document crawling systems and methods.
  18. Tankovich, Vladimir; Meyerzon, Dmitriy; Taylor, Michael James, Document length as a static relevance feature for ranking search results.
  19. Zhu, Huican; Acharya, Anurag; Ibel, Max; Gobioff, Howard B., Document reuse in a search engine crawler.
  20. Zhu, Huican; Acharya, Anurag; Ibel, Max; Gobioff, Howard Bradley, Document reuse in a search engine crawler.
  21. Zhu, Huican; Ibel, Maximilian; Acharya, Anurag; Gobioff, Howard Bradley, Document reuse in a search engine crawler.
  22. Meyerzon, Dmitriy; Shnitko, Yauhen; Burges, Chris J. C.; Taylor, Michael James, Enterprise relevancy ranking using a neural network.
  23. Robertson, Stephen; Zaragoza, Hugo; Taylor, Michael; Larimore, Stefan Isbein; Petriuc, Mihai, Field weighting in text searching.
  24. Waters, Christopher; de Haaff, Brian, Hosted searching of private local area network information.
  25. Waters, Christopher; de Haaff, Brian, Hosted searching of private local area network information.
  26. Waters, Christopher; de Haaff, Brian; Lockhart, Andrew, Hosted searching of private local area network information with support for add-on application.
  27. Waters, Christopher; de Haaff, Brian; Lockhart, Andrew, Hosted searching of private local area network information with support for add-on applications.
  28. Laucius, Andrew S.; Shakib, Darren A.; Seidman, Eytan D.; Forbes, Jonathan; Birney, Keith A., Incremental web crawler using chunks.
  29. Sato, Takao, Information managing device, information managing method, and non-transitory recording medium.
  30. Heydon, Clark Allan; Branson, Kenneth William, Insurance policy revisioning method and apparatus.
  31. Suggs, Darrell G.; Fair, Robert L.; Kimmel, Jeffrey S.; Rowe, Alan L.; Sarma, Joydeep Sen, Integrating control of service during cluster failover.
  32. Gao, Changju, Intelligent replication method.
  33. Zunger, Yonatan; Drobychev, Alexandre; Kesselman, Alexander; Vickrey, Rebekah C.; Dachille, Frank C.; Datuashvili, George, Location assignment daemon (LAD) for a distributed storage system.
  34. Zunger, Yonatan; Drobychev, Alexandre; Kesselman, Alexander; Vickrey, Rebekah C.; Dachille, Frank C.; Datuashvili, George, Location assignment daemon (LAD) for a distributed storage system.
  35. Kumar, Arvind, Management of links to data embedded in blocks of data.
  36. Rabbers,David L.; Chung,Pi Yu; Susser,Martin; Hansen,Aaron; Scott,Brian, Method and apparatus for detecting insufficient memory for data extraction processes.
  37. Briscoe,Paul Roger; Hammer,Stephen Carl, Method and apparatus for enabling an internet web server to keep an accurate count of page hits.
  38. D'Urso, Mark S., Method and apparatus for intranet searching.
  39. D'Urso, Mark S., Method and apparatus for intranet searching.
  40. Keefer, Alan Harrison, Method and apparatus for pricing insurance policies.
  41. Coomer, Graham; Johnston, Nicholas, Method and system for asynchronous analysis of URLs in messages in a live message processing environment.
  42. Von Weihe, Daniel, Method and system for document retrieval with selective document comparison.
  43. Roegner, Michael W., Method and system for dynamically implementing an enterprise resource policy.
  44. Roegner, Michael W., Method and system for dynamically implementing an enterprise resource policy.
  45. Zunger, Yonatan, Method and system for efficiently replicating data in non-relational databases.
  46. Vickrey, Rebekah C.; Dachille, Frank C.; Gheorghita, Stefan V.; Zunger, Yonatan, Method and system for providing efficient access to a tape storage system.
  47. Roegner, Michael W., Method and system for selecting advertisements to be presented to a viewer.
  48. Roegner, Michael W., Method and system for selecting content items to be presented to a viewer.
  49. Roegner, Michael W., Method and system for selecting content items to be presented to a viewer.
  50. Roegner, Michael W., Method and system for selecting content items to be presented to a viewer.
  51. Elkan,Charles, Method and system for selecting documents by measuring document quality.
  52. Safa, John, Method and system for shared document approval.
  53. McKeeth, Jim, Method and system for updating a search engine.
  54. McKeeth, Jim, Method and system for updating a search engine.
  55. McKeeth, Jim, Method and system for updating a search engine database based on popularity of links.
  56. Zunger, Yonatan; Kesselman, Alexander; Drobychev, Alexandre, Method and system for uploading data into a distributed storage system.
  57. Meyerzon, Dmitriy; Sanu, Sankrant, Method of web crawling utilizing crawl numbers.
  58. Brodsky, Elizabeth Adleberg; Elnozahy, Elmootazbellah Nabil; Rajamony, Ramakrishnan, Method, apparatus and computer program product to crawl a web site.
  59. Hughes,Jeremy P. J.; Tate,Richard P., Method, system and computer program for controlling access in a distributed data processing system.
  60. Day, Don Rutledge; Dutta, Rabindranath; Schell, David Allen, Method, system, and program for gathering indexable metadata on content at a data repository.
  61. Glover, Robin Wallace, Methods and systems for comparing presentation slide decks.
  62. More, Scott; Beyer, Ilya, Methods and systems for exact data match filtering.
  63. More, Scott, Methods and systems for image fingerprinting.
  64. Mulder, Samuel Peter Matthew, Methods and systems for monitoring documents exchanged over email applications.
  65. More, Scott, Methods and systems for preventing unauthorized disclosure of secure information using image fingerprinting.
  66. More, Scott; Beyer, Ilya; Sweeting, Daniel Christopher John, Methods and systems for protect agents using distributed lightweight fingerprints.
  67. More, Scott; Beyer, Ilya, Methods and systems to fingerprint textual information using word runs.
  68. More, Scott; Beyer, Ilya; Sweeting, Daniel Christopher John, Methods and systems to implement fingerprint lookups across remote agents.
  69. More, Scott; Beyer, Ilya; Sweeting, Daniel Christopher John, Methods and systems to implement fingerprint lookups across remote agents.
  70. Carver, Anton P. T., Minimizing visibility of stale content in web searching including revising web crawl intervals of documents.
  71. Carver, Anton P. T., Minimizing visibility of stale content in web searching including revising web crawl intervals of documents.
  72. Carver, Anton P. T., Minimizing visibility of stale content in web searching including revising web crawl intervals of documents.
  73. Rosenberg, Naor; Zilberstein, Benny; Cohen, Eli, Network crawling prioritization.
  74. Reiner Kraft ; Michael Lawrence Emens, Network repository service directory for efficient web crawling.
  75. Reiner Kraft ; Michael Lawrence Emens, Network repository service for efficient web crawling.
  76. Kesselman, Alexander, Operating on objects stored in a distributed database.
  77. Dar, Affan Arshad; Saha, Sanjib, Peek and lock using queue partitioning.
  78. Kirshenbaum, Evan R.; Suermondt, Henri J.; Lillibridge, Mark David; Yuasa, Kei; Eshghi, Kave; Forman, George, Policy applicability determination.
  79. Osias, Michael J., Providing a status of a transaction with an application on a server.
  80. Obata, Kenji; Meyerzon, Dmitriy, Proxy server using a statistical model.
  81. Obata, Kenji; Meyerzon, Dmitriy, Proxy server using a statistical model.
  82. Zunger, Yonatan; Drobychev, Alexandre; Kesselman, Alexander; Vickrey, Rebekah C.; Dachille, Frank C.; Datuashvili, George, Pruning of blob replicas.
  83. Meyerzon, Dmitriy; Zaragoza, Hugo, Ranking search results using biased click distance.
  84. Meyerzon, Dmitriy; Li, Hang, Ranking search results using feature extraction.
  85. Meyerzon, Dmitriy; Zaragoza, Hugo, Ranking search results using language types.
  86. Poznanski, Victor; Wang, Oivind; Holm, Fredrik; Bodd, Nicolai; Tankovich, Vladimir; Meyerzon, Dmitriy, Re-ranking search results.
  87. Randall, Keith H., Scheduler for search engine crawler.
  88. Randall, Keith H., Scheduler for search engine crawler.
  89. Randall, Keith H., Scheduler for search engine crawler.
  90. Zhu, Huican; Ibel, Maximilian; Acharya, Anurag; Gobioff, Howard Bradley, Scheduler for search engine crawler.
  91. Zhu, Huican; Ibel, Maximilian; Acharya, Anurag; Gobioff, Howard Bradley, Scheduler for search engine crawler.
  92. Acharya, Anurag; Louz On, Michal; Roetter, Alexander C., Search engine with multiple crawlers sharing cookies.
  93. Tankovich, Vladimir; Li, Hang; Meyerzon, Dmitriy; Xu, Jun, Search results ranking using editing distance and document information.
  94. Varma,Anujan; Restrick,Robert C.; Bannur,Jaisimha, Selecting a queue for service in a queuing system.
  95. Drobychev, Alexandre; Kesselman, Alexander; Vickrey, Rebekah C.; Dachille, Frank C.; Datuashvili, George, Storage of data in a distributed storage system.
  96. Dmitriy Meyerzon ; Sankrant Sanu, Synchronizing crawler with notification source.
  97. Olston, Christopher, System and method for adaptively refreshing a web page.
  98. Sundaresan, Neelakantan, System and method for automatic generation of dynamic search abstracts contain metadata by crawler.
  99. Bhagwan, Varun; Desai, Rajesh M.; Jalan, Piyoosh, System and method for crawl policy management utilizing IP address and IP address range.
  100. Bhagwan, Varun; Desai, Rajesh M.; Jalan, Piyoosh, System and method for crawl policy management utilizing IP address and IP address range.
  101. Glover, Robin, System and method for determining document version geneology.
  102. Najork,Marc Alexander, System and method for distributed web crawling.
  103. Galai, Yaron; Itzhak, Oded, System and method for extracting content for submission to a search engine.
  104. Najork, Marc Alexander; Heydon, Clark Allan; Mitzenmacher, Michael; Henzinger, Monika H., System and method for near-uniform sampling of web page addresses.
  105. Meyerzon, Dmitriy; Zaragoza, Hugo, System and method for ranking search results using click distance.
  106. Kesselman, Alexander, System and method for replicating objects in a distributed storage system.
  107. Merrigan, Chadd Creighton; Peltonen, Kyle G.; Meyerzon, Dmitriy; Lee, David J., System and method for scoping searches using index keys.
  108. Mulder, Matthew, System and method for securing documents prior to transmission.
  109. Brenner,Larry Bert; Srinivas,Mysore Sathyanarayana; Van Fleet,James W., System and method for thread scheduling with weak preemption policy.
  110. Pitzel,Bradley John; Bobrovskiy,Stanislav; Roberts,William A., System and method for updating information via a network.
  111. Roegner, Michael W., System for managing access to protected resources.
  112. Roegner, Michael W., System for managing access to protected resources.
  113. Hughes, Lucian P., System, method and article of manufacture for a user programmable diary interface link.
  114. Prevost, Michel; Samson, Pierre Paul; Beaulieu, Francis; Perron, Yves, Systems and methods for subscription management in a multi-channel context aware communication environment.
  115. Zunger, Yonatan; Drobychev, Alexandre; Kesselman, Alexander; Vickrey, Rebekah C.; Dachille, Frank Clare; Datuashvili, George, Systems and methods of simulating the state of a distributed storage system.
  116. Prasad, Mukul Ranjan, Technique for coordinating the distributed, parallel crawling of interactive client-server applications.
  117. Prasad, Mukul Ranjan, Technique for stateless distributed parallel crawling of interactive client-server applications.
  118. Brodsky, Elizabeth A.; Elnozahy, Elmootazbellah N.; Rajamony, Ramakrishnan, Technology for web site crawling.
  119. McCorkendale, Bruce; Sobel, William E.; Szor, Peter, Update protection system and method.
  120. Andrews, Peter J.; Faisman, Alexander; Grabarnik, Genady; Shwartz, Larisa, Voice response systems browsing.
  121. Prasad, Mukul R., Web service for automated cross-browser compatibility checking of web applications.
섹션별 컨텐츠 바로가기

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

AI-Helper 아이콘
AI-Helper
안녕하세요, AI-Helper입니다. 좌측 "선택된 텍스트"에서 텍스트를 선택하여 요약, 번역, 용어설명을 실행하세요.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.

선택된 텍스트

맨위로