[특허]Proxy server using a statistical model

Proxy server using a statistical model 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-017/30 G06F-017/00 G06F-015/16
출원번호	US-0603695 (2000-06-26)
발명자 / 주소	Obata, Kenji Meyerzon, Dmitriy
출원인 / 주소	Microsoft Corporation
대리인 / 주소	Christensen O'Connor Johnson Kindness PLLC
인용정보	피인용 횟수 : 39 인용 특허 : 10

초록 ▼

A computer based system and method of determining whether to re-fetch a previously retrieved document across a computer network is disclosed. The method utilizes a statistical model to determine whether the previously retrieved document likely changed since last accessed. The statistical model is continuously improving its accuracy by training internal probability distributions to reflect the actual experience with change rate patterns of the documents accessed. The decision of whether to access the document is based on the probability of change compared against a desired synchronization level, random selections, maximum limits on the amount of time since the document was last accessed, and other criterion. Once the decision to access is made, the document is checked for changes and this information is used to train the statistical model.

대표청구항 ▼

1. A computer-implemented method for selectively accessing a document in response to a current retrieval request, the document being identified by a document address specification, the document having been retrieved during a previous retrieval request, the method comprising:determining whether to access the document during the current retrieval request by identifying with the aid of a statistical model whether the document is likely to have changed since a previous retrieval request; and accessing the document if the determination produces an instruction indicative that the document at the document address specification should be accessed during the current retrieval request, wherein determining whether to access the document during the current retrieval request comprises computing a probability that the document is likely to have changed since a previous retrieval request, and further wherein computing the probability that the document is likely to have changed since a previous retrieval request comprises: selecting an active probability indicative of a proportion of documents in a plurality of documents that are changing at various change rates, the plurality of documents including the document, training the active probability to reflect an experience with the document during a plurality of previous document retrieval requests, and using the trained active probability to compute the probability that the document has changed since a previous retrieval request. 2. The method of claim 1, further comprising:selecting the probability that the document has changed since the previous document retrieval request as the active probability in the current retrieval request; and computing the probability that the document is likely to have changed since a previous retrieval request for the current retrieval request. 3. The method of claim 1, wherein training the active probability includes multiplying the active probability indicative of a change in the document by a training probability calculated using a statistical model.4. The method of claim 1, wherein determining whether to access the document during the current retrieval request with the aid of a statistical model further comprises:training a document probability distribution corresponding to the document address specification to reflect an experience with the document during a plurality of previous document retrieval requests, the document probability distribution including a plurality of probabilities; determining from the document probability distribution a probability that the document has changed; and making a determination of whether to access the document in a current document retrieval request based on the probability that the document has changed. 5. The method of claim 4, further comprising:calculating, based on the experience with the document during a plurality of previous document retrieval requests, a discrete random variable distribution that includes a plurality of training probabilities; multiplying each probability in the document probability distribution by a corresponding training probability from the discrete random variable distribution. 6. The method of claim 5, wherein the training probabilities are calculated using a Poisson process, the Poisson process including a Poisson equation (e^(?r*dt)) and a complementary Poisson equation (1?e^(?r*dt)).7. The method of claim 6, wherein the experience with the document during the plurality of previous document retrieval requests is derived from historical information associated with the document address specification.8. A computer-readable medium having computer-executable instructions for retrieving one document in a plurality of documents from a remote server, which when executed comprise:maintaining historical information representing prior changes to the one document at the remote server; initiating a document retrieval request procedure for retrieving particular documents in the plurality of documents; determining whether to access the one document from the remote server based on an analysis of the historical information representing prior changes to the one document at the remote server; and if the determination to access the one document is positive, identifying the one document for retrieval during the document retrieval procedure, wherein determining whether to retrieve the document further comprises: computing a probability that the one document has changed since the one document was last retrieved from the remote server, and further wherein computing the probability that the one document has changed comprises: beginning with a probability that a pre-defined proportion of documents in the plurality of documents has changed, and training the probability that the pre-defined proportion of documents has changed using the historical information associated with the one document to achieve the probability that the one document has changed since the one document was last retrieved. 9. The computer-readable medium of claim 8, further comprising making a random decision to retrieve the one document wherein the random decision is biased by the probability that the one document has changed.10. The computer-readable medium of claim 9, wherein the random decision is further biased by a synchronization level configured to influence the random decision based on a predetermined degree of tolerance for not retrieving the one document if the document is likely to have changed.11. The computer-readable medium of claim 9, wherein the random decision is made by a software routine adapted to simulate a flip of a coin.12. The computer-readable medium of claim 8, wherein:the historical information representing prior changes to the one document comprises for the one document, a change count representing the number of times the one document has been modified, an access count representing the number of times the one document has been accessed, a first access time representing the time the one document was first accessed, and a last access time representing the time the one document was last accessed; and wherein the step of training the probability comprises creating a timeline using the historical information, the timeline having representations thereon of no change intervals, change intervals, and no change chunk intervals. 13. The computer-readable medium of claim 12, wherein the step of training the probability further comprises:training the document probability distribution for each no change interval; training the document probability distribution for each change interval; and training the document probability distribution for each no change chunk interval. 14. The computer-readable medium of claim 8, wherein:the historical information representing prior changes to the one document includes a hash value associated with the one document, the hash value being a representation of the one document; and wherein the analysis includes a comparison of the hash value included in the historical information with another hash value calculated from information retrieved from the one document stored on the remote server. 15. The computer-readable medium of claim 14, wherein if the hash value included in the historical information does not match the other hash value associated with the one document stored on the remote server, updating the historical information to identify that the one document changed.

이 특허에 인용된 특허 (10)

Narendran Balakrishnan ; Rangarajan Sampath ; Yajnik Shalini, Data distribution techniques for load-balanced fault-tolerant web access.
상세보기
Meyerzon, Dmitriy; Shoroff, Srikanth; Terek, F. Soner; Norin, Scott, Method and system for detecting duplicate documents in web crawls.
상세보기
Pirolli Peter L. ; Pitkow James E., Prefetching and caching documents according to probability ranked need S list.
상세보기
Marc Alexander Najork ; Clark Allan Heydon, System and method for associating an extensible set of data with documents downloaded by a web crawler.
상세보기
Soumen Chakrabarti ; Byron Edward Dom ; Martin Henk van den Berg, System and method for focussed web crawling.
상세보기
Douglas M. Dillon, System and method for multicasting multimedia content.
상세보기
Sundaresan, Neelakantan; Yi, Jeonghee, System and method for the automatic mining of new relationships.
상세보기
Liddy Elizabeth D. ; Yu Edmund Szu-Li, System for retrieving multimedia information from the internet using multiple evolving intelligent agents.
상세보기
Najork Marc Alexander ; Heydon Clark Allan ; Wiener Janet Lynn, Web crawler system using plurality of parallel priority level queues having distinct associated download priority levels for prioritizing document downloading and maintaining document freshness.
상세보기
Wiener, Janet L.; Stata, Raymond P.; Burrows, Michael, Web page connectivity server.
상세보기

이 특허를 인용한 특허 (39)

Popek, Gerald; Blaser, Shane; Tamura, Randall; Nguyen, Thod; Warren, Terry, Accelerating network communications.
상세보기
Popek, Gerald; Blaser, Shane; Tamura, Randall; Nguyen, Thod; Warren, Terry, Accelerating network communications.
상세보기
Milner, Marius C., Automatic proxy setting modification.
상세보기
Milner, Marius C., Automatic proxy setting modification.
상세보기
Wang, Cheuksan Edward, Bloom filter for storing file access history.
상세보기
Petriuc, Mihai, Click distance determination.
상세보기
Palliyil, Sudarshan; Venkateshamurthy, Shivakumara; Aswathanarayana, Tejasvi, Computer program product and computer system for controlling performance of operations within a data processing system or networks.
상세보기
Tankovich, Vladimir; Meyerzon, Dmitriy; Poznanski, Victor, Detection of junk in search result ranking.
상세보기
Tankovich, Vladimir; Meyerzon, Dmitriy; Taylor, Michael James, Document length as a static relevance feature for ranking search results.
상세보기
Dean, Jeffrey; Haahr, Paul; Henzinger, Monika; Lawrence, Steve; Pfleger, Karl; Sercinoglu, Olcan; Tong, Simon, Document scoring based on query analysis.
상세보기
Lawrence, Steve, Document scoring based on traffic associated with a document.
상세보기
Meyerzon, Dmitriy; Shnitko, Yauhen; Burges, Chris J. C.; Taylor, Michael James, Enterprise relevancy ranking using a neural network.
상세보기
Robertson, Stephen; Zaragoza, Hugo; Taylor, Michael; Larimore, Stefan Isbein; Petriuc, Mihai, Field weighting in text searching.
상세보기
Palliyil, Sudarshan; Venkateshamurthy, Shivakumara; Vijayaraghavan, Srinivas Belur; Aswathanarayana, Tejasvi, Hash-based access to resources in a data processing network.
상세보기
Dengler, Patrick M.; Krishnan, Arvind K.; Singh, Jagdish; Sanchez, Lawrence M.; Shankar, Sai; Chittamuru, Satish Kumar; Pekic, Zoltan; Mondal, Nabarun; Kumar, Namendra; i Dalfó, Ricard Roma, Metadata driven user interface.
상세보기
Villadsen, Peter; Chen, Zhaoqi; Gottumukkala, Ramakanthachary S.; Calderon, Marcos, Metadata-based eventing supporting operations on data.
상세보기
Palliyil, Sudarshan; Venkateshamurthy, Shivakumara; Aswathanarayana, Tejasvi, Method and computer program product for identifying or managing vulnerabilities within a data processing network.
상세보기
Palliyil, Sudarshan; Venkateshamurthy, Shivakumara; Vijayaraghavan, Srinivas Belur; Aswathanarayana, Tejasvi, Methods, apparatus and computer programs for enhanced access to resources within a network.
상세보기
Gupta,Arun K.; Uppal,Rajiv K.; Parikh,Devang I., Object oriented based, business class methodology for generating quasi-static web pages at periodic intervals.
상세보기
Fredricksen, Eric Russell; Feng, Hanping; Kataru, Naga Sridhar; Harik, Georges, Prioritized preloading of documents to client.
상세보기
Meyerzon, Dmitriy; Zaragoza, Hugo, Ranking search results using biased click distance.
상세보기
Meyerzon, Dmitriy; Li, Hang, Ranking search results using feature extraction.
상세보기
Meyerzon, Dmitriy; Zaragoza, Hugo, Ranking search results using language types.
상세보기
Poznanski, Victor; Wang, Oivind; Holm, Fredrik; Bodd, Nicolai; Tankovich, Vladimir; Meyerzon, Dmitriy, Re-ranking search results.
상세보기
Fredricksen, Eric Russell; Feng, Hanping; Kataru, Naga Sridhar; Harik, Georges, Refreshing cached documents and storing differential document content.
상세보기
Tankovich, Vladimir; Li, Hang; Meyerzon, Dmitriy; Xu, Jun, Search results ranking using editing distance and document information.
상세보기
Thomas, Michael F.; Polak, Martin R.; Kunc, Dennis C.; Stanich, Jr., Frank N.; Ling, Raymond S., Security system.
상세보기
Thomas, Michael F.; Polak, Martin R.; Kunc, Dennis C.; Stanich, Jr., Frank N.; Ling, Raymond S., Security system.
상세보기
Lloyd, Matthew, Speculative rendering during cache revalidation.
상세보기
Meyerzon, Dmitriy; Zaragoza, Hugo, System and method for ranking search results using click distance.
상세보기
Merrigan, Chadd Creighton; Peltonen, Kyle G.; Meyerzon, Dmitriy; Lee, David J., System and method for scoping searches using index keys.
상세보기
Fredricksen, Eric Russell; Schneider, Fritz John; Dean, Jeffrey Adgate; Ghemawat, Sanjay; Provos, Niels; Harik, Georges, System and method of accessing a document efficiently through multi-tier web caching.
상세보기
Fredricksen, Eric Russell; Schneider, Fritz John; Dean, Jeffrey Adgate; Ghemawat, Sanjay; Provos, Niels; Harik, Georges, System and method of accessing a document efficiently through multi-tier web caching.
상세보기
Fredrickson, Eric Russell; Feng, Hanping; Kataru, Naga Sridhar; Harik, Georges, System and method of accessing a document efficiently through multi-tier web caching.
상세보기
Marmigere, Gerard; Picon, Joaquin; Secondo, Pierre, System and method to refresh proxy cache server objects.
상세보기
Eriksen, Bjorn Marius Aamodt; Laraki, Othman, Systems and methods for cache optimization.
상세보기
Eriksen, Bjorn Marius Aamodt; Rennie, Jeffrey Glenn; Laraki, Othman, Systems and methods for client authentication.
상세보기
Eriksen, Bjorn Marius Aamodt; Rennie, Jeffrey Glen; Laraki, Othman, Systems and methods for client cache awareness.
상세보기
Erikson, Bjorn Marius Aamodt; Laraki, Othman; Nicolaou, Cosmos; Feng, Hanping; Rennie, Jeffrey Glen; Severson, Denis Lee, Systems and methods of efficiently preloading documents to client devices.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Proxy server using a statistical model 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (10)

이 특허를 인용한 특허 (39)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Proxy server using a statistical model 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (10)

이 특허를 인용한 특허 (39)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트