[특허]Identifying book title sets

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-017/30
출원번호	US-0048426 (2011-03-15)
등록번호	US-9881009 (2018-01-30)
발명자 / 주소	Weight, Christopher F. Birkett, Andrew D. Hamaker, Janna Killalea, Tom Nelson, Alexander William Robb
출원인 / 주소	Amazon Technologies, Inc.
대리인 / 주소	Lee & Hayes, PLLC
인용정보	피인용 횟수 : 1 인용 특허 : 31

초록 ▼

Techniques are described for identifying book title sets. The techniques may include a first-pass comparison with other books to identify other candidate title sets. A second-pass comparison may then be performed with respect to the candidate title sets. The first-pass comparison may be based on boo

Techniques are described for identifying book title sets. The techniques may include a first-pass comparison with other books to identify other candidate title sets. A second-pass comparison may then be performed with respect to the candidate title sets. The first-pass comparison may be based on book metadata such as titles and authorship. The second-pass comparison may include a more comprehensive content comparison, such as comparing the body text of the books.

대표청구항 ▼

1. A computer-implemented method, comprising: under control of one or more processors configured with executable instructions,receiving, from a device of an author and via a content ingestion service associated with a network, an electronic book having first body text and first metadata;normalizing

1. A computer-implemented method, comprising: under control of one or more processors configured with executable instructions,receiving, from a device of an author and via a content ingestion service associated with a network, an electronic book having first body text and first metadata;normalizing the electronic book by removing illustrations from the electronic book, removing extraneous characters from the electronic book, and converting characters of the electronic book to a single case;determining, in response to the normalizing of the electronic book, whether the first metadata of the electronic book matches metadata of any existing book title sets;based at least partly on a first determination that the first metadata of the electronic book matches second metadata of no more than a single existing book title set that includes at least one book, adding the electronic book to the single existing book title set such that the single existing book title set includes the at least one book and the electronic book;based at least partly on a second determination that the first metadata of the electronic book matches third metadata of multiple existing book title sets, calculating a text matching score corresponding to individual ones of the existing book title sets, the text matching score indicating a comparison of a first frequency of one or more words included in the first body text of the electronic book and a second frequency of the one or more words included in second body text of the corresponding existing book title set; andadding the electronic book to an existing book title set of the multiple existing book title sets based at least partly on the text matching score corresponding to the existing book title set being greater than a specified threshold, the existing book title set including the electronic book and one or more other books. 2. The computer-implemented method of claim 1, wherein the first metadata of the electronic book and the metadata of the existing book title sets indicates one or more of: title;authorship;publisher;publication date;copyright date; andInternational Standard Book Number (ISBN). 3. The computer-implemented method of claim 1, wherein calculating the text matching score comprises evaluating word alignment between the electronic book and the existing book title set. 4. The computer-implemented method of claim 1, wherein calculating the text matching score comprises evaluating page alignment between the electronic book and the existing book title set. 5. The computer-implemented method of claim 1, wherein calculating the text matching score comprises evaluating word frequencies of the electronic book and the existing book title set. 6. The computer-implemented method of claim 1, wherein calculating the text matching score comprises evaluating edit distances between the electronic book and the existing book title set. 7. A computer-implemented method, comprising: under control of one or more processors configured with executable instructions,receiving, from a device of an author and via a content ingestion service associated with a network, an electronic book having first body text and first metadata;normalizing the electronic book by at least one of removing illustrations from the electronic book, removing extraneous characters from the electronic book, or converting characters of the electronic book to a single case;comparing the first metadata of the electronic book with second metadata corresponding to other books to identify one or more candidate title sets of which the electronic book may be a member;determining that a number of the one or more candidate title sets meets or exceeds a pre-determined number of candidate title sets; andbased at least partly on the determining that the number of the one or more candidate title sets meets or exceeds the pre-determined number of candidate title sets, comparing the first body text of the electronic book with second body text of the one or more candidate title sets to determine that the electronic book is a member of the one or more candidate title sets. 8. The computer-implemented method of claim 7, wherein the second body text of the one or more candidate title sets comprises a canonical text corresponding to the one or more candidate title sets. 9. The computer-implemented method of claim 7, wherein the second body text of the one or more candidate title sets comprises body text of an existing member of the one or more candidate title sets. 10. The computer-implemented method of claim 7, wherein the first metadata of the electronic book and the second metadata corresponding to the other books comprises multiple data fields. 11. The computer-implemented method of claim 7, wherein the first metadata of the electronic book and the second metadata corresponding to the other books comprises at least an author field and a title field. 12. The computer-implemented method of claim 7, wherein: the first metadata of the electronic book comprises a first author field and the second metadata corresponding to the other books comprises a second author field; andcomparing the first metadata comprises determining whether there is common authorship between the electronic book and the other books based on the first author field and the second author field. 13. The computer-implemented method of claim 7, wherein: the first metadata of the electronic book and the second metadata corresponding to the other electronic books indicate respective titles of the electronic book and the other electronic books; andthe method further comprising normalizing the first metadata prior to comparing the first metadata. 14. The computer-implemented method of claim 7, wherein comparing the first metadata comprises calculating metadata similarity scores based at least in part on similarity between the first metadata of the electronic book and the second metadata corresponding to the other electronic books. 15. The computer-implemented method of claim 7, wherein comparing the first body text comprises evaluating word alignment between the electronic book and the one or more candidate title sets. 16. The computer-implemented method of claim 7, wherein comparing the first body text comprises evaluating page alignment between the electronic book and the one or more candidate title sets. 17. The computer-implemented method of claim 7, wherein comparing the first body text comprises evaluating word frequencies of the electronic book and the one or more candidate title sets. 18. The computer-implemented method of claim 7, wherein comparing the first body text comprises evaluating edit distances between the first body text of the electronic book and the second body text of the one or more candidate title sets. 19. An online electronic book service, comprising: one or more processors; andone or more non-transitory computer-readable storage media containing instructions that are executable by the one or more processors to perform actions comprising: receiving, from a device of an author and via a content ingestion service associated with a network, an electronic book;normalizing the electronic book by at least one of removing illustrations from the electronic book, removing extraneous characters from the electronic book, or converting characters of the electronic book to a single case;performing a first-pass comparison of metadata of the electronic book with metadata of different book title sets to identify one or more candidate title sets of which the electronic book may be a member; andbased at least partly on a determination that the first-pass comparison identifies a partial match for multiple candidate title sets, performing a second-pass comparison of first body text of the electronic book with second body text of the multiple candidate title sets to determine that the electronic book is a member of any of the multiple candidate title sets. 20. The online electronic book service of claim 19, wherein the second-pass comparison comprises comparing word frequencies of the electronic book and the multiple candidate title sets. 21. The online electronic book service of claim 19, wherein the second-pass comparison comprises comparing word alignment of the electronic book with the multiple candidate title sets. 22. The online electronic book service of claim 19, wherein the second-pass comparison comprises comparing page alignment of the electronic book with the multiple candidate title sets. 23. The online electronic book service of claim 19, wherein the second-pass comparison comprises comparing edit distances between the electronic book and the multiple candidate title sets. 24. The online electronic book service of claim 19, wherein the actions further comprise, based at least partly on a determination that the first-pass comparison does not identify any candidate title sets of the multiple candidate title sets, performing the first-pass comparison with respect to a different electronic book.

LOADING...

이 특허에 인용된 특허 (31) 인용/피인용 타임라인 분석

Divine, Marc; Maestrimi, Yves; Mazeiller, Dominique, Automated system for producing booklets on demand.
상세보기
Lifantsev, Maxim, Automatic metadata identification.
상세보기
Thibaux, Romain; Vincent, Luc; Uhlik, Christopher Richard; Manmatha, Raghavan; Wang, Xuefu, Automatic metadata identification.
상세보기
Baluja, Shumeet; Jing, Yushi, Book content item search.
상세보기
Zeidman, Robert Marc, Detecting plagiarism in computer source code.
상세보기
Shirai Noriaki,JPX ; Hoashi Yoshiaki,JPX ; Matsui Takeshi,JPX, Distance measurement apparatus.
상세보기
Clark, George Philip; Crawford, Jeffrey Walter; Marino, Edward John; Brewster, Laurance Holmes, Distributing electronic books over a computer network.
상세보기
Tibbetts, Timothy A.; Rigdon, Debra A., Document analysis.
상세보기
Green, Robin A. R., Document retrieval system and search method using word set and character look-up tables.
상세보기
Thacker,Charles P.; Sommerer,Ralph, Dynamic pagination of text and resizing of image to fit in a document.
상세보기
Hendricks John S., Electronic book selection and delivery system.
상세보기
Joshi, Ashutosh; Gupta, Aparna; Mohanty, Binay; Upadhyay, Jalvin; Arora, Rajiv; Betz, Martin; Prospero, Michael; Cooke, David; Rao, Prashant, Event naming.
상세보기
Basehore Paul M., Handwritten character translator using fuzzy logic.
상세보기
Park, Tim; Dolinsky, Dmitry, Identifying duplicate electronic content based on metadata.
상세보기
DeSmet Eric (Sint Niklaas BEX), Interactive talking book and audio player assembly.
상세보기
Ito Takahiro,JPX ; Takayama Yasuhiro,JPX ; Suzuki Katsushi,JPX, Keyword extraction apparatus, keyword extraction method, and computer readable recording medium storing keyword extraction program.
상세보기
Panelli, John D.; Jalagam, Sesh; Leung, Alan Kin Chung, Maintaining and using user-created mapping history for network resource mapping.
상세보기
Lam Christopher S., Memory sharing architecture for a decoding in a computer system.
상세보기
Combs J. Andrew, Method and apparatus for providing automated searching and linking of electronic documents.
상세보기
Lopresti Daniel P. ; Sandberg Jonathan S., Method and means for enhancing optical character recognition of printed documents.
상세보기
Cox Paula J. ; Gillihan Dana L. ; Hyatt Donald Ray ; Leone Paul T. ; Nordby Kenneth M. ; Pullizzi Victor Edward ; Rauch Thyra Lynne ; Rinda Robert W., Method and system for organizing on-line books using bookcases.
상세보기
Najmi,Farrukh S., Methods and apparatus for indexing content.
상세보기
Harrington, Steven J., Object identification method and system for an augmented-reality display.
상세보기
Knut Magne Risvik NO, Search system and method for retrieval of data, and the use thereof in a search engine.
상세보기
Levin, Eugene; Corey, Martha Elizabeth, System and method for pattern recognition in sequential data.
상세보기
Chambers, Mike; Cantrell, Christian, System and method for ranking information based on clickthroughs.
상세보기
Hay,George M.; Rasmussen,Gerald, System and method for the delivery of electronic books.
상세보기
Nielsen Jakob, System for reminding a sender of an email if recipient of the email does not respond by a selected time set by the sender.
상세보기
Anderson Brian,AUX, Thermal insulating container.
상세보기
Aiken, Alexander, User interface for displaying document comparison information.
상세보기
Fish David (Elkana P.O. Box 268 44814 D.N. Efraim ILX), Warning method and apparatus and parallel correlator particularly useful therein.
상세보기

이 특허를 인용한 특허 (1) 인용/피인용 타임라인 분석

Bodapati, Sravan Babu; Kalyanapasupathy, Venkatraman, Automated identification of start-of-reading location for ebooks.
상세보기

활용도 분석정보

상세보기

다운로드

내보내기

활용도 Top5 특허

해당 특허가 속한 카테고리에서 활용도가 높은 상위 5개 콘텐츠를 보여줍니다.
더보기 버튼을 클릭하시면 더 많은 관련자료를 살펴볼 수 있습니다.

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

[미국특허] Identifying book title sets 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (31) 인용/피인용 타임라인 분석

이 특허를 인용한 특허 (1) 인용/피인용 타임라인 분석

활용도 분석정보

활용도 Top5 특허

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

[미국특허] Identifying book title sets 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (31) 인용/피인용 타임라인 분석

이 특허를 인용한 특허 (1) 인용/피인용 타임라인 분석

활용도 분석정보

활용도 Top5 특허 더보기

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

활용도 Top5 특허