[특허]Answering questions using environmental context

Answering questions using environmental context 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G10L-015/00 G10L-017/00 G10L-021/00 G10L-025/00 G10L-015/22 G10L-015/08 G10L-015/24 G10L-015/30 G06F-017/30
출원번호	US-0410180 (2017-01-19)
등록번호	US-9786279 (2017-10-10)
발명자 / 주소	Sharifi, Matthew Postelnicu, Gheorghe
출원인 / 주소	Google Inc.
대리인 / 주소	Fish & Richardson P.C.
인용정보	피인용 횟수 : 0 인용 특허 : 36

초록 ▼

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data encoding an utterance and environmental data, obtaining a transcription of the utterance, identifying an entity using the environmental data, submitting a query to a natural language query processing engine, wherein the query includes at least a portion of the transcription and data that identifies the entity, and obtaining one or more results of the query.

대표청구항 ▼

1. A computer-implemented method comprising: generating, by a mobile device, an audio recording of (i) a question about an unidentified item of media content that a different device is playing in a vicinity of the mobile device, and (ii) environmental audio;in response to forwarding the audio recording to a front end server of a natural language processing system, receiving an answer to the question that is based on processing different portions of the audio recording by a speech recognition engine server associated with the natural language processing system and a content identification engine server associated with the natural language processing system; andin response to the question, providing, by the mobile device, the answer to the question about the unidentified item of media content. 2. The computer-implemented method of claim 1, comprising: identifying one or more keywords corresponding to the question,associating the one or more keywords with one or more types of media content, andproviding the answer based on the question and the one or more types of media content. 3. The computer-implemented method of claim 2, wherein the one or more types of media content includes at least one of movie, music, television show, audio podcast, image, artwork, book, magazine, trailer, video, podcast, Internet video and video game. 4. The computer-implemented method of claim 2, wherein providing the answer based on the one or more types of media content further comprises: identifying two or more candidate answers of the question,generating ranked scores for each of the two or more candidate answers, the ranked scores based on the one or more types of media content, andproviding the answer based on the question and the ranked scores. 5. The computer-implemented method of claim 1, further comprising streaming the environmental audio. 6. The computer-implemented method of claim 1, wherein the speech recognition engine server associated with the natural language processing system and the content identification server associated with the natural language processing system are both the same server. 7. The computer-implemented method of claim 1, further comprising: detecting environmental image data associated with the item of media content, andproviding the answer based on the question and the environmental image data. 8. The computer-implemented method of claim 7, further comprising: identifying one or more types of media content based on the environmental image data, andproviding the answer based on the question, the environmental image data and the one or more types of media content. 9. A system comprising: one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising: generating, by a mobile device, an audio recording of (i) a question about an unidentified item of media content that a different device is playing in a vicinity of the mobile device, and (ii) environmental audio;in response to forwarding the audio recording to a front end server of a natural language processing system, receiving an answer to the question that is based on processing different portions of the audio recording by a speech recognition engine server associated with the natural language processing system and a content identification engine server associated with the natural language processing system; andin response to the question, providing, by the mobile device, the answer to the question about the unidentified item of media content. 10. The system of claim 9, wherein the operations comprise: identifying one or more keywords corresponding to the question,associating the one or more keywords with one or more types of media content, andproviding the answer based on the question and the one or more types of media content. 11. The system of claim 10, wherein the one or more types of media content includes at least one of movie, music, television show, audio podcast, image, artwork, book, magazine, trailer, video, podcast, Internet video and video game. 12. The system of claim 10, wherein providing the answer based on the one or more types of media content further comprises: identifying two or more candidate answers of the question,generating ranked scores for each of the two or more candidate answers, the ranked scores based on the one or more types of media content, andproviding the answer based on the question and the ranked scores. 13. The system of claim 9, wherein the operations comprise streaming the environmental audio. 14. The system of claim 9, wherein the speech recognition engine server associated with the natural language processing system and the content identification server associated with the natural language processing system are both the same server. 15. The system of claim 9, wherein the operations comprise: detecting environmental image data associated with the item of media content, andproviding the answer based on the question and the environmental image data. 16. The system of claim 15, wherein the operations comprise: identifying one or more types of media content based on the environmental image data, andproviding the answer based on the question, the environmental image data and the one or more types of media content. 17. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising: generating an audio recording of (i) a question about an unidentified item of media content that a different device is playing in a vicinity of a mobile device, and (ii) environmental audio;in response to forwarding the audio to a front end server of a natural language processing system, receiving an answer to the question that is based on processing different portions of the audio recording by a speech recognition engine server associated with the natural language processing system and a content identification engine server associated with the natural language processing system; andin response to the question, providing the answer to the question about the unidentified item of media content. 18. The non-transitory computer-readable medium of claim 17, wherein the operations comprise: identifying one or more keywords corresponding to the question,associating the one or more keywords with one or more types of media content, andproviding the answer based on the question and the one or more types of media content. 19. The non-transitory computer-readable medium of claim 17, wherein the operations comprise streaming the environmental audio. 20. The non-transitory computer-readable medium of claim 17, wherein the operations comprise: detecting environmental image data associated with the item of media content, andproviding the answer based on the question and the environmental image data.

이 특허에 인용된 특허 (36)

Toyama,Soichi, Apparatus and method for speech recognition.
상세보기
Chen,Alexander C.; Gill,Sanjivpal S., Apparatus for delivering music and information.
상세보기
VanLund, Peter Spalding; Piersol, Kurt Wesley; Meyers, James David; Simpson, Jacob Michael; Gundeti, Vikram Kumar; Thomas, David Robert; Miles, Andrew Christopher, Application focus in speech-based systems.
상세보기
Li, Yuan; Adam, Hartwig, Automatic learning of logos for visual recognition.
상세보기
Chiang, Alice; Hurd, John C., Automatically initiating an internet-based search from within a displayed document.
상세보기
Jeffrey C. Reynar ; David Allen Caulton ; Erik Rucker ; Paul Kyong Hwan Kim, Background audio recovery system.
상세보기
Weinstein, Eugene; Mengibar, Pedro J.; Schalkwyk, Johan, Context-based speech recognition.
상세보기
Keener, Jr., Ellis Barlow; Kumar, Vishal; Srivastav, Ram Ranjan, Interactive audio/video method on the internet.
상세보기
French-St. George Marilyn,CAX ; Fumai Nicola,CAX ; Pasternack Henry Adam,CAX, Management of speech and audio prompts in multimodal interfaces.
상세보기
Ha Yeong Ho,KRX ; Han Kyu Pill,KRX ; Lee Kwang Choon,KRX ; Jeon Sung Kyu,KRX, Method and apparatus for automatically compensating tone color.
상세보기
Kenyon,Stephen C.; Simkins,Laura, Method and apparatus for automatically recognizing input audio and/or video streams.
상세보기
Abe,Mototsugu; Nishiguchi,Masayuki, Method and apparatus for classifying signals method and apparatus for generating descriptors and method and apparatus for retrieving signals.
상세보기
Wasserblant, Moshe; Eilam, Barak; Lubowich, Yuval; Nissan, Maor, Method and apparatus for fast search in call-center monitoring.
상세보기
Wasserblat,Moshe; Eilam,Barak; Pereg,Oren; Kor,Ilan, Method and apparatus for fraud detection.
상세보기
Ikezoye, Vance E.; Schrempp, James B., Method and apparatus for identifying media content presented on a media playing device.
상세보기
Ikezoye,Vance E.; Schrempp,James B., Method and apparatus for identifying media content presented on a media playing device.
상세보기
Chacker, Aaron R., Method and system for an online talent business.
상세보기
Wang, Avery Li-Chun; Barton, Christopher Jacques Penrose; Mukherjee, Dheeraj Shankar; Inghelbrecht, Philip, Method and system for purchasing pre-recorded music.
상세보기
Perlmutter, S. Michael, Method for selecting interactive voice response modes using human voice detection analysis.
상세보기
Plastina, Daniel; Alkove, James M.; Debique, Kirt A.; Colville, Scott; DeBacker, Gabriel S., Methods and systems for processing playlists.
상세보기
Rhoads, Geoffrey B.; Conwell, William Y., Methods of interacting with audio and ambient music.
상세보기
Roy,Philippe, Multi-phoneme streamer and knowledge representation speech recognition system and method.
상세보기
Swierczek, Remi, Music identification system.
상세보기
Deng, Li; Huang, Xuedong; Plumpe, Michael D., Pattern recognition training method and apparatus using inserted noise followed by noise reduction.
상세보기
Wang, Avery Li Chun; Culbert, Daniel, Robust and invariant audio pattern matching.
상세보기
Goldberg Randy G. ; Rosen Kenneth H. ; Sachs Richard M. ; Winthrop ; III Joel A., Selective noise/channel/coding models and recognizers for automatic speech recognition.
상세보기
Dimitri Kanevsky ; Vit V. Libal CZ; Jan Sedivy CZ; Wlodek W. Zadrozny, Speaker model adaptation via network of similar users.
상세보기
Hart, Gregory M.; Freed, Ian W.; Zehr, Gregg Elliott; Bezos, Jeffrey P., Speech-inclusive device interfaces.
상세보기
Petkovic Dragutin ; Ponceleon Dulce Beatriz ; Srinivasan Savitha, System and method for automatic audio content analysis for word spotting, indexing, classification and retrieval.
상세보기
Wang,Avery Li Chun; Smith, III,Julius O., System and methods for recognizing sound and music signals in high noise and distortion.
상세보기
Pitman, Michael C.; Fitch, Blake G.; Abrams, Steven; Germain, Robert S., System for selling a product utilizing audio content identification.
상세보기
Logan, James D.; Goessling, Daniel F.; Goldhor, Richard S., Systems and methods for modifying broadcast programming.
상세보기
Zuckerberg, Mark; Sittig, Aaron; Marlette, Scott, Tagging digital media.
상세보기
Alanara Seppo,FIX ; Kapanen Pekka,FIX, Transmission of comfort noise parameters during discontinuous transmission.
상세보기
Mozer, Todd F.; Rogers, Jeff; Vermeulen, Pieter J.; Shaw, Jonathan, Truly handsfree speech recognition in high noise environments.
상세보기
Kaye,Evan John, Voice clip identification method.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Answering questions using environmental context 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (36)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Answering questions using environmental context 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (36)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트