[특허]Intuitive computing methods and systems

Intuitive computing methods and systems 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G10L-015/00 H04M-001/725 G06F-009/50 G06K-009/00 G10L-015/22 H04W-072/02 G06F-003/0484 G06F-003/0488 G06K-009/62 G10L-017/00 G10L-015/26 G10L-021/00 G10L-013/00 G06F-017/27 G06F-017/28 G06F-007/04 G08B-021/00 G08B-013/00 G08B-023/00 G09G-005/00 G06T-015/00 H04N-005/225 H04N-007/14 H04N-005/232 H04N-005/76
출원번호	US-0797503 (2010-06-09)
등록번호	US-9197736 (2015-11-24)
발명자 / 주소	Davis, Bruce L. Rodriguez, Tony F. Conwell, William Y. Rhoads, Geoffrey B.
출원인 / 주소	Digimarc Corporation
대리인 / 주소	Digimarc Corporation
인용정보	피인용 횟수 : 19 인용 특허 : 11

초록 ▼

A smart phone senses audio, imagery, and/or other stimulus from a user's environment, and acts autonomously to fulfill inferred or anticipated user desires. In one aspect, the detailed technology concerns phone-based cognition of a scene viewed by the phone's camera. The image processing tasks applied to the scene can be selected from among various alternatives by reference to resource costs, resource constraints, other stimulus information (e.g., audio), task substitutability, etc. The phone can apply more or less resources to an image processing task depending on how successfully the task is proceeding, or based on the user's apparent interest in the task. In some arrangements, data may be referred to the cloud for analysis, or for gleaning. Cognition, and identification of appropriate device response(s), can be aided by collateral information, such as context. A great number of other features and arrangements are also detailed.

대표청구항 ▼

1. A method of declarative reconfiguration of a smart phone system, said system having a processor configured to perform one or more acts of the method, said system also including at least first and second sensors for capturing, respectively, first and second different types of media content from a user's environment, and for producing, respectively, first and second different types of sensor output data, one of said sensors comprising a microphone for sensing audio content and producing audio output data, and another of said sensors comprising an image sensor for sensing visual content and producing image output data, the method comprising the acts: (a) applying, to a speech recognition module, audio output data corresponding to user speech received by the microphone;(b) receiving, from the speech recognition module, recognized verb data and recognized noun data corresponding, respectively, to a verb and a noun included in said user speech, the noun data identifying a subject in the user's environment from which sensor data is captured;(c) based on said recognized verb data, determining that the user is either interested in the first type of media content or in the second type of media content;(d) based on said recognized noun data, tuning a content recognition operation of the system in accordance with a determined user interest, said tuning comprising establishing a set of one or more audio or image processing operations to be performed on output data from the first sensor or the second sensor based on the determined user interest in the first type of media content or in the second type of media content, said set being selected from a larger set of signal processing operations comprising image or audio processing operations, said tuning including accessing a data structure using said recognized noun data to obtain data identifying said set of one or more signal processing operations to be performed on said output data from the first sensor or the second sensor based on the determined user interest in the first type of media content or in the second type of media content;(e) performing said tuned content recognition operation on the first sensor output data or on the second sensor output data; and(f) providing results based on said tuned content recognition operation to the user; wherein speech recognition is employed both (1) in identifying a type of media content of interest to the user, and (2) in tuning content recognition processing of said identified type of media content. 2. The method of claim 1 in which the recognized verb data comprises data corresponding to a verb from the list consisting of: look, watch, view, see, and read. 3. The method of claim 1 in which the recognized verb data comprises data corresponding to a verb from the list consisting of: listen, and hear. 4. The method of claim 1 in which the recognized noun data comprises data corresponding to a noun from the list consisting of: newspaper, book, magazine, poster, text, printing, ticket, box, package, carton, wrapper, product, barcode, watermark, photograph, person, man, boy, woman, girl, people, display, screen, monitor, video, movie, television, radio, iPhone, iPad, and Kindle. 5. The method of claim 1 that includes determining, by reference to the recognized verb data, that visual content, rather than audio content, is of interest to the user, and the method includes determining a type of image processing to be applied to the image output data. 6. The method of claim 5 wherein the type of image processing comprises digital watermark decoding. 7. The method of claim 5 wherein the type of image processing comprises image fingerprinting. 8. The method of claim 5 wherein the type of image processing comprises optical character recognition. 9. The method of claim 5 wherein the type of image processing comprises barcode reading. 10. The method of claim 1 that includes: determining, by reference to the recognized verb data, that visual content, rather than audio content, is of interest to the user; anddetermining, by reference to the recognized noun data, a filtering function to be applied to the image output data. 11. The method of claim 1 that includes: determining, by reference to the recognized verb data, that visual content, rather than audio content, is of interest to the user; anddetermining, by reference to the recognized noun data, an optical focusing function to be applied to the image output data. 12. The method of claim 1 in which the user speech data includes a negation from the list: not, no and ignore. 13. The method of claim 1 in which said recognized verb data directs the system that the user is interested in audio content rather than visual content, and said recognized noun data establishes an audio filtering function that is to be applied to said audio output data. 14. The method of claim 13 in which a passband of said audio filtering function depends on said recognized noun data. 15. The method of claim 13 that includes establishing a male voice-tailored audio filtering passband function in response to first recognized noun data, and establishing a female voice-tailored audio filtering passband function in response to second recognized noun data. 16. The method of claim 13 that includes, as a consequence of first user speech, processing audio output data with an audio filtering function having a first passband, and as a consequence of second user speech, processing audio output data with an audio filtering function having a second passband different than the first passband. 17. The method of claim 1 that includes: as a consequence of first user speech, including a first verb and a first noun, directing the system to process audio output data with a first signal processing operation; and as a consequence of second user speech, including a second verb and a second noun, directing the system to process image output data with a second signal processing operation;wherein the first verb is different than the second verb, and the first noun is different than the second noun. 18. The method of claim 1 that further includes, before act (c), detecting a keyword in the user speech, said keyword detection serving as a cue to the system to perform acts (c) through (e). 19. The method of claim 1 in which the first sensor comprises the microphone and the second sensor comprises the image sensor, and the determined user interest comprises an indication of an interest in the first type of media content, in which the first type of media content comprises audio content, and in which act (e) preforms said tuned content recognition operation on the audio output data. 20. The method of claim 1 in which the first sensor comprises the microphone and the second sensor comprises the image sensor, and the determined user interest comprises an indication of an interest in the second type of media content, in which the second type of media content comprises visual content, and in which act (e) preforms said tuned content recognition operation on the image output data. 21. A non-transitory computer readable medium containing programming instructions for configuring a smart phone system that includes a processor and at least first and second sensors for capturing, respectively, first and second different types of media content from a user's environment, and for producing, respectively, first and second different types of sensor output data, one of said sensors comprising a microphone for sensing audio content and producing audio output data, and another of said sensors comprising an image sensor for sensing visual content and producing image output data, said instructions configuring the system programmed thereby to perform acts including: (a) applying, to a speech recognition module, audio output data corresponding to user speech received by the microphone;(b) receiving, from the speech recognition module, recognized verb data and recognized noun data corresponding, respectively, to a verb and a noun included in said user speech, the noun data identifying a subject in the user's environment from which sensor data is captured;(c) based on said recognized verb data, determining that the user is either interested in the first type of media content or in the second type of media content;(d) based on said recognized noun data, tuning a content recognition operation of the system in accordance with a determined user interest, said tuning comprising establishing a set of one or more audio or image processing operations to be performed on output data from the first sensor or the second sensor based on the determined user interest in the first type of media content or in the second type of media content, said set being selected from a larger set of signal processing operations comprising image or audio processing operations, said tuning including accessing a data structure using said recognized noun data to obtain data identifying said set of one or more signal processing operations to be performed on said output data from the first sensor or the second sensor based on the determined user interest in the first type of media content or in the second type of media content;(e) performing said tuned content recognition operation on the first sensor output data or on the second sensor output data; and(f) providing results based on said tuned content recognition operation to the user; wherein speech recognition is employed both (1) in identifying a type of media content of interest to the user, and (2) in tuning content recognition processing of said identified type of media content. 22. A smart phone system including: a processor;a memory;at least first and second sensors for capturing, respectively, first and second different types of media content from a user's environment, and for producing, respectively, first and second different types of sensor output data, one of said sensors comprising a microphone for sensing audio content and producing audio output data, and another of said sensors comprising an image sensor for sensing visual content and producing image output data; and instructions in said memory that configure the system to perform:(a) applying, to a speech recognition module, audio output data corresponding to user speech received by the microphone;(b) receiving, from the speech recognition module, recognized verb data and recognized noun data corresponding, respectively, to a verb and a noun included in said user speech, the noun data identifying a subject in the user's environment from which sensor data is captured;(c) based on said recognized verb data, determining that the user is either interested in the first type of media content or in the second type of media content;(d) based on said recognized noun data, tuning a content recognition operation of the system in accordance with a determined user interest, said tuning comprising establishing a set of one or more audio or image processing operations to be performed on output data from the first sensor or the second sensor based on the determined user interest in the first type of media content or in the second type of media content, said set being selected from a larger set of signal processing operations comprising image or audio processing operations, said tuning including accessing a data structure using said recognized noun data to obtain data identifying said set of one or more signal processing operations to be performed on said output data from the first sensor or the second sensor based on the determined user interest in the first type of media content or in the second type of media content;(e) performing said tuned content recognition operation on the first sensor output data or on the second sensor output data; and(f) providing results based on said tuned content recognition operation to the user; wherein speech recognition is employed both (1) in identifying a type of media content of interest to the user, and (2) in tuning content recognition processing of said identified type of media content.

이 특허에 인용된 특허 (11)

McCune, Timothy S., Autonomous integrated headset and sound processing system for tactical applications.
상세보기
Wu, Chung-Hsien; Lin, Jen-Chun; Wei, Wen-Li; Chu, Chia-Te; Lin, Red-Tom; Hsu, Chin-Shun, Behavior recognition system and method by combining image and speech.
상세보기
Suh, Doug Young, Cooperative multi-display.
상세보기
Pazandak,Paul N.; Thompson,Craig, Guided natural language interface system and method.
상세보기
Narayanaswami,Chandrasekhar; Kirkpatrick,Edward Scott, Image capturing system and method for automatically watermarking recorded parameters for providing digital image verification.
상세보기
Hung, Chin-Fu, Method and device for enhancing accuracy of voice control with image characteristic.
상세보기
Reber, William L, Method for use with a wireless communication device for facilitating tasks using images and selections.
상세보기
Nelson Paul E. ; Anderson Christopher H. ; Whitman Ronald M. ; Gardner Paul C. ; David Mark R., Multimedia document retrieval by application of multimedia queries to a unified index of multimedia data for a plurality of multimedia data types.
상세보기
Brewer, Brett D.; Watson, Eric B.; Macbeth, Randall J.; Whyte, Nicholas A., Query-by-image search and retrieval system.
상세보기
Slotznick Benjamin, System for calculating occasion dates and converting between different calendar systems, and intelligent agent for using same.
상세보기
Kawasaki,Toshinobu; Komoda,Yoshiyuki; Tokunaga,Yoshihiko; Okada,Yukio; Shinomiya,Hirotatsu; Hayami,Takehito, Voice control system for operating home electrical appliances.
상세보기

이 특허를 인용한 특허 (19)

Heck, Larry Paul; Chinthakunta, Madhusudan; Mitby, David; Stifelman, Lisa, Augmented conversational understanding agent to identify conversation context between two humans and taking an agent action thereof.
상세보기
Ligman, Joseph W.; Pistoia, Marco; Ponzo, John J.; Thomas, Gegi, Automatic extraction, modeling, and code mapping of application user interface display screens and components.
상세보기
Ligman, Joseph W.; Pistoia, Marco; Ponzo, John J.; Thomas, Gegi, Automatic extraction, modeling, and code mapping of application user interface display screens and components.
상세보기
Ligman, Joseph W.; Pistoia, Marco; Ponzo, John J.; Thomas, Gegi, Automatic extraction, modeling, and code mapping of application user interface display screens and components.
상세보기
Koetz, Hendrik, Electrically operated domestic appliance having a voice recognition device.
상세보기
Guo, Zhen; Zhang, Zhongfei, Enhanced max margin learning on multimodal data mining in a multimedia database.
상세보기
Rodriguez, Tony F.; Shaw, Gilbert B.; Conwell, William Y., Intuitive computing methods and systems.
상세보기
Heck, Larry Paul; Chinthakunta, Madhusudan; Mitby, David; Stifelman, Lisa, Location-based conversational understanding.
상세보기
Marcu, Daniel; Dreyer, Markus, Method and system for automatic management of reputation of translators.
상세보기
Park, Jin; Jung, Jiyeon, Method and user device for providing context awareness service using speech recognition.
상세보기
Heck, Larry Paul; Chinthakunta, Madhusudan; Mitby, David; Stifelman, Lisa, Personalization of queries, conversations, and searches.
상세보기
Hoffberg, Steven M., Steerable rotating projectile.
상세보기
Rice, Janet M.; Liang, Peng; Kuehn, Terence W., Systems and methods of interpreting speech data.
상세보기
Rice, Janet M.; Liang, Peng; Kuehn, Terence W., Systems and methods of interpreting speech data.
상세보기
Rice, Janet M.; Liang, Peng; Kuehn, Terence W., Systems and methods of interpreting speech data.
상세보기
Rice, Janet M.; Liang, Peng; Kuehn, Terence W., Systems and methods of interpreting speech data.
상세보기
Heck, Larry Paul; Chinthakunta, Madhusudan; Mitby, David; Stifelman, Lisa, Task driven user intents.
상세보기
Hakkani-Tur, Dilek Zeynep; Tur, Gokhan; Iyer, Rukmini; Heck, Larry Paul, Translating natural language utterances to keyword search queries.
상세보기
Cuthbert, Alexander J.; Goyal, Sunny; Gaba, Matthew Morton; Estelle, Joshua J.; Seno, Masakazu, User interface for realtime language translation.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Intuitive computing methods and systems 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (11)

이 특허를 인용한 특허 (19)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Intuitive computing methods and systems 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (11)

이 특허를 인용한 특허 (19)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트