$\require{mediawiki-texvc}$

연합인증

연합인증 가입 기관의 연구자들은 소속기관의 인증정보(ID와 암호)를 이용해 다른 대학, 연구기관, 서비스 공급자의 다양한 온라인 자원과 연구 데이터를 이용할 수 있습니다.

이는 여행자가 자국에서 발행 받은 여권으로 세계 각국을 자유롭게 여행할 수 있는 것과 같습니다.

연합인증으로 이용이 가능한 서비스는 NTIS, DataON, Edison, Kafe, Webinar 등이 있습니다.

한번의 인증절차만으로 연합인증 가입 서비스에 추가 로그인 없이 이용이 가능합니다.

다만, 연합인증을 위해서는 최초 1회만 인증 절차가 필요합니다. (회원이 아닐 경우 회원 가입이 필요합니다.)

연합인증 절차는 다음과 같습니다.

최초이용시에는
ScienceON에 로그인 → 연합인증 서비스 접속 → 로그인 (본인 확인 또는 회원가입) → 서비스 이용

그 이후에는
ScienceON 로그인 → 연합인증 서비스 접속 → 서비스 이용

연합인증을 활용하시면 KISTI가 제공하는 다양한 서비스를 편리하게 이용하실 수 있습니다.

Speech-inclusive device interfaces

IPC분류정보
국가/구분 United States(US) Patent 등록
국제특허분류(IPC7판)
  • G10L-015/00
출원번호 US-0879981 (2010-09-10)
등록번호 US-8700392 (2014-04-15)
발명자 / 주소
  • Hart, Gregory M.
  • Freed, Ian W.
  • Zehr, Gregg Elliott
  • Bezos, Jeffrey P.
출원인 / 주소
  • Amazon Technologies, Inc.
대리인 / 주소
    Novak Druce Connolly Bove + Quigg LLP
인용정보 피인용 횟수 : 39  인용 특허 : 15

초록

A user can provide input to a computing device through various combinations of speech, movement, and/or gestures. A computing device can analyze captured audio data and analyze that data to determine any speech information in the audio data. The computing device can simultaneously capture image or v

대표청구항

1. A method of determining user input to a computing device, comprising: capturing audio data using at least one audio capture element of the computing device;concurrent with capturing audio data, capturing image data of the user using at least one image capture element of the computing device; andu

이 특허에 인용된 특허 (15)

  1. Birchfield,Stanley T.; Gillmor,Daniel K., Acoustic source localization system and method.
  2. Mao, Xiadong, Audio input system.
  3. Pereira, Mark, Audio-based position tracking.
  4. Klein, Christian; Vassigh, Ali M.; Flaks, Jason S.; Larco, Vanessa; Soemo, Thomas M., Compound gesture-speech commands.
  5. Lahr,Roy J., Head-worn, trimodal device to increase transcription accuracy in a voice recognition system and to process unvocalized speech.
  6. Baker James K. (West Newton MA), Interactive speech recognition apparatus.
  7. Aaron, Joseph D.; Brunet, Peter Thomas; Kjeldsen, Frederik C. M.; Luther, Paul S.; Mahaffey, Robert Bruce, Method and apparatus for providing visual feedback of speed production.
  8. Woodcock, Ashley Arthur; Smith, Jaclyn Ann; McGuinness, Kevin, Method for generating output data.
  9. Gould Joel M. ; Steele Elizabeth E. ; McGrath Frank J. ; Squires Steven D. ; Parke Joel W., Method of speech command recognition with dynamic assignment of probabilities according to the state of the controlled a.
  10. Basu, Sankar; de Cuetos, Philippe Christian; Maes, Stephane Herman; Neti, Chalapathy Venkata; Senior, Andrew William, Methods and apparatus for audio-visual speech detection and recognition.
  11. Stork David G. (Stanford CA) Wolff Gregory J. (Mountain View CA), Neural network acoustic and visual speech recognition system training method and apparatus.
  12. Marks, Richard L.; Mao, Xiadong, Selective sound source listening in conjunction with computer interactive processing.
  13. Chen Chengjun Julian ; Wu Frederick Yung-Fung ; Yeh James T., Speech recognition aided by lateral profile image.
  14. Hashimoto Hideki (Kanagawa-ken JPX) Nagata Yoshifumi (Kanagawa-ken JPX) Seto Shigenobu (Kanagawa-ken JPX) Takebayashi Yoichi (Kanagawa-ken JPX) Shinchi Hideaki (Kanagawa-ken JPX) Yamaguchi Koji (Chib, Speech recognition interface system suitable for window systems and speech mail systems.
  15. Chen, Shaohai; Tamchina, Phillip George; Lee, Jae Han, Stabilizing directional audio input from a moving microphone array.

이 특허를 인용한 특허 (39)

  1. Bowen, Donald J.; Dimitriadis, Dimitrios B.; Ji, Lusheng; Schroeter, Horst J., Acoustic enhancement by leveraging metadata to mitigate the impact of noisy environments.
  2. Bowen, Donald J.; Dimitriadis, Dimitrios B.; Ji, Lusheng; Schroeter, Horst J., Acoustic enhancement by leveraging metadata to mitigate the impact of noisy environments.
  3. Lloyd, Matthew I., Adjusting language models based on topics identified using context.
  4. Lloyd, Matthew I., Adjusting language models using context information.
  5. Sharifi, Matthew; Postelnicu, Gheorghe, Answering questions using environmental context.
  6. Sanders, Jason; Taubman, Gabriel; Lee, John J., Background audio identification for speech disambiguation.
  7. Sanders, Jason; Taubman, Gabriel; Lee, John J., Background audio identification for speech disambiguation.
  8. Sanders, Jason; Taubman, Gabriel; Lee, John J., Background audio identification for speech disambiguation.
  9. Froelich, Raymond J., Conversational software agent.
  10. Aleksic, Petar; Moreno Mengibar, Pedro J., Determining dialog states for language models.
  11. Yuan, Alvin; Yamamoto, Stuart, Dynamic geo-fencing for voice recognition dictionary.
  12. Okada, Takeshi, Electronic apparatus, system, storage control method, and storage medium.
  13. Lyle, Ruthie D.; O'Sullivan, Patrick Joseph; Sun, Lin, Enhancing comprehension in voice communications.
  14. Shreve, Matthew Adam; Mongeon, Michael C.; Loce, Robert P.; Bernal, Edgar A., Heuristic-based approach for automatic payment gesture classification and detection.
  15. Adams, Jeffrey P.; Salvador, Stan W.; Kneser, Reinhard, Household agent learning.
  16. Rhoads, Geoffrey B., Intuitive computing methods and systems.
  17. Rodriguez, Tony F.; Rhoads, Geoffrey B., Intuitive computing methods and systems.
  18. Celikyilmaz, Fethiye Asli; Feizollahi, Zhaleh; Hakkani-Tur, Dilek; Sarikaya, Ruhi, Language and domain independent model based approach for on-screen item selection.
  19. Lim, Suk Hwan; Aguera-Arcas, Blaise, Low power framework for controlling image sensor mode in a mobile image capture device.
  20. Lim, Suk Hwan; Aguera-Arcas, Blaise, Low power framework for processing, compressing, and transmitting images at a mobile image capture device.
  21. Takayanagi, Yuichiro; Kusaka, Masashi, Method and apparatus for recognizing speech by lip reading.
  22. Zurek, Robert A; Schuster, Adrian M; Shau, Fu-Lin; Wu, Jincheng, Method and apparatus for using image data to aid voice recognition.
  23. Sarikaya, Ruhi; Celikyilmaz, Fethiye Asli; Feizollahi, Zhaleh; Heck, Larry Paul; Hakkani-Tur, Dilek Z., Model based approach for on-screen item selection and disambiguation.
  24. Bezos, Jeffrey P., Movement recognition as input mechanism.
  25. Varthakavi, Mohan; Nanduri, Jayaram N M; Kothari, Nikhil, Multi-mode text input.
  26. Varthakavi, Mohan; Nanduri, Jayaram; Kothari, Nikhil, Multi-mode text input.
  27. Panainte, Sorin M.; Hughes, David J., Multi-pass vehicle voice recognition systems and methods.
  28. Venkatesha, Sharath; Pham, Hai D., Noise cancellation for voice activation.
  29. Rifkin, Ryan M.; Ramage, Daniel, Speech and computer vision-based control.
  30. Rifkin, Ryan M.; Ramage, Daniel, Speech and computer vision-based control.
  31. Froelich, Raymond J., Speech recognition.
  32. Froelich, Raymond J., Speech recognition.
  33. Iwai, Shiro, Speech recognition method and speech recognition device.
  34. Heiman, Arie; Yehuday, Uri, System, device and method for detecting speech.
  35. Donsbach, Aaron Michael; Vanik, Benjamin; Clapper, Jon Gabriel; Lentz, Alison; Lovejoy, Joshua Denali; Fritz, III, Robert Douglas; Duleba, Krzysztof; Zhang, Li; Payne, Juston; Fortuna, Emily Anne; Bialynicka-Birula, Iwona; Aguera-Arcas, Blaise; Ramage, Daniel; McMahan, Hugh Brendan; Lange, Oliver Fritz; Holbrook, Jess, Systems and methods for selective retention and editing of images captured by mobile image capture device.
  36. Bialynicka-Birula, Iwona; Aguera-Arcas, Blaise; Ramage, Daniel; McMahan, Hugh Brendan; Lange, Oliver Fritz; Fortuna, Emily Anne; Tyamagundlu, Divya; Holbrook, Jess; Kohlhepp, Kristine; Payne, Juston; Duleba, Krzysztof; Vanik, Benjamin; Lentz, Alison; Clapper, Jon Gabriel; Lovejoy, Joshua Denali; Donsbach, Aaron Michael, Systems and methods that leverage deep learning to selectively store images at a mobile image capture device.
  37. Drescher, Susan Adelle; Desai, Tejas Bhupendra, Vehicle infotainment and connectivity system.
  38. Graumann, David L.; Rosario, Barbara, Vehicular speech recognition grammar selection based upon captured or proximity information.
  39. Higgins, Krystal Rose; Farraro, Eric J.; Tapley, John; Manickavelu, Kumaresan; Mukherjee, Saurav, Virtual dressing room.
섹션별 컨텐츠 바로가기

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

AI-Helper 아이콘
AI-Helper
안녕하세요, AI-Helper입니다. 좌측 "선택된 텍스트"에서 텍스트를 선택하여 요약, 번역, 용어설명을 실행하세요.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.

선택된 텍스트