$\require{mediawiki-texvc}$

연합인증

연합인증 가입 기관의 연구자들은 소속기관의 인증정보(ID와 암호)를 이용해 다른 대학, 연구기관, 서비스 공급자의 다양한 온라인 자원과 연구 데이터를 이용할 수 있습니다.

이는 여행자가 자국에서 발행 받은 여권으로 세계 각국을 자유롭게 여행할 수 있는 것과 같습니다.

연합인증으로 이용이 가능한 서비스는 NTIS, DataON, Edison, Kafe, Webinar 등이 있습니다.

한번의 인증절차만으로 연합인증 가입 서비스에 추가 로그인 없이 이용이 가능합니다.

다만, 연합인증을 위해서는 최초 1회만 인증 절차가 필요합니다. (회원이 아닐 경우 회원 가입이 필요합니다.)

연합인증 절차는 다음과 같습니다.

최초이용시에는
ScienceON에 로그인 → 연합인증 서비스 접속 → 로그인 (본인 확인 또는 회원가입) → 서비스 이용

그 이후에는
ScienceON 로그인 → 연합인증 서비스 접속 → 서비스 이용

연합인증을 활용하시면 KISTI가 제공하는 다양한 서비스를 편리하게 이용하실 수 있습니다.

Neural network acoustic and visual speech recognition system training method and apparatus 원문보기

IPC분류정보
국가/구분 United States(US) Patent 등록
국제특허분류(IPC7판)
  • G01L-005/06
  • G01L-009/00
출원번호 US-0137318 (1993-10-14)
발명자 / 주소
  • Stork David G. (Stanford CA) Wolff Gregory J. (Mountain View CA)
출원인 / 주소
  • Ricoh Corporation (Menlo Park CA 02) Ricoh Company, Ltd. (Tokyo JPX 03)
인용정보 피인용 횟수 : 37  인용 특허 : 7

초록

The apparatus for the recognition of speech includes an acoustic preprocessor, a visual preprocessor, and a speech classifier that operates on the acoustic and visual preprocessed data. The acoustic preprocessor comprises a log mel spectrum analyzer that produces an equal mel bandwidth log power spe

대표청구항

A training system for a speech recognition system comprising: (a) a speech recognition system for recognizing utterances belonging to a pre-established set of allowable candidate utterances using acoustic speech signals and selected concomitant dynamic visual facial feature motion between selected f

이 특허에 인용된 특허 (7)

  1. Beadles Robert L. (Durham NC), Audio visual speech recognition.
  2. Baji Toru (Burlingame CA) Noguchi Kouki (Kokubunji CA JPX) Nakagawa Tetsuya (Millbrae CA) Tonomura Motonobu (Kodaira JPX) Akimoto Hajime (Mobara JPX) Masuhara Toshiaki (Tokyo JPX), Customized personal terminal device.
  3. Petajan Eric D. (25 Cypress St. Millburn NJ 07041), Electronic facial tracking and detection system and method and apparatus for automated speech recognition.
  4. Roberts Jed (Cambridge MA) Baker James K. (West Newton MA) Porter Edward W. (Boston MA), Method for interactive speech recognition and training.
  5. Hopfield John J. (Pasadena CA) Tank David W. (Maplewood NJ), Neural computation by time concentration.
  6. Smith Allen R. (Shelton CT) Tan Chuan-Chieh (Orange CT) Slack Thomas B. (Oxford CT) Denenberg Jeffrey N. (Trumbull CT), Probabilistic learning element.
  7. Sakamoto Kenji (Nara JPX) Yamaguchi Kouichi (Tenri JPX), Recognition apparatus using articulation positions for recognizing a voice.

이 특허를 인용한 특허 (37)

  1. Velusamy, Kavitha; Chu, Wai C.; Gopalan, Ramya; Chhetri, Amit S., Acoustic echo cancellation using visual cues.
  2. Velusamy, Kavitha; Chu, Wai C.; Gopalan, Ramya; Chhetri, Amit S., Acoustic echo cancellation using visual cues.
  3. Tan, Bozhao, Acoustic sound signature detection based on sparse features.
  4. Burke, Paul M.; Yacoub, Sherif, Allocation of speech recognition tasks and combination of results thereof.
  5. Carey, Ryan Michael; Chan, Victor Hokkiu, Analog signal reconstruction and recognition via sub-threshold modulation.
  6. Cho, Jeong-Mi; Kim, Jeong-Su; Bang, Won-Chul; Kim, Nam-Hoon, Apparatus and method for predicting user's intention based on multimodal information.
  7. Deligne, Sabine; Neti, Chalapathy V.; Potamianos, Gerasimos, Audio-visual codebook dependent cepstral normalization.
  8. Deligne,Sabine; Neti,Chalapathy V.; Potamianos,Gerasimos, Audio-visual codebook dependent cepstral normalization.
  9. Marcheret, Etienne; Vopicka, Josef; Goel, Vaibhava, Audio-visual speech recognition with scattering operators.
  10. Marcheret, Etienne; Vopicka, Josef; Goel, Vaibhava, Audio-visual speech recognition with scattering operators.
  11. Morrison, Andrew R., Camera-assisted noise cancellation and speech recognition.
  12. Lahr,Roy J., Head-worn, trimodal device to increase transcription accuracy in a voice recognition system and to process unvocalized speech.
  13. Zhou, Dong; Hovden, Gunnar; Noble, Isaac S.; Ivanchenko, Volodymyr V.; Karakotsios, Kenneth M., Managing resource usage for task performance.
  14. Chen Tsuhan ; Rao Ram R., Method and apparatus for cross-modal predictive coding for talking head sequences.
  15. Geppert,Nicolas Andre; Sattler,J��rgen, Method and system for the processing and storing of voice information and corresponding timeline information.
  16. Geppert,Nicolas Andre; Sattler,J��rgen, Method and system for the processing of voice data and for the recognition of a language.
  17. Choo,Ki hyun; Kim,Jeong su; Lee,Jae won; Lee,Ki seung, Method of setting optimum-partitioned classified neural network and method and apparatus for automatic labeling using optimum-partitioned classified neural network.
  18. Peterson Richard John ; Russell Dale William ; Karaali Orhan ; Bliss Harry Martin, Method, device and system for noise-tolerant language understanding.
  19. Wagner Thomas,DEX ; Boebel Friedrich G.,FRX ; Bauer Norbert,DEX, Person identification based on movement information.
  20. Hart, Gregory M.; Bezos, Jeffrey P.; Kwee, Frances MHH; Brown, James Samuel, Relative position-inclusive device interfaces.
  21. Capless, Jonathan, Scrolling display of electronic program guide utilizing images of user lip movements.
  22. Colmenarez,Antonio; Kellner,Andreas, Speech activity detection using acoustic and facial characteristics in an automatic speech recognition system.
  23. Campbell William Michael, Speech classifier and method using delay elements.
  24. Harada Masaaki,JPX ; Takeuchi Shin,JPX ; Fukui Motofumi,JPX ; Shimizu Tadashi,JPX, Speech detection apparatus using specularly reflected light.
  25. Hart, Gregory M.; Freed, Ian W.; Zehr, Gregg Elliott; Bezos, Jeffrey P., Speech-inclusive device interfaces.
  26. Atal, Bishnu Saroop, System and method of pattern recognition in very high dimensional space.
  27. Atal,Bishnu Saroop, System and method of pattern recognition in very high-dimensional space.
  28. Atal,Bishnu Saroop, System and method of pattern recognition in very high-dimensional space.
  29. Atal,Bishnu Saroop, System and method of pattern recognition in very high-dimensional space.
  30. Thomas, David R., Telescopic reconstruction of facial features from a speech pattern.
  31. Gruenstein, Alexander H., Training multiple neural networks with different accuracy.
  32. Costello, Kevin Robert, User interface techniques for simulating three-dimensional depth.
  33. White, Marc, Using a physical phenomenon detector to control operation of a speech recognition engine.
  34. White, Marc, Using a physical phenomenon detector to control operation of a speech recognition engine.
  35. Bernd Girod DE, Video-assisted audio signal processing system and method.
  36. Girod,Bernd, Video-assisted audio signal processing system and method.
  37. Margolis, Jeffrey, Virtual object.
섹션별 컨텐츠 바로가기

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

AI-Helper 아이콘
AI-Helper
안녕하세요, AI-Helper입니다. 좌측 "선택된 텍스트"에서 텍스트를 선택하여 요약, 번역, 용어설명을 실행하세요.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.

선택된 텍스트

맨위로