$\require{mediawiki-texvc}$
  • 검색어에 아래의 연산자를 사용하시면 더 정확한 검색결과를 얻을 수 있습니다.
  • 검색연산자
검색도움말
검색연산자 기능 검색시 예
() 우선순위가 가장 높은 연산자 예1) (나노 (기계 | machine))
공백 두 개의 검색어(식)을 모두 포함하고 있는 문서 검색 예1) (나노 기계)
예2) 나노 장영실
| 두 개의 검색어(식) 중 하나 이상 포함하고 있는 문서 검색 예1) (줄기세포 | 면역)
예2) 줄기세포 | 장영실
! NOT 이후에 있는 검색어가 포함된 문서는 제외 예1) (황금 !백금)
예2) !image
* 검색어의 *란에 0개 이상의 임의의 문자가 포함된 문서 검색 예) semi*
"" 따옴표 내의 구문과 완전히 일치하는 문서만 검색 예) "Transform and Quantization"

통합검색

연합인증

연합인증 가입 기관의 연구자들은 소속기관의 인증정보(ID와 암호)를 이용해 다른 대학, 연구기관, 서비스 공급자의 다양한 온라인 자원과 연구 데이터를 이용할 수 있습니다.

이는 여행자가 자국에서 발행 받은 여권으로 세계 각국을 자유롭게 여행할 수 있는 것과 같습니다.

연합인증으로 이용이 가능한 서비스는 NTIS, DataON, Edison, Kafe, Webinar 등이 있습니다.

한번의 인증절차만으로 연합인증 가입 서비스에 추가 로그인 없이 이용이 가능합니다.

다만, 연합인증을 위해서는 최초 1회만 인증 절차가 필요합니다. (회원이 아닐 경우 회원 가입이 필요합니다.)

연합인증 절차는 다음과 같습니다.

최초이용시에는
ScienceON에 로그인 → 연합인증 서비스 접속 → 로그인 (본인 확인 또는 회원가입) → 서비스 이용

그 이후에는
ScienceON 로그인 → 연합인증 서비스 접속 → 서비스 이용

연합인증을 활용하시면 KISTI가 제공하는 다양한 서비스를 편리하게 이용하실 수 있습니다.

특허 상세정보

Neural network acoustic and visual speech recognition system training method and apparatus

특허상세정보
국가/구분 United States(US) Patent 등록
국제특허분류(IPC7판) G01L-005/06    G01L-009/00   
미국특허분류(USC) 395/241 ; 395/239 ; 382/156
출원번호 US-0137318 (1993-10-14)
발명자 / 주소
출원인 / 주소
인용정보 피인용 횟수 : 37  인용 특허 : 7
초록

The apparatus for the recognition of speech includes an acoustic preprocessor, a visual preprocessor, and a speech classifier that operates on the acoustic and visual preprocessed data. The acoustic preprocessor comprises a log mel spectrum analyzer that produces an equal mel bandwidth log power spectrum. The visual processor detects the motion of a set of fiducial markers on the speaker\s face and extracts a set of normalized distance vectors describing lip and mouth movement. The speech classifier uses a multilevel time-delay neural network operating o...

대표
청구항

A training system for a speech recognition system comprising: (a) a speech recognition system for recognizing utterances belonging to a pre-established set of allowable candidate utterances using acoustic speech signals and selected concomitant dynamic visual facial feature motion between selected facial features associated with acoustic speech generation, comprising, (i) an acoustic feature extraction apparatus for converting signals representative of dynamic acoustic speech into a corresponding dynamic acoustic feature vector set of signals, (ii) a dyn...

이 특허를 인용한 특허 피인용횟수: 37

  1. Velusamy, Kavitha; Chu, Wai C.; Gopalan, Ramya; Chhetri, Amit S.. Acoustic echo cancellation using visual cues. USP2017099767828.
  2. Velusamy, Kavitha; Chu, Wai C.; Gopalan, Ramya; Chhetri, Amit S.. Acoustic echo cancellation using visual cues. USP20190310242695.
  3. Tan, Bozhao. Acoustic sound signature detection based on sparse features. USP2017109785706.
  4. Burke, Paul M.; Yacoub, Sherif. Allocation of speech recognition tasks and combination of results thereof. USP2013118589156.
  5. Carey, Ryan Michael; Chan, Victor Hokkiu. Analog signal reconstruction and recognition via sub-threshold modulation. USP2017059652711.
  6. Cho, Jeong-Mi; Kim, Jeong-Su; Bang, Won-Chul; Kim, Nam-Hoon. Apparatus and method for predicting user's intention based on multimodal information. USP2013128606735.
  7. Deligne, Sabine; Neti, Chalapathy V.; Potamianos, Gerasimos. Audio-visual codebook dependent cepstral normalization. USP2010027664637.
  8. Deligne,Sabine; Neti,Chalapathy V.; Potamianos,Gerasimos. Audio-visual codebook dependent cepstral normalization. USP2008017319955.
  9. Marcheret, Etienne; Vopicka, Josef; Goel, Vaibhava. Audio-visual speech recognition with scattering operators. USP20190110181325.
  10. Marcheret, Etienne; Vopicka, Josef; Goel, Vaibhava. Audio-visual speech recognition with scattering operators. USP2017079697833.
  11. Morrison, Andrew R.. Camera-assisted noise cancellation and speech recognition. USP2014018635066.
  12. Lahr,Roy J.. Head-worn, trimodal device to increase transcription accuracy in a voice recognition system and to process unvocalized speech. USP2006077082393.
  13. Zhou, Dong; Hovden, Gunnar; Noble, Isaac S.; Ivanchenko, Volodymyr V.; Karakotsios, Kenneth M.. Managing resource usage for task performance. USP2015129223415.
  14. Chen Tsuhan ; Rao Ram R.. Method and apparatus for cross-modal predictive coding for talking head sequences. USP1999055907351.
  15. Geppert,Nicolas Andre; Sattler,J��rgen. Method and system for the processing and storing of voice information and corresponding timeline information. USP2008037343288.
  16. Geppert,Nicolas Andre; Sattler,J��rgen. Method and system for the processing of voice data and for the recognition of a language. USP2008077406413.
  17. Choo,Ki hyun; Kim,Jeong su; Lee,Jae won; Lee,Ki seung. Method of setting optimum-partitioned classified neural network and method and apparatus for automatic labeling using optimum-partitioned classified neural network. USP2008107444282.
  18. Peterson Richard John ; Russell Dale William ; Karaali Orhan ; Bliss Harry Martin. Method, device and system for noise-tolerant language understanding. USP2001016178398.
  19. Wagner Thomas,DEX ; Boebel Friedrich G.,FRX ; Bauer Norbert,DEX. Person identification based on movement information. USP2000086101264.
  20. Hart, Gregory M.; Bezos, Jeffrey P.; Kwee, Frances MHH; Brown, James Samuel. Relative position-inclusive device interfaces. USP2016039274744.
  21. Capless, Jonathan. Scrolling display of electronic program guide utilizing images of user lip movements. USP2014088798311.
  22. Colmenarez,Antonio; Kellner,Andreas. Speech activity detection using acoustic and facial characteristics in an automatic speech recognition system. USP2007057219062.
  23. Campbell William Michael. Speech classifier and method using delay elements. USP2000036038535.
  24. Harada Masaaki,JPX ; Takeuchi Shin,JPX ; Fukui Motofumi,JPX ; Shimizu Tadashi,JPX. Speech detection apparatus using specularly reflected light. USP2001086272466.
  25. Hart, Gregory M.; Freed, Ian W.; Zehr, Gregg Elliott; Bezos, Jeffrey P.. Speech-inclusive device interfaces. USP2014048700392.
  26. Atal, Bishnu Saroop. System and method of pattern recognition in very high dimensional space. USP2011017869997.
  27. Atal,Bishnu Saroop. System and method of pattern recognition in very high-dimensional space. USP2007057216076.
  28. Atal,Bishnu Saroop. System and method of pattern recognition in very high-dimensional space. USP2008057369993.
  29. Atal,Bishnu Saroop. System and method of pattern recognition in very high-dimensional space. USP2006027006969.
  30. Thomas, David R.. Telescopic reconstruction of facial features from a speech pattern. USP2003096614466.
  31. Gruenstein, Alexander H.. Training multiple neural networks with different accuracy. USP2016119484022.
  32. Costello, Kevin Robert. User interface techniques for simulating three-dimensional depth. USP2016069367203.
  33. White, Marc. Using a physical phenomenon detector to control operation of a speech recognition engine. USP2012128326636.
  34. White, Marc. Using a physical phenomenon detector to control operation of a speech recognition engine. USP2015059037473.
  35. Bernd Girod DE. Video-assisted audio signal processing system and method. USP2002116483532.
  36. Girod,Bernd. Video-assisted audio signal processing system and method. USP200802RE40054.
  37. Margolis, Jeffrey. Virtual object. USP2013088509479.