[특허]Speaker adaptation based on lateral tying for large-vocabulary continuous speech recognition

Speaker adaptation based on lateral tying for large-vocabulary continuous speech recognition 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G10L-005/06
출원번호	US-0600859 (1996-02-13)
발명자 / 주소	Bellegarda Jerome R. Butzberger John W. Chow Yen-Lu
출원인 / 주소	Apple Computer, Inc.
대리인 / 주소	Sawyer & Associates
인용정보	피인용 횟수 : 115 인용 특허 : 4

초록 ▼

A system and method for performing speaker adaptation in a speech recognition system which includes a set of reference models corresponding to speech data from a plurality of speakers. The speech data is represented by a plurality of acoustic models and corresponding sub-events, and each sub-event includes one or more observations of speech data. A degree of lateral tying is computed between each pair of sub-events, wherein the degree of tying indicates the degree to which a first observation in a first sub-event contributes to the remaining sub-events. When adaptation data from a new speaker becomes available, a new observation from adaptation data is assigned to one of the sub-events. Each of the sub-events is then populated with the observations contained in the assigned sub-event based on the degree of lateral tying that was computed between each pair of sub-events. The reference models corresponding to the populated sub-events are then adapted to account for speech pattern idiosyncrasies of the new speaker, thereby reducing the error rate of the speech recognition system.

대표청구항 ▼

[ What is claimed is:] [1.] A method of performing speaker adaptation in a speech recognition system which includes a set of reference models corresponding to speech data from a plurality of speakers, the speech data represented by a plurality of acoustic models and corresponding sub-events, wherein each sub-event includes one or more observations of speech data, the method comprising the steps of:(a) computing a degree of lateral tying between each pair of sub-events, wherein the degree of tying indicates the degree to which a first observation in a first sub-event contributes to the remaining sub-events;(b) assigning a new observation from adaptation data of a new speaker to one of the sub-events;(c) populating each of the sub-events with a transformed version of the observation contained in the assigned sub-event based on the degree of lateral tying computed between each pair of sub-events;(d) adapting the reference models that correspond to the populated sub-events to account for speech pattern idiosyncrasies of the new speaker, thereby reducing the error rate of the speech recognition system.

이 특허에 인용된 특허 (4)

Bimbot Frdric (Fontenay-Aux-Roses FRX) Mathan Luc (Lannion FRX), Process for measuring the resemblance between sound samples and apparatus for performing this process.
상세보기
Chou Wu (Piscataway NJ) Juang Biing-Hwang (Warren NJ), Recognition unit model training based on competing word and word string models.
상세보기
Rosenberg Aaron E. (Berkeley Heights NJ) Soong Frank K.-P. (Fanwood NJ), Technique for modifying reference vector quantized speech feature signals.
상세보기
Picone Joseph (Plano TX) Wheatley Barbara J. (Plano TX), Voice log-in using spoken name input.
상세보기

이 특허를 인용한 특허 (115)

Gruber, Thomas R.; Sabatelli, Alessandro F.; Aybes, Alexandre A.; Pitschel, Donald W.; Voas, Edward D.; Anzures, Freddy A.; Marcos, Paul D., Actionable reminder entries.
상세보기
Gruber, Thomas Robert; Sabatelli, Alessandro F.; Aybes, Alexandre A.; Pitschel, Donald W.; Voas, Edward D.; Anzures, Freddy A.; Marcos, Paul D., Active transport based notifications.
상세보기
Mason, Henry, Analyzing audio input for efficient speech and music recognition.
상세보기
Perrone Michael P. ; Subrahmonia Jayashree, Apparatus and method for augmenting data in handwriting recognition system.
상세보기
Huang, Rongqing; Oparin, Ilya, Applying neural network language models to weighted finite state transducers for automatic speech recognition.
상세보기
Nallasamy, Udhyakumar; Kajarekar, Sachin S.; Paulik, Matthias; Seigel, Matthew, Automatic accent detection using acoustic models.
상세보기
Phillips, Michael S.; Govindarajan, Krishna K.; Fanty, Mark; Barnard, Etienne, Automatically retraining a speech recognition system.
상세보기
Kohavi Ron, Bayes rule based and decision tree hybrid classifier.
상세보기
Giuli, Richard D.; Treadgold, Nicholas K., Better resolution when referencing to concepts.
상세보기
Giuli, Richard D.; Treadgold, Nicholas K., Better resolution when referencing to concepts.
상세보기
Naik, Devang K.; Mohamed, Ali S.; Chen, Hong M., Caching apparatus for serving phonetic pronunciations.
상세보기
Newendorp, Brandon J.; Dibiase, Evan S., Competing devices responding to voice triggers.
상세보기
Sang'udi, Gerald P.; Bott, Ross A.; Tesler, Joel D.; Hawkes, John R.; Xiong, Rebecca W.; Schkolnick, Mario, Computer-related method and system for controlling data visualization in external dimension(s).
상세보기
Gerald P. Sang'udi ; Ross A. Bott ; Joel D. Tesler ; John R. Hawkes ; Rebecca W. Xiong ; Mario Schkolnick, Computer-related method, system, and program product for controlling data visualization in external dimension(s).
상세보기
Williams, Shaun E.; Mason, Henry G.; Krishnamoorthy, Mahesh; Paulik, Matthias; Agrawal, Neha; Kajarekar, Sachin S.; Uguroglu, Selen; Mohamed, Ali S., Context-based endpoint detection.
상세보기
Larson, Anthony L.; Dave, Swapnil R.; Varoglu, Devrim, Context-sensitive handling of interruptions.
상세보기
van Os, Marcel, Context-sensitive handling of interruptions by intelligent digital assistant.
상세보기
Gruber, Thomas R.; Cheyer, Adam John; Pitschel, Donald W., Crowd sourcing information to fulfill user requests.
상세보기
Rhoten, George; Treadgold, Nicholas K., Determining domain salience ranking from ambiguous words in natural speech.
상세보기
Cheyer, Adam John; Brigham, Christopher Dean; Guzzoni, Didier Rene, Determining user intent based on ontologies of domains.
상세보기
Cheyer, Adam J., Device access using voice authentication.
상세보기
Cheyer, Adam John, Device access using voice authentication.
상세보기
Piernot, Philippe P.; Binder, Justin G., Device voice control for selecting a displayed affordance.
상세보기
Carson, David A.; Keen, Daniel; Dibiase, Evan; Saddler, Harry J.; Iacono, Marco; Lemay, Stephen O.; Pitschel, Donald W.; Gruber, Thomas R., Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant.
상세보기
Fleizach, Christopher Brian; Gruber, Thomas Robert, Device, method, and user interface for voice-activated navigation and browsing of a document.
상세보기
Raitio, Tuomo J.; Hunt, Melvyn J.; Richards, Hywel B.; Chinthakunta, Madhusudan, Digital assistant providing whispered speech.
상세보기
Henton, Caroline; Naik, Devang, Disambiguating heteronyms in speech synthesis.
상세보기
Bellegarda, Jerome R., Entropy-guided text prediction using combined word and character n-gram language models.
상세보기
Futrell, Richard L.; Gruber, Thomas R., Exemplar-based natural language processing.
상세보기
Futrell, Richard L.; Gruber, Thomas R., Exemplar-based natural language processing.
상세보기
Huang Xuedong D. ; Rozak Michael J. ; Jiang Li, Extensible speech recognition system that provides a user with audio feedback.
상세보기
Bellegarda, Jerome R.; Silverman, Kim E. A., Fast, language-independent method for user authentication by voice.
상세보기
Bellegarda, Jerome R.; Silverman, Kim E. A., Fast, language-independent method for user authentication by voice.
상세보기
Fleizach, Christopher Brian; Minifie, Darren C., Handling speech synthesis of content for multiple languages.
상세보기
Washio, Nobuyuki, Information processing apparatus, method and recording medium for generating acoustic model.
상세보기
Bellegarda, Jerome R., Integrated word N-gram and class M-gram language models.
상세보기
Orr, Ryan M.; Nell, Garett R.; Brumbaugh, Benjamin L., Intelligent assistant for home automation.
상세보기
Gruber, Thomas Robert; Cheyer, Adam John; Kittlaus, Dag; Guzzoni, Didier Rene; Brigham, Christopher Dean; Giuli, Richard Donald; Bastea-Forte, Marcello; Saddler, Harry Joseph, Intelligent automated assistant.
상세보기
Gruber, Thomas Robert; Cheyer, Adam John; Kittlaus, Dag; Guzzoni, Didier Rene; Brigham, Christopher Dean; Giuli, Richard Donald; Bastea-Forte, Marcello; Saddler, Harry Joseph, Intelligent automated assistant.
상세보기
Os, Marcel Van; Saddler, Harry J.; Napolitano, Lia T.; Russell, Jonathan H.; Lister, Patrick M.; Dasari, Rohit, Intelligent automated assistant for TV user interactions.
상세보기
Van Os, Marcel; Saddler, Harry J.; Napolitano, Lia T.; Russell, Jonathan H.; Lister, Patrick M.; Dasari, Rohit, Intelligent automated assistant for TV user interactions.
상세보기
Orr, Ryan M.; Bernardo, Matthew P.; Mandel, Daniel J., Intelligent automated assistant for media exploration.
상세보기
Piersol, Kurt W.; Orr, Ryan M.; Mandel, Daniel J., Intelligent device arbitration and control.
상세보기
Booker, Susan L.; Krishnan, Murali; Weinberg, Garrett L.; Piercy, Aimee, Intelligent list reading.
상세보기
Fleizach, Christopher Brian; Hudson, Reginald Dean, Intelligent text-to-speech conversion.
상세보기
Fleizach, Christopher Brian; Hudson, Reginald Dean, Intelligent text-to-speech conversion.
상세보기
Lemay, Stephen O.; Sabatelli, Alessandro Francesco; Anzures, Freddy Allen; Chaudhri, Imran; Forstall, Scott; Novick, Gregory, Interface for a virtual digital assistant.
상세보기
Cash, Jesse R.; Dave, Swapnil R.; Varoglu, Devrim, Interpreting and acting upon commands that involve sharing information with remote devices.
상세보기
Bellegarda, Jerome R.; Barman, Bishal, Language identification from short strings.
상세보기
Hatori, Jun; Yu, Dominic, Language input correction.
상세보기
Paulik, Matthias; Evermann, Gunnar; Gillick, Laurence S., Method and apparatus for discovering trending terms in speech requests.
상세보기
Choi,In jeong; Kim,Sang ryong, Method and apparatus for discriminative estimation of parameters in maximum a posteriori (MAP) speaker adaptation condition and voice recognition method and apparatus including these.
상세보기
Kompe, Ralf; Goronzy, Silke; Marasek, Krzysztof, Method for recognizing speech to avoid over-adaptation during online speaker adaptation.
상세보기
Morii,Keiko; Ohno,Yoshio, Method for speech recognition, apparatus for the same, and voice controller.
상세보기
Paulik, Matthias; Huang, Rongqing, Method for supporting dynamic grammars in WFST-based ASR.
상세보기
Tesler Joel D., Method, system and computer program product for navigating through partial hierarchies.
상세보기
Barry Glenn Becker ; Roger A. Crawfis, Method, system and computer program product for visually approximating scattered data using color to represent values of a categorical variable.
상세보기
Rathmann Peter K. ; Haber Eben M., Method, system, and computer program product for computing histogram aggregations.
상세보기
Tesler Joel D., Method, system, and computer program product for mapping between an overview and a partial hierarchy.
상세보기
Becker Barry G., Method, system, and computer program product for visualizing a data structure.
상세보기
Kohavi Ron ; Tesler Joel D., Method, system, and computer program product for visualizing a decision-tree classifier.
상세보기
Tesler Joel D., Method, system, and computer program product for visualizing data using partial hierarchies.
상세보기
Lee, Michael M., Methods and apparatus for altering audio output signals.
상세보기
Vysotsky George J. ; Raman Vijay R., Methods and apparatus for generating and using out of vocabulary word models for speaker dependent speech recognition.
상세보기
Vysotsky George J. ; Raman Vijay R., Methods and apparatus for generating and using speaker independent garbage models for speaker dependent speech recognit.
상세보기
Lee, Michael M.; Gregg, Justin; Seguin, Chad G., Mobile device having human language translation capability with positional feedback.
상세보기
Lee, Michael M.; Gregg, Justin; Seguin, Chad G., Mobile device having human language translation capability with positional feedback.
상세보기
Gruber, Thomas R.; Saddler, Harry J.; Bellegarda, Jerome Rene; Nyeggen, Bryce H.; Sabatelli, Alessandro, Multi-command single utterance input method.
상세보기
Bellegarda, Jerome R.; Davidson, Douglas R., Multilingual word prediction.
상세보기
Naik, Devang K., Name recognition system.
상세보기
Gruber, Thomas Robert; Saddler, Harry Joseph; Cheyer, Adam John; Kittlaus, Dag; Brigham, Christopher Dean; Giuli, Richard Donald; Guzzoni, Didier Rene; Bastea-Forte, Marcello, Paraphrasing of user requests and results by automated digital assistant.
상세보기
Bellegarda, Jerome R., Parsimonious continuous-space phrase representations for natural language processing.
상세보기
Bellegarda, Jerome R.; Yaman, Sibel, Parsimonious handling of word inflection via categorical stem + suffix N-gram language models.
상세보기
Chen, Lik Harry; Cheyer, Adam John; Guzzoni, Didier Rene; Gruber, Thomas Robert, Personalized vocabulary for digital assistant.
상세보기
Wang, Xin; Ramerth, Brent D., Predictive conversion of language input.
상세보기
Dolfing, Jannes; Ramerth, Brent; Davidson, Douglas; Bellegarda, Jerome; Moore, Jennifer; Eminidis, Andreas; Shaffer, Joshua, Predictive text input.
상세보기
Paulik, Matthias; Mason, Henry G.; Seigel, Matthew S., Privacy preserving distributed evaluation framework for embedded personalized systems.
상세보기
Martel, Mathieu Jean; Deniau, Thomas, Proactive assistance based on dialog communication between devices.
상세보기
Chau, Tom; Sejdic, Ervin, Procedure for denoising dual-axis swallowing accelerometry signals.
상세보기
Kim, Yoon, Providing an indication of the suitability of speech recognition.
상세보기
Piernot, Philippe P.; Binder, Justin G., Reducing the need for manual start/end-pointing and trigger phrases.
상세보기
Gillick Laurence S. ; Corrada-Emmanuel Andres ; Newman Michael J. ; Peskin Barbara R., Sequential, nonparametric speech recognition and speaker identification.
상세보기
Cheyer, Adam John; Guzzoni, Didier Rene; Gruber, Thomas Robert; Brigham, Christopher Dean, Service orchestration for intelligent automated assistant.
상세보기
Naik, Devang K.; Piernot, Philippe P., Social reminders.
상세보기
Naik, Devang K.; Piernot, Philippe P., Social reminders.
상세보기
Kuhn Roland ; Junqua Jean-Claude, Speaker and environment adaptation based on eigenvoices.
상세보기
Kim, Yoon; Kajarekar, Sachin S., Speaker identification and unsupervised speaker adaptation techniques.
상세보기
Hab-Umbach, Reinhold, Specifying a tree structure for speech recognizers using correlation between regression classes.
상세보기
Hunt, Melvyn; Bridle, John, Speech recognition involving a mobile device.
상세보기
Sumner, Michael R.; Newendorp, Brandon J.; Orr, Ryan M., Structured dictation using intelligent automated assistants.
상세보기
Sinha, Anoop K., System and method for detecting errors in interactions with a voice-based digital assistant.
상세보기
Roberts, Andrew J.; Martin, David L.; Saddler, Harry J., System and method for emergency calls initiated by voice command.
상세보기
Evermann, Gunnar, System and method for inferring user intent from speech inputs.
상세보기
Kohavi Ron ; Sommerfield Daniel A., System and method for selection of important attributes.
상세보기
Naik, Devang K.; Tackin, Onur E., System and method for updating an adaptive speech recognition model.
상세보기
Naik, Devang K.; Gruber, Thomas R.; Weiner, Liam; Binder, Justin G.; Srisuwananukorn, Charles; Evermann, Gunnar; Williams, Shaun Eric; Chen, Hong; Napolitano, Lia T., System and method for user-specified pronunciation of words for speech synthesis and recognition.
상세보기
Naik, Devang K.; Gruber, Thomas R.; Weiner, Liam; Binder, Justin G.; Srisuwananukorn, Charles; Evermann, Gunnar; Williams, Shaun Eric; Chen, Hong; Napolitano, Lia T., System and method for user-specified pronunciation of words for speech synthesis and recognition.
상세보기
Naik, Devang K., Systems and methods for name pronunciation.
상세보기
Bellegarda, Jerome R.; Yaman, Sibel, Systems and methods for structured stem and suffix language models.
상세보기
Lee Chin-Hui ; Shinoda Koichi,JPX, Technique for adaptation of hidden markov models for speech recognition.
상세보기
Boyce Susan J. ; Brotman Lynne Shapiro ; Brown Deborah W. ; Goldberg Randy G. ; Haszto Edward D. ; Marcus Stephen M. ; Rosinski Richard R. ; Wetzel William R., Telephone-based speech recognition for data collection.
상세보기
Neels, Alice E.; Jong, Nicholas K., Text correction processing.
상세보기
Willmore, Christopher P.; Jong, Nicholas K.; Hogg, Justin S., Text prediction using combined word N-gram and unigram language models.
상세보기
Pitschel, Donald W.; Cheyer, Adam J.; Brigham, Christopher D.; Gruber, Thomas R., Training an at least partial voice command system.
상세보기
Bellegarda, Jerome R., Unified ranking with entropy-weighted information for phrase-based semantic auto-completion.
상세보기
Raitio, Tuomo J.; Prahallad, Kishore Sunkeswari; Conkie, Alistair D.; Golipour, Ladan; Winarsky, David A., Unit-selection text-to-speech synthesis based on predicted concatenation parameters.
상세보기
Jeon, Woojay, Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks.
상세보기
Haughay, Allen P., User profiling for voice input processing.
상세보기
Haughay, Allen P., User profiling for voice input processing.
상세보기
Gruber, Thomas Robert; Brigham, Christopher Dean; Keen, Daniel S.; Novick, Gregory; Phipps, Benjamin S., Using context information to facilitate processing of commands in a virtual assistant.
상세보기
Gruber, Thomas Robert; Cheyer, Adam John; Guzzoni, Didier Rene, Using event alert text as input to an automated assistant.
상세보기
Lemay, Stephen O.; Newendorp, Brandon J.; Dascola, Jonathan R., Virtual assistant activation.
상세보기
Junqua, Jean-Claude; Perronnin, Florent; Kuhn, Roland; Nguyen, Patrick, Voice personalization of speech synthesizer.
상세보기
Binder, Justin; Post, Samuel D.; Tackin, Onur; Gruber, Thomas R., Voice trigger for a digital assistant.
상세보기
Badaskar, Sameer, Voice-based media searching.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Speaker adaptation based on lateral tying for large-vocabulary continuous speech recognition 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (4)

이 특허를 인용한 특허 (115)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Speaker adaptation based on lateral tying for large-vocabulary continuous speech recognition 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (4)

이 특허를 인용한 특허 (115)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트