Free form input field support for automated voice enablement of a web page
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-017/27
G10L-015/04
G10L-015/00
G10L-015/18
G10L-021/00
G10L-025/00
G10L-015/22
G06F-017/24
G06F-003/16
G10L-015/183
G10L-015/197
H04M-003/493
출원번호
US-0099041
(2008-04-07)
등록번호
US-9047869
(2015-06-02)
발명자
/ 주소
Moore, Victor S.
Nusbickel, Wendi L.
출원인 / 주소
Nuance Communications, Inc.
대리인 / 주소
Wolf, Greenfield & Sacks, P.C.
인용정보
피인용 횟수 :
0인용 특허 :
31
초록▼
Embodiments of the present invention provide a method and computer program product for the automated voice enablement of a Web page with free form input field support. In an embodiment of the invention, a method for voice enabling a Web page with free form input field support can be provided. The me
Embodiments of the present invention provide a method and computer program product for the automated voice enablement of a Web page with free form input field support. In an embodiment of the invention, a method for voice enabling a Web page with free form input field support can be provided. The method can include receiving speech input for an input field in a Web page, parsing a core attribute for the input field and identifying an external statistical language model (SLM) referenced by the core attribute of the input field, posting the received speech input and the SLM to an automatic speech recognition (ASR) engine, and inserting a textual equivalent to the speech input provided by the ASR engine in conjunction with the SLM into the input field.
대표청구항▼
1. A method for voice enabling a Web page, the method comprising: receiving speech input for an input field in the Web page; determining whether the input field is a free form input field; and performing a plurality of first actions in response to determining that the input field is not a free form
1. A method for voice enabling a Web page, the method comprising: receiving speech input for an input field in the Web page; determining whether the input field is a free form input field; and performing a plurality of first actions in response to determining that the input field is not a free form input field, wherein the plurality of first actions includes: generating a speech grammar for the input field based upon terms associated with the input field, wherein the terms associated with the input field comprise terms in a hidden title attribute of the input field, and wherein generating the speech grammar comprises generating the speech grammar for the input field based upon the terms in the hidden title attribute; providing the received speech input and the generated speech grammar to an ASR engine configured to recognize the received speech input to produce a first textual equivalent to the received speech input using the generated speech grammar; andinserting the first textual equivalent into the input field. 2. The method of claim 1, further comprising: performing a plurality of second actions in response to determining that the input field is a free form input field, wherein the plurality of second actions includes: identifying an external statistical language model (SLM) specified in a markup language element associated with the input field, wherein the markup language element is an HTML element; providing, by using at least one processor, the received speech input and the SLM to an automatic speech recognition (ASR) engine configured to recognize the received speech input to produce a second textual equivalent to the received speech input using the SLM; andinserting the second textual equivalent into the input field. 3. The method of claim 1, wherein the terms associated with the input field comprise permitted terms for a corresponding grammar and at least one prefix for the permitted terms, and wherein generating the speech grammar comprises generating the speech grammar for the input field based upon the permitted terms and the at least one prefix for the permitted terms. 4. The method of claim 1, wherein the method further comprises determining whether a speech grammar exists for the input field and performing the act of determining whether the input field is a free form input field only if it is determined that the speech grammar does not exist for the input field. 5. The method of claim 1, wherein the terms associated with the input field are in the Web page. 6. The method of claim 2, wherein the external SLM is specified in the markup language element using a network address identifying a location of the external SLM. 7. An article of manufacture comprising a computer-readable medium storing a computer program that, when executed by at least one processor, causes the at least one processor to perform a method for voice enabling a Web page, the method comprising: receiving speech input for an input field in the Web page; determining whether the input field is a free form input field; and performing a plurality of first actions in response to determining that the input field is not a free form input field, wherein the plurality of first actions includes: generating a speech grammar for the input field based upon terms associated with the input field, wherein the terms associated with the input field comprise terms in a hidden title attribute of the input field, and wherein generating the speech grammar comprises generating the speech grammar for the input field based upon the terms in the hidden title attribute; providing the received speech input and the generated speech grammar to an ASR engine configured to recognize the received speech input to produce a first textual equivalent to the received speech input using the generated speech grammar; and inserting the first textual equivalent into the input field. 8. The article of manufacture of claim 7, wherein the method further comprises: performing a plurality of second actions in response to determining that the input field is a free form field, wherein the plurality of second actions includes: identifying an external statistical language model (SLM) specified in a markup language element associated with the input field, wherein the markup language element is an HTML element;providing, by using at least one processor, the received speech input and the SLM to an automatic speech recognition (ASR) engine configured to recognize the received speech input to produce a second textual equivalent to the received speech input using the SLM; andinserting the second textual equivalent into the input field. 9. The article of manufacture of claim 7, wherein the terms associated with the input field comprise permitted terms for a corresponding grammar and at least one prefix for the permitted terms, and wherein generating the speech grammar comprises generating the speech grammar for the input field based upon the permitted terms and the at least one prefix for the permitted terms. 10. The article of manufacture of claim 7, wherein the method further comprises determining whether a speech grammar exists for the input field and performing the act of determining whether the input field is a free form input field only if it is determined that the speech grammar does not exists for the input field. 11. The article of manufacture of claim 7, wherein the terms associated with the input field are in the Web page. 12. The article of manufacture of claim 8, wherein the external SLM is specified in the markup language element using a network address identifying a location of the external SLM. 13. A system comprising: at least one processor configured to: receive speech input for an input field in a Web page;determine whether the input field is a free form input field; and perform a plurality of first actions in response to determining that the input field is not a free form input field, wherein the plurality of first actions includes:generating a speech grammar for the input field based upon terms associated with the input field, wherein the terms associated with the input field comprise terms in a hidden title attribute of the input field, and wherein generating the speech grammar comprises generating the speech grammar for the input field based upon the terms in the hidden title attribute;providing the received speech input and the generated speech grammar to an ASR engine configured to recognize the received speech input to produce a first textual equivalent to the received speech input using the generated speech grammar; and inserting the first textual equivalent into the input field. 14. The system of claim 13, wherein the at least one processor is further configured to: perform a plurality of second actions in response to determining that the input field is a free form field, wherein the plurality of second actions includes: identifying an external statistical language model (SLM) specified in a markup language element associated with the input field, wherein the markup language element is an HTML element;providing the received s speech input and the SLM to an automatic speech recognition (ASR) engine configured to recognize the received speech input to produce a second textual equivalent to the received speech input using the SLM; andinserting the second textual equivalent into the input field. 15. The system of claim 13, wherein the terms associated with the input field comprise permitted terms for a corresponding grammar and at least one prefix for the permitted terms, and wherein generating the speech grammar comprises generating a speech grammar for the input field based upon the permitted terms and the at least one prefix for the permitted terms. 16. The system of claim 14, wherein the at least one processor is perform the act of determining whether the input field is a free form input field only if it is determined that the speech grammar does not exist for the input field. 17. The system of claim 13, wherein the terms associated with the input field are in the Web page.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (31)
Junqua Jean-Claude ; Contolini Matteo, Apparatus and method using speech understanding for automatic channel selection in interactive television.
Potter, Stephen F.; Bowker, Anthony W., Computer implemented method of analyzing recognition results between a user and an interactive application utilizing inferred values instead of transcribed speech.
Yamamoto, Hiroki; Kuboyama, Hideo; Fukada, Toshiaki, Information-processing device and method that attains speech-recognition to recognize data input via speech.
Ehsani, Farzad; Knodt, Eva M.; Master, Demitrios L., Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface.
Stifelman,Lisa Joy; Partovi,Hadi; Partovi,Haleh; Alpert,David Bryan; Marx,Matthew Talin; Bailey,Scott James; Sims,Kyle D.; Bailey,Darby McDonough; Brathwaite,Roderick Steven; Koh,Eugene; Davis,Angus Macdonald, Providing services for an information processing system using an audio interface.
Dantzig,Paul M.; Filepp,Robert; Liu,Yew Huey, System and method for generating and presenting multi-modal applications from intent-based markup scripts.
Groner, Gabriel F.; Kundin, Jane I., System and method for user controlled insertion of standardized text in user selected fields while dictating text entries for completing a form.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.