Embodiments of the present invention provide a method, system and computer program product for the automated voice enablement of a Web page. In an embodiment of the invention, a method for voice enabling a Web page can include selecting an input field of a Web page for speech input, generating a spe
Embodiments of the present invention provide a method, system and computer program product for the automated voice enablement of a Web page. In an embodiment of the invention, a method for voice enabling a Web page can include selecting an input field of a Web page for speech input, generating a speech grammar for the input field based upon terms in a core attribute of the input field, receiving speech input for the input field, posting the received speech input and the grammar to an automatic speech recognition (ASR) engine and inserting a textual equivalent to the speech input provided by the ASR engine into a document object model (DOM) for the Web page.
대표청구항▼
1. A method for voice enabling a Web page, the method comprising: receiving a selection of an input field of the Web page;determining whether a speech grammar exists for the input field;generating, by using at least one processor, a speech grammar for the input field in response to determining that
1. A method for voice enabling a Web page, the method comprising: receiving a selection of an input field of the Web page;determining whether a speech grammar exists for the input field;generating, by using at least one processor, a speech grammar for the input field in response to determining that the speech grammar does not exist for the input field, wherein generating the speech grammar comprises: identifying a plurality of terms associated with the input field, wherein the plurality of terms represent permissible input for the input field and are specified in an attribute of a markup language element for the input field; andgenerating, the speech grammar for the input field based, at least in part, upon the plurality of terms identified in the attribute of the markup language element;receiving speech input for the input field;providing the received speech input and the generated speech grammar to an automatic speech recognition (ASR) engine configured to recognize the received speech input to produce a textual equivalent to the received speech input using the generated speech grammar; andinserting the textual equivalent into the input field. 2. The method of claim 1, wherein the plurality of terms associated with the input field comprise terms in a title attribute of the markup language element for the input field, and wherein generating the speech grammar for the input field comprises generating the speech grammar for the input field based upon the terms in the title attribute of the markup language element for the input field. 3. The method of claim 1, wherein the attribute of the markup language element for the input field specifies the plurality of terms and at least one prefix for one or more of the plurality of terms that may be combined with the one or more of the plurality of terms as permissible input for the input field, and wherein generating the speech grammar for the input field comprises generating the speech grammar for the input field based upon the plurality of terms and the at least one prefix. 4. The method of claim 1, wherein the attribute of the markup language element for the input field specifies the plurality of terms and a semantic indicator for one or more of the plurality of terms, and wherein generating the speech grammar for the input field comprises generating the speech grammar based upon the permitted terms and the semantic indicator for the permitted terms. 5. The method of claim 1, wherein providing the received speech input and the generated speech grammar to the ASR engine, comprises providing the received speech input and the generated speech grammar to a communicatively coupled remote representational state transfer (REST) compliant ASR engine. 6. A system comprising: at least one processor configured to: determine that an input field of the Web page has been selected;determine whether a speech grammar exists for the input field;generate a speech grammar for the input field in response to determining that the speech grammar does not exist for the input field, wherein generating the speech grammar comprises:identifying a plurality of terms associated with the input field wherein the plurality of terms represent permissible input for the input field and are specified in an attribute of a markup language element for the input field; andgenerating the speech grammar for the input field based, at least in part, upon the plurality of terms identified in the attribute of the markup language element;determining that speech input has been received for the input field;provide the received speech input and the generated speech grammar to a communicatively coupled automatic speech recognition (ASR) engine configured to recognize the received speech input to produce a textual equivalent to the received speech input using the generated speech grammar; andinsert the textual equivalent into the input field. 7. The system of claim 6, wherein the plurality of terms associated with the input field comprise terms in a title attribute of the markup language element for the input field, and wherein generating the speech grammar for the input field comprises generating the speech grammar for the input field based upon the terms in the title attribute of the markup language element for the input field. 8. The system of claim 6, wherein the attribute of the markup language element for the input field specifies the plurality of terms and at least one prefix for one or more of the plurality of terms that may be combined with the one or more of the plurality of terms as permissible input for the input field, and wherein generating the speech grammar for the input field comprises generating the speech grammar for the input field based upon the plurality of terms and the at least one prefix. 9. The system of claim 6, wherein the attribute of the markup language element for the input field specifies the plurality of terms and a semantic indicator for one or more of the plurality of terms, and wherein generating the speech grammar for the input field comprises generating the speech grammar based upon the permitted terms and the semantic indicator for the permitted terms. 10. The system of claim 6, wherein the ASR engine is a representational state transfer (REST) compliant ASR engine. 11. An article of manufacture comprising a computer-readable medium storing a computer program that, when executed by at least one processor, causes the at least one processor to perform a method for voice enabling a Web page, the method comprising: receiving a selection of an input field of the Web page,determining whether a speech grammar exists for the input field;generating a speech grammar for the input field in response to determining that the speech grammar does not exist for the input field, wherein generating the speech grammar comprises: identifying a plurality of terms associated with the input field, wherein the plurality of terms represent permissible input for the input field and are specified in an attribute of a markup language element for the input field; andgenerating, by the at least one processor, the speech grammar for the input field based, at least in part, upon the plurality of terms identified in the attribute of the markup language element;receiving speech input for the input field;providing the received speech input and the generated speech grammar to an automatic speech recognition (ASR) engine configured to recognize the received speech input to produce a textual equivalent to the received speech input using the generated speech grammar; andinserting the textual equivalent into the input field. 12. The article of manufacture of claim 11, wherein the plurality of terms associated with the input field comprise terms in a hidden title attribute of the markup language element for the input field, and wherein generating the speech grammar for the input field comprises generating the speech grammar for the input field based upon the terms in the hidden title attribute of the markup language element for the input field. 13. The article of manufacture of claim 11, wherein the attribute of the markup language element for the input field specifies the plurality of terms and at least one prefix for one or more of the plurality of terms that may be combined with the one or more of the plurality of terms as permissible input for the input field, and wherein generating the speech grammar for the input field comprises generating the speech grammar for the input field based upon the plurality of terms and the at least one prefix. 14. The article of manufacture of claim 11, wherein the attribute of the markup language element for the input field specifies the plurality of terms and a semantic indicator for one or more of the plurality of terms, and wherein generating the speech grammar for the input field comprises generating the speech grammar based upon the permitted terms and the semantic indicator for the permitted terms. 15. The method of claim 1, further comprising generating the speech grammar for the input field only if it is determined that the speech grammar does not exist for the input field. 16. The system of claim 6, wherein the at least one processor is further configured to generate the speech grammar for the input field only if it is determined that the speech grammar does not exist for the input field. 17. The article of manufacture of claim 11, wherein the method further comprises generating the speech grammar for the input field only if it is determined that the speech grammar does not exist for the input field. 18. The method of claim 1, wherein the plurality of terms associated with the input field are in the Web page. 19. The system of claim 6, wherein the plurality of terms associated with the input field are in the Web page. 20. The article of manufacture of claim 11, wherein the plurality of terms associated with the input field are in the Web page.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (29)
Junqua Jean-Claude ; Contolini Matteo, Apparatus and method using speech understanding for automatic channel selection in interactive television.
Potter, Stephen F.; Bowker, Anthony W., Computer implemented method of analyzing recognition results between a user and an interactive application utilizing inferred values instead of transcribed speech.
Yamamoto, Hiroki; Kuboyama, Hideo; Fukada, Toshiaki, Information-processing device and method that attains speech-recognition to recognize data input via speech.
Stifelman,Lisa Joy; Partovi,Hadi; Partovi,Haleh; Alpert,David Bryan; Marx,Matthew Talin; Bailey,Scott James; Sims,Kyle D.; Bailey,Darby McDonough; Brathwaite,Roderick Steven; Koh,Eugene; Davis,Angus Macdonald, Providing services for an information processing system using an audio interface.
Dantzig,Paul M.; Filepp,Robert; Liu,Yew Huey, System and method for generating and presenting multi-modal applications from intent-based markup scripts.
Groner, Gabriel F.; Kundin, Jane I., System and method for user controlled insertion of standardized text in user selected fields while dictating text entries for completing a form.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.