[특허]Automated voice enablement of a web page

Automated voice enablement of a web page 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G01L-021/00 G10L-019/00 G10L-025/90 G06F-015/16 G06F-003/048 G06F-003/00 H04M-003/493 G10L-015/26
출원번호	US-0099028 (2008-04-07)
등록번호	US-8831950 (2014-09-09)
발명자 / 주소	Moore, Victor S. Nusbickel, Wendi L.
출원인 / 주소	Nuance Communications, Inc.
대리인 / 주소	Wolf, Greenfield & Sacks, P.C.
인용정보	피인용 횟수 : 1 인용 특허 : 29

초록 ▼

Embodiments of the present invention provide a method, system and computer program product for the automated voice enablement of a Web page. In an embodiment of the invention, a method for voice enabling a Web page can include selecting an input field of a Web page for speech input, generating a speech grammar for the input field based upon terms in a core attribute of the input field, receiving speech input for the input field, posting the received speech input and the grammar to an automatic speech recognition (ASR) engine and inserting a textual equivalent to the speech input provided by the ASR engine into a document object model (DOM) for the Web page.

대표청구항 ▼

1. A method for voice enabling a Web page, the method comprising: receiving a selection of an input field of the Web page;determining whether a speech grammar exists for the input field;generating, by using at least one processor, a speech grammar for the input field in response to determining that the speech grammar does not exist for the input field, wherein generating the speech grammar comprises: identifying a plurality of terms associated with the input field, wherein the plurality of terms represent permissible input for the input field and are specified in an attribute of a markup language element for the input field; andgenerating, the speech grammar for the input field based, at least in part, upon the plurality of terms identified in the attribute of the markup language element;receiving speech input for the input field;providing the received speech input and the generated speech grammar to an automatic speech recognition (ASR) engine configured to recognize the received speech input to produce a textual equivalent to the received speech input using the generated speech grammar; andinserting the textual equivalent into the input field. 2. The method of claim 1, wherein the plurality of terms associated with the input field comprise terms in a title attribute of the markup language element for the input field, and wherein generating the speech grammar for the input field comprises generating the speech grammar for the input field based upon the terms in the title attribute of the markup language element for the input field. 3. The method of claim 1, wherein the attribute of the markup language element for the input field specifies the plurality of terms and at least one prefix for one or more of the plurality of terms that may be combined with the one or more of the plurality of terms as permissible input for the input field, and wherein generating the speech grammar for the input field comprises generating the speech grammar for the input field based upon the plurality of terms and the at least one prefix. 4. The method of claim 1, wherein the attribute of the markup language element for the input field specifies the plurality of terms and a semantic indicator for one or more of the plurality of terms, and wherein generating the speech grammar for the input field comprises generating the speech grammar based upon the permitted terms and the semantic indicator for the permitted terms. 5. The method of claim 1, wherein providing the received speech input and the generated speech grammar to the ASR engine, comprises providing the received speech input and the generated speech grammar to a communicatively coupled remote representational state transfer (REST) compliant ASR engine. 6. A system comprising: at least one processor configured to: determine that an input field of the Web page has been selected;determine whether a speech grammar exists for the input field;generate a speech grammar for the input field in response to determining that the speech grammar does not exist for the input field, wherein generating the speech grammar comprises:identifying a plurality of terms associated with the input field wherein the plurality of terms represent permissible input for the input field and are specified in an attribute of a markup language element for the input field; andgenerating the speech grammar for the input field based, at least in part, upon the plurality of terms identified in the attribute of the markup language element;determining that speech input has been received for the input field;provide the received speech input and the generated speech grammar to a communicatively coupled automatic speech recognition (ASR) engine configured to recognize the received speech input to produce a textual equivalent to the received speech input using the generated speech grammar; andinsert the textual equivalent into the input field. 7. The system of claim 6, wherein the plurality of terms associated with the input field comprise terms in a title attribute of the markup language element for the input field, and wherein generating the speech grammar for the input field comprises generating the speech grammar for the input field based upon the terms in the title attribute of the markup language element for the input field. 8. The system of claim 6, wherein the attribute of the markup language element for the input field specifies the plurality of terms and at least one prefix for one or more of the plurality of terms that may be combined with the one or more of the plurality of terms as permissible input for the input field, and wherein generating the speech grammar for the input field comprises generating the speech grammar for the input field based upon the plurality of terms and the at least one prefix. 9. The system of claim 6, wherein the attribute of the markup language element for the input field specifies the plurality of terms and a semantic indicator for one or more of the plurality of terms, and wherein generating the speech grammar for the input field comprises generating the speech grammar based upon the permitted terms and the semantic indicator for the permitted terms. 10. The system of claim 6, wherein the ASR engine is a representational state transfer (REST) compliant ASR engine. 11. An article of manufacture comprising a computer-readable medium storing a computer program that, when executed by at least one processor, causes the at least one processor to perform a method for voice enabling a Web page, the method comprising: receiving a selection of an input field of the Web page,determining whether a speech grammar exists for the input field;generating a speech grammar for the input field in response to determining that the speech grammar does not exist for the input field, wherein generating the speech grammar comprises: identifying a plurality of terms associated with the input field, wherein the plurality of terms represent permissible input for the input field and are specified in an attribute of a markup language element for the input field; andgenerating, by the at least one processor, the speech grammar for the input field based, at least in part, upon the plurality of terms identified in the attribute of the markup language element;receiving speech input for the input field;providing the received speech input and the generated speech grammar to an automatic speech recognition (ASR) engine configured to recognize the received speech input to produce a textual equivalent to the received speech input using the generated speech grammar; andinserting the textual equivalent into the input field. 12. The article of manufacture of claim 11, wherein the plurality of terms associated with the input field comprise terms in a hidden title attribute of the markup language element for the input field, and wherein generating the speech grammar for the input field comprises generating the speech grammar for the input field based upon the terms in the hidden title attribute of the markup language element for the input field. 13. The article of manufacture of claim 11, wherein the attribute of the markup language element for the input field specifies the plurality of terms and at least one prefix for one or more of the plurality of terms that may be combined with the one or more of the plurality of terms as permissible input for the input field, and wherein generating the speech grammar for the input field comprises generating the speech grammar for the input field based upon the plurality of terms and the at least one prefix. 14. The article of manufacture of claim 11, wherein the attribute of the markup language element for the input field specifies the plurality of terms and a semantic indicator for one or more of the plurality of terms, and wherein generating the speech grammar for the input field comprises generating the speech grammar based upon the permitted terms and the semantic indicator for the permitted terms. 15. The method of claim 1, further comprising generating the speech grammar for the input field only if it is determined that the speech grammar does not exist for the input field. 16. The system of claim 6, wherein the at least one processor is further configured to generate the speech grammar for the input field only if it is determined that the speech grammar does not exist for the input field. 17. The article of manufacture of claim 11, wherein the method further comprises generating the speech grammar for the input field only if it is determined that the speech grammar does not exist for the input field. 18. The method of claim 1, wherein the plurality of terms associated with the input field are in the Web page. 19. The system of claim 6, wherein the plurality of terms associated with the input field are in the Web page. 20. The article of manufacture of claim 11, wherein the plurality of terms associated with the input field are in the Web page.

이 특허에 인용된 특허 (29)

Junqua Jean-Claude ; Contolini Matteo, Apparatus and method using speech understanding for automatic channel selection in interactive television.
상세보기
Galanes, Francisco M.; Hon, Hsiao-Wuen; Jacoby, James D.; Lecoueche, Renaud J.; Potter, Stephen F., Application abstraction with dialog purpose.
상세보기
Potter, Stephen F.; Bowker, Anthony W., Computer implemented method of analyzing recognition results between a user and an interactive application utilizing inferred values instead of transcribed speech.
상세보기
Gould Joel M., Continuous speech recognition.
상세보기
Ferrans,James; Engelsma,Jonathan; Pearce,Michael; Randolph,Mark; Vogedes,Jerome, Dialog recognition and control in a voice browser.
상세보기
Gong,Li; Weng,Jie; Raiyani,Samir; Swan,Richard J.; Vogler,Hartmut K., Dynamic grammar for voice-enabled applications.
상세보기
Ativanichayaphong, Soonthorn; Cross, Jr., Charles W.; McCobb, Gerald M., Enabling grammars in web page frames.
상세보기
Lee, Nicholas J.; Frederick, Robert; Schoenbaum, Ronald J., Generation and selection of voice recognition grammars for conducting database searches.
상세보기
Yamamoto, Hiroki; Kuboyama, Hideo; Fukada, Toshiaki, Information-processing device and method that attains speech-recognition to recognize data input via speech.
상세보기
Bennett, Ian M., Internet based speech recognition system with dynamic grammars.
상세보기
Wang, Kuansan; Hon, Hsiao Wuen, Markup language extensions for web enabled recognition.
상세보기
Bangalore, Srinivas; Feng, Junlan; Rahim, Mazin G., Method and apparatus for automatically building conversational systems.
상세보기
Charney, Michael L.; Starren, Justin, Method and system for voice activating web pages.
상세보기
Moore, Victor S.; Nusbickel, Wendi L., Proactive completion of input fields for automated voice enablement of a web page.
상세보기
Stifelman,Lisa Joy; Partovi,Hadi; Partovi,Haleh; Alpert,David Bryan; Marx,Matthew Talin; Bailey,Scott James; Sims,Kyle D.; Bailey,Darby McDonough; Brathwaite,Roderick Steven; Koh,Eugene; Davis,Angus Macdonald, Providing services for an information processing system using an audio interface.
상세보기
Wang,Kuansan; Hon,Hsiao Wuen, Servers for web enabled speech recognition.
상세보기
Doyle, Sean, Speech recognition application grammar modeling.
상세보기
Dantzig,Paul M.; Filepp,Robert; Liu,Yew Huey, System and method for generating and presenting multi-modal applications from intent-based markup scripts.
상세보기
Julia,Luc E.; Bing,Jehan G.; Dubreuil,Jerome, System and method for speech activated navigation.
상세보기
Groner, Gabriel F.; Kundin, Jane I., System and method for user controlled insertion of standardized text in user selected fields while dictating text entries for completing a form.
상세보기
Chidlovskii, Boris, System and method of automatic wrapper grammar generation.
상세보기
Patch, Kimberly, Systems and methods of a structured grammar for a speech recognition command system.
상세보기
Kadashevich A. Julie (Tyngsboro MA) Harvey Mary F. (Reading MA) Clark Cheryl (Arlington MA), Text searching system.
상세보기
Zimmerman, Roger S.; Egerman, Paul; Zavaliagkos, George, Transcription data extraction.
상세보기
Arning, Andreas; Seiffert, Roland, Using a prediction algorithm on the addressee field in electronic mail systems.
상세보기
Wang,Kuansan; Hon,Hsiao Wuen, Web enabled recognition architecture.
상세보기
Galanes,Francisco M.; Lecoeuche,Renaud J.; Su,Fei, Web server controls for web enabled recognition and/or audible prompting for call controls.
상세보기
Brown, Michael Kenneth; Rehor, Kenneth G.; Schmult, Brian Carl; Tuckey, Curtis Duane, Web-based platform for interactive voice response (IVR).
상세보기
Brown, Michael Kenneth; Glinski, Stephen Charles; Schmult, Brian Carl, Web-based voice dialog interface.
상세보기

이 특허를 인용한 특허 (1)

Sadkin, Eric; Kaushik, Lakshmish; Gill, Jasjeet; Luz, Etay, Method and system for dynamic speech recognition and tracking of prewritten script.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Automated voice enablement of a web page 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (29)

이 특허를 인용한 특허 (1)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Automated voice enablement of a web page 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (29)

이 특허를 인용한 특허 (1)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트