Method for processing the output of a speech recognizer
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G10L-021/00
G10L-015/28
G10L-015/34
G06F-017/27
G10L-015/14
G10L-015/19
G10L-015/16
G10L-015/22
출원번호
US-0940253
(2015-11-13)
등록번호
US-9805723
(2017-10-31)
발명자
/ 주소
Roy, Philippe
Lagassey, Paul J.
출원인 / 주소
Great Northern Research, LLC
대리인 / 주소
Hoffberg, Esq., Steven M.
인용정보
피인용 횟수 :
0인용 특허 :
199
초록▼
A method for processing speech, comprising semantically parsing a received natural language speech input with respect to a plurality of predetermined command grammars in an automated speech processing system; determining if the parsed speech input unambiguously corresponds to a command and is suffic
A method for processing speech, comprising semantically parsing a received natural language speech input with respect to a plurality of predetermined command grammars in an automated speech processing system; determining if the parsed speech input unambiguously corresponds to a command and is sufficiently complete for reliable processing, then processing the command; if the speech input ambiguously corresponds to a single command or is not sufficiently complete for reliable processing, then prompting a user for further speech input to reduce ambiguity or increase completeness, in dependence on a relationship of previously received speech input and at least one command grammar of the plurality of predetermined command grammars, reparsing the further speech input in conjunction with previously parsed speech input, and iterating as necessary. The system also monitors abort, fail or cancel conditions in the speech input.
대표청구항▼
1. A method for processing speech, comprising: receiving an input represent human speech;processing the received input within alternate concurrent instances of a virtual processing environment executing on at least one automated data processing system;automatically communicating, between respective
1. A method for processing speech, comprising: receiving an input represent human speech;processing the received input within alternate concurrent instances of a virtual processing environment executing on at least one automated data processing system;automatically communicating, between respective alternate instances of the virtual processing environment, at least one status message to coordinate a reduction in a number of the alternate instances of the virtual processing environment;analyzing:(1) the received input to determine a human-initiated command;(2) the received input to determine data associated with the human initiated command;(3) a completeness and unambiguity of the received input with respect to an ability to at least partially process the human-initiated command; andif the received input is complete and unambiguous with respect to the ability to at least partially process the human initiated command, commencing processing, andif the received input is incomplete or ambiguous with respect to the ability to at least partially process the human initiated command, prompting for additional input. 2. The method according to claim 1, further comprising parsing the received input according to at least one predetermined grammar prior to analyzing. 3. The method according to claim 2, wherein said analyzing a completeness and unambiguity comprises determining a statistical reliability of a correspondence of the received input to the at least one predetermined grammar. 4. The method according to claim 3, wherein said determining a statistical reliability of a correspondence of the received input to the at least one predetermined grammar comprises determining a statistical reliability of a correspondence of the received input to each of a plurality of grammars corresponding to a predetermined plurality of available commands, and determining whether a single unambiguous predetermined command is indicated by the received input. 5. The method according to claim 1, wherein said prompting for additional input comprises determining a question to prompt the user, the answer to which is predicted to increase a statistical reliability of a correspondence of the received input to the at least one predetermined grammar. 6. The method according to claim 1, further comprising analyzing received input for presence of a human-initiated abort command, and ceasing said analyzing upon recognition of the abort command. 7. The method according to claim 1, wherein the received input comprises a plurality of commands, further comprising determining a completeness and unambiguity of the received speech with respect to the plurality of commands concurrently. 8. The method according to claim 7, wherein the plurality of commands are directed toward a single software application. 9. The method according to claim 7, wherein, if the received input corresponds to valid commands of a plurality of different applications, storing a state of each application, testing the received input as an input to the plurality of different applications, determining at least one application command execution failure, and restoring the prior stored state of the respective application subject to the command execution failure. 10. The method according to claim 7, wherein the plurality of commands are directed toward a plurality of different software applications. 11. The method according to claim 10, wherein the plurality of different applications comprise a messaging application and a contact database application. 12. The method according to claim 10, wherein the plurality of different applications comprise a calendar application and a contact database application. 13. A method, comprising: hosting, in an automated data processing system, at least one virtual workspace configured to support instances of a virtual processing environment;processing the received input within alternate concurrent instances of the virtual processing environment executing on at least one automated data processing system;automatically communicating, between respective alternate instances of the virtual processing environment, at least one status message to coordinate a reduction in a number of the alternate instances of the virtual processing environment;analyzing a completeness and unambiguity of the received input with respect to an ability to at least partially process a human-initiated command;if the received input is complete and unambiguous with respect to the ability to at least partially process the human initiated command, commencing processing;if the received input is incomplete or ambiguous with respect to the ability to at least partially process the human initiated command, prompting for additional input, by formulating a question to prompt the user, the answer to which is predicted to increase a statistical reliability of a correspondence to the at least one predetermined grammar, wherein the determination of the statistical reliability of the correspondence of the received input to the at least one predetermined grammar comprises a determination of a statistical reliability of a correspondence of the received input to each of a plurality of grammars corresponding to a predetermined plurality of available commands, and a determination of whether a single unambiguous predetermined available is indicated by the received input; andanalyze the received input for presence of a human-initiated abort command, and cease the analysis upon recognition of the abort command. 14. A method for processing speech, comprising: receiving speech as an input;processing the received speech within alternate concurrent instances of a virtual processing environment executing on at least one automated data processing system;automatically communicating, between respective alternate instances of the virtual processing environment, at least one status message to coordinate a reduction in a number of the alternate instances of the virtual processing environment;analyzing:(1) the received speech to determine a command;(2) the received speech to determine data associated with the command;(3) a completeness and unambiguity of the received speech with respect to an ability to execute the command; andif the received speech is complete and unambiguous with respect to the ability to execute the command, executing the command, within at least one virtual processing environment, in conjunction with data associated with the command, andif the received speech is incomplete or ambiguous with respect to the ability to execute the command, prompting for additional input. 15. The method according to claim 14, further comprising parsing the received speech according to at least one predetermined grammar prior to analyzing. 16. The method according to claim 15, wherein said analyzing a completeness and unambiguity comprises determining a statistical reliability of a correspondence of the received speech to the at least one predetermined grammar. 17. The method according to claim 16, wherein said determining a statistical reliability of a correspondence of the received speech to the at least one predetermined grammar comprises determining a statistical reliability of a correspondence of the received speech to each of a plurality of grammars corresponding to a predetermined plurality of available commands, and determining whether a single unambiguous predetermined command is indicated by the received speech. 18. The method according to claim 14, wherein said prompting for additional input comprises communicating a question as a prompt, having an answer predicted to increase a statistical reliability of a correspondence of the received speech to the at least one predetermined grammar. 19. The method according to claim 14, further comprising analyzing received speech for presence of a human-initiated abort command, and ceasing said analyzing upon recognition of the abort command. 20. The method according to claim 14, wherein the received speech comprises a plurality of commands, further comprising determining a completeness and unambiguity of the received speech with respect to the plurality of commands concurrently. 21. The method according to claim 20, wherein the plurality of commands are directed toward a plurality of different applications. 22. The method according to claim 21, wherein, if the received speech corresponds to valid commands of a plurality of different applications, storing a state of each application, testing the received input as an input to the plurality of different applications, determining at least one application command execution failure, and restoring the prior stored state of the respective application subject to the command execution failure.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (199)
Julia, Luc; Voutsas, Dimitris; Cheyer, Adam, Accessing network-based electronic information through scripted online interfaces using spoken input.
Van Vliembergen Eduardus J. W. (Venlo NLX) Heemels Robertus P. E. H. (Roermond NLX) Cremers Louis M. G. (Venlo NLX) Hommersom Frederik J. (Venlo NLX) Gerritsen Jan (Nijmegen NLX), Apparatus and method for syntactic signal analysis.
Deligne Sabine ; Sagisaka Yoshinori,JPX ; Nakajima Hideharu,JPX, Apparatus for generating a statistical sequence model called class bi-multigram model with bigram dependencies assumed between adjacent sequences.
Kamatani, Satoshi; Chino, Tetsuro; Furihata, Kentaro, Apparatus, system, method, and computer program product for resolving ambiguities in translations.
Shieber Stuart M. ; Armstrong John ; Baptista Rafael Jose ; Bentz Bryan A. ; Ganong ; III William F. ; Selesky Donald Bryant, Command parsing and rewrite system.
Kamatani, Satoshi; Chino, Tetsuo, Communication support apparatus and computer program product for supporting communication by performing translation between languages.
Kamatani, Satoshi; Chino, Tetsuro; Kuroda, Yuka, Communication support apparatus and computer program product for supporting communication by performing translation between languages.
Steel,Samuel W. D.; Kruschwitz,Udo; Webb,Nicholas J.; De Roeck,Anne N.; Scott,Paul D.; Turner,Raymond; Tsui,Kwok C.; Wobcke,Wayne R.; Azvine,Behnam, Index to a semi-structured database.
Bennett, Ian M.; Babu, Bandi Ramesh; Morkhandikar, Kishor; Gururaj, Pallaki, Interactive speech based learning/training system formulating search queries based on natural language parsing of recognized user queries.
Matheson,Caroline, Man-machine dialogue system, controls dialogue between system and user using dialogue specification employing augmented transition networks propagating tokens.
Colier Ronald L. (T/L Apartments ; 16 Cheshire Dr. ; Apt. 121 Pittsfield MA 01201), Method and apparatus adapted for an audibly-driven, handheld, keyless and mouseless computer for performing a user-cente.
Monaco Peter C. ; Ehrlich Steven C. ; Ghosh Debajit ; Klenk Mark ; Sinai Julian ; Thirumalai Madhavan ; Gupta Sundeep, Method and apparatus for creating modifiable and combinable speech objects for acquiring information from a speaker in an interactive voice response system.
Tal Peter (53 Driftwood Dr. Port Washington NY 11050), Method and apparatus for uniquely identifying individuals by particular physical characteristics and security system uti.
Fischer,Uwe; Hoffmann,Stefan; Kriechbaum,Werner; Stenzel,Gerhard, Method and system for the automatic generation of multi-lingual synchronized sub-titles for audiovisual data.
Michael S. Phillips ; Mark A. Fanty ; Krishna K. Govindarajan, Method and system of reviewing the behavior of an interactive speech recognition application.
Noyes Dallas B. (2500 George Washington Way ; #124 Richland WA 99352), Method for representation of knowledge in a computer as a network database system.
Battle James Thomas ; Hung Andy C. ; Purcell Stephen C., Multimedia processor using variable length instructions with opcode specification of source operand as result of prior i.
Loatman Robert B. (Vienna VA) Post Stephen D. (McLean VA) Yang Chih-King (Rockville MD) Hermansen John C. (Catharpin VA), Natural language understanding system.
Janek Gabor,HUX ; Wutte Heribert,AUX ; Grabherr Manfred, Product including a speech recognition device and method of generating a command lexicon for a speech recognition device.
Ostendorf Mari ; Singer Harald,JPX, Speaker-independent model generation apparatus and speech recognition apparatus each equipped with means for splitting.
Morgan Scott Anthony ; Roberts David John,GBX ; Swearingen Craig Ardner ; Tannenbaum Alan Richard, Speech command input recognition system for interactive computer display with term weighting means used in interpreting potential commands from relevant speech terms.
Gagnon, Jean; Roy, Philippe; Lagassey, Paul J., Speech interface system and method for control and interaction with applications on a computing system.
Ohmori,Kumiko; Higashida,Masanobu; Mizusawa,Noriko, Speech recognition based interactive information retrieval scheme using dialogue control to reduce user stress.
Tang Donald T.,CNX ; Shen Li Qin,CNX ; Zhu Xiao Jin,CNX, Speech recognition method and system for recognizing single or un-correlated Chinese characters.
Marx Matthew T. ; Carter Jerry K. ; Phillips Michael S. ; Holthouse Mark A. ; Seabury Stephen D. ; Elizondo-Cecenas Jose L. ; Phaneuf Brett D., System and method for developing interactive speech applications.
Dantzig,Paul M.; Filepp,Robert; Liu,Yew Huey, System and method for generating and presenting multi-modal applications from intent-based markup scripts.
Maes, Stephane Herman; Neti, Chalapathy Venkata, System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input.
Coffman, Daniel M.; Hosn, Rafah A.; Kleindienst, Jan; Maes, Stephane H.; Raman, Thiruvilwamalai V., System and method for providing dialog management and arbitration in a multi-modal environment.
Eberle, Hannes; Leon, Christopher S.; Maass, Bodo; Patnaik, Anurag; Santa Ana, Alberto; Zirngibl, Michael, System and method for the creation and automatic deployment of personalized, dynamic and interactive inbound and outbound voice services, with real-time interactive voice database queries.
Maes, Stephane H.; Lubensky, David M.; Sakrajda, Andrzej, Universal IP-based and scalable architectures across conversational applications using web services for speech and audio processing resources.
Ackley H. Sprague ; Maltsev Pavel A. ; Ohanian Michael, Universal data input and processing device, such as universal point-of-sale device for inputting and processing bar code symbols, document images, and other data.
Ball, Thomas J.; Cox, Kenneth Charles; Grinter, Rebecca Elizabeth; Hibino, Stacie Lynn; Jagadeesan, Lalita Jategaonkar; Mantilla, David Alejandro, User interface for translating natural language inquiries into database queries and data presentations.
Brant Arthur ; Mandell Kenneth ; Rader R. Scott ; Walsh Alexander ; deJuan ; Jr. Eugene ; Greenberg Robert, Voice command and control medical care system.
Alpdemir, Ahmet; James, Arthur, Voice-interactive marketplace providing promotion and promotion tracking, loyalty reward and redemption, and other features.
Lim, Kang S.; Nguyen, Joseph A., Voice-recognition-based methods for establishing outbound communication through a unified messaging system including intelligent calendar interface.
Mori,Shinsuke; Nishimura,Masafumi; Itoh,Nobuyasu, Word predicting method, voice recognition method, and voice recognition apparatus and program using the same methods.
Kaplan, Boris, Working method for treatment of abstract objects (the thought-substances) of the system of artificial intelligence of a cyborg or an android for the pointer-oriented object acquisition method for abstract treatment of information of this system based on a natural language.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.