IPC분류정보
국가/구분 |
United States(US) Patent
등록
|
국제특허분류(IPC7판) |
|
출원번호 |
US-0241037
(2008-09-30)
|
등록번호 |
US-8219407
(2012-07-10)
|
발명자
/ 주소 |
- Roy, Philippe
- Lagassey, Paul J.
|
출원인 / 주소 |
- Great Northern Research, LLC
|
대리인 / 주소 |
|
인용정보 |
피인용 횟수 :
166 인용 특허 :
116 |
초록
▼
A system and method for processing speech input comprising a speech recognizer and a logical command processor which facilitates additional processing of speech input beyond the speech recognizer level. A speech recognizer receives input from a user, and when a command is identified in the speech in
A system and method for processing speech input comprising a speech recognizer and a logical command processor which facilitates additional processing of speech input beyond the speech recognizer level. A speech recognizer receives input from a user, and when a command is identified in the speech input, if the command meets conditions that require additional processing, a representation of the speech input s stored for subsequent processing. A logical command processor performs additional processing of command input by analyzing the command and its elements, determining which elements are required for successful processing the command and which elements are present and lacking. The user is prompted to supply missing information, and subsequent user input is added to the command structure until the command input is aborted or the command structure reaches sufficient completeness to enable execution of the command. Thereby, speech input of complex commands in natural language in a system running a plurality of applications and processes is made possible.
대표청구항
▼
1. A method for processing the output of a speech recognizer comprising: determining that a recognized command by a speech recognizer requires additional processing;storing a representation of the output of the speech recognizer in a command structure; iteratively determining if the command is suffi
1. A method for processing the output of a speech recognizer comprising: determining that a recognized command by a speech recognizer requires additional processing;storing a representation of the output of the speech recognizer in a command structure; iteratively determining if the command is sufficiently complete and ready for processing, and if so executing the command in a respective application or process and exiting said iteratively determining step;if the command is insufficiently complete or not ready for processing, prompting a user for further input;receiving, processing and storing in the command structure prompted user command-related input; anddetermining an abort condition, and if the abort condition exists, exiting the iterative determining, else continuing said iteratively determining step. 2. The method according to claim 1, wherein at least one of: a command status with respect to at least one of a context, an entry in a commands dictionary and a status with respect to a status flag; and analysis of the speech input with respect to at least one of a context and a completeness are used to determine if a command requires additional processing. 3. The method according to claim 1, wherein the speech is processed in a processing environment having a speech recognizer and a logical command processor, wherein the speech recognizer generates a first outcome, and wherein the logical command processor provides input to the speech recognizer to alter a statistical process, to produce a second outcome of the speech processing, and at least one of the first outcome and the second outcome are used for subsequent processing by the logical command processor. 4. The method according to claim 1, wherein the speech recognizer processes speech input with both an HMM and context free grammars; and at least a portion of said speech processing is performed by a plurality of analyzers using context free grammars in parallel, each analyzer analyzing according to a different set of criteria. 5. The method according to claim 1, wherein said representation of a speech input is at least one of a set of potentially recognized words, a data matrix, a context based data construct, a command structure and a textual representation of the speech input. 6. The method according to claim 1, wherein the speech is processed in an environment having a speech recognizer software process and a logical command processor software process, wherein upon determination that an input needs additional processing at least a portion of an output of the speech recognizer software process is further processed by the logical command processor software process. 7. The method according to claim 6, wherein at least one non-linguistic implicit user input is employed as a cue to influence processing by the logical command processor, and at least one of a natural language analysis and a syntactic analysis are used by the logical command processor to determine a context of a speech input. 8. A method for processing a stored output of a speech recognizer comprising: determining a context of a speech input to a speech recognizer; anda) if the speech input is in the context of a command, then for each command present in the stored output;i) determining a set of elements required for executing the command;ii) determining if the stored output is sufficiently complete for executing the command, and if so executing the command in a respectively targeted application or process;otherwise:iii) prompting a user for further input and receiving, processing and storing subsequent user input until the stored output and stored subsequent user input are together sufficiently complete for execution of the command or until the process is aborted; andiv) if sufficiently complete for processing, executing the command in a respectively targeted application or process; andb) if the speech input is in the context of a data input, passing a representation of the stored output or an identifier thereof to a contextually appropriate data sink. 9. The method according to claim 8, wherein if the speech input is not a command or a data input, at least one of notifying the user and prompting the user for input. 10. The method according to claim 8, wherein at least one of: a command status with respect to at least one of a context, an entry in a commands dictionary and a status with respect to a status flag; and analysis of the speech input with respect to at least one of a context and a completeness are used to determine if a command requires additional processing. 11. The method according to claim 8, wherein the speech is processed in a processing environment having a speech recognizer and a logical command processor, wherein the speech recognizer generates a first outcome, and wherein the logical command processor provides input to the speech recognizer to alter a statistical process, to produce a second outcome of the speech processing, and at least one of the first outcome and the second outcome are used for subsequent processing by the logical command processor. 12. The method according to claim 8, wherein the speech recognizer processes speech input with both an HMM and context free grammars; and at least a portion of said speech processing is performed by a plurality of analyzers using context free grammars in parallel, each analyzer analyzing according to a different set of criteria. 13. The method according to claim 8, wherein said representation of a speech input is at least one of a set of potentially recognized words, a data matrix, a context based data construct, a command structure and a textual representation of the speech input. 14. The method according to claim 8, wherein the speech is processed in an environment having a speech recognizer software process and a logical command processor software process, wherein upon determination that an input needs additional processing, at least a portion of an output of the speech recognizer software process is further processed by the logical command processor software process. 15. The method according to claim 8, wherein at least one of a non-linguistic implicit user input is employed as a cue to influence processing by the logical command processor, and at least one of a natural language analysis and syntactic analysis are used by the logical command processor to determine a context of a speech input. 16. A method for processing an output of a speech recognizer, comprising: (a) storing a representation of the output of the speech recognizer as a representation of the speech input in a memory;(b) determining if at least one command is present in the stored representation;(c) if at least one command is present in the stored representation, determining a context of the speech input;(d) if the speech input is not in context of a command, at least one of notifying a user, prompting the user for input, and passing a data input representing the stored representation to a contextually appropriate data sink;(e) if the determined context of the speech input is in a command context, then for each command present in the speech input:i) determining the elements required for processing each respective command; andii) determining if the stored representation comprises all the elements required for executing the command, and if so executing the command;Otherwise:iii) prompting the user for further input and receiving and processing subsequent input from the user until aborted or the received and processed information renders the command sufficiently complete for execution; andiv) if a command becomes sufficiently complete for execution, executing the command in an appropriate application or process. 17. The method according to claim 16, wherein the speech is processed in a processing environment having a speech recognizer and a logical command processor, wherein the speech recognizer generates a first outcome, and wherein the logical command processor provides input to the speech recognizer to alter a statistical process, to produce a second outcome of the speech processing, and at least one of the first outcome and the second outcome are used for subsequent processing by the logical command processor. 18. The method according to claim 16, wherein the speech recognizer processes speech input with both an HMM and context free grammars; and at least a portion of said speech processing is performed by a plurality of analyzers using context free grammars in parallel, each analyzer analyzing according to a different set of criteria. 19. The method according to claim 16, wherein said representation of a speech input is at least one of a set of potentially recognized words, a data matrix, a context based data construct, a command structure and a textual representation of the speech input. 20. The method according to claim 16, wherein the speech is processed in an environment having a speech recognizer software process and a logical command processor software process, wherein upon determination that an input needs additional processing, at least a portion of an output of the speech recognizer software process is further processed by the logical command processor software process. 21. The method according to claim 16, wherein at least one of a non-linguistic implicit user input is employed as a cue to influence processing by the logical command processor, and at least one of a natural language analysis and a syntactic analysis are used by the logical command processor to determine a context of a speech input. 22. A method for processing speech, comprising the steps of: (a) determining if an output of a speech recognizer contains an available command;(b) if an available command is present, but sufficient explicit conditions are not fulfilled for processing of the command, performing a context sensitive process having as available options at least:(i) determining if sufficient conditions for processing of the command, and if so presenting the command to a command processor for execution;(ii) determining if the output of the speech recognizer represents data, and if so passing a representation of the speech input to an appropriate data sink; and(iii) prompting the user with a request for further input, wherein the prompting is automatically adapted to seek information from the user to at least one of increase a completeness and reduce an ambiguity of the available command. 23. The method according to claim 22, wherein if an available command is present, but sufficient explicit conditions are not fulfilled for processing of the command, selectively performing a context sensitive process comprising: (i) determining if sufficient conditions for processing of the command, and presenting the command to a command processor for execution;(ii) determining if the output of the speech recognizer represents data, and if the output of the natural language speech recognizer represents data, passing a representation of the speech input to an appropriate data sink; and(iii) prompting the user with a request for further input, wherein the prompting is automatically adapted to seek information from the user to at least one of increase a completeness and reduce an ambiguity of the available command. 24. The method according to claim 22, wherein the speech recognizer employs a natural language speech process.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.