최소 단어 이상 선택하여야 합니다.
최대 10 단어까지만 선택 가능합니다.
다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
NTIS 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
DataON 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Edison 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Kafe 바로가기국가/구분 | United States(US) Patent 등록 |
---|---|
국제특허분류(IPC7판) |
|
출원번호 | US-0341621 (2011-12-30) |
등록번호 | US-8370147 (2013-02-05) |
발명자 / 주소 |
|
출원인 / 주소 |
|
인용정보 | 피인용 횟수 : 42 인용 특허 : 374 |
A conversational, natural language voice user interface may provide an integrated voice navigation services environment. The voice user interface may enable a user to make natural language requests relating to various navigation services, and further, may interact with the user in a cooperative, con
A conversational, natural language voice user interface may provide an integrated voice navigation services environment. The voice user interface may enable a user to make natural language requests relating to various navigation services, and further, may interact with the user in a cooperative, conversational dialogue to resolve the requests. Through dynamic awareness of context, available sources of information, domain knowledge, user behavior and preferences, and external systems and devices, among other things, the voice user interface may provide an integrated environment in which the user can speak conversationally, using natural language, to issue queries, commands, or other requests relating to the navigation services provided in the environment.
1. A method for providing a natural language voice user interface, comprising: receiving a natural language utterance from an input device associated with a computing device, wherein the natural language utterance relates to navigation, and wherein the computing device is moving;determining a curren
1. A method for providing a natural language voice user interface, comprising: receiving a natural language utterance from an input device associated with a computing device, wherein the natural language utterance relates to navigation, and wherein the computing device is moving;determining a current location and direction of travel of the computing device;selecting, from among a plurality of sets of location-specific grammar information, a set of location-specific grammar information based on proximity between the current location and a location associated with the set of location-specific grammar information and based on whether the direction of travel of the computing device corresponds with movement towards the location associated with the set of location-specific grammar information;generating a recognition grammar with the set of location-specific grammar information;generating one or more interpretations of the natural language utterance using the recognition grammar; identifying, by a navigation agent executing on the computing device, one or more requests in the natural language utterance that relate to navigation from the one or more interpretations of the natural language utterance; andresolving, by the navigation agent executing on the computing device, the one or more requests. 2. The method of claim 1, wherein resolving the one or more requests includes: determining, by the navigation agent executing on the computing device, that the one or more requests in the natural language utterance approximate requested information; andcausing one or more subsequent interactions with a user to successively refine and resolve the requested information approximated in the one or more requests, wherein the one or more subsequent interactions in the dialogue include one or more output prompts and one or more subsequent multi-modal inputs to successively refine and resolve the requested information approximated in the one or more requests. 3. The method of claim 1, wherein resolving the one or more requests includes: determining, by the navigation agent executing on the computing device, that the one or more requests identified in the natural language utterance include a navigation request to calculate a route to a full or partial address;calculating the route from the current location of the computing device to a destination having an address that corresponds to the full or partial address; andgenerating directions from the current location to the destination, wherein the navigation agent generates the directions using the information associated with a navigation-specific information source. 4. The method of claim 3, wherein the directions include information about one or more destinations, one or more points of interest, traffic, parking, weather, or one or more events relevant to the route and the current location associated with the location device. 5. The method of claim 3, further comprising: receiving a multi-modal input from the input device subsequent to calculating the route, wherein the multi-modal input includes a subsequent request; andinvoking one or more domain agents to resolve the subsequent request in the multi-modal input, wherein the one or more domain agents filter results associated with the subsequent request according to the calculated route. 6. The method of claim 3, further comprising: determining, with the navigation agent executing on the computing device and based on the navigation-specific information source, that the one or more requests identified in the natural language utterance further include a multi-modal request to control a map display associated with the route;associating a non-voice component in the multi-modal input with the one or more interpretations of the natural language utterance, wherein the non-voice component in the multi-modal input identifies a portion of the map display; andissuing a command to control the identified portion of the map display in accordance with the one or more interpretations. 7. The method of claim 1, wherein each of the plurality of sets of location-specific grammar information corresponds to a different subdivision, neighborhood, city, or county, and wherein the selected set of location-specific grammar information corresponds to a subdivision, neighborhood, city, or county that is most proximal, among the plurality of sets, to the current location of the computing device. 8. The method of claim 7, wherein generating the one or more interpretations of the natural language utterance includes: recognizing one or more words in the natural language utterance that define a navigation command;recognizing one or more additional words in the natural language utterance that define a location associated with the navigation command; andgenerating the recognition grammar based on the location defined in the one or more additional words recognized in the natural language utterance. 9. The method of claim 1, wherein the generating the one or more interpretations of the natural language utterance is further based on shared knowledge, and wherein the shared knowledge includes one or more inferences generated by an inferencing engine associated with the computing device. 10. The method of claim 9, further comprising generating , in response to the natural language utterance, a suggestion of a one or more available navigation services. 11. The method of claim 1, wherein the computing device comprises a memory having a memory capacity allocated to store grammar information, and wherein the generating the recognition grammar comprises swapping out another of the plurality of sets of location-specific information that was stored in the memory with the selected set of location-specific grammar information. 12. The method of claim 1, wherein the computing device comprises a memory, wherein the generating the recognition grammar comprises retrieving the selected set of location-specific grammar information and storing the selected set in memory, the method further comprising: detecting a change in the current location of the computing device;selecting another set of location-specific grammar information from the plurality of sets of location-specific grammar information based on the change in the current location; andreplacing the set of location-specific grammar information in the memory with the another set of location-specific grammar information. 13. The method of claim 12, wherein the set corresponds to a first location and the another set corresponds to a second location, and wherein the selecting the another set comprises determining that the changed current location is more proximal to the second location than to the first location. 14. The method of claim 1, wherein the computing device comprises a memory that stores one or more sets of locations-specific grammar information, the method further comprising changing a number of sets of location-specific grammar information stored in the memory. 15. The method of claim 1, wherein the computing device comprises a memory that stores one or more sets of location-specific grammar information, and wherein the generating the recognition grammar comprises: retrieving the selected set of location-specific grammar information;detecting redundant information within the selected set of location-specific grammar information or among the one or more of sets of location-specific grammar information; andstoring the selected set of location-specific grammar information in the memory without having the redundant information stored in the memory. 16. The method of claim 1, wherein the computing device comprises a memory and wherein the selected set of location-specific grammar information comprises a plurality of geographic elements having a same name, and wherein the generating the recognition grammar comprises storing the selected set of location-specific grammar information in the memory without having at least one of the plurality of geographic elements stored in the memory. 17. The method of claim 1, wherein the plurality of geographic elements comprises a plurality of street elements, and wherein the generating the recognition grammar comprises removing at least one of the street elements from memory. 18. The method of claim 1, wherein the computing devices stores or is configured to access a context stack, wherein the generating the one or more interpretations is further based on the context stack. 19. The method of claim 18, wherein the generating the one or more interpretations comprises determining, from the context stack, a current conversation topic or a previously invoked domain agent. 20. A system for providing a natural language voice user interface, comprising: one or more processors configured to: receive a natural language utterance from an input device associated with a computing device, wherein the natural language utterance relates to navigation, and wherein the computing device is moving;determine a current location and direction of travel of the computing device;select, from among a plurality of sets of location-specific grammar information, a set of location-specific grammar information based on proximity between the current location and a location associated with the set of location-specific grammar information and based on whether the direction of travel of the computing device corresponds with movement towards the location associated with the set of location-specific grammar;generate a recognition grammar with the set of location-specific grammar information;generate one or more interpretations of the natural language utterance using the recognition grammar;identify one or more requests in the natural language utterance that relate to navigation from the one or more interpretations of the natural language utterance; andresolve the one or more requests. 21. The system of claim 20, wherein the one or more processors are configured to resolve the one or more requests by: determining that the one or more requests in the natural language utterance approximate requested information; andcausing one or more subsequent interactions with a user to successively refine and resolve the requested information approximated in the one or more requests, wherein the one or more subsequent interactions in the dialogue include one or more output prompts and one or more subsequent multi-modal inputs to successively refine and resolve the requested information approximated in the one or more requests. 22. The system of claim 20, wherein the one or more processors are configured to resolve the one or more requests by: determining that the one or more requests identified in the natural language utterance include a navigation request to calculate a route to a full or partial address;calculating the route from the current location to a destination having an address that corresponds to the full or partial address; andgenerating directions from the current location to the destination, wherein the navigation agent generates the directions using the information associated with a navigation-specific information source. 23. The system of claim 22, wherein the directions include information about one or more destinations, one or more points of interest, traffic, parking, weather, or one or more events relevant to the route and the current location associated with the location device. 24. The system of claim 22, further comprising one or more domain agents configured to: resolve a subsequent request in a multi-modal input received from the input device subsequent to the navigation agent having calculated the route; andfilter results associated with the subsequent request according to the calculated route to resolve the subsequent request. 25. The system of claim 22, wherein the one or more processors are configured to: determine that the one or more requests identified in the natural language utterance further include a multi-modal request to control a map display associated with the route;associate a non-voice component in the multi-modal input with the one or more interpretations, wherein the non-voice component in the multi-modal input identifies a portion of the map display; andissue a command to control the identified portion of the map display in accordance with the one or more interpretations to resolve the one or more requests identified in the natural language utterance. 26. The method of claim 20, wherein each of the plurality of sets of location-specific grammar information corresponds to a different subdivision, neighborhood, city, or county, and wherein the selected set of location-specific grammar information corresponds to a subdivision, neighborhood, city, or county that is most proximal, among the plurality of sets, to the current location of the computing device. 27. The system of claim 26, further comprising a multi-pass speech recognition module configured to: recognize one or more words in the natural language utterance that define a navigation command;recognize one or more additional words in the natural language utterance that define a location associated with the command; andgenerate the recognition grammar based on the location defined in the one or more additional words recognized in the natural language utterance. 28. The system of claim 20, wherein the one or more processors are configured to generate the one or more interpretations further based on shared knowledge, and wherein the one or more interpretations includes the one or more inferences. 29. The system of claim 28, wherein the one or more processors are further configured to generate a response to suggest one or more available navigation services.
Copyright KISTI. All Rights Reserved.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.