최소 단어 이상 선택하여야 합니다.
최대 10 단어까지만 선택 가능합니다.
다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
NTIS 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
DataON 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Edison 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Kafe 바로가기국가/구분 | United States(US) Patent 등록 |
---|---|
국제특허분류(IPC7판) |
|
출원번호 | US-0643741 (2017-07-07) |
등록번호 | US-10079014 (2018-09-18) |
발명자 / 주소 |
|
출원인 / 주소 |
|
대리인 / 주소 |
|
인용정보 | 피인용 횟수 : 0 인용 특허 : 2224 |
A speech recognition system uses, in one embodiment, an extended phonetic dictionary that is obtained by processing words in a user's set of databases, such as a user's contacts database, with a set of pronunciation guessers. The speech recognition system can use a conventional phonetic dictionary a
A speech recognition system uses, in one embodiment, an extended phonetic dictionary that is obtained by processing words in a user's set of databases, such as a user's contacts database, with a set of pronunciation guessers. The speech recognition system can use a conventional phonetic dictionary and the extended phonetic dictionary to recognize speech inputs that are user requests to use the contacts database, for example, to make a phone call, etc. The extended phonetic dictionary can be updated in response to changes in the contacts database, and the set of pronunciation guessers can include pronunciation guessers for a plurality of locales, each locale having its own pronunciation guesser.
1. A non-transitory machine-readable storage medium storing one or more programs, comprising instructions, which when executed by one or more processors of an electronic device, cause the electronic device to: detect a change in a user's set of one or more databases;responsive to detecting the chang
1. A non-transitory machine-readable storage medium storing one or more programs, comprising instructions, which when executed by one or more processors of an electronic device, cause the electronic device to: detect a change in a user's set of one or more databases;responsive to detecting the change in the user's set of one or more databases,obtain words from the user's set of one or more databases; and process, using a set of one or more pronunciation guessers, the words from the user's set of one or more databases, the processing producing additional phonetic data derived from the words, the additional phonetic data forming an extended phonetic dictionary for the user;receive a speech input;perform speech recognition on the speech input to determine phonemes in the speech input and to determine a set of one or more matches using the determined phonemes, a phonetic dictionary, and the extended phonetic dictionary;determine a best match from the set of one or more matches; andprovide an output associated with the best match. 2. The non-transitory medium as in claim 1, wherein providing the output associated with the best match comprises displaying a speech recognition output associated with the best match on a screen of the electronic device. 3. The non-transitory medium as in claim 1, wherein the change in the user's set of one or more databases comprises a change in at least one of: a contacts database, a calendar database, and a media database. 4. The non-transitory medium as in claim 1, wherein the user's set of one or more databases comprises a contacts database and wherein the change in the user's set of one or more databases comprises an addition of a name of a contact to the contacts database. 5. The non-transitory medium as in claim 1, wherein the user's set of one or more databases comprises a contacts database with names of people and telephone numbers, wherein the change comprises a change in the contacts database, wherein the phonetic dictionary is for natural language speech recognition, and wherein the speech input is for calling a telephone number or sending a text message to another device. 6. The non-transitory medium as in claim 5, wherein the non-transitory computer-readable storage medium further comprises instructions, which when executed by the one or more processors of the electronic device, cause the device to: obtain the change in the contacts database in response to the change being made; andprocess, using the set of one or more pronunciation guessers, the change in the contacts database to update the extended phonetic dictionary based on the change in the contacts database. 7. The non-transitory medium as in claim 6, wherein the set of one or more pronunciation guessers comprise pronunciation guessers for a plurality of locales, each locale having its own pronunciation guesser. 8. The non-transitory medium as in claim 7, wherein the non-transitory computer-readable storage medium further comprises instructions, which when executed by the one or more processors of the electronic device, cause the device to: automatically determine a set of languages or locales from the words in the user's set of one or more databases;select a subset of the set of one or more pronunciation guessers, wherein the subset is selected based upon the set of languages or locales that were automatically determined from the words in the user's set of one or more databases; andwherein the processing which produces the additional phonetic data uses only the subset of the set of one or more pronunciation guessers when producing the additional phonetic data. 9. The non-transitory medium as in claim 6, wherein the set of one or more pronunciation guessers comprises different pronunciation guessers for a same locale. 10. The non-transitory medium as in claim 6, wherein obtaining the words from the user's set of one or more databases comprises obtaining the words in the contacts database from an external electronic device, wherein receiving the speech input comprises receiving the speech input from the external electronic device, and wherein providing the output associated with the best match comprises transmitting the best match to the external electronic device. 11. The non-transitory medium as in claim 10, wherein the electronic device executes the one or more programs for a plurality of users, and each user of the plurality of users has a dedicated extended phonetic dictionary that is unique to each user. 12. The non-transitory medium as in claim 1, wherein determining the best match from the set of one or more matches is based on phonetic fuzzy matching. 13. The non-transitory medium as in claim 1, wherein the non-transitory computer-readable storage medium further comprises instructions, which when executed by the one or more processors of the electronic device, cause the device to: identify a portion of the determined phonemes using contextual information associated with the speech input, wherein performing speech recognition on the speech input to determine the set of one or more matches includes performing speech recognition on the portion of the determined phonemes using the extended phonetic dictionary. 14. The non-transitory medium as in claim 1, wherein the non-transitory computer-readable storage medium further comprises instructions, which when executed by the one or more processors of the electronic device, cause the device to: based on the best match, play a media item or send a communication to another electronic device. 15. A machine implemented method comprising: detecting a change in a user's set of one or more databases;responsive to detecting the change in the user's set of one or more databases, obtaining words from the user's set of one or more databases; andprocessing, using a set of one or more pronunciation guessers, the words from the user's set of one or more databases, the processing producing additional phonetic data derived from the words, the additional phonetic data forming an extended phonetic dictionary for the user;receiving a speech input;performing speech recognition on the speech input to determine phonemes in the speech input and to determine a set of one or more matches using the determined phonemes, a phonetic dictionary, and the extended phonetic dictionary:determining a best match from the set of one or more matches; andproviding an output associated with the best match. 16. The method of claim 15, wherein the method is performed by an electronic device and wherein providing the output associated with the best match comprises displaying a speech recognition output associated with the best match on a screen of the electronic device. 17. The method of claim 15, wherein the change in the user's set of one or more databases comprises a change in at least one of: a contacts database, a calendar database, and a media database. 18. The method of claim 15, wherein the user's set of one or more databases comprises a contacts database and wherein the change in the user's set of one or more databases comprises an addition of a name of a contact to the contacts database. 19. The method of claim 15, wherein the user's set of one or more databases comprises a contacts database with names of people and telephone numbers, wherein the change comprises a change in the contacts database, wherein the phonetic dictionary is for natural language speech recognition, and wherein the speech input is for calling a telephone number or sending a text message to another device. 20. The method of claim 19, further comprising: obtaining the change in the contacts database in response to the change being made; andprocessing, using the set of one or more pronunciation guessers, the change in the contacts database to update the extended phonetic dictionary based on the change in the contacts database. 21. The method of claim 20, wherein the set of one or more pronunciation guessers comprise pronunciation guessers for a plurality of locales, each locale having its own pronunciation guesser. 22. The method of claim 21, further comprising: automatically determining a set of languages or locales from the words in the user's set of one or more databases;selecting a subset of the set of one or more pronunciation guessers, wherein the subset is selected based upon the set of languages or locales that were automatically determined from the words in the user's set of one or more databases; andwherein the processing which produces the additional phonetic data uses only the subset of the set of one or more pronunciation guessers when producing the additional phonetic data. 23. The method of claim 20, wherein the set of one or more pronunciation guessers comprises different pronunciation guessers for a same locale. 24. The method of claim 20, wherein the method is performed by a server, wherein obtaining the words from the user's set of one or more databases comprises obtaining the words in the contacts database from an external electronic device, wherein receiving the speech input comprises receiving the speech input from the external electronic device, and wherein providing the output associated with the best match comprises transmitting the best match to the external electronic device. 25. The method of claim 24, wherein the method is performed for a plurality of users, and each user of the plurality of users has a dedicated extended phonetic dictionary that is unique to each user. 26. The method of claim 15, wherein determining the best match from the set of one or more matches is based on phonetic fuzzy matching. 27. The method of claim 15, further comprising: identifying a portion of the determined phonemes using contextual information associated with the speech input, wherein performing speech recognition on the speech input to determine the set of one or more matches includes performing speech recognition on the portion of the determined phonemes using the extended phonetic dictionary. 28. The method of claim 15, wherein the method is performed by an electronic device, the method further comprising: based on the best match, playing a media item or sending a communication to another electronic device. 29. A data processing system comprising: an input device for receiving a speech input from a user;a set of one or more processors coupled to the input device; anda memory coupled to the set of one or more processors, the memory storing a phonetic dictionary for speech recognition and storing an extended phonetic dictionary for the user;wherein the extended phonetic dictionary is produced by:responsive to detecting a change in a user's set of one or more databases, processing, using the set of one or more processors, words in the user's set of one or more databases using a set of one or more pronunciation guessers, andwherein the set of one or more processors performs speech recognition on the speech input to determine phonemes in the speech input and to determine a best match using the determined phonemes, the phonetic dictionary, and the extended phonetic dictionary. 30. The system of claim 29, wherein the memory stores one or more programs, comprising instructions, which when executed by the set of one or more processors, cause the system to: provide an output associated with the best match, wherein providing the output associated with the best match comprises displaying a speech recognition output associated with the best match on a screen of the system. 31. The system of claim 29, wherein the change in the user's set of one or more databases comprises a change in at least one of: a contacts database, a calendar database, and a media database. 32. The system of claim 29, wherein the user's set of one or more databases comprises a contacts database and wherein the change in the user's set of one or more databases comprises an addition of a name of a contact to the contacts database. 33. The system of claim 29, wherein the user's set of one or more databases comprises a contacts database with names of people and telephone numbers, wherein the change comprises a change in the contacts database, wherein the phonetic dictionary is for natural language speech recognition, and wherein the speech input is for calling a telephone number or sending a text message to another device. 34. The system of claim 33, wherein the memory stores one or more programs, comprising instructions, which when executed by the set of one or more processors, cause the system to: obtain the change in the contacts database in response to the change being made; andprocess, using the set of one or more pronunciation guessers, the change in the contacts database to update the extended phonetic dictionary based on the change in the contacts database. 35. The system of claim 34, wherein the set of one or more pronunciation guessers comprise pronunciation guessers for a plurality of locales, each locale having its own pronunciation guesser. 36. The system of claim 35, wherein the one or more programs further comprise instructions, which when executed by the set of one or more processors, cause the system to: automatically determine a set of languages or locales from the words in the user's set of one or more databases;select a subset of the set of one or more pronunciation guessers, wherein the subset is selected based upon the set of languages or locales that were automatically determined from the words in the user's set of one or more databases; andwherein processing the words in the user's set of one or more databases uses only the subset of the set of one or more pronunciation guessers. 37. The system of claim 35, wherein the system executes the one or more programs for a plurality of users, and each user of the plurality of users has a dedicated extended phonetic dictionary that is unique to each user. 38. The system of claim 34, wherein the set of one or more pronunciation guessers comprises different pronunciation guessers for a same locale. 39. The system of claim 34, wherein receiving the speech input comprises receiving the speech input from an external electronic device. 40. The system of claim 29, wherein determining the best match is based on phonetic fuzzy matching. 41. The system of claim 29, wherein the memory stores one or more programs, comprising instructions, which when executed by the set of one or more processors, cause the system to: identify a portion of the determined phonemes using contextual information associated with the speech input, wherein performing speech recognition on the speech input to determine the best match includes performing speech recognition on the portion of the determined phonemes using the extended phonetic dictionary. 42. The system of claim 29, wherein the memory stores one or more programs, comprising instructions, which when executed by the set of one or more processors, cause the system to: based on the best match, play a media item or send a communication to an electronic device external to the system.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.