최소 단어 이상 선택하여야 합니다.
최대 10 단어까지만 선택 가능합니다.
다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
NTIS 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
DataON 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Edison 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Kafe 바로가기국가/구분 | United States(US) Patent 등록 |
---|---|
국제특허분류(IPC7판) |
|
출원번호 | US-0492720 (2012-06-08) |
등록번호 | US-9721563 (2017-08-01) |
발명자 / 주소 |
|
출원인 / 주소 |
|
대리인 / 주소 |
|
인용정보 | 피인용 횟수 : 0 인용 특허 : 517 |
A speech recognition system uses, in one embodiment, an extended phonetic dictionary that is obtained by processing words in a user's set of databases, such as a user's contacts database, with a set of pronunciation guessers. The speech recognition system can use a conventional phonetic dictionary a
A speech recognition system uses, in one embodiment, an extended phonetic dictionary that is obtained by processing words in a user's set of databases, such as a user's contacts database, with a set of pronunciation guessers. The speech recognition system can use a conventional phonetic dictionary and the extended phonetic dictionary to recognize speech inputs that are user requests to use the contacts database, for example, to make a phone call, etc. The extended phonetic dictionary can be updated in response to changes in the contacts database, and the set of pronunciation guessers can include pronunciation guessers for a plurality of locales, each locale having its own pronunciation guesser.
1. A machine readable non-transitory storage medium storing executable instructions which, when executed by a data processing system, cause the data processing system to perform a method comprising: storing a phonetic dictionary for speech recognition;obtaining words from a user's set of one or more
1. A machine readable non-transitory storage medium storing executable instructions which, when executed by a data processing system, cause the data processing system to perform a method comprising: storing a phonetic dictionary for speech recognition;obtaining words from a user's set of one or more databases;receiving a speech input from the user;responsive to the speech input, processing, using a plurality of pronunciation guessers, the words from the user's set of one or more databases, the processing producing additional phonetic data derived from the words, the additional phonetic data forming an extended phonetic dictionary unique to the user;processing the speech input by comparing phonemes detected in the speech input to the phonetic dictionary and to the extended phonetic dictionary to determine a set of one or more matches; anddetermining a best match from the set of one or more matches. 2. The medium as in claim 1 wherein the user's set of one or more databases comprises a contacts database with names of people and telephone numbers, and wherein the phonetic dictionary is for natural language speech recognition and wherein the speech input is for calling a telephone number or sending a text message to another user's device. 3. The medium as in claim 2, wherein the method further comprises: obtaining changes in the contacts database and processing, using the plurality of pronunciation guessers, the changes to update the extended phonetic dictionary based on the changes, wherein the obtaining of the changes occurs in response to the changes being made. 4. The medium as in claim 3, wherein the plurality of pronunciation guessers comprise pronunciation guessers for a plurality of locales, each locale having its own pronunciation guesser. 5. The medium as in claim 4, wherein the method further comprises: determining automatically a set of languages or locales from the words in the user's set of one or more databases;selecting a subset of the plurality of pronunciation guessers, the subset selected based upon the set of languages or locales that were automatically determined from the words in the user's set of one or more databases; andwherein the processing which produces the additional phonetic data uses only the subset of the plurality of pronunciation guessers when producing the additional phonetic data. 6. The medium as in claim 4 wherein the method is performed by a server that is coupled through a wireless network to the user's device which includes the contacts database and wherein the server obtains the words in the contacts database from the user's device through the wireless network and wherein the server receives the speech input from the user's device through the wireless network and wherein the server transmits the best match to the user's device through the wireless network. 7. The medium as in claim 6 wherein the server performs the method for a plurality of users, and each user in the plurality of users has a dedicated extended phonetic dictionary that is unique to each user. 8. The medium as in claim 3, wherein the plurality of pronunciation guessers comprises different pronunciation guessers for the same locale. 9. The medium as in claim 3 wherein the method is performed by the user's device which includes the contacts database. 10. A machine implemented method comprising: storing a phonetic dictionary for speech recognition;obtaining words from a user's set of one or more databases;receiving a speech input from the user;responsive to the speech input, processing, using a plurality of pronunciation guessers, the words from the user's set of one or more databases, the processing producing additional phonetic data derived from the words, the additional phonetic data forming an extended phonetic dictionary unique to the user;processing the speech input by comparing phonemes detected in the speech input to the phonetic dictionary and to the extended phonetic dictionary to determine a set of one or more matches; anddetermining a best match from the set of one or more matches. 11. The method as in claim 10 wherein the user's set of one or more databases comprises a contacts database with names of people and telephone numbers, and wherein the phonetic dictionary is for natural language speech recognition and wherein the speech input is for calling a telephone number or sending a text message to another user's device. 12. The method as in claim 11, wherein the method further comprises: obtaining changes in the contacts database and processing, using the plurality of pronunciation guessers, the changes to update the extended phonetic dictionary based on the changes, wherein the obtaining of the changes occurs in response to the changes being made. 13. The method as in claim 12, wherein the plurality of pronunciation guessers comprise pronunciation guessers for a plurality of locales, each locale having its own pronunciation guesser. 14. The method as in claim 13, wherein the method further comprises: determining automatically a set of languages or locales from the words in the user's set of one or more databases;selecting a subset of the plurality of pronunciation guessers, the subset selected based upon the set of languages or locales that were automatically determined from the words in the user's set of one or more databases; andwherein the processing which produces the additional phonetic data uses only the subset of the plurality of pronunciation guessers when producing the additional phonetic data. 15. The method as in claim 13 wherein the method is performed by a server that is coupled through a wireless network to the user's device which includes the contacts database and wherein the server obtains the words in the contacts database from the user's device through the wireless network and wherein the server receives the speech input from the user's device through the wireless network and wherein the server transmits the best match to the user's device through the wireless network. 16. The method as in claim 12, wherein the plurality of pronunciation guessers comprises different pronunciation guessers for the same locale. 17. The method as in claim 12 wherein the method is performed by the user's device which includes the contacts database. 18. A data processing system comprising: an input device for receiving a speech input from a user;a set of one or more processors coupled to the input device; anda memory coupled to the set of one or more processors, the memory storing a phonetic dictionary for speech recognition and storing an extended phonetic dictionary unique to the user,wherein the extended phonetic dictionary is produced by: responsive to the speech input, processing, using the set of one or more processors, words in a user's set of one or more databases using a plurality of pronunciation guessers,wherein the set of one or more processors processes the speech input by comparing phonemes detected in the speech input to the phonetic dictionary and to the extended phonetic dictionary to determine a best match. 19. The system as in claim 18 wherein the system is a server that is coupled through a wireless network to a user's device which stores the set of one or more databases which include a contacts database, and wherein the input device is a network interface that is coupled to the wireless network and wherein the server transmits the best match to the user's device through the wireless network, and wherein the speech input is for calling a telephone number in the contacts database or for sending a message to another user in the contacts database. 20. The system as in claim 18 wherein the user's set of one or more databases includes a contacts database and changes in the contacts database are processed using the plurality of pronunciation guessers to update the extended phonetic dictionary based on the changes, and wherein the plurality of pronunciation guessers comprise pronunciation guessers for a plurality of locales, each locale having its own pronunciation guesser.
Copyright KISTI. All Rights Reserved.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.