최소 단어 이상 선택하여야 합니다.
최대 10 단어까지만 선택 가능합니다.
다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
NTIS 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
DataON 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Edison 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Kafe 바로가기국가/구분 | United States(US) Patent 등록 |
---|---|
국제특허분류(IPC7판) |
|
출원번호 | US-0445863 (2017-02-28) |
등록번호 | US-9966060 (2018-05-08) |
발명자 / 주소 |
|
출원인 / 주소 |
|
대리인 / 주소 |
|
인용정보 | 피인용 횟수 : 0 인용 특허 : 2181 |
The method is performed at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors. A first speech input including at least one word is received. A first phonetic representation of the at least one word is determined, the f
The method is performed at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors. A first speech input including at least one word is received. A first phonetic representation of the at least one word is determined, the first phonetic representation comprising a first set of phonemes selected from a speech recognition phonetic alphabet. The first set of phonemes is mapped to a second set of phonemes to generate a second phonetic representation, where the second set of phonemes is selected from a speech synthesis phonetic alphabet. The second phonetic representation is stored in association with a text string corresponding to the at least one word.
1. A method for learning word pronunciations, comprising: at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors: detecting an error in a speech based interaction with a digital assistant based on detecting a user input
1. A method for learning word pronunciations, comprising: at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors: detecting an error in a speech based interaction with a digital assistant based on detecting a user input other than the speech based interaction;in response to detecting the error, receiving a speech input from a user, the speech input including a pronunciation of one or more words;determining, based on the pronunciation of the one or more words, a first set of phonemes from a speech recognition phonetic alphabet and a second set of phonemes from a speech synthesis phonetic alphabet;updating one or more databases to include the first set of phonemes and the second set of phonemes in association with a text string corresponding to the one or more words; andperforming speech recognition or speech synthesis using the updated one or more databases. 2. The method of claim 1, wherein the one or more words were received in a prior speech input provided by the user, and wherein the error is an error in speech recognition of the one or more words. 3. The method of claim 1, wherein the one or more words were output in a speech output by the electronic device, and wherein the error is an error in speech synthesis of the one or more words. 4. The method of claim 1, further comprising: receiving the speech input including the one or more words;performing speech recognition on the speech input to generate the text string corresponding to the one or more words;determining a confidence metric of the text string; anddetecting the error based on a determination that the confidence metric does not meet a predetermined threshold. 5. The method of claim 1, further comprising: synthesizing a speech output including the one or more words; anddetecting the error based on an indication from the user that the one or more words were pronounced incorrectly. 6. The method of claim 1, further comprising: performing speech recognition on the speech input to generate the text string corresponding to the one or more words; and wherein updating the one or more databases comprisesupdating a speech recognizer to associate the first set of phonemes with the text string. 7. The method of claim 6, further comprising: receiving a second speech input including the one or more words;determining a third set of phonemes for the one or more words;determining that the one or more words in the second speech input correspond to the text string based on comparing the first set of phonemes and the third set of phonemes. 8. The method of claim 1, further comprising: prior to receiving the speech input from the user and after detecting the error, prompting the user to provide the speech input, the speech input including a preferred pronunciation of the one or more words. 9. The method of claim 1, further comprising: synthesizing a speech output including the one or more words; anddisplaying a textual version of the speech output on a display of the electronic device. 10. A non-transitory computer readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by an electronic device, cause the device to: detect an error in a speech based interaction with a digital assistant based on detecting a user input other than the speech based interaction;in response to detecting the error, receive a speech input from a user, the speech input including a pronunciation of one or more words;determine, based on the pronunciation of the one or more words, a first set of phonemes from a speech recognition phonetic alphabet and a second set of phonemes from a speech synthesis phonetic alphabet;update one or more databases to include the first set of phonemes and the second set of phonemes in association with a text string corresponding to the one or more words; andperform speech recognition or speech synthesis using the updated one or more databases. 11. The computer readable storage medium of claim 10, wherein the one or more words were received in a prior speech input provided by the user, and wherein the error is an error in speech recognition of the one or more words. 12. The computer readable storage medium of claim 10, wherein the one or more words were output in a speech output by the electronic device, and wherein the error is an error in speech synthesis of the one or more words. 13. The computer readable storage medium of claim 10, wherein the instructions further cause the device to: receive the speech input including the one or more words;perform speech recognition on the speech input to generate the text string corresponding to the one or more words;determine a confidence metric of the text string; anddetect the error based on a determination that the confidence metric does not meet a predetermined threshold. 14. The computer readable storage medium of claim 10, wherein the instructions further cause the device to: synthesize a speech output including the one or more words; anddetect the error based on an indication from the user that the one or more words were pronounced incorrectly. 15. An electronic device, comprising: one or more processors; andmemory storing one or more programs, the one or more programs including instructions, which when executed by the one or more processors, cause the one or more processors to: detect an error in a speech based interaction with a digital assistant based on detecting a user input other than the speech based interaction;in response to detecting the error, receive a speech input from a user, the speech input including a pronunciation of one or more words;determine, based on the pronunciation of the one or more words, a first set of phonemes from a speech recognition phonetic alphabet and a second set of phonemes from a speech synthesis phonetic alphabet;update one or more databases to include the first set of phonemes and the second set of phonemes in association with a text string corresponding to the one or more words; andperform speech recognition or speech synthesis using the updated one or more databases. 16. The device of claim 15, wherein the one or more words were received in a prior speech input provided by the user, and wherein the error is an error in speech recognition of the one or more words. 17. The device of claim 15, wherein the one or more words were output in a speech output by the electronic device, and wherein the error is an error in speech synthesis of the one or more words. 18. The device of claim 15, wherein the instructions further cause the one or more processors to: receive the speech input including the one or more words;perform speech recognition on the speech input to generate the text string corresponding to the one or more words;determine a confidence metric of the text string; anddetect the error based on a determination that the confidence metric does not meet a predetermined threshold. 19. The device of claim 15, wherein the instructions further cause the one or more processors to: synthesize a speech output including the one or more words; anddetect the error based on an indication from the user that the one or more words were pronounced incorrectly.
Copyright KISTI. All Rights Reserved.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.