최소 단어 이상 선택하여야 합니다.
최대 10 단어까지만 선택 가능합니다.
다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
NTIS 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
DataON 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Edison 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Kafe 바로가기국가/구분 | United States(US) Patent 등록 |
---|---|
국제특허분류(IPC7판) |
|
출원번호 | US-0834239 (2015-08-24) |
등록번호 | US-10074360 (2018-09-11) |
발명자 / 주소 |
|
출원인 / 주소 |
|
대리인 / 주소 |
|
인용정보 | 피인용 횟수 : 0 인용 특허 : 1951 |
This relates to providing an indication of the suitability of an acoustic environment for performing speech recognition. One process can include receiving an audio input and determining a speech recognition suitability based on the audio input. The speech recognition suitability can include a numeri
This relates to providing an indication of the suitability of an acoustic environment for performing speech recognition. One process can include receiving an audio input and determining a speech recognition suitability based on the audio input. The speech recognition suitability can include a numerical, textual, graphical, or other representation of the suitability of an acoustic environment for performing speech recognition. The process can further include displaying a visual representation of the speech recognition suitability to indicate the likelihood that a spoken user input will be interpreted correctly. This allows a user to determine whether to proceed with the performance of a speech recognition process, or to move to a different location having a better acoustic environment before performing the speech recognition process. In some examples, the user device can disable operation of a speech recognition process in response to determining that the speech recognition suitability is below a threshold suitability.
1. A method for operating a virtual assistant, the method comprising: at an electronic device: receiving an audio input from an acoustic environment;determining a speech recognition suitability value based on the audio input, wherein the speech recognition suitability value represents a suitability
1. A method for operating a virtual assistant, the method comprising: at an electronic device: receiving an audio input from an acoustic environment;determining a speech recognition suitability value based on the audio input, wherein the speech recognition suitability value represents a suitability of the acoustic environment of the electronic device for speech recognition;in accordance with a determination of the speech recognition suitability value, displaying a visual representation of the speech recognition suitability value;determining whether the speech recognition suitability value satisfies a predetermined criterion; andin accordance with a determination that the speech recognition suitability value does not satisfy the predetermined criterion, disabling, by the electronic device, speech recognition functionality on the electronic device. 2. The method of claim 1, wherein determining the speech recognition suitability value based on the audio input comprises: determining one or more characteristics of the acoustic environment based on the audio input; anddetermining the speech recognition suitability based on the one or more characteristics of the acoustic environment. 3. The method of claim 2, wherein the one or more characteristics of the acoustic environment comprises a signal to noise ratio for a first frequency band of the acoustic environment. 4. The method of claim 3, wherein the one or more characteristics of the acoustic environment comprises a type of noise detected in the first frequency band. 5. The method of claim 2, wherein the one or more characteristics of the acoustic environment comprises a signal to noise ratio for a second frequency band of the acoustic environment. 6. The method of claim 5, wherein the one or more characteristics of the acoustic environment comprises a type of noise detected in the second frequency band. 7. The method of claim 2, wherein the one or more characteristics of the acoustic environment comprises a number of transient noises detected in a buffer comprising previously recorded audio of the acoustic environment. 8. The method of claim 2, wherein determining the speech recognition suitability value comprises: determining a speech recognition suitability vector based on the audio input, wherein the speech recognition suitability vector comprises one or more elements that represent the one or more characteristics of the acoustic environment; andusing a neural network to determine the speech recognition suitability value based on the speech recognition suitability vector. 9. The method of claim 1, wherein the visual representation comprises one or more bars, and wherein a value of the speech recognition suitability value is represented by a number of the one or more bars. 10. The method of claim 1, wherein the visual representation comprises an icon, and wherein the speech recognition suitability value is represented by a color of the icon. 11. The method of claim 10, wherein the icon comprises an image of a microphone. 12. The method of claim 10, wherein displaying the visual representation of the speech recognition suitability value comprises: determining whether the speech recognition suitability value is less than a threshold value;in accordance with a determination that the speech recognition suitability value is less than the threshold value, displaying the icon in a grayed out state; andin accordance with a determination that the speech recognition suitability value is not less than the threshold value, displaying the icon in a non-grayed out state. 13. The method of claim 1, wherein the method further comprises: receiving a user selection of the visual representation of the speech recognition suitability value;in accordance with a determination that the speech recognition suitability value is not less than a threshold value, performing speech recognition on an audio input received subsequent to receiving the user selection; andin accordance with a determination that the speech recognition suitability value is less than the threshold value, forgoing the performance of speech recognition on the audio input received subsequent to receiving the user selection. 14. The method of claim 12, wherein the method further comprises: in accordance with a determination that the speech recognition suitability value is less than the threshold value, outputting a message indicating a low suitability of the acoustic environment of the electronic device for speech recognition. 15. The method of claim 1, wherein the visual representation comprises a textual representation of the speech recognition suitability value. 16. The method of claim 1, wherein determining the speech recognition suitability value comprises periodically determining the speech recognition suitability value, and wherein displaying the visual representation of the speech recognition suitability value comprises updating the display of the visual representation of the speech recognition suitability value in accordance with the periodically determined speech recognition suitability value. 17. The method of claim 1, wherein the speech recognition suitability value comprises a numerical value. 18. The method of claim 1, wherein the audio input does not include speech from a user of the electronic device. 19. A non-transitory computer-readable storage medium for operating a virtual assistant, the computer-readable storage medium comprising instructions for: receiving an audio input from an acoustic environment;determining a speech recognition suitability value based on the audio input, wherein the speech recognition suitability value represents a suitability of the acoustic environment of the electronic device for speech recognition;in accordance with a determination of the speech recognition suitability value, displaying a visual representation of the speech recognition suitability value;determining whether the speech recognition suitability value satisfies a predetermined criterion; andin accordance with a determination that the speech recognition suitability value does not satisfy the predetermined criterion, disabling, by the electronic device, speech recognition functionality on the electronic device. 20. The storage medium of claim 19, wherein the visual representation comprises an icon, and wherein the speech recognition suitability value is represented by a color of the icon. 21. The storage medium of claim 20, wherein displaying the visual representation of the speech recognition suitability value comprises: determining whether the speech recognition suitability value is less than a threshold value;in accordance with a determination that the speech recognition suitability value is less than the threshold value, displaying the icon in a grayed out state; andin accordance with a determination that the speech recognition suitability value is not less than the threshold value, displaying the icon in a non-grayed out state. 22. The storage medium of claim 21, further comprising: in accordance with a determination that the speech recognition suitability value is less than the threshold value, outputting a message indicating a low suitability of the acoustic environment of the electronic device for speech recognition. 23. The storage medium of claim 20, wherein the icon comprises an image of a microphone. 24. The storage medium of claim 19, wherein determining the speech recognition suitability value comprises periodically determining the speech recognition suitability value, and wherein displaying the visual representation of the speech recognition suitability value comprises updating the display of the visual representation of the speech recognition suitability value in accordance with the periodically determined speech recognition suitability value. 25. The storage medium of claim 19, wherein the speech recognition suitability value is determined based on a signal to noise ratio for a first frequency band of the acoustic environment. 26. The storage medium of claim 19, wherein determining the speech recognition suitability value based on the audio input comprises: determining one or more characteristics of the acoustic environment based on the audio input; anddetermining the speech recognition suitability based on the one or more characteristics of the acoustic environment. 27. The storage medium of claim 26, further comprising: determining a speech recognition suitability vector based on the audio input, wherein the speech recognition suitability vector comprises one or more elements that represent the one or more characteristics; andusing a neural network to determine the speech recognition suitability value based on the speech recognition suitability vector. 28. The storage medium of claim 26, wherein the one or more characteristics of the acoustic environment comprises a type of noise detected in a first frequency band. 29. The storage medium of claim 26, wherein the one or more characteristics of the acoustic environment comprises a number of transient noises detected in a buffer comprising previously recorded audio of the acoustic environment. 30. The storage medium of claim 19, wherein the visual representation comprises one or more bars, and wherein a value of the speech recognition suitability value is represented by a number of the one or more bars. 31. The storage medium of claim 19, wherein the visual representation comprises a textual representation of the speech recognition suitability value. 32. The storage medium of claim 19, wherein the speech recognition suitability value comprises a numerical value. 33. A system for operating a virtual assistant, the system comprising: one or more processors;memory; andone or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for: receiving an audio input from an acoustic environment;determining a speech recognition suitability value based on the audio input, wherein the speech recognition suitability value represents a suitability of acoustic environment of the electronic device for speech recognition;in accordance with a determination of the speech recognition suitability value, displaying a visual representation of the speech recognition suitability value;determining whether the speech recognition suitability value satisfies a predetermined criterion; andin accordance with a determination that the speech recognition suitability value does not satisfy the predetermined criterion, disabling, by the electronic device, speech recognition functionality on the electronic device. 34. The system of claim 33, wherein determining the speech recognition suitability value based on the audio input comprises: determining one or more characteristics of the acoustic environment based on the audio input; anddetermining the speech recognition suitability based on the one or more characteristics of the acoustic environment. 35. The system of claim 34, further comprising: determining a speech recognition suitability vector based on the audio input, wherein the speech recognition suitability vector comprises one or more elements that represent the one or more characteristics; andusing a neural network to determine the speech recognition suitability value based on the speech recognition suitability vector. 36. The system of claim 34, wherein the one or more characteristics of the acoustic environment comprises a type of noise detected in a first frequency band. 37. The system of claim 34, wherein the one or more characteristics of the acoustic environment comprises a number of transient noises detected in a buffer comprising previously recorded audio of the acoustic environment. 38. The system of claim 33, wherein the visual representation comprises an icon, and wherein the speech recognition suitability value is represented by a color of the icon. 39. The system of claim 38, wherein displaying the visual representation of the speech recognition suitability value comprises: determining whether the speech recognition suitability value is less than a threshold value;in accordance with a determination that the speech recognition suitability value is less than the threshold value, displaying the icon in a grayed out state; andin accordance with a determination that the speech recognition suitability value is not less than the threshold value, displaying the icon in a non-grayed out state. 40. The system of claim 39, further comprising: in accordance with a determination that the speech recognition suitability value is less than the threshold value, outputting a message indicating a low suitability of the acoustic environment of the electronic device for speech recognition. 41. The system of claim 38, wherein the icon comprises an image of a microphone. 42. The system of claim 33, wherein determining the speech recognition suitability value comprises periodically determining the speech recognition suitability value, and wherein displaying the visual representation of the speech recognition suitability value comprises updating the display of the visual representation of the speech recognition suitability value in accordance with the periodically determined speech recognition suitability value. 43. The system of claim 33, wherein the speech recognition suitability value is determined based on a signal to noise ratio for a first frequency band of the acoustic environment. 44. The system of claim 33, wherein the visual representation comprises one or more bars, and wherein a value of the speech recognition suitability value is represented by a number of the one or more bars. 45. The system of claim 33, wherein the visual representation comprises a textual representation of the speech recognition suitability value. 46. The system of claim 33, wherein the speech recognition suitability value comprises a numerical value.
Copyright KISTI. All Rights Reserved.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.