IPC분류정보
국가/구분 |
United States(US) Patent
등록
|
국제특허분류(IPC7판) |
|
출원번호 |
UP-0020596
(2004-12-22)
|
등록번호 |
US-7634064
(2009-12-24)
|
발명자
/ 주소 |
|
출원인 / 주소 |
|
대리인 / 주소 |
|
인용정보 |
피인용 횟수 :
6 인용 특허 :
120 |
초록
▼
A system and method for improving voice recognition processing at a server system that receives voice input from a remotely located user system. The user system includes a microphone, a processor that performs front-end voice recognition processing of the received user voice input, and a communicat
A system and method for improving voice recognition processing at a server system that receives voice input from a remotely located user system. The user system includes a microphone, a processor that performs front-end voice recognition processing of the received user voice input, and a communication component configured to send the front-end processed user voice input to a destination wirelessly over a network. The server system includes a communication component configured to receive the sent front-end processed user voice input, and a processor configured to complete voice recognition processing of the sent front-end processed user voice input.
대표청구항
▼
The invention claimed is: 1. A method for digital signal manipulation, comprising: receiving an acoustic analog signal at a user system; converting the analog signal to a digital signal; canceling noise from the digital signal to form a processed digital signal; detecting user speech in the process
The invention claimed is: 1. A method for digital signal manipulation, comprising: receiving an acoustic analog signal at a user system; converting the analog signal to a digital signal; canceling noise from the digital signal to form a processed digital signal; detecting user speech in the processed digital signal by evaluating change in amplitude sign of the processed digital signal; and if user speech is detected in the processed digital signal, packaging the user speech into speech packets to form a packaged voice signal; selecting a transmission format compatible with the packaged voice signal; and transmitting the packaged voice signal to a server; receiving non acoustic data at the user system; and if user speech is not detected in the processed digital signal, packaging the non acoustic data into data packets to form a packaged data signal; selecting a transmission format compatible with the packaged data signal; and transmitting the packaged data signal to a server. 2. The method of claim 1, wherein the packaged data signal is transmitted to the server using a maximum possible bandwidth. 3. A method for digital signal manipulation, comprising: receiving an acoustic analog signal at a user system; converting the analog signal to a digital signal; canceling noise from the digital signal to form a processed digital signal; detecting user speech in the processed digital signal by evaluating change in amplitude sign of the processed digital signal; and if user speech is detected in the processed digital signal, packaging the user speech into speech packets to form a packaged voice signal; selecting a transmission format compatible with the packaged voice signal; transmitting the packaged voice signal to a server; evaluating the processed digital signal to determine whether data exist that enhances speech detection and matching at the server; and if data from the processed digital signal exist to be transmitted to the server to enhance speech detection and matching at the server, packaging the data from the processed digital signal into data packets; and interspersing data packets with the voice packets. 4. A method for digital signal manipulation, comprising: receiving an acoustic analog signal at a user system; converting the analog signal to a digital signal; canceling noise and echoes from the digital signal to form a processed digital signal; detecting user speech in the processed digital signal by evaluating change in amplitude sign of the processed digital signal; and if user speech is detected in the processed digital signal, packaging the user speech into speech packets to form a packaged voice signal; selecting a transmission format compatible with the packaged voice signal; and transmitting the packaged voice signal to a server; if user speech is not detected in the processed digital signal, receiving non acoustic data at the user system; packaging the processed digital signal into data packets to form a packaged data signal; selecting a transmission format compatible with the packaged data signal; and transmitting the packaged data signal to a server. 5. The method of claim 4, wherein the packaged data signal is transmitted to the server using a maximum possible bandwidth. 6. A method for digital signal manipulation, comprising: receiving an acoustic analog signal at a user system; converting the analog signal to a digital signal; canceling noise and echoes from the digital signal to form a processed digital signal; detecting user speech in the processed digital signal by evaluating change in amplitude sign of the processed digital signal; and if user speech is detected in the processed digital signal, packaging the user speech into speech packets to form a packaged voice signal; selecting a transmission format compatible with the packaged voice signal; and transmitting the packaged voice signal to a server; evaluating the processed digital signal to determine whether data exist that enhances speech detection and matching at the server; and if data from the processed digital signal exists to be transmitted to the server to enhance speech detection and matching at the server, packaging the data from the processed digital signal into data packets; and interspersing data packets with the voice packets. 7. A system comprising: a user system configured for receiving an acoustic analog signal and converting to a digital signal, where at the user system the system further comprises: a processor having: a first algorithm for canceling noise and echoes from the digital signal to form a processed digital signal; a second algorithm for detecting speech in the processed digital signal by examining for the change in amplitude sign and the rate of amplitude change in the processed digital signal; a third algorithm for packaging the processed digital signal with data or speech packets in accordance with the detected user speech to form a packaged voice signal; a fourth algorithm for selecting a transmission format in accord with the packaged voice signal; and means for transmitting the digital signal to a server, the server having a plurality of algorithms, wherein the plurality of algorithms match the speech content of the packaged voice signal with instructions stored in the server database to execute the instructions, wherein if user speech is detected, a determination is made whether data needs to be added to the packaged voice signal to enhance speech detection and matching at the server. 8. The system of claim 7, wherein if the determination is negative, the transmission format for the packaged voice signal is a maximum packet bandwidth. 9. The system of claim 7, wherein if the determination is affirmative, the transmission format for the packaged voice signal includes interspersing data packets with voice packets. 10. A system comprising: means for receiving at a user an acoustic analog signal and converting to a digital signal, where at the user system the system further comprises: means for canceling noise and echoes from the digital signal to form a processed digital signal; means for detecting user speech in the processed digital signal by examining for the change in amplitude sign and the rate of amplitude change in the processed digital signal; means for packaging the processed digital signal with data or speech packets in accordance with the user detected user speech to form a packaged voice signal; means for selecting a transmission format in accord with the packaged voice signal; means for transmitting the packaged voice signal to a server system, where the server system further includes: means for matching the speech content of the packaged voice signal with instructions stored in the server database; and means for executing the instructions; if user speech is not detected in the processed digital signal, further comprising: means for receiving non acoustic data at the user system; means for packaging the processed digital signal into data packets to form a packaged data signal; means for selecting a transmission format compatible with the packaged data signal; and means for transmitting the packaged data signal to a server. 11. A system comprising: means for receiving at a user an acoustic analog signal and converting to a digital signal, where at the user system the system further comprises: means for canceling noise and echoes from the digital signal to form processed digital Signal; means for detecting user speech in the processed digital signal by examining for the change in amplitude sign and the rate of amplitude change in the processed digital signal; means for packaging the processed digital signal with data or speech packets in accordance with the user detected user speech to form a packaged voice signal; means for selecting a transmission format in accord with the packaged voice signal; means for transmitting the packaged voice signal to a server system, where the server system further includes: means for matching the speech content of the packaged voice signal with instructions stored in the server database; and means for executing the instructions; if user speech is detected in the processed digital signal, further comprising: means for evaluating the additional digital data to determine whether data exist that enhances speech detection and matching at the server; and if data from the processed digital signal exists to be transmitted to the server to enhance speech detection and matching at the server, means for packaging the data from the processed digital signal into data packets, and means for interspersing data packets with the voice packets. 12. A system comprising: means for receiving at a user an acoustic analog signal and converting to a digital signal, where at the user system the system further comprises: means for canceling noise and echoes from the digital signal to form a processed digital signal; means for detecting user speech in the processed digital signal by examining for the change in amplitude sign and the rate of amplitude change in the processed digital signal; means for packaging the processed digital signal with data or speech packets in accordance with the user detected user speech to form a packaged voice signal; means for selecting a transmission format in accord with the packaged voice signal; means for transmitting the packaged voice signal to a server system, where the server system further includes: means for matching the speech content of the packaged voice signal with instructions stored in the server database; and means for executing the instructions; wherein if user speech is detected, a determination is made whether data needs to be added to the packaged voice signal to enhance speech detection and matching at the server. 13. The system of claim 12, wherein if the determination is negative, the transmission format for the packaged voice signal is a maximum packet bandwidth.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.