IPC분류정보
국가/구분 |
United States(US) Patent
등록
|
국제특허분류(IPC7판) |
|
출원번호 |
US-0229324
(2008-08-20)
|
등록번호 |
US-8650028
(2014-02-11)
|
발명자
/ 주소 |
|
출원인 / 주소 |
- Mindspeed Technologies, Inc.
|
대리인 / 주소 |
|
인용정보 |
피인용 횟수 :
2 인용 특허 :
148 |
초록
▼
A method comprises analyzing each frame of a plurality of frames of the speech signal to determine one or more speech parameters for the speech signal; deciding, for each frame of the plurality of frames of the speech signal, based on the one or more speech parameters of the speech signal, to select
A method comprises analyzing each frame of a plurality of frames of the speech signal to determine one or more speech parameters for the speech signal; deciding, for each frame of the plurality of frames of the speech signal, based on the one or more speech parameters of the speech signal, to select one of a plurality of encoding modes including a first encoding mode and a second encoding mode for encoding each frame of the plurality of frames of the speech signal; encoding each frame of the plurality of frames of the speech signal according to the selected one of the plurality of encoding modes for each frame of the plurality of frames in the deciding; the first encoding mode supports a first encoding rate and the second encoding mode supports a second encoding rate, wherein the first encoding rate is the same encoding rate as the encoding rate.
대표청구항
▼
1. A method of encoding a speech signal, the method comprising: analyzing each frame of a plurality of frames of the speech signal to determine one or more speech parameters for the speech signal, wherein one parameter of the one or more speech parameters includes one or more pitch lags;deciding, fo
1. A method of encoding a speech signal, the method comprising: analyzing each frame of a plurality of frames of the speech signal to determine one or more speech parameters for the speech signal, wherein one parameter of the one or more speech parameters includes one or more pitch lags;deciding, for each frame of the plurality of frames of the speech signal, based on the one or more speech parameters of the speech signal, to select one of a plurality of encoding modes including a first encoding mode, a second encoding mode and a third encoding mode for encoding each frame of the plurality of frames of the speech signal;converting the speech signal into an encoded speech by encoding each frame of the plurality of frames of the speech signal according to the selected one of the plurality of encoding modes for each frame of the plurality of frames in the deciding;wherein the first encoding mode supports a first encoding rate, the second encoding mode supports a second encoding rate and the third encoding mode supports a third encoding rate, wherein the first encoding rate is the same encoding rate as the second encoding rate, wherein the third encoding rate is different than the first encoding rate and the second encoding rate, and wherein the converting of the speech signal to the encoded speech signal for each frame of the plurality of frames of the speech signal further comprises encoding a single pitch lag of the one or more pitch lags if the encoding mode is one of the second encoding mode and the third encoding mode;wherein the converting of the speech signal to an encoded speech signal for each frame of the plurality of frames of the speech signal comprises encoding of a single pitch lag of the one or more pitch lags if the encoding mode is the second encoding mode or the third encoding mode. 2. The method of claim 1, wherein the first encoding rate and the second encoding rate are both at 6.65 kbps, and the third encoding rate is 5.80 kbps. 3. The method of claim 1, wherein the first encoding mode is long-term prediction mode (LTP_mode) and the second encoding mode and the third encoding mode are pitch preprocessing mode (PP_mode). 4. The method of claim 1, wherein the deciding is based on the one or more speech parameters of the speech signal including a pitch lag parameter. 5. The method of claim 1, wherein the deciding is based on the one or more speech parameters of the speech signal including a pitch gain parameter. 6. The method of claim 1, wherein the deciding is based on the one or more speech parameters of the speech signal including a line spectrum frequency LSF parameter. 7. The method of claim 1, wherein the deciding is based on the one or more speech parameters of the speech signal including a pitch correlation parameter. 8. The method of claim 1, wherein the deciding is based on the one or more speech parameters of the speech signal including linear prediction analysis parameters. 9. The method of claim 8, wherein the deciding is based on the one or more speech parameters of the speech signal including a distance measure between linear prediction analysis parameters. 10. The method of claim 1, wherein the first encoding mode supports a plurality of encoding rates including the first encoding rate and the second encoding mode supports a plurality of encoding rates including the second encoding rate. 11. A speech encoding system for encoding a speech signal, the speech encoding system comprising: an encoder processing circuit configured to: analyze each frame of a plurality of frames of the speech signal to determine one or more speech parameters for the speech signal, wherein one parameter of the one or more speech parameters includes one or more pitch lags;decide, for each frame of the plurality of frames of the speech signal, based on the one or more speech parameters of the speech signal, to select one of a plurality of encoding modes including a first encoding, a second encoding mode and a third encoding mode for encoding each frame of the plurality of frames of the speech signal;convert the speech signal into an encoded speech by encoding each frame of the plurality of frames of the speech signal according to the selected one of the plurality of encoding modes for each frame of the plurality of frames, thereby converting the speech signal into an encoded speech;wherein the first encoding mode supports a first encoding rate, the second encoding mode supports a second encoding rate and the third encoding mode supports a third encoding rate, wherein the first encoding rate is the same encoding rate as the second encoding rate, wherein the third encoding rate is different than the first encoding rate and the second encoding rate, wherein converting the speech signal to the encoded speech signal for each frame of the plurality of frames of the speech signal further comprises encoding a single pitch lag of the one or more pitch lags if the encoding mode is one of the second encoding mode and the third encoding mode. 12. The speech encoding system of claim 11, wherein the first encoding rate and the second encoding rate are both at 6.65 kbps, and the third encoding rate is 5.80 kbps. 13. The speech encoding system of claim 11, wherein the first encoding mode is long-term prediction mode (LTP_mode) and the second encoding mode and the third encoding mode are pitch preprocessing mode (PP_mode). 14. The speech encoding system of claim 11, wherein the encoder processing circuit is configured to decide based on the one or more speech parameters of the speech signal including a pitch lag parameter. 15. The speech encoding system of claim 11, wherein the encoder processing circuit is configured to decide based on the one or more speech parameters of the speech signal including a pitch gain parameter. 16. The speech encoding system of claim 11, wherein the encoder processing circuit is configured to decide based on the one or more speech parameters of the speech signal including a line spectrum frequency LSF parameter. 17. The speech encoding system of claim 11, wherein the encoder processing circuit is configured to decide based on the one or more speech parameters of the speech signal including a pitch correlation parameter. 18. The speech encoding system of claim 11, wherein the encoder processing circuit is configured to decide based on the one or more speech parameters of the speech signal including linear prediction analysis parameters. 19. The speech encoding system of claim 18, wherein the encoder processing circuit is configured to decide based on the one or more speech parameters of the speech signal including a distance measure between linear prediction analysis parameters. 20. The speech encoding system of claim 11, wherein the first encoding mode supports a plurality of encoding rates including the first encoding rate and the second encoding mode supports a plurality of encoding rates including the second encoding rate. 21. The method of claim 1, wherein the converting of the speech signal to the encoded speech signal for each frame of the plurality of frames of the speech signal uses Code Excited Linear Prediction (CELP) if the encoding mode is the first encoding mode. 22. The speech encoding system of claim 11, wherein converting the speech signal to the encoded speech signal for each frame of the plurality of frames of the speech signal uses Code Excited Linear Prediction (CELP) if the encoding mode is the first encoding mode.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.