Adaptive gain reduction for encoding a speech signal
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G10L-019/00
G10L-019/20
G10L-019/09
G10L-019/18
G10L-025/90
출원번호
US-0218242
(2008-07-11)
등록번호
US-9269365
(2016-02-23)
발명자
/ 주소
Su, Huan-Yu
Gao, Yang
출원인 / 주소
Mindspeed Technologies, Inc.
대리인 / 주소
Farjami & Farjami LLP
인용정보
피인용 횟수 :
1인용 특허 :
153
초록▼
There is provided a method of encoding an input speech signal. The method comprises identifying a fixed codebook vector from a fixed codebook; identifying an adaptive codebook vector from a adaptive codebook; calculating an adaptive codebook gain; reducing the adaptive codebook gain by an amount; op
There is provided a method of encoding an input speech signal. The method comprises identifying a fixed codebook vector from a fixed codebook; identifying an adaptive codebook vector from a adaptive codebook; calculating an adaptive codebook gain; reducing the adaptive codebook gain by an amount; optimally selecting a fixed codebook gain based on the adaptive codebook gain while both the fixed codebook vector and the adaptive codebook vector remain fixed; and converting the input speech signal into an encoded speech using the fixed codebook gain, the adaptive codebook gain, the fixed codebook vector and the adaptive codebook vector. The amount of reducing the adaptive codebook gain may be varied.
대표청구항▼
1. A method of encoding a speech signal using a multi-rate encoder having a plurality of encoding rates, the method comprising: converting the speech signal from an analog form to a digitized speech signal;identifying a fixed codebook vector from a fixed codebook using the digitized speech signal;id
1. A method of encoding a speech signal using a multi-rate encoder having a plurality of encoding rates, the method comprising: converting the speech signal from an analog form to a digitized speech signal;identifying a fixed codebook vector from a fixed codebook using the digitized speech signal;identifying an adaptive codebook vector from an adaptive codebook using the digitized speech signal;calculating an adaptive codebook gain;reducing the adaptive codebook gain by an amount, wherein the amount is based on one encoding rate of the plurality of encoding rates;optimally selecting a fixed codebook gain based on the adaptive codebook gain while both the fixed codebook vector and the adaptive codebook vector remain fixed; andgenerating an encoded speech using the fixed codebook gain, the adaptive codebook gain, the fixed codebook vector and the adaptive codebook vector;wherein the reducing of the adaptive codebook gain reduces the adaptive codebook gain more when the encoding rate is a first encoding rate of the plurality of encoding rates than when the encoding rate is a second encoding rate of the plurality of encoding rates, and wherein the first encoding rate is lower than the second encoding rate. 2. The method of claim 1, wherein the encoding rate is adaptively selected from the plurality of encoding rates. 3. The method of claim 2, wherein the encoding rate is adaptively selected on a frame-by-frame basis. 4. The method of claim 1, wherein the amount is further based on a correlation value. 5. The method of claim 4, wherein the correlation value is based on an original target signal. 6. The method of claim 4, wherein the correlation value is based on a filtered signal from the adaptive codebook. 7. A multi-rate speech encoding device having a plurality of encoding rates for encoding a speech signal, the device comprising: a fixed codebook;an adaptive codebook; anda processing circuit configured to: convert the speech signal from an analog form to a digitized speech signal;identify a fixed codebook vector from the fixed codebook using the digitized speech signal;identify an adaptive codebook vector from the adaptive codebook using the digitized speech signal;calculate an adaptive codebook gain;reduce the adaptive codebook gain by an amount, wherein the amount is based on one encoding rate from the plurality of encoding rates;optimally select a fixed codebook gain based on the adaptive codebook gain while both the fixed codebook vector and the adaptive codebook vector remain fixed; andgenerate an encoded speech using the fixed codebook gain, the adaptive codebook gain, the fixed codebook vector and the adaptive codebook vector;wherein reducing the adaptive codebook gain by the processing circuit reduces the adaptive codebook gain more when the encoding rate is a first encoding rate of the plurality of encoding rates than when the encoding rate is a second encoding rate of the plurality of encoding rates, and wherein the first encoding rate is lower than the second encoding rate. 8. The device of claim 7, wherein the encoding rate is adaptively selected from the plurality of encoding rates. 9. The device of claim 8, wherein the encoding rate is adaptively selected on a frame-by-frame basis. 10. The device of claim 7, wherein the amount is further based on a correlation value. 11. The device of claim 10, wherein the correlation value is based on an original target signal. 12. The device of claim 10, wherein the correlation value is based on a filtered signal from the adaptive codebook. 13. A method of encoding a speech signal using a multi-rate encoder having a plurality of encoding rates, the method comprising: converting the speech signal from an analog form to a digitized speech signal;identifying a plurality of pitch parameters using the digitized speech signal;applying a perceptual weighting filter to the digitized speech signal to generate a first target signal;identifying an adaptive codebook vector from an adaptive codebook using the plurality of pitch parameters;filtering the adaptive codebook vector to generate a filtered adaptive codebook vector;calculating an adaptive codebook gain for the adaptive codebook vector;adaptively reducing the adaptive codebook gain based on a correlation between the first target speech and the filtered adaptive codebook vector to generate a reduced adaptive codebook gain, wherein the adaptively reducing the adaptive codebook gain is further based on one encoding rate from the plurality of encoding rates;generating a second target signal based on the first target signal, the filtered adaptive codebook vector and the reduced adaptive codebook gain; andgenerating an encoded speech using the second target signal:wherein the reducing of the adaptive codebook gain reduces the adaptive codebook gain more when the encoding rate is a first encoding rate of the plurality of encoding rates than when the encoding rate is a second encoding rate of the plurality of encoding rates, and wherein the first encoding rate is lower than the second encoding rate. 14. The method of claim 13, wherein the encoding rate is adaptively selected from the plurality of encoding rates. 15. A multi-rate speech encoding device having a plurality of encoding rates for encoding a speech signal, the device comprising: an adaptive codebook; anda processing circuit configured to: convert the speech signal from an analog form to a digitized speech signal;identify a plurality of pitch parameters using the digitized speech signal;apply a perceptual weighting filter to the digitized speech signal to generate a first target signal;identify an adaptive codebook vector from the adaptive codebook using the plurality of pitch parameters;filter the adaptive codebook vector to generate a filtered adaptive codebook vector;calculate an adaptive codebook gain for the adaptive codebook vector;adaptively reduce the adaptive codebook gain based on a correlation between the first target speech and the filtered adaptive codebook vector to generate a reduced adaptive codebook gain, wherein the processing circuit is further configured to adaptively reduce the adaptive codebook gain based on one encoding rate from the plurality of encoding rates;generate a second target signal based on the first target signal, the filtered adaptive codebook vector and the reduced adaptive codebook gain; andgenerate an encoded speech using the second target signal;wherein reducing the adaptive codebook gain by the processing circuit reduces the adaptive codebook gain more when the encoding rate is a first encoding rate of the plurality of encoding rates than when the encoding rate is a second encoding rate of the plurality of encoding rates, and wherein the first encoding rate is lower than the second encoding rate. 16. The device of claim 15, wherein the encoding rate is adaptively selected from the plurality of encoding rates. 17. The method of claim 14, wherein the encoding rate is adaptively selected on a frame-by-frame basis. 18. The device of claim 16, wherein the encoding rate is adaptively selected on a frame-by-frame basis. 19. A method of encoding a speech signal using a multi-rate encoder having a plurality of encoding rates, the method comprising: converting the speech signal from an analog form to a digitized speech signal;identifying a fixed codebook vector from a fixed codebook using the digitized speech signal;identifying an adaptive codebook vector from an adaptive codebook using the digitized speech signal;calculating an adaptive codebook gain;reducing the adaptive codebook gain by an amount, wherein the amount varies based on the plurality of encoding rates;optimally selecting a fixed codebook gain based on the adaptive codebook gain while both the fixed codebook vector and the adaptive codebook vector remain fixed; andgenerating an encoded speech using the fixed codebook gain, the adaptive codebook gain, the fixed codebook vector and the adaptive codebook vector;wherein the reducing of the adaptive codebook gain reduces the adaptive codebook gain more when the encoding rate is a first encoding rate of the plurality of encoding rates than when the encoding rate is a second encoding rate of the plurality of encoding rates, and wherein the first encoding rate is lower than the second encoding rate. 20. The method of claim 19 further comprising adaptively selecting one of the plurality of encoding rates. 21. The method of claim 20, wherein the adaptively selecting is performed on a frame-by-frame basis. 22. The method of claim 19, wherein the amount is further based on a correlation value. 23. The method of claim 22, wherein the correlation value is based on an original target signal. 24. The method of claim 22, wherein the correlation value is based on a filtered signal from the adaptive codebook. 25. A multi-rate speech encoding device having a plurality of encoding rates for encoding a speech signal, the device comprising: a fixed codebook;an adaptive codebook; anda processing circuit configured to: convert the speech signal from an analog form to a digitized speech signal;identify a fixed codebook vector from the fixed codebook using the digitized speech signal;identify an adaptive codebook vector from the adaptive codebook using the digitized speech signal;calculate an adaptive codebook gain;reduce the adaptive codebook gain by an amount, wherein the amount varies based on the plurality of encoding rates;optimally select a fixed codebook gain based on the adaptive codebook gain while both the fixed codebook vector and the adaptive codebook vector remain fixed; andgenerate an encoded speech using the fixed codebook gain, the adaptive codebook gain, the fixed codebook vector and the adaptive codebook vector;wherein reducing the adaptive codebook gain by the processing circuit reduces the adaptive codebook gain more when the encoding rate is a first encoding rate of the plurality of encoding rates than when the encoding rate is a second encoding rate of the plurality of encoding rates, and wherein the first encoding rate is lower than the second encoding rate. 26. The device of claim 25, wherein the processing circuit configured to adaptively select one of the plurality of encoding rates. 27. The device of claim 26, wherein the processing circuit configured to adaptively select one of the plurality of encoding rates on a frame-by-frame basis. 28. The device of claim 25, wherein the amount is further based on a correlation value. 29. The device of claim 28, wherein the correlation value is based on an original target signal. 30. The device of claim 28, wherein the correlation value is based on a filtered signal from the adaptive codebook. 31. A method of encoding a speech signal using a multi-rate encoder having a plurality of encoding rates, the method comprising: converting the speech signal from an analog form to a digitized speech signal;identifying a fixed codebook vector from a fixed codebook using the digitized speech signal;identifying an adaptive codebook vector from an adaptive codebook using the digitized speech signal;calculating an adaptive codebook gain;reducing the adaptive codebook gain by an amount, wherein the amount depends on the plurality of encoding rates;optimally selecting a fixed codebook gain based on the adaptive codebook gain while both the fixed codebook vector and the adaptive codebook vector remain fixed; andgenerating an encoded speech using the fixed codebook gain, the adaptive codebook gain, the fixed codebook vector and the adaptive codebook vector;wherein the reducing of the adaptive codebook gain reduces the adaptive codebook gain more when the encoding rate is a first encoding rate of the plurality of encoding rates than when the encoding rate is a second encoding rate of the plurality of encoding rates, and wherein the first encoding rate is lower than the second encoding rate. 32. The method of claim 31 further comprising adaptively selecting one of the plurality of encoding rates. 33. The method of claim 32, wherein the adaptively selecting is performed on a frame-by-frame basis. 34. The method of claim 31, wherein the amount further depends on a correlation value. 35. The method of claim 34, wherein the correlation value depends on an original target signal. 36. The method of claim 34, wherein the correlation value depends on a filtered signal from the adaptive codebook. 37. A multi-rate speech encoding device having a plurality of encoding rates for encoding a speech signal, the device comprising: a fixed codebook;an adaptive codebook; anda processing circuit configured to: convert the speech signal from an analog form to a digitized speech signal;identify a fixed codebook vector from the fixed codebook using the digitized speech signal;identify an adaptive codebook vector from the adaptive codebook using the digitized speech signal;calculate an adaptive codebook gain;reduce the adaptive codebook gain by an amount, wherein the amount depends on the plurality of encoding rates;optimally select a fixed codebook gain based on the adaptive codebook gain while both the fixed codebook vector and the adaptive codebook vector remain fixed; andgenerate an encoded speech using the fixed codebook gain, the adaptive codebook gain, the fixed codebook vector and the adaptive codebook vector;wherein reducing the adaptive codebook gain by the processing circuit reduces the adaptive codebook gain more when the encoding rate is a first encoding rate of the plurality of encoding rates than when the encoding rate is a second encoding rate of the plurality of encoding rates, and wherein the first encoding rate is lower than the second encoding rate. 38. The device of claim 37, wherein the processing circuit configured to adaptively select one of the plurality of encoding rates. 39. The device of claim 38, wherein the processing circuit configured to adaptively select one of the plurality of encoding rates on a frame-by-frame basis. 40. The device of claim 37, wherein the amount further depends on a correlation value. 41. The device of claim 40, wherein the correlation value depends on an original target signal. 42. The device of claim 40, wherein the correlation value depends on a filtered signal from the adaptive codebook. 43. A method of encoding a speech signal using a multi-rate encoder having a plurality of encoding rates, the method comprising: converting the speech signal from an analog form to a digitized speech signal;identifying a fixed codebook vector from a fixed codebook using the digitized speech signal;identifying an adaptive codebook vector from an adaptive codebook using the digitized speech signal;calculating an adaptive codebook gain;reducing the adaptive codebook gain by an amount, wherein the amount is based on an encoding rate selected from the plurality of encoding rates:optimally selecting a fixed codebook gain based on the adaptive codebook gain while both the fixed codebook vector and the adaptive codebook vector remain fixed; andgenerating an encoded speech using the fixed codebook gain, the adaptive codebook gain, the fixed codebook vector and the adaptive codebook vector;wherein the reducing of the adaptive codebook gain reduces the adaptive codebook gain more when the encoding rate is a first encoding rate of the plurality of encoding rates than when the encoding rate is a second encoding rate of the plurality of encoding rates, and wherein the first encoding rate is lower than the second encoding rate. 44. The method of claim 43 further comprising adaptively selecting the encoding rate from the plurality of encoding rates. 45. The method of claim 44, wherein the adaptively selecting is performed on a frame-by-frame basis. 46. The method of claim 43, wherein the amount is further based on a correlation value. 47. The method of claim 46, wherein the correlation value is based on an original target signal. 48. The method of claim 46, wherein the correlation value is based on a filtered signal from the adaptive codebook. 49. A multi-rate speech encoding device having a plurality of encoding rates for encoding a speech signal, the device comprising: a fixed codebook;an adaptive codebook; anda processing circuit configured to: convert the speech signal from an analog form to a digitized speech signal;identify a fixed codebook vector from the fixed codebook using the digitized speech signal;identify an adaptive codebook vector from the adaptive codebook using the digitized speech signal;calculate an adaptive codebook gain;reduce the adaptive codebook gain by an amount, wherein the amount is based on an encoding rate selected from the plurality of encoding rates;optimally select a fixed codebook gain based on the adaptive codebook gain while both the fixed codebook vector and the adaptive codebook vector remain fixed; andgenerate an encoded speech using the fixed codebook gain, the adaptive codebook gain, the fixed codebook vector and the adaptive codebook vector;wherein reducing the adaptive codebook gain by the processing circuit reduces the adaptive codebook gain more when the encoding rate is a first encoding rate of the plurality of encoding rates than when the encoding rate is a second encoding rate of the plurality of encoding rates, and wherein the first encoding rate is lower than the second encoding rate. 50. The device of claim 49, wherein the processing circuit configured to adaptively select the encoding rate from the plurality of encoding rates. 51. The device of claim 50, wherein the processing circuit configured to adaptively select one of the plurality of encoding rates on a frame-by-frame basis. 52. The device of claim 49, wherein the amount is further based on a correlation value. 53. The device of claim 52, wherein the correlation value is based on an original target signal.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (153)
Toyoshima Yasuo,JPX ; Kusagawa Tetsuya,JPX ; Hamajima Mitsugu,JPX ; Nakanishi Minoru,JPX ; Yana Yuji,JPX ; Saka Akihiko,JPX, Absorbent article.
Poulsen Steven P. ; Preston ; II James P., Apparatus and method for maintaining a constant speech envelope using variable coefficient automatic gain control.
Aoyagi Hiromi,JPX ; Ariyama Yoshihiro,JPX ; Hosoda Kenichiro,JPX, Code-excited linear predictive coder and decoder with conversion filter for converting stochastic and impulsive excitati.
Oshikiri Masahiro,JPX ; Miseki Kimio,JPX ; Akamine Masami,JPX ; Amada Tadashi,JPX, Coding apparatus having adaptive coding at different bit rates and pitch emphasis.
Ingemar Johansson SE; Erik Ekudden SE; Jonas Svedberg SE; Anders Uvliden SE, Complex signal activity detection for improved speech/noise classification of an audio signal.
Valenzuela Steude Reinaldo A. (Wellesley MA) Danisewicz Ronald G. (Methuen MA), Digital speech coder having improved long term lag parameter determination.
Gerson Ira A. (1120 Nottingham La. Hoffman Estates IL 60195) Jasiuk Mark A. (6611 N. Hiawatha Ave. Chicago IL 60646), Digital speech coder having optimized signal energy parameters.
Anderson Keith W. ; Toot ; Jr. Carl B., Integration of mobile station channel measurement with paging channel displacement operation in a wireless communicatio.
Patel Jayesh S. ; Kolb Douglas E., LPAS speech coder using vector quantized, multi-codebook, multi-tap pitch predictor and optimized ternary source excitation codebook derivation.
Galand Claude (Cagnes Sur Mer FRX) Menez Jean (Cagnes Sur Mer FRX) Rosso Michle (Nice FRX), Method and apparatus for dynamically adapting a vector-quantizing coder codebook.
Padovani, Roberto; Bender, Paul E.; Black, Peter J.; Grob, Matthew S.; Hinderling, Jurg K.; Sindhushayana, Nagabhushana T.; Wheatley, III, Charles E., Method and apparatus for high rate packet data transmission.
El Maleh, Khaled H.; Choy, Eddie Lun Tik; Ananthapadmanabhan, Arasanipalai K.; DeJaco, Andrew P.; Huang, Pengjun, Method and apparatus for improved detection of rate errors in variable rate receivers.
Nishiguchi,Masayuki; Iijima,Kazuyuki; Matsumoto,Jun; Omori,Shiro, Method and apparatus for speech encoding and decoding by sinusoidal analysis and waveform encoding with phase reproducibility.
Okajima, Ichiro; Maebara, Akihiro; Murase, Atsushi; Kobayashi, Katsumi; Takami, Tadao, Method and apparatus of a mobile communication system which reduces power consumption by observing only the starting slot on a TDMA radio channel.
Honkanen Tero,FIX ; LaFlamme Claude,CAX ; Adoul Jean-Pierre,CAX, Method and device for eradicating instability due to periodic signals in analysis-by-synthesis speech codecs.
Byun Kyung-Jin,KRX ; Yoo Ha-Young,KRX ; Kim Jong-Jae,KRX ; Han Ki-Chun,KRX ; Kim Jae-Suk,KRX ; Bae Myung-Jin,KRX, Method for reducing pitch search time for vocoder.
Olofsson H.ang.kan Gunnar,SEX ; Thielecke Jorn,DEX, Method for selecting a combination of modulation and channel coding schemes in a digital communication system.
Rotola-Pukkila, Jani; Mikkola, Hannu; Kapanen, Pekka; Vainio, Janne, Method for transmitting background noise information in data transmission in data frames.
Massingill Larry William ; Reinhold Stan L. ; Weiss Richard E. ; Solve Torbjorn Wilson, Methods and systems for reduced power operation of cellular mobile terminals.
Gupta Prabhat K. (Germantown MD) Lamkin Allan (Arlington VA) Kepley ; III Walter R. (Gaithersburg MD), Real-time implementation of a 8Kbps CELP coder on a DSP pair.
Gerson Ira A. (Schaumburg IL) Jasiuk Mark A. (Chicago IL) Hartman Matthew A. (Bloomington IL), Speech coding method and apparatus using mean squared error modifier for selected speech coder parameters using VSELP te.
Yoshizumi Yoshiyuki (Suita JPX) Mekata Tsuyoshi (Katano JPX) Yamada Yoshinori (Katano JPX) Suzuki Ryoji (Nara JPX), Speech signal processing apparatus for amplifying an input signal based upon consonant features of the signal.
Thyssen Jes, Synchronized encoder-decoder frame concealment using speech coding parameters including line spectral frequencies and filter coefficients.
Bartkowiak John G. ; Ireton Mark, System and method for performing pitch estimation and error checking on low estimated pitch values in a correlation base.
Pertrushin Valery A., System, method and article of manufacture for detecting emotion in voice signals through analysis of a plurality of voice signal parameters.
Jacobs Paul E. (San Diego CA) Gardner William R. (San Diego CA) Lee Chong U. (San Diego CA) Gilhousen Klein S. (San Diego CA) Lam S. Katherine (San Diego CA) Tsai Ming-Chang (San Diego CA), Variable rate vocoder.
Jacobs Paul E. (San Diego CA) Gardner William R. (San Diego CA) Lee Chong U. (San Diego CA) Gilhousen Klein S. (San Diego CA) Lam S. Katherine (San Diego CA) Tsai Ming-Chang (San Diego CA), Variable rate vocoder.
Nishiguchi Masayuki,JPX ; Matsumoto Jun,JPX ; Ono Shinobu,JPX, Voice analysis-synthesis method using noise having diffusion which varies with frequency band to modify predicted phase.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.