Adaptive tilt compensation for synthesized speech
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G10L-019/09
G10L-019/12
G10L-019/18
G10L-019/20
G10L-025/90
G10L-019/02
G10L-019/00
출원번호
US-0215649
(2008-06-27)
등록번호
US-9401156
(2016-07-26)
발명자
/ 주소
Su, Huan-Yu
Gao, Yang
출원인 / 주소
SAMSUNG ELECTRONICS CO., LTD.
대리인 / 주소
Sughrue Mion, PLLC
인용정보
피인용 횟수 :
0인용 특허 :
149
초록▼
There is provided a method of using an adaptive tilt compensation by a speech decoder. The method comprises receiving a bit stream including a plurality of parameters representative of a speech signal; identifying an adaptive code vector and a fixed code vector using the plurality of parameters; sca
There is provided a method of using an adaptive tilt compensation by a speech decoder. The method comprises receiving a bit stream including a plurality of parameters representative of a speech signal; identifying an adaptive code vector and a fixed code vector using the plurality of parameters; scaling the adaptive code vector and the fixed code vector to generate a scaled adaptive code vector and a scaled fixed code vector; summing the scaled adaptive code vector and the scaled fixed code vector to generate a synthesized output; calculating a first reflection coefficient based on the plurality of parameters representative of the speech signal; multiplying the first reflection coefficient by a factor to generate a tilt factor; and applying the tilt factor to the synthesized output based on an encoding bit rate.
대표청구항▼
1. A method of using an adaptive tilt compensation by a speech decoder and generating a speech signal, the method comprising: receiving a bit stream including a plurality of parameters representative of the speech signal;identifying an adaptive code vector and a fixed code vector using the plurality
1. A method of using an adaptive tilt compensation by a speech decoder and generating a speech signal, the method comprising: receiving a bit stream including a plurality of parameters representative of the speech signal;identifying an adaptive code vector and a fixed code vector using the plurality of parameters;scaling the adaptive code vector and the fixed code vector to generate a scaled adaptive code vector and a scaled fixed code vector;generating a first synthesized output using the scaled adaptive code vector and the scaled fixed code vector;generating a tilt factor based on the plurality of parameters representative of the speech signal and an encoding rate, wherein the tilt factor is generated by calculating a first reflection coefficient and multiplying the first reflection coefficient by a factor;applying the tilt factor to the first synthesized output to generate a second synthesized output; andconverting the second synthesized output into the speech signal;wherein the converting of the second synthesized output into the speech signal comprises applying a synthesis filter to the second synthesized output, andwherein the generating a tilt factor comprises increasing the tilt factor as the encoding rate decreases while decoding the speech signal. 2. The method of claim 1, wherein the speech decoder includes a first encoding bit rate and a second encoding bit rate, wherein the first encoding bit rate is higher than the second encoding bit rate, and wherein the method further comprises: determining whether the encoding bit rate is the first encoding bit rate or the second encoding bit rate;applying the tilt factor to the first synthesized output if the encoding bit rate is the second encoding bit rate; anddeciding to not apply the tilt factor to the first synthesized output if the encoding bit rate is the first encoding bit rate. 3. The method of claim 1, wherein generating the first synthesized output is by summing the scaled adaptive code vector and the scaled fixed code vector. 4. The method of claim 1, wherein the first synthesized output is a synthesized residual. 5. The method of claim 1, wherein the first synthesized output is a weighted synthesized residual. 6. The method of claim 1, wherein the first synthesized output is a signal in a residual domain. 7. The method of claim 1, wherein the first synthesized output is a weighted signal in a residual domain. 8. A speech decoder comprising: a receiver configured to receive a bit stream including a plurality of parameters representative of a speech signal;an adaptive codebook; anda fixed codebook;wherein the speech decoder is configured to identify an adaptive code vector and a fixed code vector using the plurality of parameters from the adaptive codebook and the fixed codebook, respectively;wherein the speech decoder is further configured to scale the adaptive code vector and the fixed code vector to generate a scaled adaptive code vector and a scaled fixed code vector;wherein the speech decoder is further configured to generate a first synthesized output using the scaled adaptive code vector and the scaled fixed code vector;wherein the speech decoder is further configured to generate a tilt factor based on the plurality of parameters representative of the speech signal and an encoding rate and wherein the tilt factor is generated by calculating a first reflection coefficient and multiplying the first reflection coefficient by a factor;wherein the speech decoder is further configured to apply the tilt factor to the first synthesized output to generate a second synthesized output;wherein the speech decoder is further configured to apply a synthesis filter to the second synthesized output; andwherein the speech decoder generates the tilt factor by increasing the tilt factor as the encoding rate decreases while decoding the speech signal. 9. The speech decoder of claim 8, wherein the speech decoder includes a first encoding bit rate and a second encoding bit rate, wherein the first encoding bit rate is higher than the second encoding bit rate, and wherein the speech decoder is further configured to determine whether the encoding bit rate is the first encoding bit rate or the second encoding bit rate, apply the tilt factor to the first synthesized output if the encoding bit rate is the second encoding bit rate, and decide to not apply the tilt factor to the first synthesized output if the encoding bit rate is the first encoding bit rate. 10. The speech decoder of claim 8, wherein the speech decoder is further configured to generate the first synthesized output by summing the scaled adaptive code vector and the scaled fixed code vector. 11. The speech decoder of claim 8, wherein the first synthesized output is a synthesized residual. 12. The speech decoder of claim 8, wherein the first synthesized output is a weighted synthesized residual. 13. The speech decoder of claim 8, wherein the first synthesized output is a signal in a residual domain. 14. The speech decoder of claim 8, wherein the first synthesized output is a weighted signal in a residual domain. 15. A method of using an adaptive tilt compensation by a multi-rate speech decoder and generating a speech signal, the method comprising: receiving a bit stream including a plurality of parameters representative of the speech signal;identifying an adaptive code vector and a fixed code vector using the plurality of parameters;scaling the adaptive code vector and the fixed code vector to generate a scaled adaptive code vector and a scaled fixed code vector;generating a first synthesized output using the scaled adaptive code vector and the scaled fixed code vector;determining an amount based on an encoding bit rate for a tilt factor by calculating a first reflection coefficient based on the plurality of parameters representative of the speech signal and multiplying the first reflection coefficient by a factor;applying the tilt factor to the first synthesized output to generate a second synthesized output; andconverting the second synthesized output into the speech signal;wherein the converting of the second synthesized output into the speech signal comprises applying a synthesis filter to the second synthesized output, andwherein the determining the amount based on the encoding bit rate for the tilt factor increases the tilt factor as the encoding rate decreases while decoding the speech signal. 16. The method of claim 15, wherein the first synthesized output is a synthesized residual. 17. The method of claim 15, wherein the first synthesized output is a weighted synthesized residual. 18. The method of claim 15, wherein the first synthesized output is a signal in a residual domain. 19. The method of claim 15, wherein the first synthesized output is a weighted signal in a residual domain. 20. A multi-rate speech decoder comprising: a receiver configured to receive a bit stream including a plurality of parameters representative of a speech signal;an adaptive codebook; anda fixed codebook;wherein the multi-rate speech decoder is configured to identify an adaptive code vector and a fixed code vector using the plurality of parameters from the adaptive codebook and the fixed codebook;wherein the multi-rate speech decoder is further configured to scale the adaptive code vector and the fixed code vector to generate a scaled adaptive code vector and a scaled fixed code vector;wherein the multi-rate speech decoder is further configured to generate a first synthesized output using the scaled adaptive code vector and the scaled fixed code vector;wherein the multi-rate speech decoder is further configured to determine an amount based on an encoding bit rate for a tilt factor by calculating a first reflection coefficient based on the plurality of parameters representative of the speech signal and multiplying the first reflection coefficient by a factor;wherein the multi-rate speech decoder is further configured to apply the tilt factor to the first synthesized output to generate a second synthesized output;wherein the multi-rate speech decoder is further configured to convert the second synthesized output into the speech signal; andwherein the multi-rate speech decoder is further configured to convert the second synthesized output into the speech signal by applying a synthesis filter to the second synthesized output; andwherein the multi-rate speech decoder determines the amount based on the encoding bit rate for the tilt factor by increasing the tilt factor as the encoding rate decreases while decoding the speech signal. 21. The multi-rate speech decoder of claim 20, wherein the first synthesized output is a synthesized residual. 22. The multi-rate speech decoder of claim 20, wherein the first synthesized output is a weighted synthesized residual. 23. The multi-rate speech decoder of claim 20, wherein the first synthesized output is a signal in a residual domain. 24. The multi-rate speech decoder of claim 20, wherein the first synthesized output is a weighted signal in a residual domain. 25. A method of using an adaptive tilt compensation by a multi-rate speech decoder and generating a speech signal, the method comprising: receiving a bit stream including a plurality of parameters representative of the speech signal, wherein the plurality of parameters include a first parameter and a second parameter;identifying an adaptive code vector using the first parameter;identifying a fixed code vector using the second parameter;scaling the adaptive code vector to generate a scaled adaptive code vector; scaling the fixed code vector to generate a scaled fixed code vector;generating a first synthesized output using the scaled adaptive code vector and the scaled fixed code vector;generating a tilt factor based on a bit rate of the multi-rate speech decoder;applying the tilt factor to the first synthesized output to generate a second synthesized output; andconverting the second synthesized output into the speech signal;wherein the converting of the second synthesized output into the speech signal comprises applying a synthesis filter to the second synthesized output, andwherein the generating the tilt factor comprises increasing the tilt factor as the bit rate decreases while decoding the speech signal. 26. The method of claim 25, wherein the plurality of parameters further include a third parameter and generating the tilt factor comprises: calculating a first reflection coefficient based on the third parameter; and multiplying the first reflection coefficient by a factor to generate the tilt factor. 27. The method of claim 26, wherein generating the first synthesized output comprises summing the scaled adaptive code vector and the scaled fixed code vector. 28. The method of claim 26, wherein the first synthesized output is a synthesized residual. 29. The method of claim 26, wherein the first synthesized output is a weighted synthesized residual. 30. The method of claim 26, wherein the first synthesized output is a signal in a residual domain. 31. The method of claim 26, wherein the first synthesized output is a weighted signal in a residual domain. 32. A multi-rate speech decoder comprising: a receiver configured to receive a bit stream including a plurality of parameters representative of a speech signal, wherein the plurality of parameters include a first parameter and a second parameter;an adaptive codebook; anda fixed codebook;wherein the multi-rate speech decoder is configured to identify an adaptive code vector from the adaptive codebook using the first parameter;wherein the multi-rate speech decoder is configured to identify a fixed code vector from the fixed codebook using the second parameter;wherein the multi-rate speech decoder is further configured to scale the adaptive code vector to generate a scaled adaptive code vector;wherein the multi-rate speech decoder is further configured to scale the fixed code vector to generate a scaled fixed code vector;wherein the multi-rate speech decoder is further configured to generate a first synthesized output using the scaled adaptive code vector and the scaled fixed code vector;wherein the multi-rate speech decoder is further configured to generate a tilt factor based on a bit rate of the multi-rate speech decoder;wherein the multi-rate speech decoder is further configured to apply the tilt factor to the first synthesized output to generate a second synthesized output; andwherein the multi-rate speech decoder is further configured to convert the second synthesized output into the speech signal by applying a synthesis filter to the second synthesized output, and wherein the multi-rate speech decoder is configured to generate the tilt factor by increasing the tilt factor as the bit rate decreases while decoding the speech signal. 33. The multi-rate speech decoder of claim 32, wherein the plurality of parameters further include a third parameter and wherein the multi-rate speech decoder is configured to generate the tilt factor by: calculating a first reflection coefficient based on the third parameter; andmultiplying the first reflection coefficient by a factor to generate the tilt factor. 34. The multi-rate speech decoder of claim 33, wherein the first synthesized output is a synthesized residual. 35. The multi-rate speech decoder of claim 33, wherein the first synthesized output is a weighted synthesized residual. 36. The multi-rate speech decoder of claim 33, wherein the first synthesized output is a signal in a residual domain. 37. The multi-rate speech decoder of claim 33, wherein the first synthesized output is' a weighted signal in a residual domain. 38. The multi-rate speech decoder of claim 33, wherein the multi-rate speech decoder is configured to generate the first synthesized output by summing the scaled adaptive code vector and the scaled fixed code vector. 39. A multi-rate speech decoder comprising: a receiver configured to receive a bit stream including a plurality of parameters representative of a speech signal, wherein the plurality of parameters include a first parameter and a second parameter;an adaptive code vector generator configured to generate an adaptive code vector using the first parameter;a fixed code vector generator configured to generate a fixed code vector using the second parameter;an adaptive codebook gain configured to scale the adaptive code vector to generate a scaled adaptive code vector;a fixed codebook gain configured to scale the fixed code vector to generate a scaled fixed code vector;a first synthesized output generator configured to generate a first synthesized output using the scaled adaptive code vector and the scaled fixed code vector;a tilt factor generator configured to generate a tilt factor based on a bit rate of the multi-rate speech decoder;a tilt compensator configured to apply the tilt factor to the first synthesized output to generate a second synthesized output; anda speech converter configured to apply a synthesis filter to the second synthesized output to generate the speech signal, andwherein the tilt factor generator is configured to increase the tilt factor as the bit rate decreases while decoding the speech signal. 40. The multi-rate speech decoder of claim 39, wherein the plurality of parameters further include a third parameter and wherein the tilt factor generator comprises: a first reflection coefficient calculator configured to calculate a first reflection coefficient based on the third parameter; anda multiplier configured to multiply the first reflection coefficient by a factor to generate the tilt factor. 41. The multi-rate speech decoder of claim 40, wherein the first synthesized output is a synthesized residual. 42. The multi-rate speech decoder of claim 40, wherein the first synthesized output is a weighted synthesized residual. 43. The multi-rate speech decoder of claim 40, wherein the first synthesized output is a signal in a residual domain. 44. The multi-rate speech decoder of claim 40, wherein the first synthesized output is a weighted signal in a residual domain. 45. The multi-rate speech decoder of claim 40, wherein a first synthesized output generator is configured to add the scaled adaptive code vector and the scaled fixed code vector to generate the first synthesized output. 46. A method of using an adaptive tilt compensation by a multi-rate speech decoder and generating a speech signal, the method comprising: receiving a bit stream including a plurality of parameters representative of the speech signal, wherein the plurality of parameters include at least an adaptive codebook index, an adaptive codebook gain, a fixed codebook index, a fixed codebook gain and synthesis filter parameters;identifying an adaptive code vector using at least one of the plurality of parameters;identifying a fixed code vector using at least one of the plurality of parameters; scaling the adaptive code vector to generate a scaled adaptive code vector;scaling the fixed code vector to generate a scaled fixed code vector;generating a first synthesized output using the scaled adaptive code vector and the scaled fixed code vector;generating a tilt factor based on a bit rate of the multi-rate speech decoder;generating a second synthesized output from the first synthesized output by applying the tilt factor; andconverting the second synthesized output into the speech signal;wherein the converting of the second synthesized output into the speech signal comprises applying a synthesis filter to the second synthesized output, andwherein the generating the tilt factor comprises increasing tilt factor as the bit rate decreases while decoding the speech signal. 47. The method of claim 46, wherein the generating the tilt factor comprises: calculating a first reflection coefficient based on at least one of the plurality of parameters; andmultiplying the first reflection coefficient by a factor to generate the tilt factor. 48. The method of claim 47, wherein generating the first synthesized output comprises summing the scaled adaptive code vector and the scaled fixed code vector. 49. The method of claim 47, wherein the first synthesized output is a synthesized residual. 50. The method of claim 47, wherein the first synthesized output is a weighted synthesized residual. 51. The method of claim 47, wherein the first synthesized output is a signal in a residual domain. 52. The method of claim 47, wherein the first synthesized output is a weighted signal in a residual domain. 53. A multi-rate speech decoder comprising: a receiver configured to receive a bit stream including a plurality of parameters representative of the speech signal, wherein the plurality of parameters include at least an adaptive codebook index, an adaptive codebook gain, a fixed codebook index, a fixed codebook gain and synthesis filter parameters;an adaptive codebook; anda fixed codebook;wherein the multi-rate speech decoder is configured to identify an adaptive code vector from the adaptive codebook using at least one of the plurality of parameters;wherein the multi-rate speech decoder is configured to identify a fixed code vector from the fixed codebook using at least one of the plurality of parameters;wherein the multi-rate speech decoder is further configured to scale the adaptive code vector to generate a scaled adaptive code vector;wherein the multi-rate speech decoder is further configured to scale the fixed code vector to generate a scaled fixed code vector;wherein the multi-rate speech decoder is further configured to generate a first synthesized output using the scaled adaptive code vector and the scaled fixed code vector;wherein the multi-rate speech decoder is further configured to generate a tilt factor based on a bit rate of the multi-rate speech decoder;wherein the multi-rate speech decoder is further configured to generate a second synthesized output from the first synthesized output by applying the tilt factor;wherein the multi-rate speech decoder is further configured to convert the second synthesized output into the speech signal by applying a synthesis filter to the second synthesized output; andwherein the multi-rate speech decoder is further configured to increase the tilt factor as the bit rate decreases while decoding the speech signal. 54. The multi-rate speech decoder of claim 53, wherein the multi-rate speech decoder is configured to generate the tilt factor by: calculating a first reflection coefficient based on at least one of the plurality of parameters; andmultiplying the first reflection coefficient by a factor to generate the tilt factor. 55. The multi-rate speech decoder of claim 54, wherein the first synthesized output is a synthesized residual. 56. The multi-rate speech decoder of claim 54, wherein the first synthesized output is a weighted synthesized residual. 57. The multi-rate speech decoder of claim 54, wherein the first synthesized output is a signal in a residual domain. 58. The multi-rate speech decoder of claim 54, wherein the first synthesized output is a weighted signal in a residual domain. 59. The multi-rate speech decoder of claim 54, wherein the multi-rate speech decoder is configured to generate the first synthesized output by summing the scaled adaptive code vector and the scaled fixed code vector.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (149)
Toyoshima Yasuo,JPX ; Kusagawa Tetsuya,JPX ; Hamajima Mitsugu,JPX ; Nakanishi Minoru,JPX ; Yana Yuji,JPX ; Saka Akihiko,JPX, Absorbent article.
Poulsen Steven P. ; Preston ; II James P., Apparatus and method for maintaining a constant speech envelope using variable coefficient automatic gain control.
Aoyagi Hiromi,JPX ; Ariyama Yoshihiro,JPX ; Hosoda Kenichiro,JPX, Code-excited linear predictive coder and decoder with conversion filter for converting stochastic and impulsive excitati.
Ingemar Johansson SE; Erik Ekudden SE; Jonas Svedberg SE; Anders Uvliden SE, Complex signal activity detection for improved speech/noise classification of an audio signal.
Valenzuela Steude Reinaldo A. (Wellesley MA) Danisewicz Ronald G. (Methuen MA), Digital speech coder having improved long term lag parameter determination.
Gerson Ira A. (1120 Nottingham La. Hoffman Estates IL 60195) Jasiuk Mark A. (6611 N. Hiawatha Ave. Chicago IL 60646), Digital speech coder having optimized signal energy parameters.
Anderson Keith W. ; Toot ; Jr. Carl B., Integration of mobile station channel measurement with paging channel displacement operation in a wireless communicatio.
Patel Jayesh S. ; Kolb Douglas E., LPAS speech coder using vector quantized, multi-codebook, multi-tap pitch predictor and optimized ternary source excitation codebook derivation.
Galand Claude (Cagnes Sur Mer FRX) Menez Jean (Cagnes Sur Mer FRX) Rosso Michle (Nice FRX), Method and apparatus for dynamically adapting a vector-quantizing coder codebook.
Padovani, Roberto; Bender, Paul E.; Black, Peter J.; Grob, Matthew S.; Hinderling, Jurg K.; Sindhushayana, Nagabhushana T.; Wheatley, III, Charles E., Method and apparatus for high rate packet data transmission.
El Maleh, Khaled H.; Choy, Eddie Lun Tik; Ananthapadmanabhan, Arasanipalai K.; DeJaco, Andrew P.; Huang, Pengjun, Method and apparatus for improved detection of rate errors in variable rate receivers.
Nishiguchi,Masayuki; Iijima,Kazuyuki; Matsumoto,Jun; Omori,Shiro, Method and apparatus for speech encoding and decoding by sinusoidal analysis and waveform encoding with phase reproducibility.
Okajima, Ichiro; Maebara, Akihiro; Murase, Atsushi; Kobayashi, Katsumi; Takami, Tadao, Method and apparatus of a mobile communication system which reduces power consumption by observing only the starting slot on a TDMA radio channel.
Honkanen Tero,FIX ; LaFlamme Claude,CAX ; Adoul Jean-Pierre,CAX, Method and device for eradicating instability due to periodic signals in analysis-by-synthesis speech codecs.
Byun Kyung-Jin,KRX ; Yoo Ha-Young,KRX ; Kim Jong-Jae,KRX ; Han Ki-Chun,KRX ; Kim Jae-Suk,KRX ; Bae Myung-Jin,KRX, Method for reducing pitch search time for vocoder.
Olofsson H.ang.kan Gunnar,SEX ; Thielecke Jorn,DEX, Method for selecting a combination of modulation and channel coding schemes in a digital communication system.
Rotola-Pukkila, Jani; Mikkola, Hannu; Kapanen, Pekka; Vainio, Janne, Method for transmitting background noise information in data transmission in data frames.
Massingill Larry William ; Reinhold Stan L. ; Weiss Richard E. ; Solve Torbjorn Wilson, Methods and systems for reduced power operation of cellular mobile terminals.
Gupta Prabhat K. (Germantown MD) Lamkin Allan (Arlington VA) Kepley ; III Walter R. (Gaithersburg MD), Real-time implementation of a 8Kbps CELP coder on a DSP pair.
Gerson Ira A. (Schaumburg IL) Jasiuk Mark A. (Chicago IL) Hartman Matthew A. (Bloomington IL), Speech coding method and apparatus using mean squared error modifier for selected speech coder parameters using VSELP te.
Yoshizumi Yoshiyuki (Suita JPX) Mekata Tsuyoshi (Katano JPX) Yamada Yoshinori (Katano JPX) Suzuki Ryoji (Nara JPX), Speech signal processing apparatus for amplifying an input signal based upon consonant features of the signal.
Thyssen Jes, Synchronized encoder-decoder frame concealment using speech coding parameters including line spectral frequencies and filter coefficients.
Bartkowiak John G. ; Ireton Mark, System and method for performing pitch estimation and error checking on low estimated pitch values in a correlation base.
Pertrushin Valery A., System, method and article of manufacture for detecting emotion in voice signals through analysis of a plurality of voice signal parameters.
Jacobs Paul E. (San Diego CA) Gardner William R. (San Diego CA) Lee Chong U. (San Diego CA) Gilhousen Klein S. (San Diego CA) Lam S. Katherine (San Diego CA) Tsai Ming-Chang (San Diego CA), Variable rate vocoder.
Jacobs Paul E. (San Diego CA) Gardner William R. (San Diego CA) Lee Chong U. (San Diego CA) Gilhousen Klein S. (San Diego CA) Lam S. Katherine (San Diego CA) Tsai Ming-Chang (San Diego CA), Variable rate vocoder.
Nishiguchi Masayuki,JPX ; Matsumoto Jun,JPX ; Ono Shinobu,JPX, Voice analysis-synthesis method using noise having diffusion which varies with frequency band to modify predicted phase.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.