Method and apparatus for predictively quantizing voiced speech
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-015/00
G10L-021/00
G10L-021/02
G10L-019/00
출원번호
US-0190524
(2008-08-12)
등록번호
US-8660840
(2014-02-25)
발명자
/ 주소
Ananthapadmanabhan, Arasanipalai K.
Manjunath, Sarath
Huang, Pengjun
Choy, Eddie-Lun Tik
Dejaco, Andrew P.
출원인 / 주소
QUALCOMM Incorporated
대리인 / 주소
Yoo, Heejong
인용정보
피인용 횟수 :
2인용 특허 :
52
초록▼
A method and apparatus for predictively quantizing voiced speech includes a parameter generator and a quantizer. The parameter generator is configured to extract parameters from frames of predictive speech such as voiced speech, and to transform the extracted information to a frequency-domain repres
A method and apparatus for predictively quantizing voiced speech includes a parameter generator and a quantizer. The parameter generator is configured to extract parameters from frames of predictive speech such as voiced speech, and to transform the extracted information to a frequency-domain representation. The quantizer is configured to subtract a weighted sum of the parameters for previous frames from the parameter for the current frame. The quantizer is configured to quantize the difference value. A prototype extractor may be added to first extract a pitch period prototype to be processed by the parameter generator.
대표청구항▼
1. An apparatus comprising: a processor configured to: quantize a target error vector obtained from one or more parameters associated with a speech frame;quantize a difference between a pitch lag value for a current frame and a pitch lag value for a previous frame without quantizing the pitch lag va
1. An apparatus comprising: a processor configured to: quantize a target error vector obtained from one or more parameters associated with a speech frame;quantize a difference between a pitch lag value for a current frame and a pitch lag value for a previous frame without quantizing the pitch lag value for the current frame; andform a set of quantized speech frame parameters from the quantized target error vector. 2. The apparatus of claim 1, wherein the one or more parameters include an amplitude component of the speech frame. 3. The apparatus of claim 1, wherein the one or more parameters include a phase value associated with the speech frame. 4. The apparatus of claim 1, wherein the one or more parameters include a linear spectral information component associated with the speech frame. 5. The apparatus of claim 1, wherein the processor is configured to transmit the set of quantized speech frame parameters across a wireless communication channel. 6. The apparatus of claim 1, wherein the one or more parameters have been extracted from a plurality of voiced speech frames. 7. The apparatus of claim 1, wherein the one or more parameters have been extracted from the speech frame, wherein the speech frame comprises a voiced speech frame. 8. The apparatus of claim 1, wherein the target error vector is defined by an equation: TMn=(LMn-β1nU^M-1n-β2nU^M-2n-…-βPnU^M-Pn)β0n;n=0,1,…,N-1,wherein LMn is an unquantized N-dimensional line spectral information (LSI) vector for an Mth frame,wherein ÛM-1n, ÛM-2n, . . . , UM-Pn are contributions of LSI parameters of a number of frames, P, prior to a frame M, andwherein β0n, β1n, β2n, . . . , βPn are respective weights such that β0n+β1n+β2n+, . . . , +βPn=1. 9. The apparatus of claim 1, wherein a quantized pitch lag value is defined by an equation: {circumflex over (L)}m={circumflex over (δ)}Lm+ηm1Lm1+ηm2Lm2+ . . . +ηmxLmx wherein Lm1, Lm2, . . . , Lmx are pitch lag values for frames m1, m2, . . . , mN, respectively, andwherein ηm1, ηm2, . . . ηmx are corresponding weights. 10. The apparatus of claim 1, wherein the processor is further configured to: quantize an amplitude prediction error vector obtained from the one or more parameters associated with the speech frame, wherein the quantized amplitude prediction error vector is defined by an equation: Âm={circumflex over (δ)}Am+αm1TAm1+αm2TAm2+ . . . +αmNTAmN,wherein Am1, Am2, . . . , AmN are a subset of amplitude vectors for frames m1, m2, . . . , mN, respectively, andwherein αm1T, αm2T, . . . , αmNT are transposes of corresponding weight vectors. 11. A method of forming a set of quantized speech frame parameters, the method comprising: quantizing a target error vector obtained from one or more parameters associated with a speech frame;quantizing a difference between a pitch lag value for a current frame and a pitch lag value for a previous frame without quantizing the pitch lag value for the current frame; andforming a set of quantized speech frame parameters from the quantized target error vector. 12. The method of claim 11, wherein the one or more parameters include an amplitude component of the speech frame. 13. The method of claim 11, wherein the one or more parameters include a phase value associated with the speech frame. 14. The method of claim 11, wherein the one or more parameters include a linear spectral information component associated with the speech frame. 15. The method of claim 11, further comprising transmitting the set of quantized speech frame parameters across a wireless communication channel. 16. The method of claim 11, wherein the one or more parameters have been extracted from the speech frame, wherein the speech frame comprises a voiced speech frame. 17. An apparatus comprising: means for quantizing a target error vector obtained from one or more parameters associated with a speech frame;means for quantizing a difference between a pitch lag value for a current frame and a pitch lag value for a previous frame without quantizing the pitch lag value for the current frame; andmeans for forming a set of quantized speech frame parameters from the quantized target error vector. 18. The apparatus of claim 17, wherein the one or more parameters include an amplitude component of the speech frame. 19. The apparatus of claim 17, further comprising means to transmit the set of quantized speech frame parameters across a wireless communication channel. 20. A non-transitory computer-readable medium comprising instructions that upon execution in a processor cause the processor to: quantize a target error vector obtained from one or more parameters associated with a speech frame;quantize a difference between a pitch lag value for a current frame and a pitch lag value for a previous frame without quantizing the pitch lag value for the current frame; andform a set of quantized speech frame parameters from the quantized target error vector. 21. The computer-readable medium of claim 20, wherein the one or more parameters include a phase value associated with the speech frame. 22. The computer-readable medium of claim 20, wherein the one or more parameters include a linear spectral information component associated with the speech frame. 23. The computer-readable medium of claim 20, further comprising instructions to transmit the set of quantized speech frame parameters across a wireless communication channel.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (52)
Gao Yang ; Su Huan-Yu, Adaptive gain reduction to produce fixed codebook target signal.
McDonough John G. ; Chang Chienchung ; Singh Randeep ; Sakamaki Charles E. ; Tsai Ming-Chang ; Kantak Prashant, Application specific integrated circuit (ASIC) for performing rapid speech compression in a mobile telephone system.
Arasanipalai K. Ananthapadmanabhan ; Sharath Manjunath, Method and apparatus for interleaving line spectral information quantization methods in a speech coder.
Ananthapadmanabhan,Arasanipalai K.; Manjunath,Sharath; Huang,Pengjun; Choy,Eddie Lun Tik; DeJaco,Andrew P., Method and apparatus for quantizing pitch, amplitude, phase and linear spectrum of voiced speech.
Bergstrom Chad Scott ; Gifford Carl Steven ; Pattison Richard James ; Abousleman Glen Patrick, Method and apparatus for speech excitation waveform coding using multiple error waveforms.
Iijima Kazuyuki,JPX ; Nishiguchi Masayuki,JPX ; Matsumoto Jun,JPX, Speech decoding method and apparatus for selecting random noise codevectors as excitation signals for an unvoiced speech.
Gilhousen Klein S. (San Diego CA) Jacobs Irwin M. (La Jolla CA) Weaver ; Jr. Lindsay A. (San Diego CA), Spread spectrum multiple access communication system using satellite or terrestrial repeaters.
Thyssen Jes, Synchronized encoder-decoder frame concealment using speech coding parameters including line spectral frequencies and filter coefficients.
Gilhousen Klein S. (San Diego CA) Jacobs Irwin M. (La Jolla CA) Padovani Roberto (San Diego CA) Weaver ; Jr. Lindsay A. (San Diego CA) Wheatley ; III Charles E. (Del Mar CA) Viterbi Andrew J. (La Jol, System and method for generating signal waveforms in a CDMA cellular telephone system.
Jacobs Paul E. (San Diego CA) Gardner William R. (San Diego CA) Lee Chong U. (San Diego CA) Gilhousen Klein S. (San Diego CA) Lam S. Katherine (San Diego CA) Tsai Ming-Chang (San Diego CA), Variable rate vocoder.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.