[특허]Sub-sampled excitation waveform codebooks

Sub-sampled excitation waveform codebooks 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G10L-019/00 G10L-019/12 G10L-021/02
출원번호	UP-0322245 (2002-12-17)
등록번호	US-7698132 (2010-05-20)
발명자 / 주소	Kandhadai, Ananthapadamanabhan A. Manjunath, Sharath El-Maleh, Khaled
출원인 / 주소	QUALCOMM Incorporated
대리인 / 주소	Macek, Kyong
인용정보	피인용 횟수 : 3 인용 특허 : 47

초록 ▼

Methods and apparatus are presented for reducing the number of bits needed to represent an excitation waveform. An acoustic signal in an analysis frame is analyzed to determine whether it is a band-limited signal. A sub-sampled sparse codebook is used to generate the excitation waveform if the acoustic signal is a band-limited signal. The sub-sampled sparse codebook is generated by decimating permissible pulse locations from the codebook track in accordance with the frequency characteristic of the acoustic signal.

대표청구항 ▼

What is claimed is: 1. A method for forming an excitation waveform in a speech coder, the method comprising: determining whether an acoustic signal in an analysis frame is a band-limited signal; if the acoustic signal is a band-limited signal, then using a sub-sampled sparse codebook to generate the excitation waveform, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook; and if the acoustic signal is not a band-limited signal, then using a sparse codebook to generate the excitation waveform, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset; and wherein using the sub-sampled sparse codebook to generate the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform to generate the excitation waveform. 2. The method of claim 1, wherein determining whether an acoustic signal in an analysis frame is a band-limited signal comprises: determining a voice activity level of the acoustic signal; and using the voice activity level to determine whether the acoustic signal is a band-limited signal. 3. The method of claim 1, wherein determining whether an acoustic signal in an analysis frame is a band-limited signal comprises: comparing an energy level of a low frequency band of the acoustic signal to an energy level of a high frequency band of the acoustic signal; and if the energy level of the low frequency band of the acoustic signal is higher than the energy level of the high frequency band of the acoustic signal, then deciding that the acoustic signal is a band-limited signal. 4. The method of claim 1, wherein determining whether an acoustic signal in an analysis frame is a band-limited signal comprises: determining a zero-crossing rate for the acoustic signal; and if the zero-crossing rate is low, then deciding that the acoustic signal is a band-limited signal. 5. The method of claim 1, wherein determining whether an acoustic signal in an analysis frame is a band-limited signal comprises: determining the periodicity of a low frequency band of the acoustic signal; and if the periodicity of the low frequency band of the acoustic signal is high, then deciding that the acoustic signal is a band-limited signal. 6. The method of claim 1, wherein determining whether an acoustic signal in an analysis frame is a band-limited signal comprises: analyzing the spectral content of the acoustic signal for a significant band-limited component. 7. The method of claim 1, further comprising: determining at least one of a spectral content, voice activity and zero-crossing rate of the acoustic signal; and based on determining at least one of the spectral content, voice activity and zero-crossing rate of the acoustic signal, generating the sub-sampled sparse codebook. 8. The method of claim 1, further comprising excluding certain candidate excitation waveforms from a search through a stochastic excitation waveform codebook. 9. The method of claim 1, if the acoustic signal is band-limited, further comprising reallocating bits, which would have been used to represent an excitation waveform from the sparse codebook, to represent another speech encoding parameter. 10. The method of claim 9, wherein the speech encoding parameter comprises a linear predictive coding (LPC) filter coefficient. 11. The method of claim 1, further comprising: generating multiple candidate excitation waveforms based on different sub-sampled sparse codebooks; and determining which of the multiple candidate excitation waveforms is better suited for acting as the excitation waveform. 12. Apparatus for forming an excitation waveform, comprising: a memory element; and a processing element configured to execute a set of instructions stored on the memory element, the set of instructions for: determining whether an acoustic signal in an analysis frame is a band-limited signal; using a sub-sampled sparse codebook to generate the excitation waveform if the acoustic signal is a band-limited signal, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from a sparse codebook; and using the sparse codebook to generate the excitation waveform if the acoustic signal is not a band-limited signal, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset; and wherein using the sub-sampled sparse codebook to generate the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform to generate the excitation waveform. 13. The apparatus of claim 12, wherein the apparatus is a wideband vocoder. 14. The apparatus of claim 12, wherein the apparatus is a narrowband vocoder. 15. The apparatus of claim 12, wherein the apparatus is a variable rate vocoder. 16. The apparatus of claim 12, wherein the apparatus is a fixed rate vocoder. 17. An apparatus for forming an excitation waveform, comprising: means for determining whether an acoustic signal in an analysis frame is a band-limited signal; means for using a sub-sampled sparse codebook to generate the excitation waveform if the acoustic signal is a band-limited signal, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from a sparse codebook; and means for using the sparse codebook to generate the excitation waveform if die acoustic signal is not a band-limited signal, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse codebook comprises a subset of the predetermined positions, such that die excitation waveform is generated through placement of pulses within the predetermined positions or the subset; wherein using the sub-sampled sparse codebook to generate the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform to generate the excitation waveform. 18. The apparatus of claim 17, wherein the apparatus is a wideband vocoder. 19. A method for a signal coder to reduce the number of bits used to represent an excitation waveform, comprising: determining a frequency characteristic of an acoustic signal; generating a sub-sampled sparse codebook waveform from a sparse codebook if the frequency characteristic indicates that sub-sampling does not impair the perceptual quality of the acoustic signal, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook; and using the sub-sampled sparse codebook waveform to represent the excitation waveform rather than a waveform from the sparse codebook; wherein using the sub-sampled sparse codebook to represent the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform. 20. Apparatus for reducing the number of bits used to represent an excitation waveform, comprising: a memory element; and a processing element configured to execute a set of instructions stored on the memory element, the set of instructions for: determining a frequency characteristic of an acoustic signal; generating a sub-sampled sparse codebook waveform from a sparse codebook if the frequency characteristic indicates that sub-sampling does not impair the perceptual quality of the acoustic signal, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook; and using the sub-sampled sparse codebook waveform to represent the excitation waveform rather than a waveform from the sparse codebook; wherein using the sub-sampled sparse codebook to represent the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform. 21. An apparatus for reducing the number of bits used to represent an excitation waveform, comprising: means for determining a frequency characteristic of an acoustic signal; means for generating a sub-sampled sparse codebook waveform from a sparse codebook if the frequency characteristic indicates that sub-sampling does not impair the perceptual quality of the acoustic signal, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook; and means for using the sub-sampled sparse codebook waveform to represent the excitation waveform rather than a waveform from the sparse codebook; wherein using the sub-sampled sparse codebook waveform to represent the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform. 22. The apparatus of claim 21, wherein the apparatus is a wideband vocoder. 23. The apparatus of claim 21, wherein the apparatus is a narrowband vocoder. 24. The apparatus of claim 21, wherein the apparatus is a variable rate vocoder. 25. The apparatus of claim 21, wherein the apparatus is a fixed rate vocoder. 26. A method for execution by a suitably programmed processor to generate a sub-sampled sparse codebook from a sparse codebook, wherein the sparse codebook comprises pulses at a set of permissible pulse locations, the method comprising: analyzing a frequency characteristic of an acoustic signal; determining whether an initial excitation waveform corresponding to the acoustic signal comprises mostly odd track positions or mostly even track positions; and decimating a subset of permissible pulse locations from the set of permissible pulse locations of the sparse codebook in accordance with the frequency characteristic of the acoustic signal to generate the sub-sampled sparse codebook, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook. 27. Apparatus for generating a sub-sampled sparse codebook from a sparse codebook, wherein the sparse codebook comprises pulses at a set of permissible pulse locations, the apparatus comprising: a memory element; and a processing element configured to execute a set of instructions stored on the memory element, the set of instructions for: analyzing a frequency characteristic of an acoustic signal; determining whether an initial excitation waveform corresponding to the acoustic signal comprises mostly odd track positions or mostly even track positions; and decimating a subset of permissible pulse locations from the set of permissible pulse locations of the sparse codebook in accordance with the frequency characteristic of the acoustic signal to generate the sub-sampled sparse codebook, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook. 28. Apparatus for generating a sub-sampled sparse codebook from a sparse codebook, wherein the sparse codebook comprises pulses at a set of permissible pulse locations, the apparatus comprising: means for analyzing a frequency characteristic of an acoustic signal; means for determining whether an initial excitation waveform corresponding to the acoustic signal comprises mostly odd track positions or mostly even track positions; and means for decimating a subset of permissible pulse locations from the set of permissible pulse locations of the sparse codebook in accordance with the frequency characteristic of the acoustic signal to generate the sub-sampled sparse codebook, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook. 29. The apparatus of claim 28, wherein the apparatus is a wideband vocoder. 30. The apparatus of claim 28, wherein the apparatus is a narrowband vocoder. 31. The apparatus of claim 28, wherein the apparatus is a variable rate vocoder. 32. The apparatus of claim 28, wherein the apparatus is a fixed rate vocoder. 33. A speech coder, comprising: a linear predictive coding (LPC) unit configured to determine LPC coefficients of an acoustic signal; a frequency analysis unit configured to determine whether the acoustic signal is band-limited; a quantizer unit configured to receive the LPC coefficients to and quantize the LPC coefficients; and an excitation parameter generator configured to receive a determination from the frequency analysis unit regarding whether the acoustic signal is band-limited and to implement a sub-sampled sparse codebook, the sparse codebook comprising a set of predetermined possible positions and the sub-sampled sparse code book comprising a subset of the predetermined positions, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook, and wherein implementing the sub-sampled sparse codebook comprises determining whether an initial excitation waveform comprises mostly odd track positions or mostly even track positions. 34. The speech coder of claim 33, wherein the quantizer unit is further configured to receive the determination from the frequency analysis unit regarding whether the acoustic signal is band-limited and to update the quantization scheme accordingly. 35. The speech coder of claim 33, wherein the quantizer unit is further configured to receive information from the excitation parameter generator regarding the implementation of the sub-sampled sparse codebook and to update the quantization scheme accordingly. 36. A computer-program product comprising a computer-readable medium having instructions thereon, the instructions comprising: code for determining whether an acoustic signal in an analysis frame is a band-limited signal; code for using a sub-sampled sparse codebook to generate an excitation waveform if the acoustic signal is a band-limited signal, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from a sparse codebook; and code for using the sparse codebook to generate the excitation waveform if the acoustic signal is not a band-limited signal, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset; wherein using the sub-sampled sparse codebook to generate the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform.

이 특허에 인용된 특허 (47)

Maung Tin, ACLEP codec with modified autocorrelation matrix storage and search.
상세보기
Huang Si J. (Singapore SGX) Tan Ah P. (Singapore SGX), Adaptive bit allocation for video and audio coding.
상세보기
Chhatwal Harprit S.,GBX, Adaptive speech coder having code excited linear predictor with multiple codebook searches.
상세보기
Yu Alfred, Adaptively compressing sound with multiple codebooks.
상세보기
Adoul Jean-Pierre,CAX ; Laflamme Claude,CAX, Algebraic codebook with signal-selected pulse amplitude/position combinations for fast coding of speech.
상세보기
Kannan,Karthik; Subramanian,Meenakshi Sundaram, Apparatus, methods and articles incorporating a fast algebraic codebook search technique.
상세보기
McDonough John G. ; Chang Chienchung ; Singh Randeep ; Sakamaki Charles E. ; Tsai Ming-Chang ; Kantak Prashant, Application specific integrated circuit (ASIC) for performing rapid speech compression in a mobile telephone system.
상세보기
McDonough John G. ; Lee Way-Shing, Application specific integrated circuit (ASIC) for performing rapid speech compression in a mobile telephone system.
상세보기
Urano Takashi,JPX ; Tsuchikane Koichi,JPX ; Kobayashi Satoko,JPX, Bit-rate conversion circuit for a compressed motion video bitstream.
상세보기
McDonough John G. ; Chang Chienchung ; Singh Randeep ; Sakamaki Charles E. ; Tsai Ming-Chang ; Kantak Prashant, Block normalization processor.
상세보기
Gao, Yang, Codebook structure and search for speech coding.
상세보기
Gao, Yang, Codebook structure for changeable pulse multimode speech coding.
상세보기
Su Huan-Yu, Comb codebook structure.
상세보기
Gao Yang, Completed fixed codebook for speech encoder.
상세보기
Benno, Steven A., Constraining pulse positions in CELP vocoding.
상세보기
Adoul Jean-Pierre (Sherbrooke CAX) Laflamme Claude (Sherbrooke CAX), Depth-first algebraic-codebook search for fast coding of speech.
상세보기
Bertrand John P. (Upper Nyack NY), Digital speech coding circuit.
상세보기
Adoul Jean-Pierre (Sherbrooke CAX) Laflamme Claude (Sherbrooke CAX), Dynamic codebook for efficient speech coding based on algebraic codes.
상세보기
Israelsen Paul D. ; Huang Chien-Min, Hierarchical adaptive multistage vector quantization.
상세보기
Stachurski,Jacek; McCree,Alan V., Hybrid speed coding and system.
상세보기
Lu Ning ; Kok Chi-Wah, Method and apparatus for designing a codebook for error resilient data transmission.
상세보기
DeJaco Andrew P., Method and apparatus for performing speech frame encoding mode selection in a variable rate encoding system.
상세보기
Kolesnik Victor D. (St. Petersburg RUX) Trofimov Andrey N. (St. Petersburg RUX) Bocharova Irina E. (St. Petersburg RUX) Krachkovsky Victor Y. (St. Petersburg RUX) Kudryashov Boris D. (St. Petersburg , Method and apparatus for speech compression using multi-mode code excited linear predictive coding.
상세보기
Das Amitav, Method and apparatus using multi-path multi-stage vector quantizer.
상세보기
Vainio, Janne; Mikkola, Hannu; Rotola-Pukkila, Jani, Method and arrangement for changing source signal bandwidth in a telecommunication connection with multiple bandwidth capability.
상세보기
Thyssen,Jes, Method for robust classification in speech coding.
상세보기
Kwon Soon Y., Method for speech coding based on a code excited linear prediction (CELP) model.
상세보기
Gortz Udo,DEX, Method of synthesizing a block of a speech signal in a celp-type coder.
상세보기
Bhattacharya Bhaskar,CAX, Method to suppress noise in digital voice processing.
상세보기
Tian Wenshun,SGX, Multi-pulse synthesis simplification in analysis-by-synthesis coders.
상세보기
Gao, Yang, Pitch determination using speech classification and prior pitch estimation.
상세보기
Mermelstein Paul (Cote St. Luc CAX), Reducing search complexity for code-excited linear prediction (CELP) coding.
상세보기
Huan-Yu Su ; Yang Gao, Speech classification and parameter weighting used in codebook search.
상세보기
Morii,Toshiyuki; Yasunaga,Kazutoshi, Speech coding apparatus and speech decoding apparatus.
상세보기
Hayashi Shinji,JPX ; Kurihara Sachiko,JPX ; Kataoka Akitoshi,JPX, Speech coding method.
상세보기
Su, Huan-Yu; Benyassine, Adil; Thyssen, Jes, Speech encoder using voice activity detection in coding noise.
상세보기
Gilhousen Klein S. (San Diego CA) Jacobs Irwin M. (La Jolla CA) Weaver ; Jr. Lindsay A. (San Diego CA), Spread spectrum multiple access communication system using satellite or terrestrial repeaters.
상세보기
Wang,Tian; Koishida,Kazuhito; Khalil,Hosam A.; Sun,Xiaoqin; Chen,Wei Ge, Sub-band voice codec with multi-stage codebooks and redundant coding.
상세보기
Fette Bruce Alan ; Jaskie Cynthia Ann, System and method for communicating a perceptually encoded speech spectrum signal.
상세보기
Gilhousen Klein S. (San Diego CA) Jacobs Irwin M. (La Jolla CA) Padovani Roberto (San Diego CA) Weaver ; Jr. Lindsay A. (San Diego CA) Wheatley ; III Charles E. (Del Mar CA) Viterbi Andrew J. (La Jol, System and method for generating signal waveforms in a CDMA cellular telephone system.
상세보기
Winger, Lowell, System and method for reduced codebook vector quantization.
상세보기
Gersho Allen ; Das Amitava ; Rao Ajit Venkat, Variable dimension vector quantization.
상세보기
Jacobs Paul E. (San Diego CA) Gardner William R. (San Diego CA) Lee Chong U. (San Diego CA) Gilhousen Klein S. (San Diego CA) Lam S. Katherine (San Diego CA) Tsai Ming-Chang (San Diego CA), Variable rate vocoder.
상세보기
Gupta Prabhat K. (Germantown MD) Jangi Shrirang (Germantown MD) Lamkin Allan B. (Arlington VA) Kepley ; III W. Robert (Gaithersburg ; MD) Morris Adrian J. (Gaithersburg ; MD), Voice activity detector for speech signals in variable background noise.
상세보기
Mai Don L. (Garland TX) Campbell Bruce W. (Richardson TX), Voice operated switch.
상세보기
Prezas Dimitrios P. (Park Ridge IL) Thomson David L. (Warrenville IL), Voice synthesis utilizing multi-level filter excitation.
상세보기
Anandakumar, Krishnasamy; Viswanathan, Vishu R.; McCree, Alan V., Wireless base station systems for packet communications.
상세보기

이 특허를 인용한 특허 (3)

Strommer, Stefan; Sorensen, Karsten Vandborg; Jensen, Soren Skak; Vos, Koen; Bergenheim, Jon, Encoding and decoding speech signals.
상세보기
Park, Hanjun; Kim, Youngtae; Kim, Kijun; Kim, Hyungtae, Method and apparatus for reporting downlink channel state.
상세보기
Grancharov, Volodya; Norvell, Erik; Sverrisson, Sigurdur, Method and arrangement for scalable low-complexity coding/decoding.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Sub-sampled excitation waveform codebooks 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (47)

이 특허를 인용한 특허 (3)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Sub-sampled excitation waveform codebooks 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (47)

이 특허를 인용한 특허 (3)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트