[특허]Systems, methods, and apparatus for wideband encoding and decoding of active frames

Systems, methods, and apparatus for wideband encoding and decoding of active frames 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G10L-019/14 G10L-011/06
출원번호	US-0830842 (2007-07-30)
등록번호	US-8532984 (2013-09-10)
발명자 / 주소	Rajendran, Vivek Kandhadai, Ananthapadmanabhan A.
출원인 / 주소	QUALCOMM Incorporated
대리인 / 주소	Yoo, Heejong
인용정보	피인용 횟수 : 3 인용 특허 : 9

초록

Applications of dim-and-burst techniques to coding of wideband speech signals are described. Reconstruction of a highband portion of a frame of a wideband speech signal using information from a previous frame is also described.

대표청구항 ▼

1. A method of processing a speech signal, said method comprising: based on a first active frame of the speech signal, producing a first speech packet that includes a description of a spectral envelope, over (A) a first frequency band and (B) a second frequency band that extends above the first frequency band, of a portion of the speech signal that includes the first active frame;based on a second active frame of the speech signal that occurs in the speech signal immediately after said first active frame, producing a second speech packet that includes a description of a spectral envelope, over the first frequency band, of a portion of the speech signal that includes the second active frame; andproducing an encoded frame that contains (A) the second speech packet and (B) a burst of an information signal that is separate from the speech signal,wherein the second speech packet does not include a description of a spectral envelope over the second frequency band. 2. The method of processing a speech signal according to claim 1, wherein said method comprises, based on a third active frame of the speech signal, producing a third speech packet that includes a description of a spectral envelope, over the first frequency band and the second frequency band, of a portion of the speech signal that includes the third active frame, wherein said third active frame occurs in the speech signal immediately after said second active frame. 3. The method of processing a speech signal according to claim 1, wherein the description of a spectral envelope of a portion of the speech signal that includes the first active frame includes separate first and second descriptions, wherein the first description is a description of a spectral envelope, over the first frequency band, of a portion of the speech signal that includes the first active frame, and wherein the second description is a description of a spectral envelope, over the second frequency band, of a portion of the speech signal that includes the first active frame. 4. The method of processing a speech signal according to claim 1, wherein the first and second frequency bands overlap by at least two hundred Hertz. 5. The method of processing a speech signal according to claim 4, wherein said overlap occurs in the range of from 3.5 to 7 kilohertz. 6. The method of processing a speech signal according to claim 1, wherein the length of the burst is less than the length of the second speech packet. 7. The method of processing a speech signal according to claim 1, wherein the length of the burst is equal to the length of the second speech packet. 8. The method of processing a speech signal according to claim 1, wherein the length of the burst is greater than the length of the second speech packet. 9. The method of processing a speech signal according to claim 1, wherein said producing the first speech packet is performed in response to a first state of a rate control signal, and wherein said producing the second speech packet is performed in response to a second state of the rate control signal that is different than said first state. 10. The method of processing a speech signal according to claim 1, wherein said method comprises: generating a dimming control signal, based on information from a mask file;in response to a first state of said dimming control signal, producing a first encoded frame that includes the first speech packet; andin response to a second state of said dimming control signal that is different than said first state, producing a second encoded frame that includes the second speech packet and does not include a description of a spectral envelope over the second frequency band. 11. A speech encoder, said speech encoder comprising: a packet encoder configured to produce (A), based on a first active frame of a speech signal and in response to a first state of a rate control signal, a first speech packet that includes a description of a spectral envelope over (1) a first frequency band and (2) a second frequency band that extends above the first frequency band and (B), based on a second active frame of the speech signal and in response to a second state of the rate control signal different than the first state, a second speech packet that includes a description of a spectral envelope over the first frequency band; anda frame formatter arranged to receive the first and second speech packets and configured to produce (A), in response to a first state of a dimming control signal, a first encoded frame that contains the first speech packet and (B), in response to a second state of the dimming control signal different than the first state, a second encoded frame that contains the second speech packet and a burst of an information signal that is separate from the speech signal,wherein the first and second encoded frames have the same length, the first speech packet occupies at least eighty percent of the first encoded frame, and the second speech packet occupies not more than half of the second encoded frame, andwherein said second active frame occurs immediately after said first active frame in the speech signal, andwherein the second speech packet does not include a description of a spectral envelope over the second frequency band, andwherein at least one among said packet encoder and said frame formatter includes a processor. 12. The speech encoder according to claim 11, wherein an overlap of the first and second frequency bands occurs in the range of from 3.5 to 4 kilohertz. 13. An apparatus for processing a speech signal, said apparatus comprising: means for producing, based on a first active frame of the speech signal, a first speech packet that includes a description of a spectral envelope, over (A) a first frequency band and (B) a second frequency band that extends above the first frequency band, of a portion of the speech signal that includes the first active frame;means for producing, based on a second active frame of the speech signal that occurs in the speech signal immediately after said first active frame, a second speech packet that includes a description of a spectral envelope, over the first frequency band, of a portion of the speech signal that includes the second active frame; andmeans for producing an encoded frame that contains (A) the second speech packet and (B) a burst of an information signal that is separate from the speech signal,wherein the second speech packet does not include a description of a spectral envelope over the second frequency band. 14. The apparatus for processing a speech signal according to claim 13, wherein an overlap of the first and second frequency bands occurs in the range of from 3.5 to 4 kilohertz. 15. The apparatus for processing a speech signal according to claim 13, wherein said apparatus comprises means for producing a third speech packet, based on a third active frame of the speech signal, that includes a description of a spectral envelope, over the first frequency band and the second frequency band, of a portion of the speech signal that includes the third active frame, wherein said third active frame occurs in the speech signal immediately after said second active frame. 16. A non-transitory computer-readable medium, said medium comprising: code for causing at least one computer to produce, based on a first active frame of the speech signal, a first speech packet that includes a description of a spectral envelope, over (A) a first frequency band and (B) a second frequency band that extends above the first frequency band, of a portion of the speech signal that includes the first active frame;code for causing at least one computer to produce, based on a second active frame of the speech signal that occurs in the speech signal immediately after said first active frame, a second speech packet that includes a description of a spectral envelope, over the first frequency band, of a portion of the speech signal that includes the second active frame; andcode for causing at least one computer to produce an encoded frame that contains (A) the second speech packet and (B) a burst of an information signal that is separate from the speech signal,wherein the second speech packet does not include a description of a spectral envelope over the second frequency band. 17. The medium according to claim 16, wherein an overlap of the first and second frequency bands occurs in the range of from 3.5 to 4 kilohertz. 18. A method of processing speech packets, said method comprising: based on information from a first speech packet from an encoded speech signal, obtaining a description of a spectral envelope of a first frame of a speech signal over (A) a first frequency band and (B) a second frequency band different than the first frequency band;based on information from a second speech packet from the encoded speech signal, obtaining a description of a spectral envelope of a second frame of the speech signal over the first frequency band;obtaining, from an encoded frame of the encoded speech signal, a burst of an information signal that is separate from the speech signal, wherein the encoded frame includes the second speech packet; andbased on a presence of the burst in the encoded frame, and based on information from the first speech packet, obtaining a description of a spectral envelope of the second frame over the second frequency band; andbased on information from the second speech packet, obtaining information relating to a pitch component of the second frame for the first frequency band. 19. The method of processing speech packets according to claim 18, wherein the description of a spectral envelope of a first frame of a speech signal comprises a description of a spectral envelope of the first frame over the first frequency band and a description of a spectral envelope of the first frame over the second frequency band. 20. The method of processing speech packets according to claim 18, wherein the information relating to a pitch component of the second frame for the first frequency band includes a pitch lag value. 21. The method of processing speech packets according to claim 18, wherein said method comprises calculating, based on the information relating to a pitch component of the second frame for the first frequency band, an excitation signal of the second frame for the first frequency band. 22. The method of processing speech packets according to claim 21, wherein said calculating an excitation signal is based on information relating to a second pitch component for the first frequency band, and wherein the information relating to a second pitch component is based on information from the first speech packet. 23. The method of processing speech packets according to claim 21, wherein said method comprises calculating, based on the excitation signal of the second frame for the first frequency band, an excitation signal of the second frame for the second frequency band. 24. The method of processing speech packets according to claim 18, wherein said obtained description of the spectral envelope of the second frame over the second frequency band is based on said description of the spectral envelope of the first frame over the second frequency band. 25. The method of processing speech packets according to claim 18, wherein the first and second frequency bands overlap by at least two hundred Hertz, and wherein said overlap occurs in the range of from 3.5 to 7 kilohertz. 26. The method of processing speech packets according to claim 18, wherein said obtaining a description of a spectral envelope of the second frame over the second frequency band is based on an indication of a narrowband coding scheme for the second frame. 27. An apparatus for processing speech packets, said apparatus comprising: means for obtaining, based on information from a first speech packet from an encoded speech signal, a description of a spectral envelope of a first frame of a speech signal over (A) a first frequency band and (B) a second frequency band different than the first frequency band;means for obtaining, based on information from a second speech packet from the encoded speech signal, a description of a spectral envelope of a second frame of the speech signal over the first frequency band;means for obtaining, based on information from an encoded frame of the encoded speech signal, a burst of an information signal that is separate from the speech signal, wherein the encoded frame includes the second speech packet; andmeans for obtaining, based on a presence of the burst in the encoded frame, and based on information from the first speech packet, a description of a spectral envelope of the second frame over the second frequency band; andmeans for obtaining, based on information from the second speech packet, information relating to a pitch component of the second frame for the first frequency band. 28. The apparatus for processing speech packets according to claim 27, wherein the description of a spectral envelope of a first frame of a speech signal comprises separate first and second descriptions, wherein the first description is a description of a spectral envelope of the first frame over the first frequency band, and wherein the second description is a description of a spectral envelope of the first frame over the second frequency band. 29. The apparatus for processing speech packets according to claim 27, wherein the information relating to a pitch component of the second frame for the first frequency band includes a pitch lag value. 30. The apparatus for processing speech packets according to claim 27, wherein said apparatus comprises means for calculating, based on the information relating to a pitch component of the second frame for the first frequency band, an excitation signal of the second frame for the first frequency band, and wherein said apparatus comprises means for calculating, based on the excitation signal of the second frame for the first frequency band, an excitation signal of the second frame for the second frequency band. 31. The apparatus for processing speech packets according to claim 27, wherein an overlap of the first and second frequency bands occurs in the range of from 3.5 to 4 kilohertz. 32. The apparatus for processing speech packets according to claim 27, wherein said means for obtaining a description of a spectral envelope of the second frame over the second frequency band is configured to obtain said description if a narrowband coding scheme is indicated for the second frame. 33. A non-transitory computer-readable medium, said medium comprising: code for causing at least one computer to obtain, based on information from a first speech packet from an encoded speech signal, a description of a spectral envelope of a first frame of a speech signal over (A) a first frequency band and (B) a second frequency band different than the first frequency band;code for causing at least one computer to obtain, based on information from a second speech packet from the encoded speech signal, a description of a spectral envelope of a second frame of the speech signal over the first frequency band;code for causing at least one computer to calculate, based on information from an encoded frame of the encoded speech signal, a burst of an information signal that is separate from the speech signal, wherein the encoded frame includes the second speech packet; andcode for causing at least one computer to obtain, based on a presence of the burst in the encoded frame, and based on information from the first speech packet, a description of a spectral envelope of the second frame over the second frequency band; andcode for causing at least one computer to obtain, based on information from the second speech packet, information relating to a pitch component of the second frame for the first frequency band. 34. The computer program product according to claim 33, wherein the description of a spectral envelope of a first frame of a speech signal comprises separate first and second descriptions, wherein the first description is a description of a spectral envelope of the first frame over the first frequency band, and wherein the second description is a description of a spectral envelope of the first frame over the second frequency band. 35. The computer program product according to claim 33, wherein the information relating to a pitch component of the second frame for the first frequency band includes a pitch lag value. 36. The computer program product according to claim 33, wherein said medium comprises code for causing at least one computer to calculate, based on the information relating to a pitch component of the second frame for the first frequency band, an excitation signal of the second frame for the first frequency band, and wherein said medium comprises code for causing at least one computer to calculate, based on the excitation signal of the second frame for the first frequency band, an excitation signal of the second frame for the second frequency band. 37. A speech decoder configured to calculate a decoded speech signal based on an encoded speech signal, said speech decoder comprising: control logic configured to generate a control signal comprising a sequence of values that is based on coding indices of speech packets from the encoded speech signal, each value of the sequence corresponding to a frame period of the decoded speech signal; anda packet decoder configured(A) to calculate, in response to a value of the control signal having a first state, a corresponding decoded frame based on a description of a spectral envelope of the decoded frame over (1) a first frequency band and (2) a second frequency band that extends above the first frequency band, the description being based on information from a speech packet from the encoded speech signal, and(B) to calculate, in response to a value of the control signal having a second state different than the first state, a corresponding decoded frame based on (1) a description of a spectral envelope of the decoded frame over the first frequency band, the description being based on information from a speech packet from the encoded speech signal, and (2) a description of a spectral envelope of the decoded frame over the second frequency band, the description being based on information from at least one speech packet that occurs in the encoded speech signal before the speech packet,wherein said control logic is configured to set a value of the control signal to have the second state if a corresponding frame of the encoded speech signal includes a burst of an information signal that is separate from the decoded speech signal, andwherein at least one among said control logic and said packet decoder includes a processor. 38. The speech decoder according to claim 37, wherein the description of a spectral envelope of the decoded frame over (1) a first frequency band and (2) a second frequency band that extends above the first frequency band comprises separate first and second descriptions, wherein the first description is a description of a spectral envelope of the decoded frame over the first frequency band, and wherein the second description is a description of a spectral envelope of the decoded frame over the second frequency band. 39. The speech decoder according to claim 37, wherein the information relating to a pitch component of the second frame for the first frequency band includes a pitch lag value. 40. The speech decoder according to claim 37, wherein said packet decoder is configured to calculate, in response to a value of the control signal having a second state, and based on the information relating to a pitch component of the second frame for the first frequency band, an excitation signal of the second frame for the first frequency band, and wherein said apparatus comprises means for calculating, based on the excitation signal of the second frame for the first frequency band, an excitation signal of the second frame for the second frequency band. 41. The speech decoder according to claim 37, wherein said description of the spectral envelope of the decoded frame over the second frequency band is based on a description, from said at least one speech packet that occurs in the encoded speech signal before the speech packet, of a spectral envelope over the second frequency band. 42. The speech decoder according to claim 37, wherein an overlap of the first and second frequency bands occurs in the range of from 3.5 to 4 kilohertz. 43. The speech decoder according to claim 37, wherein said control logic is configured to set the value of the control signal to have the second state if a narrowband coding scheme is indicated for the frame. 44. A method of processing a speech signal, said method comprising: based on a first frame of the speech signal, generating a rate selection signal that indicates a wideband coding scheme;based on information from a mask file, generating a dimming control signal;based on a state of the dimming control signal that corresponds to the first frame, overriding the wideband coding scheme selection to select a narrowband coding scheme; andencoding the first frame according to the narrowband coding scheme. 45. The method of processing a speech signal according to claim 44, wherein said encoding the first frame according to the narrowband coding scheme comprises encoding the first frame into a first speech packet, and wherein said method comprises producing an encoded frame that includes the first speech packet and a burst of an information signal separate from the speech signal. 46. The method of processing a speech signal according to claim 44, wherein said method comprises encoding a second frame of the speech signal according to the wideband coding scheme, wherein said second frame immediately follows said first frame in the speech signal. 47. The method of processing a speech signal according to claim 44, wherein said method comprises encoding a previous frame of the speech signal according to the wideband coding scheme, wherein said previous frame immediately precedes said first frame in the speech signal.

이 특허에 인용된 특허 (9)

Zindler Mark O. ; Funyak David C. ; Hood Teresa I. ; Peifer Jonathan M. ; Helms Roger W. ; Little David E., Circuit interrupter providing improved securement of an electrical terminal within the housing.
상세보기
Manjunath Sharath ; Dejaco Andrew P., Method and apparatus for maintaining a target bit rate in a speech coder.
상세보기
Padovani Roberto (San Diego CA) Tiedemann ; Jr. Edward G. (San Diego CA) Odenwalder Joseph P. (Del Mar CA) Zehavi Ephraim (Haifa ILX) Wheatley ; III Charles E. (Del Mar CA), Method and apparatus for the formatting of data for transmission.
상세보기
Padovani Roberto (San Diego CA) Tiedemann ; Jr. Edward G. (San Diego CA) Weaver ; Jr. Lindsay A. (San Diego CA) Butler Brian K. (Cardiff CA), Method and apparatus for the formatting of data for transmission.
상세보기
Stefan Oestreich DE, Method and radio communication system for transmitting speech information using a broadband or a narrowband speech coding method depending on transmission possibilities.
상세보기
Kim, Young-Jin; Jee, Heon-Joo, Method for enhancing voice quality in CDMA communication system using variable rate vocoder.
상세보기
Manjunath, Sharath; Gardner, William, Multiple mode variable rate speech coding.
상세보기
Kleijn Willem Bastiaan (Basking Ridge NJ) Nahumi Dror (Ocean NJ), RCELP coder.
상세보기
Rao, Ajit V., Signal modification based on continuous time warping for low bit rate CELP coding.
상세보기

이 특허를 인용한 특허 (3)

Hiltner, Jeffrey A.; Matten, Alan H.; Schumacher, Dale R.; Such, Albert J., Encoded packet selection from a first voice stream to create a second voice stream.
상세보기
Rajendran, Vivek; Subasingha, Subasingha Shaminda; Krishnan, Venkatesh, Systems and methods for determining an interpolation factor set for synthesizing a speech signal.
상세보기
Rajendran, Vivek; Kandhadai, Ananthapadmanabhan A., Systems, methods, and apparatus for wideband encoding and decoding of inactive frames.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Systems, methods, and apparatus for wideband encoding and decoding of active frames 원문보기

초록

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (9)

이 특허를 인용한 특허 (3)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Systems, methods, and apparatus for wideband encoding and decoding of active frames 원문보기

초록

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (9)

이 특허를 인용한 특허 (3)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트