[특허]Systems, methods, and apparatus for signal change detection

Systems, methods, and apparatus for signal change detection 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G10L-011/06
출원번호	US-0830548 (2007-07-30)
등록번호	US-8725499 (2014-05-13)
발명자 / 주소	Rajendran, Vivek Kandhadai, Ananthapadmanabhan A.
출원인 / 주소	QUALCOMM Incorporated
대리인 / 주소	Yoo, Heejong
인용정보	피인용 횟수 : 2 인용 특허 : 15

초록 ▼

Disclosed configurations include systems, methods, and apparatus arranged to generate a sequence of spectral tilt values that is based on inactive frames of a speech signal. For each of a plurality of inactive frames of the speech signal, a transmit decision is made according to a change calculated among at least two corresponding values of the sequence. The outcome of the transmit decision determines whether a silence description is transmitted for the corresponding inactive frame.

대표청구항 ▼

1. A method of processing a speech signal, said method comprising: generating, by a sequence generator of a computer, a sequence of spectral tilt values that is based on a plurality of inactive frames of the speech signal, wherein the sequence of spectral tilt values comprises a sequence of reflection coefficients, wherein each of the spectral tilt values is based on at least one reflection coefficient of a corresponding inactive frame of the speech signal, the at least one reflection coefficient comprising at least one of a first reflection coefficient of the corresponding inactive frame or a second reflection coefficient of the corresponding inactive frame;calculating, by a calculator of the computer, a change among at least two of the reflection coefficient-based spectral tilt values; andfor an inactive frame among the plurality of inactive frames, deciding, by a comparator of the computer, whether to transmit a description for the frame,wherein said deciding whether to transmit a description for the frame is based on the calculated change. 2. The method of processing a speech signal according to claim 1, wherein said generating a sequence of spectral tilt values comprises smoothing another sequence of spectral tilt values to generate the sequence of spectral tilt values, wherein each of the spectral tilt values of the other sequence indicates a spectral tilt of a corresponding one of the plurality of inactive frames. 3. The method of processing a speech signal according to claim 1, wherein each of a plurality of the spectral tilt values is based on at least another spectral tilt value in the sequence of spectral tilt values. 4. The method of processing a speech signal according to claim 1, wherein each of a plurality of the spectral tilt values is based on (A) a spectral tilt of a corresponding one of the plurality of inactive frames and (B) at least another spectral tilt value in the sequence of spectral tilt values. 5. The method of processing a speech signal according to claim 1, wherein the calculated change is based on a difference between consecutive values in the sequence of spectral tilt values. 6. The method of processing a speech signal according to claim 1, wherein said calculating a change comprises calculating a distance between adjacent values in the sequence of spectral tilt values. 7. The method of processing a speech signal according to claim 1, wherein said deciding whether to transmit a description for the frame comprises comparing the calculated change to a threshold value. 8. The method of processing a speech signal according to claim 1, wherein an outcome of said deciding whether to transmit a description for the frame is based on a relation between (A) a magnitude of the calculated change and (B) a threshold value. 9. The method of processing a speech signal according to claim 1, wherein said method comprises, if an outcome of said deciding whether to transmit a description for the frame is a decision to transmit a description for the frame, transmitting a silence description that includes at least one of a spectral envelope description and an energy envelope description. 10. The method of processing a speech signal according to claim 9, wherein said method comprises calculating the silence description based on at least one among (A) spectral envelope descriptions of each of a plurality of inactive frames and (B) energy envelope descriptions of each of a plurality of inactive frames. 11. The method of processing a speech signal according to claim 1, wherein said deciding whether to transmit a description for the frame is based on at least one among (A) a vector describing a spectral envelope of the frame, (B) a residual energy of the frame, (C) a distance in time to a most recent transmission of a description for an inactive frame, (D) a distance in time to a most recent active frame, (E) a description of an energy envelope of the frame, (F) a mean absolute value of the frame, and (G) an energy value of the frame. 12. The method of processing a speech signal according to claim 11, wherein said method comprises, if an outcome of said deciding whether to transmit a description for the frame is a decision to transmit a description for the frame, transmitting a silence description that includes at least one of a spectral envelope description and an energy envelope description. 13. The method of processing a speech signal according to claim 1, wherein said deciding whether to transmit a description for the frame comprises, in response to detecting that a change in a measure of coding gain exceeds a threshold value, deciding not to transmit a description for the frame. 14. The method of processing a speech signal according to claim 13, wherein each value of the measure of coding gain is based on the values of a plurality of reflection coefficients of a corresponding inactive frame of the speech signal. 15. The method of processing a speech signal according to claim 1, wherein said method comprises calculating, for each of a plurality of the spectral tilt values in the sequence of spectral tilt values, a change among the spectral tilt value and at least one other spectral tilt value in the sequence of spectral tilt values, and wherein said method comprises, for each of another plurality of inactive frames of the speech signal, deciding whether to transmit a description for the frame, andwherein, for each of the other plurality of inactive frames, an outcome of said deciding whether to transmit a description for the frame is based on at least one of the calculated changes. 16. The method of processing a speech signal according to claim 15, wherein, for at least some of the other plurality of inactive frames, an outcome of said deciding whether to transmit a description for the frame is a decision not to transmit a description for the frame. 17. The method of processing a speech signal according to claim 15, wherein, for each of the other plurality of inactive frames, said deciding whether to transmit a description for the frame comprises, in response to detecting that a change in a measure of coding gain exceeds a threshold value, deciding not to transmit a description for the frame. 18. The method of processing a speech signal according to claim 17, wherein, for each of the other plurality of inactive frames, said change in a measure of coding gain is based on (A) a value for the measure of coding gain for a first inactive frame of the speech signal that precedes the frame and (B) a value for the measure of coding gain for a second inactive frame of the speech signal that precedes the frame and is different from the first inactive frame. 19. The method of processing a speech signal according to claim 1, wherein said generating a sequence of spectral tilt values comprises, for at least some of the plurality of inactive frames, generating a corresponding spectral tilt value among the sequence of spectral tilt values according to a distance in time between the inactive frame and a preceding active frame of the speech signal. 20. The method of processing a speech signal according to claim 19, wherein said generating a corresponding spectral tilt value among the sequence of spectral tilt values comprises setting the spectral tilt value to a previous spectral tilt value among the sequence of spectral tilt values when the distance in time between the inactive frame and a preceding active frame of the speech signal is less than a threshold value. 21. The method of processing a speech signal according to claim 1, wherein said generating a sequence of spectral tilt values comprises, for at least some of the plurality of inactive frames, calculating a corresponding spectral tilt value among the sequence of spectral tilt values according to a measure of coding gain for the inactive frame. 22. The method of processing a speech signal according to claim 1, wherein said generating a sequence of spectral tilt values comprises, for at least one of the sequence of spectral tilt values, setting the spectral tilt value to a previous spectral tilt value among the sequence of spectral tilt values in response to detecting that a change in a measure of coding gain exceeds a threshold value. 23. The method of claim 1, further comprising: combining multiple transmit indications into a composite transmit indication, wherein each transmit indication is produced from a different blanking algorithm; anddetermining whether to transmit a description of an inactive frame based on the composite transmit indication. 24. A non-transitory computer-readable medium, said medium comprising instructions that when executed cause at least one computer to: generate a sequence of spectral tilt values that is based on a plurality of inactive frames of a speech signal, wherein the sequence of spectral tilt values comprises a sequence of reflection coefficients, wherein each of the spectral tilt values is based on at least one reflection coefficient of a corresponding inactive frame of the speech signal, the at least one reflection coefficient comprising at least one of a first reflection coefficient of the corresponding inactive frame or a second reflection coefficient of the corresponding inactive frame;calculate a change among at least two of the reflection coefficient-based spectral tilt values; anddecide, for an inactive frame among the plurality of inactive frames, and based on the calculated change, whether to transmit a description for the frame. 25. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to generate a sequence of spectral tilt values are configured to cause the at least one computer to generate each of a plurality of the spectral tilt values based on at least another spectral tilt value in the sequence of spectral tilt values. 26. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to calculate a change are configured to cause the at least one computer to calculate the change based on a difference between consecutive values in the sequence of spectral tilt values. 27. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to decide whether to transmit a description for the frame are configured to cause the at least one computer to decide whether to transmit a description for the frame based on a relation between (A) a magnitude of the calculated change and (B) a threshold value. 28. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to decide whether to transmit a description for the frame include instructions for causing the at least one computer to decide, in response to a change in a measure of coding gain that exceeds a threshold value, not to transmit a description for the frame. 29. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to calculate a change are configured to cause the at least one computer to calculate, for each of a plurality of the spectral tilt values in the sequence of spectral tilt values, a change among the spectral tilt value and at least one other spectral tilt value in the sequence of spectral tilt values, and wherein said instructions for causing at least one computer to decide whether to transmit a description for the frame are configured to cause the at least one computer to decide, for each of another plurality of inactive frames of the speech signal, whether to transmit a description for the frame, andwherein said instructions for causing at least one computer to decide whether to transmit a description for the frame are configured such that, for each of the other plurality of inactive frames, the decision whether to transmit a description for the frame is based on at least one of the calculated changes. 30. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to generate a sequence of spectral tilt values comprise instructions for causing the at least one computer to generate, for at least some of the plurality of inactive frames, a corresponding spectral tilt value among the sequence of spectral tilt values according to a distance in time between the inactive frame and a preceding active frame of the speech signal. 31. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to generate a sequence of spectral tilt values are configured to cause the at least one computer, for at least one of the sequence of spectral tilt values, to set the spectral tilt value to a previous spectral tilt value among the sequence of spectral tilt values in response to detecting that a change in a measure of coding gain exceeds a threshold value. 32. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to generate a sequence of spectral tilt values are configured to cause the at least one computer to smooth another sequence of spectral tilt values to generate the sequence of spectral tilt values, wherein each of the spectral tilt values of the other sequence indicates a spectral tilt of a corresponding one of the plurality of inactive frames. 33. An apparatus for processing a speech signal, said apparatus comprising: a sequence generator configured to generate a sequence of spectral tilt values that is based on a plurality of inactive frames of the speech signal, wherein the sequence of spectral tilt values comprises a sequence of reflection coefficients, wherein each of the spectral tilt values is based on at least one reflection coefficient of a corresponding inactive frame of the speech signal, the at least one reflection coefficient comprising at least one of a first reflection coefficient of the corresponding inactive frame or a second reflection coefficient of the corresponding inactive frame;a calculator configured to calculate a change among at least two of the reflection coefficient-based spectral tilt values; anda comparator configured to decide, for an inactive frame among the plurality of inactive frames, and based on the calculated change, whether to transmit a description for the frame. 34. The apparatus for processing a speech signal according to claim 33, wherein said comparator is configured to decide whether to transmit a description for the frame based on a relation between (A) a magnitude of the calculated change and (B) a threshold value. 35. The apparatus for processing a speech signal according to claim 33, wherein the apparatus comprises a device for wireless communications that includes said sequence generator, said calculator, and said comparator, and wherein said device is configured to transmit, in response to a decision by said comparator to transmit a description for the frame, a silence description that includes at least one of a spectral envelope description and an energy envelope description. 36. The apparatus for processing a speech signal according to claim 33, wherein said comparator is configured to decide, in response to a change in a measure of coding gain that exceeds a threshold value, not to transmit a description for the frame. 37. The apparatus for processing a speech signal according to claim 33, wherein said calculator is configured to calculate, for each of a plurality of the spectral tilt values in the sequence of spectral tilt values, a change among the spectral tilt value and at least one other spectral tilt value in the sequence of spectral tilt values, and wherein said comparator is configured to decide, for each of another plurality of inactive frames of the speech signal, whether to transmit a description for the frame, andwherein said comparator is configured such that, for each of the other plurality of inactive frames, the decision whether to transmit a description for the frame is based on at least one of the calculated changes. 38. The apparatus for processing a speech signal according to claim 33, wherein said sequence generator is configured to generate, for at least some of the plurality of inactive frames, a corresponding spectral tilt value among the sequence of spectral tilt values according to a distance in time between the inactive frame and a preceding active frame of the speech signal. 39. The apparatus for processing a speech signal according to claim 33, wherein said sequence generator is configured, for at least one of the sequence of spectral tilt values, to set the spectral tilt value to a previous spectral tilt value among the sequence of spectral tilt values in response to detecting that a change in a measure of coding gain exceeds a threshold value. 40. The apparatus for processing a speech signal according to claim 33, wherein said sequence generator is configured to generate the sequence of spectral tilt values by smoothing another sequence of spectral tilt values, wherein each of the spectral tilt values of the other sequence indicates a spectral tilt of a corresponding one of the plurality of inactive frames. 41. An apparatus for processing a speech signal, said apparatus comprising: means for generating a sequence of spectral tilt values that is based on a plurality of inactive frames of the speech signal, wherein the sequence of spectral tilt values comprises a sequence of reflection coefficients, wherein each of the spectral tilt values is based on at least one reflection coefficient of a corresponding inactive frame of the speech signal, the at least one reflection coefficient comprising at least one of a first reflection coefficient of the corresponding inactive frame or a second reflection coefficient of the corresponding inactive frame;means for calculating a change among at least two of the reflection coefficient-based spectral tilt values; andmeans for deciding, for an inactive frame among the plurality of inactive frames, and based on the calculated change, whether to transmit a description for the frame. 42. The apparatus for processing a speech signal according to claim 41, wherein said apparatus comprises means for transmitting, in response to a decision by said means for deciding to transmit a description for the frame, a silence description that includes at least one of a spectral envelope description and an energy envelope description. 43. The apparatus for processing a speech signal according to claim 41, wherein said means for generating a sequence of spectral tilt values is configured to generate, for at least some of the plurality of inactive frames, a corresponding spectral tilt value among the sequence of spectral tilt values according to a distance in time between the inactive frame and a preceding active frame of the speech signal. 44. The apparatus for processing a speech signal according to claim 41, wherein said means for generating a sequence of spectral tilt values is configured, for at least one of the sequence of spectral tilt values, to set the spectral tilt value to a previous spectral tilt value among the sequence of spectral tilt values in response to detecting that a change in a measure of coding gain exceeds a threshold value. 45. The apparatus for processing a speech signal according to claim 41, wherein said means for generating a sequence of spectral tilt values is configured to generate the sequence of spectral tilt values by smoothing another sequence of spectral tilt values,wherein each of the spectral tilt values of the other sequence indicates a spectral tilt of a corresponding one of the plurality of inactive frames. 46. A method of processing a speech signal, said method comprising: generating, by a sequence generator of a computer, a sequence of spectral tilt values that is based on a plurality of inactive frames of the speech signal, wherein the sequence of spectral tilt values comprises a sequence of reflection coefficients, wherein each of the spectral tilt values is based on at least one reflection coefficient of a corresponding inactive frame of the speech signal, the at least one reflection coefficient comprising at least one of a first reflection coefficient of the corresponding inactive frame or a second reflection coefficient of the corresponding inactive frame;calculating, by a calculator of the computer, a change among at least two of the reflection coefficient-based spectral tilt values; andfor an inactive frame among the plurality of inactive frames, deciding, by a comparator of the computer, whether to transmit a description for the frame,wherein said deciding whether to transmit a description for the frame is based on the calculated change, andwherein said generating a sequence of spectral tilt values comprises, for at least some of the plurality of inactive frames, generating a corresponding spectral tilt value among the sequence of spectral tilt values according to a distance in time between the inactive frame and a preceding active frame of the speech signal.

이 특허에 인용된 특허 (15)

Weimin Peng ; James Patrick Ashley, Method and apparatus for coding and decoding speech.
상세보기
Allen Gersho ; Eyal Shlomot ; Vladimir Cuperman ; Chunyan Li, Method and apparatus for hybrid coding of speech at 4KBPS having phase alignment between mode-switched frames.
상세보기
Michaelis, Paul Roller, Method and apparatus for improving the intelligibility of digitally compressed speech.
상세보기
Manjunath Sharath ; Dejaco Andrew P., Method and apparatus for maintaining a target bit rate in a speech coder.
상세보기
Padovani Roberto (San Diego CA) Tiedemann ; Jr. Edward G. (San Diego CA) Weaver ; Jr. Lindsay A. (San Diego CA) Butler Brian K. (Cardiff CA), Method and apparatus for the formatting of data for transmission.
상세보기
DeJaco Andrew P. (San Diego CA), Method for determining speech encoding rate in a variable rate vocoder.
상세보기
Jarvinen, Kari; Kapanen, Pekka; Ruoppila, Vesa; Rotola-Pukkila, Jani, Methods for generating comfort noise during discontinuous transmission.
상세보기
Ehara, Hiroyuki, Multimode speech coding apparatus and decoding apparatus.
상세보기
Manjunath, Sharath; Gardner, William, Multiple mode variable rate speech coding.
상세보기
Kleijn Willem Bastiaan (Basking Ridge NJ) Nahumi Dror (Ocean NJ), RCELP coder.
상세보기
Li, Dunling; Sisli, Gokhan; Thomas, Daniel, SID frame detection with human auditory perception compensation.
상세보기
Rao, Ajit V., Signal modification based on continuous time warping for low bit rate CELP coding.
상세보기
Gao,Yang; Shlomot,Eyal; Benyassine,Adil, Tone detection algorithm for a voice activity detector.
상세보기
Nakamura Kazuo,JPX, Voice-presence/absence discriminator having highly reliable lead portion detection.
상세보기
Bhaskar,Udaya; Swaminathan,Kumar, Voicing measure for a speech CODEC system.
상세보기

이 특허를 인용한 특허 (2)

Choo, Ki-hyun; Miao, Lei; Oh, Eun-mi, Method and apparatus for encoding and decoding high frequency signal.
상세보기
Choo, Ki-hyun; Miao, Lei; Oh, Eun-mi, Method and apparatus for encoding and decoding high frequency signal.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Systems, methods, and apparatus for signal change detection 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (15)

이 특허를 인용한 특허 (2)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Systems, methods, and apparatus for signal change detection 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (15)

이 특허를 인용한 특허 (2)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트