IPC분류정보
국가/구분 |
United States(US) Patent
등록
|
국제특허분류(IPC7판) |
|
출원번호 |
US-0830548
(2007-07-30)
|
등록번호 |
US-8725499
(2014-05-13)
|
발명자
/ 주소 |
- Rajendran, Vivek
- Kandhadai, Ananthapadmanabhan A.
|
출원인 / 주소 |
|
대리인 / 주소 |
|
인용정보 |
피인용 횟수 :
2 인용 특허 :
15 |
초록
▼
Disclosed configurations include systems, methods, and apparatus arranged to generate a sequence of spectral tilt values that is based on inactive frames of a speech signal. For each of a plurality of inactive frames of the speech signal, a transmit decision is made according to a change calculated
Disclosed configurations include systems, methods, and apparatus arranged to generate a sequence of spectral tilt values that is based on inactive frames of a speech signal. For each of a plurality of inactive frames of the speech signal, a transmit decision is made according to a change calculated among at least two corresponding values of the sequence. The outcome of the transmit decision determines whether a silence description is transmitted for the corresponding inactive frame.
대표청구항
▼
1. A method of processing a speech signal, said method comprising: generating, by a sequence generator of a computer, a sequence of spectral tilt values that is based on a plurality of inactive frames of the speech signal, wherein the sequence of spectral tilt values comprises a sequence of reflecti
1. A method of processing a speech signal, said method comprising: generating, by a sequence generator of a computer, a sequence of spectral tilt values that is based on a plurality of inactive frames of the speech signal, wherein the sequence of spectral tilt values comprises a sequence of reflection coefficients, wherein each of the spectral tilt values is based on at least one reflection coefficient of a corresponding inactive frame of the speech signal, the at least one reflection coefficient comprising at least one of a first reflection coefficient of the corresponding inactive frame or a second reflection coefficient of the corresponding inactive frame;calculating, by a calculator of the computer, a change among at least two of the reflection coefficient-based spectral tilt values; andfor an inactive frame among the plurality of inactive frames, deciding, by a comparator of the computer, whether to transmit a description for the frame,wherein said deciding whether to transmit a description for the frame is based on the calculated change. 2. The method of processing a speech signal according to claim 1, wherein said generating a sequence of spectral tilt values comprises smoothing another sequence of spectral tilt values to generate the sequence of spectral tilt values, wherein each of the spectral tilt values of the other sequence indicates a spectral tilt of a corresponding one of the plurality of inactive frames. 3. The method of processing a speech signal according to claim 1, wherein each of a plurality of the spectral tilt values is based on at least another spectral tilt value in the sequence of spectral tilt values. 4. The method of processing a speech signal according to claim 1, wherein each of a plurality of the spectral tilt values is based on (A) a spectral tilt of a corresponding one of the plurality of inactive frames and (B) at least another spectral tilt value in the sequence of spectral tilt values. 5. The method of processing a speech signal according to claim 1, wherein the calculated change is based on a difference between consecutive values in the sequence of spectral tilt values. 6. The method of processing a speech signal according to claim 1, wherein said calculating a change comprises calculating a distance between adjacent values in the sequence of spectral tilt values. 7. The method of processing a speech signal according to claim 1, wherein said deciding whether to transmit a description for the frame comprises comparing the calculated change to a threshold value. 8. The method of processing a speech signal according to claim 1, wherein an outcome of said deciding whether to transmit a description for the frame is based on a relation between (A) a magnitude of the calculated change and (B) a threshold value. 9. The method of processing a speech signal according to claim 1, wherein said method comprises, if an outcome of said deciding whether to transmit a description for the frame is a decision to transmit a description for the frame, transmitting a silence description that includes at least one of a spectral envelope description and an energy envelope description. 10. The method of processing a speech signal according to claim 9, wherein said method comprises calculating the silence description based on at least one among (A) spectral envelope descriptions of each of a plurality of inactive frames and (B) energy envelope descriptions of each of a plurality of inactive frames. 11. The method of processing a speech signal according to claim 1, wherein said deciding whether to transmit a description for the frame is based on at least one among (A) a vector describing a spectral envelope of the frame, (B) a residual energy of the frame, (C) a distance in time to a most recent transmission of a description for an inactive frame, (D) a distance in time to a most recent active frame, (E) a description of an energy envelope of the frame, (F) a mean absolute value of the frame, and (G) an energy value of the frame. 12. The method of processing a speech signal according to claim 11, wherein said method comprises, if an outcome of said deciding whether to transmit a description for the frame is a decision to transmit a description for the frame, transmitting a silence description that includes at least one of a spectral envelope description and an energy envelope description. 13. The method of processing a speech signal according to claim 1, wherein said deciding whether to transmit a description for the frame comprises, in response to detecting that a change in a measure of coding gain exceeds a threshold value, deciding not to transmit a description for the frame. 14. The method of processing a speech signal according to claim 13, wherein each value of the measure of coding gain is based on the values of a plurality of reflection coefficients of a corresponding inactive frame of the speech signal. 15. The method of processing a speech signal according to claim 1, wherein said method comprises calculating, for each of a plurality of the spectral tilt values in the sequence of spectral tilt values, a change among the spectral tilt value and at least one other spectral tilt value in the sequence of spectral tilt values, and wherein said method comprises, for each of another plurality of inactive frames of the speech signal, deciding whether to transmit a description for the frame, andwherein, for each of the other plurality of inactive frames, an outcome of said deciding whether to transmit a description for the frame is based on at least one of the calculated changes. 16. The method of processing a speech signal according to claim 15, wherein, for at least some of the other plurality of inactive frames, an outcome of said deciding whether to transmit a description for the frame is a decision not to transmit a description for the frame. 17. The method of processing a speech signal according to claim 15, wherein, for each of the other plurality of inactive frames, said deciding whether to transmit a description for the frame comprises, in response to detecting that a change in a measure of coding gain exceeds a threshold value, deciding not to transmit a description for the frame. 18. The method of processing a speech signal according to claim 17, wherein, for each of the other plurality of inactive frames, said change in a measure of coding gain is based on (A) a value for the measure of coding gain for a first inactive frame of the speech signal that precedes the frame and (B) a value for the measure of coding gain for a second inactive frame of the speech signal that precedes the frame and is different from the first inactive frame. 19. The method of processing a speech signal according to claim 1, wherein said generating a sequence of spectral tilt values comprises, for at least some of the plurality of inactive frames, generating a corresponding spectral tilt value among the sequence of spectral tilt values according to a distance in time between the inactive frame and a preceding active frame of the speech signal. 20. The method of processing a speech signal according to claim 19, wherein said generating a corresponding spectral tilt value among the sequence of spectral tilt values comprises setting the spectral tilt value to a previous spectral tilt value among the sequence of spectral tilt values when the distance in time between the inactive frame and a preceding active frame of the speech signal is less than a threshold value. 21. The method of processing a speech signal according to claim 1, wherein said generating a sequence of spectral tilt values comprises, for at least some of the plurality of inactive frames, calculating a corresponding spectral tilt value among the sequence of spectral tilt values according to a measure of coding gain for the inactive frame. 22. The method of processing a speech signal according to claim 1, wherein said generating a sequence of spectral tilt values comprises, for at least one of the sequence of spectral tilt values, setting the spectral tilt value to a previous spectral tilt value among the sequence of spectral tilt values in response to detecting that a change in a measure of coding gain exceeds a threshold value. 23. The method of claim 1, further comprising: combining multiple transmit indications into a composite transmit indication, wherein each transmit indication is produced from a different blanking algorithm; anddetermining whether to transmit a description of an inactive frame based on the composite transmit indication. 24. A non-transitory computer-readable medium, said medium comprising instructions that when executed cause at least one computer to: generate a sequence of spectral tilt values that is based on a plurality of inactive frames of a speech signal, wherein the sequence of spectral tilt values comprises a sequence of reflection coefficients, wherein each of the spectral tilt values is based on at least one reflection coefficient of a corresponding inactive frame of the speech signal, the at least one reflection coefficient comprising at least one of a first reflection coefficient of the corresponding inactive frame or a second reflection coefficient of the corresponding inactive frame;calculate a change among at least two of the reflection coefficient-based spectral tilt values; anddecide, for an inactive frame among the plurality of inactive frames, and based on the calculated change, whether to transmit a description for the frame. 25. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to generate a sequence of spectral tilt values are configured to cause the at least one computer to generate each of a plurality of the spectral tilt values based on at least another spectral tilt value in the sequence of spectral tilt values. 26. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to calculate a change are configured to cause the at least one computer to calculate the change based on a difference between consecutive values in the sequence of spectral tilt values. 27. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to decide whether to transmit a description for the frame are configured to cause the at least one computer to decide whether to transmit a description for the frame based on a relation between (A) a magnitude of the calculated change and (B) a threshold value. 28. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to decide whether to transmit a description for the frame include instructions for causing the at least one computer to decide, in response to a change in a measure of coding gain that exceeds a threshold value, not to transmit a description for the frame. 29. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to calculate a change are configured to cause the at least one computer to calculate, for each of a plurality of the spectral tilt values in the sequence of spectral tilt values, a change among the spectral tilt value and at least one other spectral tilt value in the sequence of spectral tilt values, and wherein said instructions for causing at least one computer to decide whether to transmit a description for the frame are configured to cause the at least one computer to decide, for each of another plurality of inactive frames of the speech signal, whether to transmit a description for the frame, andwherein said instructions for causing at least one computer to decide whether to transmit a description for the frame are configured such that, for each of the other plurality of inactive frames, the decision whether to transmit a description for the frame is based on at least one of the calculated changes. 30. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to generate a sequence of spectral tilt values comprise instructions for causing the at least one computer to generate, for at least some of the plurality of inactive frames, a corresponding spectral tilt value among the sequence of spectral tilt values according to a distance in time between the inactive frame and a preceding active frame of the speech signal. 31. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to generate a sequence of spectral tilt values are configured to cause the at least one computer, for at least one of the sequence of spectral tilt values, to set the spectral tilt value to a previous spectral tilt value among the sequence of spectral tilt values in response to detecting that a change in a measure of coding gain exceeds a threshold value. 32. The computer-readable medium according to claim 24, wherein said instructions for causing at least one computer to generate a sequence of spectral tilt values are configured to cause the at least one computer to smooth another sequence of spectral tilt values to generate the sequence of spectral tilt values, wherein each of the spectral tilt values of the other sequence indicates a spectral tilt of a corresponding one of the plurality of inactive frames. 33. An apparatus for processing a speech signal, said apparatus comprising: a sequence generator configured to generate a sequence of spectral tilt values that is based on a plurality of inactive frames of the speech signal, wherein the sequence of spectral tilt values comprises a sequence of reflection coefficients, wherein each of the spectral tilt values is based on at least one reflection coefficient of a corresponding inactive frame of the speech signal, the at least one reflection coefficient comprising at least one of a first reflection coefficient of the corresponding inactive frame or a second reflection coefficient of the corresponding inactive frame;a calculator configured to calculate a change among at least two of the reflection coefficient-based spectral tilt values; anda comparator configured to decide, for an inactive frame among the plurality of inactive frames, and based on the calculated change, whether to transmit a description for the frame. 34. The apparatus for processing a speech signal according to claim 33, wherein said comparator is configured to decide whether to transmit a description for the frame based on a relation between (A) a magnitude of the calculated change and (B) a threshold value. 35. The apparatus for processing a speech signal according to claim 33, wherein the apparatus comprises a device for wireless communications that includes said sequence generator, said calculator, and said comparator, and wherein said device is configured to transmit, in response to a decision by said comparator to transmit a description for the frame, a silence description that includes at least one of a spectral envelope description and an energy envelope description. 36. The apparatus for processing a speech signal according to claim 33, wherein said comparator is configured to decide, in response to a change in a measure of coding gain that exceeds a threshold value, not to transmit a description for the frame. 37. The apparatus for processing a speech signal according to claim 33, wherein said calculator is configured to calculate, for each of a plurality of the spectral tilt values in the sequence of spectral tilt values, a change among the spectral tilt value and at least one other spectral tilt value in the sequence of spectral tilt values, and wherein said comparator is configured to decide, for each of another plurality of inactive frames of the speech signal, whether to transmit a description for the frame, andwherein said comparator is configured such that, for each of the other plurality of inactive frames, the decision whether to transmit a description for the frame is based on at least one of the calculated changes. 38. The apparatus for processing a speech signal according to claim 33, wherein said sequence generator is configured to generate, for at least some of the plurality of inactive frames, a corresponding spectral tilt value among the sequence of spectral tilt values according to a distance in time between the inactive frame and a preceding active frame of the speech signal. 39. The apparatus for processing a speech signal according to claim 33, wherein said sequence generator is configured, for at least one of the sequence of spectral tilt values, to set the spectral tilt value to a previous spectral tilt value among the sequence of spectral tilt values in response to detecting that a change in a measure of coding gain exceeds a threshold value. 40. The apparatus for processing a speech signal according to claim 33, wherein said sequence generator is configured to generate the sequence of spectral tilt values by smoothing another sequence of spectral tilt values, wherein each of the spectral tilt values of the other sequence indicates a spectral tilt of a corresponding one of the plurality of inactive frames. 41. An apparatus for processing a speech signal, said apparatus comprising: means for generating a sequence of spectral tilt values that is based on a plurality of inactive frames of the speech signal, wherein the sequence of spectral tilt values comprises a sequence of reflection coefficients, wherein each of the spectral tilt values is based on at least one reflection coefficient of a corresponding inactive frame of the speech signal, the at least one reflection coefficient comprising at least one of a first reflection coefficient of the corresponding inactive frame or a second reflection coefficient of the corresponding inactive frame;means for calculating a change among at least two of the reflection coefficient-based spectral tilt values; andmeans for deciding, for an inactive frame among the plurality of inactive frames, and based on the calculated change, whether to transmit a description for the frame. 42. The apparatus for processing a speech signal according to claim 41, wherein said apparatus comprises means for transmitting, in response to a decision by said means for deciding to transmit a description for the frame, a silence description that includes at least one of a spectral envelope description and an energy envelope description. 43. The apparatus for processing a speech signal according to claim 41, wherein said means for generating a sequence of spectral tilt values is configured to generate, for at least some of the plurality of inactive frames, a corresponding spectral tilt value among the sequence of spectral tilt values according to a distance in time between the inactive frame and a preceding active frame of the speech signal. 44. The apparatus for processing a speech signal according to claim 41, wherein said means for generating a sequence of spectral tilt values is configured, for at least one of the sequence of spectral tilt values, to set the spectral tilt value to a previous spectral tilt value among the sequence of spectral tilt values in response to detecting that a change in a measure of coding gain exceeds a threshold value. 45. The apparatus for processing a speech signal according to claim 41, wherein said means for generating a sequence of spectral tilt values is configured to generate the sequence of spectral tilt values by smoothing another sequence of spectral tilt values,wherein each of the spectral tilt values of the other sequence indicates a spectral tilt of a corresponding one of the plurality of inactive frames. 46. A method of processing a speech signal, said method comprising: generating, by a sequence generator of a computer, a sequence of spectral tilt values that is based on a plurality of inactive frames of the speech signal, wherein the sequence of spectral tilt values comprises a sequence of reflection coefficients, wherein each of the spectral tilt values is based on at least one reflection coefficient of a corresponding inactive frame of the speech signal, the at least one reflection coefficient comprising at least one of a first reflection coefficient of the corresponding inactive frame or a second reflection coefficient of the corresponding inactive frame;calculating, by a calculator of the computer, a change among at least two of the reflection coefficient-based spectral tilt values; andfor an inactive frame among the plurality of inactive frames, deciding, by a comparator of the computer, whether to transmit a description for the frame,wherein said deciding whether to transmit a description for the frame is based on the calculated change, andwherein said generating a sequence of spectral tilt values comprises, for at least some of the plurality of inactive frames, generating a corresponding spectral tilt value among the sequence of spectral tilt values according to a distance in time between the inactive frame and a preceding active frame of the speech signal.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.