A simple and efficient method for producing an obfuscated speech signal which may be used to mask a stream of speech, is disclosed. A speech signal representing the speech stream to be masked is obtained. The speech signal is then temporally partitioned into segments, preferably corresponding to p
A simple and efficient method for producing an obfuscated speech signal which may be used to mask a stream of speech, is disclosed. A speech signal representing the speech stream to be masked is obtained. The speech signal is then temporally partitioned into segments, preferably corresponding to phonemes within the speech stream. The segments are then stored in a memory, and some or all of the segments are subsequently selected, retrieved, and assembled into an obfuscated speech signal representing an unintelligble speech stream that, when combined with the speech signal or reproduced and combined with the speech stream, provides a masking effect. While the presently preferred embodiment finds application most readily in an open plan office, embodiments suitable for use in restaurants, classrooms, and in telecommunications systems are also disclosed.
대표청구항▼
The invention claimed is: 1. A method of producing a substantially unintelligible, obfuscated speech signal from intelligible speech, comprising the steps of: obtaining a speech signal representing a speech stream; temporally partitioning said speech signal into a plurality of variable length segm
The invention claimed is: 1. A method of producing a substantially unintelligible, obfuscated speech signal from intelligible speech, comprising the steps of: obtaining a speech signal representing a speech stream; temporally partitioning said speech signal into a plurality of variable length segments, each of said segments having a length determined by features of said speech signal, said segments occurring in an initial order within said speech signal; selecting a plurality of selected segments from among said segments; and assembling said selected segments, in an order different than said initial order, to produce said obfuscated speech signal, wherein said obfuscated speech signal is speech-like. 2. The method of claim 1, further comprising the step, immediately following said temporally partitioning step, of: storing said segments in a memory; and further comprising the step, immediately following said selecting step, of; retrieving said selected segments from said memory. 3. The method of claim 2, wherein said selected segments are selected from a plurality of segments within said memory comprising a recent history of segments present in said speech signal. 4. The method of claim 3, wherein said selected segments are selected randomly from said plurality of segments contained within said memory. 5. The method of claim 3, wherein each of said selected segments is selected with a relative frequency commensurate with a relative frequency of occurrence within said speech signal. 6. The method of claim 1, wherein said obfuscated speech signal is produced in substantially real time. 7. The method of claim 1, wherein said speech signal represents a previously recorded speech stream. 8. The method of claim 1, wherein said obfuscated speech signal simulates unintelligible background conversation. 9. The method of claim 1, wherein said obfuscated speech signal is transmitted through a telecommunications network. 10. The method of claim 1, further comprising the step, immediately following said assembling step, of: combining said speech signal and said obfuscated speech signal to produce a combined speech signal; wherein said combined signal comprises a speech stream that is substantially unintelligible. 11. The method of claim 1, further comprising the steps, immediately following said assembling step, of: reproducing said obfuscated speech signal to provide an obfuscated speech stream, and combining said speech stream and said obfuscated speech stream to produce a combined speech stream; wherein said combined speech stream is substantially unintelligible. 12. The method of claim 1, wherein said speech signal is obtained from a microphone. 13. The method of claim 1, wherein said obfuscated speech signal is reproduced by a loudspeaker. 14. The method of claim 1, wherein said speech signal is obtained from an office environment. 15. The method of claim 1, wherein said selected segments comprise each segment within said speech stream. 16. The method of claim 1, wherein said speech signal comprises a sequence of digital values. 17. The method of claim 1, wherein said segments represent phonemes within said speech stream. 18. The method of claim 17, wherein said phonemes are determined using a continuous speech recognition system. 19. The method of claim 1, wherein said assembling step comprises the step of: applying a shaping function to each of said selected segments; wherein said shaping function provides a smooth transition between successive segments in said obfuscated speech signal. 20. The method of claim 1, wherein said selecting and assembling steps concurrently produce a plurality of said obfuscated speech signals from said speech signal. 21. An apparatus for producing a substantially unintelligible, obfuscated speech signal from intelligible speech, comprising: a module for obtaining a speech signal representing a speech stream; a module for temporally partitioning said speech signal into a plurality of variable length segments, temporally partitioning said speech signal into a plurality of variable length segments, each of said segments having a length determined by features of said speech signal, said segments occurring in an initial order within said speech signal; a module for selecting a plurality of selected segments from among said segments; and a module for assembling said selected segments, in an order different than said initial order, to produce said obfuscated speech signal, wherein said obfuscated speech signal is speech-like. 22. The apparatus of claim 21, further comprising: a memory for storing said segments; and a module for retrieving said selected segments from said memory. 23. The apparatus of claim 22, wherein said selected segments are selected from a plurality of segments within said memory comprising a recent history of segments present in said speech signal. 24. The apparatus of claim 23, wherein said selected segments are selected randomly from said plurality of segments contained within said memory. 25. The apparatus of claim 23, wherein each of said selected segments is selected with a relative frequency commensurate with a relative frequency of occurrence within said speech signal. 26. The apparatus of claim 21, wherein said obfuscated speech signal is produced in substantially real time. 27. The apparatus of claim 21, wherein said speech signal represents a previously recorded speech stream. 28. The apparatus of claim 21, wherein said obfuscated speech signal simulates unintelligible background conversation. 29. The apparatus of claim 21, further comprising: a module for transmitting said obfuscated speech signal through a telecommunications network. 30. The apparatus of claim 21, further comprising: a module for combining said speech signal and said obfuscated speech signal to produce a combined speech signal; wherein said combined signal comprises a speech stream that is substantially unintelligible. 31. The apparatus of claim 21, further comprising: a module for reproducing said obfuscated speech signal to provide an obfuscated speech stream, and a module for combining said speech stream and said obfuscated speech stream to produce a combined speech stream; wherein said combined speech stream is substantially unintelligible. 32. The apparatus of claim 21, further comprising: a microphone for obtaining said speech signal. 33. The apparatus of claim 21, further comprising: a loudspeaker for reproducing said obfuscated speech. 34. The apparatus of claim 21, wherein said speech signal is obtained from an office environment. 35. The apparatus of claim 21, wherein said selected segments comprise each segment within said speech stream. 36. The apparatus of claim 21, wherein said speech signal comprises a sequence of digital values. 37. The apparatus of claim 21, wherein said segments represent phonemes within said speech stream. 38. The apparatus of claim 37, wherein said phonemes are determined using a continuous speech recognition system. 39. The apparatus of claim 21, wherein said module for assembling further comprises: a module for applying a shaping function to each of said selected segments; wherein said shaping function provides a smooth transition between successive segments in said obfuscated speech signal. 40. The apparatus of claim 21, wherein said modules for selecting and assembling concurrently produce a plurality of said obfuscated speech signals from said speech signal.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (18)
Gossman William (Silver Spring MD) Eatwell Graham (Cambridge GBX), Active high transmission loss panel.
Propst Robert L. (Ann Arbor MI) Randolph Travis M. (Saugatuck MI) Reyda Theodore M. (Saugatuck MI), Combination acoustic conditioner and light fixture.
Schmid ; Pierre ; Brunner ; Eduard ; Stofer ; Walter, Method and apparatus for the scrambled transmission of spoken information via a telephony channel.
Gruber, Thomas R.; Sabatelli, Alessandro F.; Aybes, Alexandre A.; Pitschel, Donald W.; Voas, Edward D.; Anzures, Freddy A.; Marcos, Paul D., Actionable reminder entries.
Gruber, Thomas Robert; Sabatelli, Alessandro F.; Aybes, Alexandre A.; Pitschel, Donald W.; Voas, Edward D.; Anzures, Freddy A.; Marcos, Paul D., Active transport based notifications.
Shechtman, Slava; Steinberg, Raphael, Deriving geographic distribution of physiological or psychological conditions of human speakers while preserving personal privacy.
Shechtman, Slava; Steinberg, Raphael, Deriving geographic distribution of physiological or psychological conditions of human speakers while preserving personal privacy.
Carson, David A.; Keen, Daniel; Dibiase, Evan; Saddler, Harry J.; Iacono, Marco; Lemay, Stephen O.; Pitschel, Donald W.; Gruber, Thomas R., Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant.
Gruber, Thomas Robert; Cheyer, Adam John; Kittlaus, Dag; Guzzoni, Didier Rene; Brigham, Christopher Dean; Giuli, Richard Donald; Bastea-Forte, Marcello; Saddler, Harry Joseph, Intelligent automated assistant.
Gruber, Thomas Robert; Cheyer, Adam John; Kittlaus, Dag; Guzzoni, Didier Rene; Brigham, Christopher Dean; Giuli, Richard Donald; Bastea-Forte, Marcello; Saddler, Harry Joseph, Intelligent automated assistant.
Os, Marcel Van; Saddler, Harry J.; Napolitano, Lia T.; Russell, Jonathan H.; Lister, Patrick M.; Dasari, Rohit, Intelligent automated assistant for TV user interactions.
Van Os, Marcel; Saddler, Harry J.; Napolitano, Lia T.; Russell, Jonathan H.; Lister, Patrick M.; Dasari, Rohit, Intelligent automated assistant for TV user interactions.
Gruber, Thomas Robert; Saddler, Harry Joseph; Cheyer, Adam John; Kittlaus, Dag; Brigham, Christopher Dean; Giuli, Richard Donald; Guzzoni, Didier Rene; Bastea-Forte, Marcello, Paraphrasing of user requests and results by automated digital assistant.
Naik, Devang K.; Gruber, Thomas R.; Weiner, Liam; Binder, Justin G.; Srisuwananukorn, Charles; Evermann, Gunnar; Williams, Shaun Eric; Chen, Hong; Napolitano, Lia T., System and method for user-specified pronunciation of words for speech synthesis and recognition.
Naik, Devang K.; Gruber, Thomas R.; Weiner, Liam; Binder, Justin G.; Srisuwananukorn, Charles; Evermann, Gunnar; Williams, Shaun Eric; Chen, Hong; Napolitano, Lia T., System and method for user-specified pronunciation of words for speech synthesis and recognition.
Gruber, Thomas Robert; Brigham, Christopher Dean; Keen, Daniel S.; Novick, Gregory; Phipps, Benjamin S., Using context information to facilitate processing of commands in a virtual assistant.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.