Techniques are described for automatically measuring fluency of a patient's speech based on prosodic characteristics thereof. The prosodic characteristics may include statistics regarding silent pauses, filled pauses, repetitions, or fundamental frequency of the patient's speech. The statistics may
Techniques are described for automatically measuring fluency of a patient's speech based on prosodic characteristics thereof. The prosodic characteristics may include statistics regarding silent pauses, filled pauses, repetitions, or fundamental frequency of the patient's speech. The statistics may include a count, average number of occurrences, duration, average duration, frequency of occurrence, standard deviation, or other statistics. In one embodiment, a method includes receiving an audio sample that includes speech of a patient, analyzing the audio sample to identify prosodic characteristics of the speech of the patient, and automatically measuring fluency of the speech of the patient based on the prosodic characteristics. These techniques may present several advantages, such as objectively measuring fluency of a patient's speech without requiring a manual transcription or other manual intervention in the analysis process.
대표청구항▼
1. A system comprising: a computer-readable medium having stored thereon model data that includes at least one model of a characteristic of a language spoken by a patient; anda speech analyzer configured to analyze an audio sample including speech of the patient, identify phonemes from the speech of
1. A system comprising: a computer-readable medium having stored thereon model data that includes at least one model of a characteristic of a language spoken by a patient; anda speech analyzer configured to analyze an audio sample including speech of the patient, identify phonemes from the speech of the patient, analyze the identified phonemes to identify prosodic characteristics of the speech of the patient using the model, and automatically measure fluency of the speech of the patient based on the prosodic characteristics. 2. The system of claim 1, wherein the speech analyzer comprises an analysis module configured to analyze speech of the patient of the audio sample to determine, from the speech of the patient of the audio sample, at least one of a count of total number of pauses, a count of a number of silent pauses, a count of a number of filled pauses, an average of the total number of pauses, a standard deviation in the total number of pauses, a count of a number of repetitions, and fluctuations in a fundamental frequency. 3. The system of claim 1, wherein the model data includes an acoustic model that models phoneme boundaries, and wherein the speech analyzer references the acoustic model to identify locations of phonemes, silences, and white noise. 4. The system of claim 3, wherein the model data includes a phoneme model that models phonemes, and wherein the speech analyzer references the phoneme model to identify particular phonemes. 5. The system of claim 1, wherein the model data includes a phoneme model that models phonemes, and wherein the speech analyzer references the phoneme model to identify particular phonemes. 6. The system of claim 1, wherein the speech analyzer comprises a recognition engine to identify pauses of the speech of the patient of the audio sample. 7. The system of claim 6, wherein the recognition engine is configured to produce a list of tagged symbols to identify phonemes, silent pauses, filled pauses, and a duration of each pause. 8. The system of claim 1, wherein the speech analyzer comprises a repetition detector configured to identify repetitions of the speech of the patient of the audio sample. 9. The system of claim 1, further comprising a spectrogram calculator to calculate a spectrogram from a portion of the audio sample. 10. A device comprising: a computer-readable medium having stored thereon model data that includes at least one model of a characteristic of a language spoken by a patient; andone or more processors configured to analyze an audio sample including speech of the patient, identify phonemes from the speech of the patient, analyze the identified phonemes to identify prosodic characteristics of the speech of the patient using the model, and automatically measure fluency of the speech of the patient based on the prosodic characteristics. 11. The device of claim 10, wherein the one or more processors are configured to analyze speech of the patient of the audio sample to determine, from the speech of the patient of the audio sample, at least one of a count of total number of pauses, a count of a number of silent pauses, a count of a number of filled pauses, an average of the total number of pauses, a standard deviation in the total number of pauses, a count of a number of repetitions, and fluctuations in a fundamental frequency. 12. The device of claim 10, wherein the one or more processors are further configured to calculate a spectrogram from a portion of the audio sample. 13. The device of claim 10, wherein the model data includes an acoustic model that models phoneme boundaries, and wherein the one or more processors reference the acoustic model to identify locations of phonemes, silences, and white noise. 14. The device of claim 13, wherein the model data includes a phoneme model that models phonemes, and wherein the speech analyzer references the phoneme model to identify particular phonemes. 15. The device of claim 10, wherein the model data includes a phoneme model that models phonemes, and wherein the speech analyzer references the phoneme model to identify particular phonemes. 16. The device of claim 10, wherein the one or more processors are further configured to identify pauses of the speech of the patient of the audio sample. 17. The device of claim 16, wherein the one or more processors are configured to produce a list of tagged symbols to identify phonemes, silent pauses, filled pauses, and a duration of each pause. 18. The device of claim 10, wherein the one or more processors are configured to identify repetitions of the speech of the patient of the audio sample.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (199)
Werbos Paul J., 3-brain architecture for an intelligent decision and control system.
Jenkins, William M.; Peterson, Bret E.; Miller, Steven; Merzenich, Michael M.; Tallal, Paula, Adaptive motivation for computer-assisted training system.
William M. Jenkins ; Bret E. Peterson ; Steven Miller ; Michael M. Merzenich ; Paula Tallal, Adaptive motivation for computer-assisted training system.
Silverman Kim Ernest Alexander (Danbury CT), Automated voice synthesis employing enhanced prosodic treatment of text, spelling of text and rate of annunciation.
Widrow,Bernard; Aragon,Juan Carlos; Percival,Brian Mitchell, Cognitive memory and auto-associative neural network based search engine for computer and network located images and photographs.
Kaye Jonathan (London GBX) Williams Geoffrey (Chelmsford Essex GBX), Computer system and computer-implemented process for phonology-based automatic speech recognition.
Wasowicz Janet Marie ; Maerlender Art Carl, Diagnostic system and method for phonological awareness, phonological processing, and reading skill testing.
Cohen, Charles J.; Beach, Glenn; Cavell, Brook; Foulk, Gene; Jacobus, Charles J.; Obermark, Jay; Paul, George, Gesture-controlled interfaces for self-service machines and other applications.
Cohen, Charles J.; Beach, Glenn; Cavell, Brook; Foulk, Gene; Jacobus, Charles J.; Obermark, Jay; Paul, George, Gesture-controlled interfaces for self-service machines and other applications.
Pedersen Jan O. (Palo Alto CA) Halvorsen Per-Kristian (Los Altos CA) Cutting Douglass R. (Menlo Park CA) Tukey John W. (Princeton NJ) Bier Eric A. (Los Altos CA) Bobrow Daniel G. (Palo Alto CA), Iterative technique for phrase query formation and an information retrieval system employing same.
Connolly,John F.; Marchand,Yannick; D'Arcy,Ryan C. N., Linking neurophysiological and neuropsychological measures for cognitive function assessment in a patient.
Lu Daozheng (Buffalo Grove IL) Kiewit David A. (Palm Harbor FL) Zhang Jia (Mundelein IL), Market research method and system for collecting retail store and shopper market research data.
Basu, Sankar; de Cuetos, Philippe Christian; Maes, Stephane Herman; Neti, Chalapathy Venkata; Senior, Andrew William, Method and apparatus for audio-visual speech detection and recognition.
Bernstein Jared C., Method and apparatus for estimating fitness to perform tasks based on linguistic and other aspects of spoken responses in constrained interactions.
Martinez Tony R. ; Moncur R. Brian ; Shepherd D. Lynn ; Parr Randall J. ; Wilson D. Randall ; Hansen Carl Hal, Method and apparatus for signal classification using a multilayer network.
Jenkins William M. ; Merzenich Michael M. ; Miller Steven Lamont ; Peterson Bret E. ; Tallal Paula, Method and apparatus for training of cognitive and memory systems in humans.
Jenkins, William M.; Merzenich, Michael M.; Miller, Steven Lamont; Peterson, Bret E.; Tallal, Paula, Method and apparatus for training of cognitive and memory systems in humans.
Jenkins William M. ; Merzenich Michael M. ; Miller Steven Lamont ; Peterson Bret E. ; Tallal Paula, Method and apparatus for training of sensory and perceptual systems in LLI subjects.
Jenkins William M. ; Merzenich Michael M. ; Miller Steven Lamont ; Peterson Bret E. ; Tallal Paula, Method and apparatus for training of sensory and perceptual systems in LLI systems.
Merzenich Michael Mathias ; Jenkins William Michael ; Schreiner Christoph E. ; Tallal Paula Anne ; Miller Steven Lamont, Method and device for enhancing the recognition of speech among speech-impaired individuals.
Paula Anne Tallal ; Michael Mathias Merzenich ; William Michael Jenkins ; Steven Lamont Miller ; Christoph E. Schreiner, Method and device for enhancing the recognition of speech among speech-impaired individuals.
Paula Anne Tallal ; Michael Mathias Merzenich ; William Michael Jenkins ; Steven Lamont Miller ; Christoph E. Schreiner, Method and device for enhancing the recognition of speech among speech-impaired individuals.
Paula Anne Tallal ; Michael Mathias Merzenich ; William Michael Jenkins ; Steven Lamont Miller ; Christoph E. Schreiner, Method and device for enhancing the recognition of speech among speech-impaired individuals.
Paula Anne Tallal ; Michael Mathias Merzenich ; William Michael Jenkins ; Steven Lamont Miller ; Christoph E. Schreiner, Method and device for enhancing the recognition of speech among speech-impaired individuals.
Paula Anne Tallal ; Michael Mathias Merzenich ; William Michael Jenkins ; Steven Lamont Miller ; Christoph E. Schreiner, Method and device for enhancing the recognition of speech among speech-impaired individuals.
Paula Anne Tallal ; Michael Mathias Merzenich ; William Michael Jenkins ; Steven Lamont Miller ; Christoph E. Schreiner, Method and device for enhancing the recognition of speech among speech-impaired individuals.
Paula Anne Tallal ; Michael Mathias Merzenich ; William Michael Jenkins ; Steven Lamont Miller ; Christoph E. Schreiner, Method and device for enhancing the recognition of speech among speech-impaired individuals.
Tallal Paula Anne ; Merzenich Mathias ; Jenkins William Michael ; Miller Steven Lamont ; Schreiner Christopher E., Method and device for enhancing the recognition of speech among speech-impaired individuals.
Tallal Paula Anne ; Merzenich Michael Mathias ; Jenkins William Michael ; Miller Steven Lamont ; Schreiner Christoph E., Method and device for enhancing the recognition of speech among speech-impaired individuals.
Tallal Paula Anne ; Miller Steven Lamont ; Merzenich Mathias ; Jenkins William Michael ; Schreiner Christoph E., Method and device for enhancing the recognition of speech among speech-impaired individuals.
Fedorovskaya,Elena A.; Endrikhovski,Serguei; Matraszek,Tomasz A.; Parulski,Kenneth A.; Mir,Jose M., Method for providing affective information in an imaging system.
Maples Creve ; Peterson Craig A., Method of using multiple perceptual channels to increase user absorption of an N-dimensional presentation environment.
Basu, Sankar; de Cuetos, Philippe Christian; Maes, Stephane Herman; Neti, Chalapathy Venkata; Senior, Andrew William, Methods and apparatus for audio-visual speech detection and recognition.
Loatman Robert B. (Vienna VA) Post Stephen D. (McLean VA) Yang Chih-King (Rockville MD) Hermansen John C. (Catharpin VA), Natural language understanding system.
Meyer Bernd J. (Athens GA) Sellers Jeffrey P. (Suwanee GA) Thomsen Jan U. (Fredricksberg DKX), Neural network system and methods for analysis of organic materials and structures using spectral data.
Jones Joseph M. (157 Monahan Dr. Fort Walton Beach FL 32549), Oral sound analysis method and apparatus for determining voice, speech and perceptual styles.
Cooper,Richard G., Organizing structured and unstructured database columns using corpus analysis and context modeling to extract knowledge from linguistic phrases in the database.
Giraldo,Carlos M.; Helal,Abdelsalam A.; Kaddoura,Youssef Omar; Mann,William C., Remote surveillance and assisted care using a mobile communication device.
Freiling Michael J. (Portland OR) Alexander James H. (Beaverton OR) Phillips Brian (Aloha OR) Messick Steven L. (Portland OR), Rule acquisition for expert systems.
Maes, Stephane Herman; Neti, Chalapathy Venkata, System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input.
Fratkina,Raya; Anderson,Monica; Angel,Mark; Copperman,Max; Huffman,Scott B.; Kay,David B.; Stern,Robert; Rudy,Jeffrey, System and method for providing an intelligent multi-step dialog with a user.
Wheatley Barbara J. (2009 Pleasant Valley Dr. Plano TX 75023) Hemphill Charles T. (112 Greenwood Ct. Coppell TX 75019) Fisher Thomas D. (19251 Preston Rd. ; #723 Dallas TX 75252) Doddington George R., System and method for time aligning speech.
Ostermann, Joern; Civanlar, Mehmet Reha; Graf, Hans Peter; Isaacson, Thomas M., System and method of controlling sound in a multi-media communication application.
Ostermann,Joern; Civanlar,Mehmet Reha; del Valle,Ana Cristina Andres; Haffner,Patrick, System and method of customizing animated entities for use in a multi-media communication application.
Pertrushin Valery A., System method and article of manufacture for detecting emotion in voice signals by utilizing statistics for voice signal parameters.
Pertrushin Valery A., System, method and article of manufacture for detecting emotion in voice signals through analysis of a plurality of voice signal parameters.
Macleod,John B., Telephone call handling center where operators utilize synthesized voices generated or modified to exhibit or omit prescribed speech characteristics.
Nightingale Charles (Felixstowe GB2) Wyard Peter J. (Woodbridge GB2), Trainable neural network having short-term memory for altering input layer topology during training.
Jenkins William M. ; Peterson Bret E. ; Miller Steven ; Merzenich Michael M. ; Tallal Paula, Uniform motivation for multiple computer-assisted training systems.
Jenkins, William M.; Peterson, Bret E.; Miller, Steven; Merzenich, Michael M.; Tallal, Paula, Uniform motivation for multiple computer-assisted training systems.
Jenkins, William M.; Peterson, Bret E.; Miller, Steven; Merzenich, Michael M.; Tallal, Paula, Uniform motivation for multiple computer-assisted training systems.
Gilai Atzmon,ILX ; Resnekov Hezi,ILX, Word-containing database accessing system for responding to ambiguous queries, including a dictionary of database words, a dictionary searcher and a database searcher.
Gilai Atzmon,ILX ; Resnekov Hezi,ILX, Word-containing database accessing system for responding to ambiguous queries, including a dictionary of database words, a dictionary searcher and a database searcher.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.