System and method for efficiently transcribing verbal messages to text
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G10L-015/00
G10L-021/00
H04M-001/64
H04M-011/00
G06F-019/00
출원번호
US-0568065
(2012-08-06)
등록번호
US-8583433
(2013-11-12)
발명자
/ 주소
Webb, Mike O.
Peterson, Bruce J.
Kaseda, Janet S.
출원인 / 주소
Intellisist, Inc.
대리인 / 주소
Inouye, Patrick J.S.
인용정보
피인용 횟수 :
2인용 특허 :
75
초록▼
A system and method for efficiently transcribing verbal messages to text is provided. Verbal messages are received and at least one of the verbal messages is divided into segments. Automatically recognized text is determined for each of the segments by performing speech recognition and a confidence
A system and method for efficiently transcribing verbal messages to text is provided. Verbal messages are received and at least one of the verbal messages is divided into segments. Automatically recognized text is determined for each of the segments by performing speech recognition and a confidence rating is assigned to the automatically recognized text for each segment. A threshold is applied to the confidence ratings and those segments with confidence ratings that fall below the threshold are identified. The segments that fall below the threshold are assigned to one or more human agents starting with those segments that have the lowest confidence ratings. Transcription from the human agents is received for the segments assigned to that agent. The transcription is assembled with the automatically recognized text of the segments not assigned to the human agents as a text message for the at least one verbal message.
대표청구항▼
1. A system for transcribing verbal messages into text, comprising: verbal messages; anda processor to execute the following modules, comprising:a segment module to divide one such verbal message into segments;a speech recognizer module to determine automatically recognized text for each of the segm
1. A system for transcribing verbal messages into text, comprising: verbal messages; anda processor to execute the following modules, comprising:a segment module to divide one such verbal message into segments;a speech recognizer module to determine automatically recognized text for each of the segments and a confidence rating assigned to the automatically recognized text for that segment, wherein the confidence rating comprises a probability that the automatically recognized text is accurate;a threshold module to apply a threshold to the confidence ratings and identifying those segments with confidence ratings that fall below the threshold;an assignment module to provide the segments that fall below the threshold to a workbench partial message queue for assigning to one or more human agents starting with those segments that have the lowest confidence ratings, to withhold the segments that are above the threshold from the workbench partial message queue and to automatically output the withheld segments for assembly into a text message for the verbal message;a receipt module to receive transcription from the human agents for the segments assigned to that human agent; andan assembly module to assemble the received transcription with the automatically recognized text of the withheld segments in the text message. 2. A system according to claim 1, further comprising: an identification module to identify a rise in speech recognition performance; andthe assignment module to assign to the human agents only those segments that have the lowest confidence ratings and retaining the automatically recognized text of the remaining segments. 3. A system according to claim 1, further comprising: a factor assignment module to assign factors to each human agent comprising at least one of quality, fatigue, and performance factors; andthe assignment module to assign the segments that fall below the threshold to the human agents based on the factors. 4. A system according to claim 1, further comprising: a further assignment module to assign the verbal message to one or more processors that perform the speech recognition based on assignment rules comprising one or more of message content, message type, and priority level of the message. 5. A system according to claim 1, further comprising: a format identification module to identify common formats within the verbal message;the speech recognizer to determine automatically recognized text for the common formats; anda text assembly module to assemble the automatically recognized text of the common formats with the transcription and automatically recognized text of the withheld segments. 6. A system according to claim 5, wherein the common formats correspond to frequently used phrases and sentences. 7. A system according to claim 1, further comprising: a segment determination module to determine the segments of the verbal message based on one or more of a point in the message where silence is present and after a specified duration of time. 8. A system according to claim 1, further comprising: a highlight module to highlight the segments of the verbal message to be transcribed by the human agents. 9. A system according to claim 1, wherein the segments that fall below the threshold are assigned to the human agents based on at least one of message rank, agent availability, and message content. 10. A system according to claim 1, wherein the transcription by the human agents comprises at least one of editing the automatically recognized text and manual transcription of the segment. 11. A method for transcribing verbal messages into text, comprising the steps of: receiving verbal messages;dividing one such verbal message into segments;determining, via a processor, automatically recognized text for each of the segments by performing speech recognition and a confidence rating assigned to the automatically recognized text for that segment, wherein the confidence rating comprises a probability that the automatically recognized text is accurate;applying a threshold to the confidence ratings and identifying those segments with confidence ratings that fall below the threshold;providing the segments that fall below the threshold to a workbench partial message queue for assigning to one or more human agents starting with those segments that have the lowest confidence ratings;withholding the segments that are above the threshold from the workbench partial message queue and automatically outputting the withheld segments for assembly into a text message for the verbal message;receiving transcription from the human agents for the segments assigned to that human agent; andassembling the received transcription with the automatically recognized text of the withheld segments in the text message. 12. A method according to claim 11, further comprising: identifying a rise in speech recognition performance; andassigning to the human agents only those segments that have the lowest confidence ratings and retaining the automatically recognized text of the remaining segments. 13. A method according to claim 11, further comprising: assigning factors to each human agent comprising at least one of quality, fatigue, and performance factors; andassigning the segments that fall below the threshold to the human agents based on the factors. 14. A method according to claim 11, further comprising: assigning the verbal message to one or more processors that perform the speech recognition based on assignment rules comprising one or more of message content, message type, and priority level of the message. 15. A method according to claim 11, further comprising: identifying common formats within the verbal message;determining automatically recognized text for the common formats; andassembling the automatically recognized text of the common formats with the transcription and automatically recognized text of the withheld segments. 16. A method according to claim 15, wherein the common formats correspond to frequently used phrases and sentences. 17. A method according to claim 11, further comprising: determining the segments of the at least one verbal message based on one or more of a point in the message where silence is present and after a specified duration of time. 18. A method according to claim 11, further comprising: highlighting the segments of the verbal message to be transcribed by the human agents. 19. A s method according to claim 11, wherein the segments that fall below the threshold are assigned to the human agents based on at least one of message rank, agent availability, and message content. 20. A method according to claim 11, wherein the transcription by the human agents comprises at least one of editing the automatically recognized text and manual transcription of the segment.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (75)
Roland Kuhn ; Jean-Claude Junqua, Adaptation system and method for E-commerce and V-commerce applications.
Syed S. Ali ; Joseph M. Cannon ; James A. Johanson ; Joseph A. Sopko, Apparatus and method for grouping and prioritizing voice messages for convenient playback.
Deborah W. Brown ; Randy G. Goldberg ; Richard R. Rosinski ; William R. Wetzel, Distributed recognition system having multiple prompt-specific and response-specific speech recognizers.
Emerson William D. (Boulder CO) Hill Deborah J. (Denver CO) Loeb Karen C. (Englewood CO) Mizrahi Albert (Boulder CO) Schlegel Charles T. (Boulder CO) Scott Lowell C. (Old Bridge NJ), Integrated message service system.
James R. Lewis ; Kerry A. Ortega ; Ronald E. Van Buskirk ; Huifang Wang ; Amado Nassiff ; Barbara E. Ballard, Method and apparatus for improving speech command recognition accuracy using event-based constraints.
Bruckner Markus (Basel CH) Guanella Gustav (Zurich CH) Vouga Claude Andre (Baden CH), Method and apparatus for the secret transmission of speech signals.
Julia Skladman ; Robert J. Thornberry, Jr. ; Bruce A. Chatterley ; Alexander Siu-Kay Ng CA; Bruce L. Peterson, Method and system for interfacing systems unified messaging with legacy systems located behind corporate firewalls.
Grajski,Kamil, Method of and apparatus for improving productivity of human reviewers of automatically transcribed documents generated by media conversion systems.
Gillick,Laurence S.; Roth,Robert; Manganaro,Linda; Peskin,Barbara R.; Petty,David C.; Rao,Ashwin, Methods and systems of routing utterances based on confidence estimates.
Matsuura Yoshihiro (Funabashi OR JPX) Skinner Toby (Beaverton OR), Speaker independent speech recognition system and method using neural network and DTW matching technique.
Suzuki Matsumi (Ebina JA) Morino Tetsuro (Ebina JA) Yokota Shozo (Ebina JA), Speech recognition method and apparatus adapted to a plurality of different speakers.
Holmes,David William James, System and method for recognition of and automatic connection using spoken address information received in voice mails and live telephone conversations.
Arumainayagam Allen Theivendran (Malden MA) Penfield Robert Flagg (Maynard MA) Reppucci Stephen Gerard (Haverhill MA), Voice mail network and networking method.
Cheston ; III Frank C. ; Hatton Patricia V., Voice mail system for obtaining forwarding number information from directory assistance systems having speech recognition.
Zimmerman, Roger S.; Antunes, Christopher S.; Barron, Jeremy E.; Tomasulo, Sharon Lee; Fiore, Claudia W.; Johnson, Christopher E.; Khesin, Anatole; Miller, Joshua, Systems and methods for automated transcription training.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.