Computer-implemented system and method for transcribing verbal messages
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G10L-015/00
G06Q-010/06
G10L-015/26
H04M-003/533
출원번호
US-0074653
(2013-11-07)
등록번호
US-9418659
(2016-08-16)
발명자
/ 주소
Webb, Mike O.
출원인 / 주소
INTELLISIST, INC.
대리인 / 주소
Inouye, Patrick J. S.
인용정보
피인용 횟수 :
0인용 특허 :
76
초록▼
A computer-implemented system and method for transcribing verbal messages is provided. Verbal messages each comprising audio content are received. Automatically recognized text is generated for the audio content of at least one of the verbal messages. A turn-around processing time is applied to the
A computer-implemented system and method for transcribing verbal messages is provided. Verbal messages each comprising audio content are received. Automatically recognized text is generated for the audio content of at least one of the verbal messages. A turn-around processing time is applied to the verbal message. The automatically recognized text and verbal message are transferred to a human agent when an expected processing time of the verbal message satisfies the turn-around processing time. At least a portion of the automatically recognized text is replaced with manual transcription from the human agent. The automatically recognized text and manual transcription are provided as a text message.
대표청구항▼
1. A computer-implemented system for transcribing verbal messages, comprising: an inbound message processor to receive verbal messages each comprising audio content;a split and merge message processor to divide the verbal message into segments prior to generating automatically recognized text;a spee
1. A computer-implemented system for transcribing verbal messages, comprising: an inbound message processor to receive verbal messages each comprising audio content;a split and merge message processor to divide the verbal message into segments prior to generating automatically recognized text;a speech recognizer to generate the automatically recognized text for the audio content of at least one of the verbal messages and to obtain the automatically recognized text for each segment;a pattern matcher to determine whether two or more of the segments share a common format;a transmission module to provide the segments without a common format to a human agent and at least one other human agent;a time threshold module to apply a turn-around processing time to the verbal message;a transfer module to transfer the automatically recognized text and verbal message to the human agent when an expected processing time of the verbal message is below the turn-around processing time;an editing module to replace at least a portion of the automatically recognized text with manual transcription from the human agent;a post-processor to generate post-processed text from the automatically recognized text of the segments that share a common format, of the automatically recognized text of the segments without a common format that have not been replaced, and of the manual transcription from the human agent; anda transmission module to perform at least one of transmitting the post-processed text to a user via email, delivering the post-processed text to a user via short message service, and supplying the post-processed text to a user via an application program interface. 2. A system according to claim 1, further comprising: an agent selection module to select the human agent for sending the automatically recognized text and the audio content for the verbal message based on at least one of an agent rank, availability, and message content. 3. A system according to claim 2, wherein the rank comprises at least one of agent fatigue, performance, and speed. 4. A system according to claim 1, further comprising: a message transmission module to simultaneously send the verbal message to the speech recognition module for automatic speech recognition and to the human agent for manual transcription. 5. A system according to claim 1, wherein the manual transcription is performed by the human agent using at least one of a line mode that allows the human agent to focus on a portion of the verbal message, a word mode that includes shortcuts on a keyboard for editing or transcribing the verbal message, and a whole message mode that enables the human agent to edit or transcribe the entire verbal message. 6. A system according to claim 1, further comprising: a segment module to divide the verbal message into segments prior to generating the automatically recognized text;the speech recognition module to obtain the automatically recognized text for each segment;a confidence rating module to assign a confidence level to the automatically recognized text for each segment;a segment transmission module to transmit those segments with confidence levels below a confidence threshold to the human agent and at least one other human agent; anda text entry module to enter those segments of automatically recognized text with confidence levels that satisfy the confidence threshold into the text message. 7. A system according to claim 6, further comprising: a priority transmission module to first, send the segments with the lowest confidence levels to the human agent and later, to send the segments with higher confidence levels that do not satisfy the confidence threshold to the human agent. 8. A system according to claim 1, further comprising: a content determination module to determine a difficulty of processing each segment;a transmission module to assign those segments with the highest difficulty to the human agent and to subsequently assign those segments with lower difficulty to the human agent. 9. A computer-implemented method for transcribing verbal messages, comprising: receiving verbal messages each comprising audio content;generating automatically recognized text for the audio content of at least one of the verbal messages;dividing the verbal message into segments prior to generating the automatically recognized text;obtaining the automatically recognized text for each segment;determining whether two or more of the segments share a common format;providing the segments without a common format to a human agent and at least one other human agent; andapplying a turn-around processing time to the verbal message;transferring the automatically recognized text and the verbal message to the human agent when an expected processing time of the verbal message is below the turn-around processing time;replacing at least a portion of the automatically recognized text with manual transcription from the human agent;generating, via a post-processor, post-processed text from the automatically recognized text of the segments that share a common format, of the automatically recognized text of the segments without a common format that have not been replaced, and of the manual transcription from the human agent; andperforming at least one of transmitting the post-processed text to a user via email, delivering the post-processed text to a user via short message service, and supplying the post-processed text to a user via an application program interface. 10. A method according to claim 9, further comprising: selecting the human agent for sending the automatically recognized text and the audio content for the verbal message based on at least one of an agent rank, availability, and message content. 11. A method according to claim 10, wherein the rank comprises at least one of agent fatigue, performance, and speed. 12. A method according to claim 9, further comprising: simultaneously sending the verbal message to a speech recognition module for automatic speech recognition and to the human agent for manual transcription. 13. A method according to claim 9, wherein the manual transcription is performed by the human agent using at least one of a line mode that allows the human agent to focus on a portion of the verbal message, a word mode that includes shortcuts on a keyboard for editing or transcribing the verbal message, and a whole message mode that enables the human agent to edit or transcribe the entire verbal message. 14. A method according to claim 9, further comprising: dividing the verbal message into segments prior to generating the automatically recognized text;obtaining the automatically recognized text for each segment;assigning a confidence rating to the automatically recognized text for each segment;transmitting those segments with confidence levels below a confidence threshold to the human agent and at least one other human agent; andentering those segments of automatically recognized text with confidence levels that satisfy the confidence threshold into the text message. 15. A method according to claim 14, further comprising: first, sending the segments with the lowest confidence levels to the human agent; andlater, sending the segments with higher confidence levels that do not satisfy the confidence threshold to the human agent. 16. A method according to claim 9, further comprising: determining a difficulty of processing each segment;assigning those segments with the highest difficulty to the human agent; andsubsequently assigning those segments with lower difficulty to the human agent.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (76)
Roland Kuhn ; Jean-Claude Junqua, Adaptation system and method for E-commerce and V-commerce applications.
Syed S. Ali ; Joseph M. Cannon ; James A. Johanson ; Joseph A. Sopko, Apparatus and method for grouping and prioritizing voice messages for convenient playback.
Emerson William D. (Boulder CO) Hill Deborah J. (Denver CO) Loeb Karen C. (Englewood CO) Mizrahi Albert (Boulder CO) Schlegel Charles T. (Boulder CO) Scott Lowell C. (Old Bridge NJ), Integrated message service system.
James R. Lewis ; Kerry A. Ortega ; Ronald E. Van Buskirk ; Huifang Wang ; Amado Nassiff ; Barbara E. Ballard, Method and apparatus for improving speech command recognition accuracy using event-based constraints.
Bruckner Markus (Basel CH) Guanella Gustav (Zurich CH) Vouga Claude Andre (Baden CH), Method and apparatus for the secret transmission of speech signals.
Julia Skladman ; Robert J. Thornberry, Jr. ; Bruce A. Chatterley ; Alexander Siu-Kay Ng CA; Bruce L. Peterson, Method and system for interfacing systems unified messaging with legacy systems located behind corporate firewalls.
Grajski,Kamil, Method of and apparatus for improving productivity of human reviewers of automatically transcribed documents generated by media conversion systems.
Matsuura Yoshihiro (Funabashi OR JPX) Skinner Toby (Beaverton OR), Speaker independent speech recognition system and method using neural network and DTW matching technique.
Suzuki Matsumi (Ebina JA) Morino Tetsuro (Ebina JA) Yokota Shozo (Ebina JA), Speech recognition method and apparatus adapted to a plurality of different speakers.
Holmes,David William James, System and method for recognition of and automatic connection using spoken address information received in voice mails and live telephone conversations.
Arumainayagam Allen Theivendran (Malden MA) Penfield Robert Flagg (Maynard MA) Reppucci Stephen Gerard (Haverhill MA), Voice mail network and networking method.
Cheston ; III Frank C. ; Hatton Patricia V., Voice mail system for obtaining forwarding number information from directory assistance systems having speech recognition.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.