Text to speech conversion of text messages from mobile communication devices
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
H04L-012/66
H04M-011/00
G10L-013/08
출원번호
US-0628693
(2009-12-01)
등록번호
US-8345665
(2013-01-01)
우선권정보
IT-FI01A0199 (2001-10-22)
발명자
/ 주소
Vieri, Riccardo
Vieri, Flavio
출원인 / 주소
Apple Inc.
대리인 / 주소
Morgan, Lewis & Bockius LLP
인용정보
피인용 횟수 :
92인용 특허 :
177
초록▼
A method includes providing a user interface, at a mobile communication device, that includes a first area to receive text input and a second area to receive an identifier associated with an addressee device. The text input and the identifier are received via the user interface. A short message serv
A method includes providing a user interface, at a mobile communication device, that includes a first area to receive text input and a second area to receive an identifier associated with an addressee device. The text input and the identifier are received via the user interface. A short message service (SMS) message including the text input is transmitted to a Text to Speech (TTS) server for conversion into an audio message and for transmission of the audio message to the addressee device associated with the identifier. An acknowledge message transmitted from the TTS server permits the addressee device to allow delivery of the audio message or to decline delivery of the audio message. The TTS server transmits the audio message in response to the addressee device allowing delivery of the audio message. A confirmation message is received from the TTS server that indicates that a reply voice message has been received from the addressee device in response to the audio message.
대표청구항▼
1. A non-transitory computer-readable storage medium comprising instructions that, when executed by a processor, cause the processor to: provide a user interface at a mobile communication device, the user interface including a first area to receive text input and a second area to receive an identifi
1. A non-transitory computer-readable storage medium comprising instructions that, when executed by a processor, cause the processor to: provide a user interface at a mobile communication device, the user interface including a first area to receive text input and a second area to receive an identifier associated with an addressee device;receive the text input and the identifier via the user interface;transmit a short message service (SMS) message including the text input from the mobile communication device to a Text to Speech (TTS) server for conversion into an audio message and for transmission of the audio message to the addressee device associated with the identifier, wherein the TTS server transmits an acknowledge message that permits the addressee device to allow delivery of the audio message or to decline delivery of the audio message, and wherein the TTS server transmits the audio message in response to the addressee device allowing delivery of the audio message; andreceive a confirmation message from the TTS server, wherein the confirmation message indicates that a reply voice message has been received from the addressee device in response to the audio message. 2. The non-transitory computer-readable storage medium of claim 1, further comprising instructions that, when executed by the processor, cause the processor to transmit a request from the mobile communication device to the TTS server to receive the reply voice message. 3. The non-transitory computer-readable storage medium of claim 2, further comprising instructions that, when executed by the processor, cause the processor to play the reply voice message at the mobile communication device in response to receipt of the reply voice message from the TTS server. 4. The non-transitory computer-readable storage medium of claim 1, wherein the confirmation message includes an email message and wherein the reply voice message is attached to the email message. 5. The non-transitory computer-readable storage medium of claim 1, further comprising instructions that, when executed by the processor, cause the processor to: provide a repeat transmission option via the user interface in response to the addressee device not accepting or declining delivery of the audio message; andautomatically attempt one or more additional transmissions of the audio message to the addressee device in response to a selection of the repeat transmission option. 6. The non-transitory computer-readable storage medium of claim 5, wherein one or more additional transmissions are automatically attempted when the addressee device is busy, and wherein the one or more additional transmissions are automatically attempted when the addressee device is free but does not answer a call attempting delivery of the audio message. 7. The non-transitory computer-readable storage medium of claim 1, wherein the user interface further comprises a selectable input to specify a tone of voice associated with the audio message. 8. The non-transitory computer-readable storage medium of claim 1, wherein the identifier includes a telephone number, and wherein the second area is adapted to receive one or more telephone numbers. 9. The non-transitory computer-readable storage medium of claim 1, wherein the user interface further comprises a banner interface area configured to receive a selection of a pre-recorded communication, and wherein the selected pre-recorded communication is sent with the audio message. 10. The non-transitory computer-readable storage medium of claim 1, wherein the acknowledge message transmitted to the addressee device includes a vocal message requesting a telephone keypad response, wherein the telephone keypad response includes one of a first telephone keypad response to listen to the audio message and a second telephone keypad response to decline the audio message. 11. A method comprising: receiving, at a Text to Speech (TTS) server, a short message service (SMS) message and one or more destination addresses from a mobile communication device, the SMS message including text input received at the mobile communication device;converting the text input into an audio message;transmitting an acknowledge message from the TTS server to the one or more destination addresses via the Internet, wherein the acknowledge message permits a destination device to allow delivery of the audio message or to decline delivery of the audio message;transmitting the audio message from the TTS server to the one or more destination addresses via the Internet in response to the destination device accepting delivery of the audio message;receiving, at the TTS server, a reply voice message from the destination device in response to the audio message; andtransmitting a confirmation message from the TTS server to the mobile communication device in response to receiving the reply voice message, wherein the confirmation message indicates that the reply voice message has been received at the TTS server from the destination device. 12. The method of claim 11, further comprising: receiving a request from the mobile communication device for the reply voice message; andtransmitting the reply voice message from the TTS server to the mobile communication device in response to the request. 13. The method of claim 11, wherein the confirmation message includes an email message and wherein the reply voice message is attached to the email message. 14. The method of claim 11, wherein the SMS message further comprises a language identifier specifying a language to which the text input is to be translated, and wherein the method further comprises translating the text input from a first language to a second language before the text input is converted into the audio message. 15. The method of claim 11, wherein the SMS message received at the TTS server further includes information associated with a selection of a pre-recorded communication from a plurality of pre-recorded communications presented at the mobile communication device via a banner interface of a web site, wherein the selected pre-recorded communication is included in the audio message transmitted from the TTS server to the one or more destination addresses. 16. The method of claim 11, further comprising automatically attempting one or more additional transmissions of the audio message in response to the destination device not accepting or declining delivery of the audio message. 17. A method comprising: providing a user interface to a display device of a mobile communication device, the user interface including a first area to receive text input and a second area to receive an identifier associated with an addressee device;receiving the text input and the identifier via the user interface;transmitting a short message service (SMS) message including the text input from the mobile communication device to a Text to Speech (TTS) server for conversion into an audio message and for transmission of the audio message to the addressee device associated with the identifier, wherein the TTS server transmits an acknowledge message that permits the addressee device to allow delivery of the audio message or to decline delivery of the audio message, and wherein the TTS server transmits the audio message in response to the addressee device allowing delivery of the audio message; andreceiving, at the mobile communication device, an email message from the TTS server, the email message indicating that a reply voice message has been received at the TTS server from the addressee device in response to the audio message. 18. The method of claim 17, wherein the reply voice message is attached to the email message. 19. The method of claim 17, further comprising transmitting a request from the mobile communication device to the TTS server to receive the reply voice message. 20. The method of claim 19, wherein the reply voice message is received at the TTS server in a first language and wherein the TTS server converts the reply voice message from the first language to a second language before transmission to the mobile communication device.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (177)
Julia, Luc; Voutsas, Dimitris; Cheyer, Adam, Accessing network-based electronic information through scripted online interfaces using spoken input.
Conrad Donovan (19107 SE. 47th Pl. Issaquah WA 98027) Cosby Charles (10925 165th Pl. NE. Redmond WA 98052), Anticipated meaning natural language interface.
Alexander I. McAllister ; James McKinley, Apparatus for and method of disambiguation of directory listing searches utilizing multiple selectable secondary search keys.
Korall,Toby; Bregman,Zeev; Ben Moshe,Irit; Giladi,Miron, Automated database assistance using a telephone for a speech based or text based multimedia communication mode.
Hoffberg Steven Mark ; Hoffberg-Borghesani Linda Irene, Ergonomic man-machine interface incorporating adaptive pattern recognition based control system.
Cheyer, Adam J.; Martin, David L.; Moran, Douglas B.; Mark, William S., Extensible software-based architecture for communication and cooperation within and between communities of distributed agents and distributed objects.
Appelt, Douglas E.; Arnold, James Frederick; Bear, John S.; Hobbs, Jerry Robert; Israel, David J.; Kameyama, Megumi; Martin, David L.; Myers, Karen Louise; Ravichandran, Gopalan; Stickel, Mark Edward, Information retrieval by natural language querying.
Burton, David Kearney; Elam, Gayle Ayers; Becker, Simonie Justine Hodges; Loveland, Shawn Domenic, Integrated voice access to a variety of personal information services.
Allen, James F.; Chambers, Nathanael; Galescu, Lucian; Jung, Hyuckchul; Taysom, William, Interactive complex task teaching system that allows for natural language input, recognizes a user's intent, and automatically performs tasks in document object model (DOM) nodes.
Bennett, Ian M.; Babu, Bandi Ramesh; Morkhandikar, Kishor; Gururaj, Pallaki, Interactive speech based learning/training system formulating search queries based on natural language parsing of recognized user queries.
Lennig Matthew (Westmount CAX) Sharp Robert D. (Westmount CAX) Bielby Gregory J. (Pointe-Claire CAX), Method and apparatus for automation of directory assistance using speech recognition.
Healey, Jennifer A.; Hosn, Rafah A.; Maes, Stephane Herman; Raman, Thiruvilvama Lai V.; Tiwari, Alpana, Methods and systems for authoring of mixed-initiative multi-modal interactions and related browsing mechanisms.
Kennewick, Robert A.; Locke, David; Kennewick, Sr., Michael R.; Kennewick, Jr., Michael R.; Kennewick, Richard; Freeman, Tom; Elston, Stephen F., Mobile systems and methods for responding to natural language speech utterance.
Weider, Chris; Kennewick, Richard; Kennewick, Mike; Di Cristo, Philippe; Kennewick, Robert A.; Menaker, Samuel; Armstrong, Lynn Elise, Mobile systems and methods of supporting natural language human-machine interactions.
Hedin Erik B. (Lidingo SEX) Jonsson Gregor I. (Lidingo SEX) Olsson Lars E. (Kista SEX) Sanamrad Mohammad A. (Lidingo SEX) Westling Sven O. G. (Stockholm SEX), Natural language analyzing apparatus and method.
Suda Aruna Rohra,JPX ; Jeyachandran Suresh,JPX, Natural language processing system and method for processing input information by predicting kind thereof.
Gagnon, Jean; Roy, Philippe; Lagassey, Paul J., Speech interface system and method for control and interaction with applications on a computing system.
Mozer,Todd F.; Mozer,Forrest S., Speech recognition programming information retrieved from a remote source to a speech recognition system for performing a speech recognition method.
Della Pietra Stephen Andrew ; Epstein Mark Edward ; Franz Martin ; Koppelman Joshua David Sherer ; Roukos Salim Estephan ; Ward Robert Todd, Statistical natural language understanding using hidden clumpings.
Maes, Stephane Herman; Neti, Chalapathy Venkata, System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input.
Fratkina, Raya; Anderson, Monica; Angel, Mark A.; Copperman, Max; Huffman, Scott B.; Kay, David; Stern, Robert, System and method for providing an intelligent multi-step dialog with a user.
Freeman, Tom; Kennewick, Mike, System and method for selecting and presenting advertisements based on natural language processing of voice-based input.
Kennewick, Robert A.; Locke, David; Kennewick, Sr., Michael R.; Kennewick, Jr., Michael R.; Kennewick, Richard; Freeman, Tom, System and method for user-specific speech recognition.
Halverson, Christine; Julia, Luc; Voutsas, Dimitris; Cheyer, Adam, System, method, and article of manufacture for agent-based navigation in a speech-based data navigation system.
Kennewick, Robert A.; Locke, David; Kennewick, Sr., Michael R.; Kennewick, Jr., Michael R.; Kennewick, Richard; Freeman, Tom, Systems and methods for processing natural language speech utterances with context-specific domain agents.
Kennewick, Robert A.; Locke, David; Kennewick, Sr., Michael R.; Kennewick, Jr., Michael R.; Kennewick, Richard; Freeman, Tom, Systems and methods for responding to natural language speech utterance.
Kennewick,Robert A.; Locke,David; Kennewick, Sr.,Michael R.; Kennewick, Jr.,Michael R.; Kennewick,Richard; Freeman,Tom, Systems and methods for responding to natural language speech utterance.
Kennewick,Robert A.; Locke,David; Kennewick, Sr.,Michael R.; Kennewick, Jr.,Michael R.; Kennewick,Richard; Freeman,Tom, Systems and methods for responding to natural language speech utterance.
Cheyer, Adam J.; Martin, David L., Unified messaging methods and systems for communication and cooperation among distributed agents in a computing environment.
Paek, Timothy S.; Chickering, David M., Using predictive user models for language modeling on a personal device with user behavior models based on statistical modeling.
Chad Burkey ; Sergei Lopatin ; Lucian Hughes, Voice interactive web-based agent system responsive to a user location for prioritizing and formatting information.
Surace Kevin J. ; White George M. ; Reeves Byron B. ; Nass Clifford I. ; Campbell Mark D. ; Albert Roy D. ; Giangola James P., Voice user interface with personality.
Surace Kevin J. ; White George M. ; Reeves Byron B. ; Nass Clifford I. ; Campbell Mark D. ; Albert Roy D. ; Giangola James P., Voice user interface with personality.
Gruber, Thomas R.; Sabatelli, Alessandro F.; Aybes, Alexandre A.; Pitschel, Donald W.; Voas, Edward D.; Anzures, Freddy A.; Marcos, Paul D., Actionable reminder entries.
Gruber, Thomas Robert; Sabatelli, Alessandro F.; Aybes, Alexandre A.; Pitschel, Donald W.; Voas, Edward D.; Anzures, Freddy A.; Marcos, Paul D., Active transport based notifications.
Carson, David A.; Keen, Daniel; Dibiase, Evan; Saddler, Harry J.; Iacono, Marco; Lemay, Stephen O.; Pitschel, Donald W.; Gruber, Thomas R., Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant.
Vadodaria, Vishal, Intelligent agent / personal virtual assistant with animated 3D persona, facial expressions, human gestures, body movements and mental states.
Gruber, Thomas Robert; Cheyer, Adam John; Kittlaus, Dag; Guzzoni, Didier Rene; Brigham, Christopher Dean; Giuli, Richard Donald; Bastea-Forte, Marcello; Saddler, Harry Joseph, Intelligent automated assistant.
Gruber, Thomas Robert; Cheyer, Adam John; Kittlaus, Dag; Guzzoni, Didier Rene; Brigham, Christopher Dean; Giuli, Richard Donald; Bastea-Forte, Marcello; Saddler, Harry Joseph, Intelligent automated assistant.
Os, Marcel Van; Saddler, Harry J.; Napolitano, Lia T.; Russell, Jonathan H.; Lister, Patrick M.; Dasari, Rohit, Intelligent automated assistant for TV user interactions.
Van Os, Marcel; Saddler, Harry J.; Napolitano, Lia T.; Russell, Jonathan H.; Lister, Patrick M.; Dasari, Rohit, Intelligent automated assistant for TV user interactions.
Gruber, Thomas Robert; Saddler, Harry Joseph; Cheyer, Adam John; Kittlaus, Dag; Brigham, Christopher Dean; Giuli, Richard Donald; Guzzoni, Didier Rene; Bastea-Forte, Marcello, Paraphrasing of user requests and results by automated digital assistant.
Naik, Devang K.; Gruber, Thomas R.; Weiner, Liam; Binder, Justin G.; Srisuwananukorn, Charles; Evermann, Gunnar; Williams, Shaun Eric; Chen, Hong; Napolitano, Lia T., System and method for user-specified pronunciation of words for speech synthesis and recognition.
Naik, Devang K.; Gruber, Thomas R.; Weiner, Liam; Binder, Justin G.; Srisuwananukorn, Charles; Evermann, Gunnar; Williams, Shaun Eric; Chen, Hong; Napolitano, Lia T., System and method for user-specified pronunciation of words for speech synthesis and recognition.
Vieri, Riccardo, System to generate and set up an advertising campaign based on the insertion of advertising messages within an exchange of messages, and method to operate said system.
Gruber, Thomas Robert; Brigham, Christopher Dean; Keen, Daniel S.; Novick, Gregory; Phipps, Benjamin S., Using context information to facilitate processing of commands in a virtual assistant.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.