[특허]Graphical user interface for determining speech recognition accuracy

Graphical user interface for determining speech recognition accuracy 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G10L-021/06 G10L-021/00 G10L-015/26 G10L-015/00
출원번호	US-0196017 (2002-07-16)
등록번호	US-7260534 (2007-08-21)
발명자 / 주소	Gandhi,Shailesh B. Jaiswal,Peeyush Moore,Victor S. Toon,Gregory L.
출원인 / 주소	International Business Machines Corporation
대리인 / 주소	Akerman Senterfitt
인용정보	피인용 횟수 : 55 인용 특허 : 9

초록 ▼

A solution for determining the accuracy of a speech recognition system. A first graphical user interface (GUI) is provided for selecting a transaction log. The transaction log has at least one entry that specifies a speech recognition text result. A second GUI is also provided for selecting at least one audio segment corresponding to the entry. The second GUI includes an activatable icon for initiating transcription of the audio segment through a reference speech recognition engine to generate a second text result.

대표청구항 ▼

What is claimed is: 1. A method of determining the accuracy of a speech recognition system comprising: providing a first graphical user interface (GUI) for selecting a transaction log wherein said transaction log has at least one entry, said entry specifying a speech recognition text result and a plurality of corresponding attributes comprising a first attribute specifying a sound processing filter associated with said audio segment, a second attribute specifying a configuration of a speech recognition system generating said speech recognition text result, a third attribute specifying an acoustic model on which said speech recognition text result is based, and a fourth attribute specifying a linguistic model on which said speech recognition text result is based; and providing a second GUI for selecting at least one audio segment corresponding to said entry; wherein said second GUI comprises an activatable icon for initiating transcription of said audio segment through a reference speech recognition engine to generate a second text result. 2. The method of claim 1, wherein said second GUI comprises an input portion for receiving user corrected transcribed text. 3. The method of claim 1, further comprising: providing a third GUI, wherein said third GUI comprises one or more controls to associate said audio segment with at least one condition. 4. The method of claim 3, wherein said condition specifies at least a person having generated said audio segment, a gender of said person, and ambient sounds influencing a recognizability of said audio segment. 5. The method of claim 4, wherein said ambient sounds are at least one of weather generated sound and background noise. 6. The method of claim 3, wherein said condition is stored in said transaction log and associated with said entry. 7. The method of claim 1 wherein, said second GUI is automatically presented upon a transaction log being selected. 8. The method of claim 1, further comprising the step of providing a fourth GUI, wherein said fourth GUI comprises one or more indicators to show an operational status of a software application used in determining the accuracy of a speech recognition system. 9. The method of claim 1, further comprising providing a fifth GUI displaying said text result and said second text result. 10. The method of claim 9, wherein said fifth GUI further displays manually entered text corresponding to said audio segment. 11. The method of claim 9, wherein said fifth GUI further displays data. 12. The method of claim 11 wherein said data is statistical data. 13. A machine-readable storage, having stored thereon a computer program having a plurality of code sections executable by a machine for causing the machine to perform the steps of: providing a first graphical user interface (GUI) for selecting a transaction log wherein said transaction log has at least one entry, said entry specifying a speech recognition text result and a plurality of corresponding attributes comprising a first attribute specifying a sound processing filter associated with said audio segment, a second attribute specifying a configuration of a speech recognition system generating said speech recognition text result, a third attribute specifying an acoustic model on which said speech recognition text result is based, and a fourth attribute specifying a linguistic model on which said speech recognition text result is based; and providing a second GUI for selecting at least one audio segment corresponding to said entry; wherein said second GUI comprises an activatable icon for initiating transcription of said audio segment through a reference speech recognition engine to generate a second text result. 14. The machine readable storage of claim 13, wherein said second GUI comprises an input portion for receiving user corrected transcribed text. 15. The machine readable storage of claim 13, further comprising: providing a third GUI, wherein said third GUI comprises one or more controls to associate said audio segment with at least one condition. 16. The machine readable storage of claim 15, wherein said condition specifies at least a person having generated said audio segment, a gender of said person, and ambient sounds influencing a recognizability of said audio segment. 17. The machine readable storage of claim 16, wherein said ambient sounds are at least one of weather generated sound and background noise. 18. The machine readable storage of claim 15, wherein said condition is stored in said transaction log and associated with said entry. 19. The machine readable storage of claim 13, said second GUI is automatically presented upon a transaction log being selected. 20. The machine readable storage of claim 13, further comprising the step of providing a fourth GUI, wherein said fourth GUI comprises one or more indicators to show an operational status of a software application used in determining the accuracy of a speech recognition system. 21. The machine readable storage of claim 13, further comprising providing a fifth GUI displaying said text result and said second text result. 22. The machine readable storage of claim 21, wherein said fifth GUI further displays manually entered text corresponding to said audio segment. 23. The machine readable storage of claim 21, wherein said fifth GUI further displays data. 24. The machine readable storage of claim 23, wherein said data is statistical data.

이 특허에 인용된 특허 (9)

Doyle,Sean, Automatically improving a voice recognition system.
상세보기
Young Jonathan Hood ; Parmenter David Wilsberg ; Roth Robert ; Dubach Joev ; Gadbois Gregory J. ; Van Even Stijn, Error correction in speech recognition.
상세보기
Brandow Ronald Lloyd ; Strzalkowski Tomasz, Improving speech recognition through text-based linguistic post-processing.
상세보기
Lewis James R. ; Ballard Barbara, Method and system for automatically determining whether to update a language model based upon user amendments to dictated text.
상세보기
Amado Nassiff ; Kerry A. Ortega, Smart correction of dictated speech.
상세보기
Baker James K., Speech recognition using multiple recognizers (selectively) applied to the same input sample.
상세보기
Bijl David,GBX ; Hyde-Thomson Henry,GBX, Speech to text conversion.
상세보기
Kahn Jonathan ; Flynn Thomas P. ; Qin Charles ; Tippe Robert J., System and method for automating transcription services.
상세보기
Cole Alan G. (Katonah NY) Riekert Robert H. (Ossining NY), Text editor for speech input.
상세보기

이 특허를 인용한 특허 (55)

Hoffberg, Steven M.; Hoffberg-Borghesani, Linda I., Adaptive pattern recognition based controller apparatus and method and human-interface therefore.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Adding information or functionality to a rendered document via association with an electronic counterpart.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Adding value to a rendered document.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Aggregate analysis of text captures performed by multiple users from rendered documents.
상세보기
Boes, Kirstin; Riggs, Curtis; Ford, Jon, Apparatus and method for queuing jobs in a distributed dictation /transcription system.
상세보기
Boes, Kirstin; Riggs, Curtis; Ford, Jon, Apparatus and method for queuing jobs in a distributed dictation/transcription system.
상세보기
King, Martin; Grover, Dale; Kushler, Clifford; Stafford-Fraser, James; Mannby, Claes-Fredrik, Archive of text captures from rendered documents.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Association of a portable scanner with input/output and storage devices.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Association of a portable scanner with input/output and storage devices.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Automatic modification of web pages.
상세보기
King, Martin T.; Stephens, Redwood; Mannby, Claes-Fredrik; Peterson, Jesse; Sanvitale, Mark; Smith, Michael J., Automatically capturing information, such as capturing information using a document-aware device.
상세보기
King, Martin T.; Stephens, Redwood; Mannby, Claes-Fredrik; Peterson, Jesse; Sanvitale, Mark; Smith, Michael J.; Daley-Watson, Christopher J., Automatically providing content associated with captured information, such as information captured in real-time.
상세보기
King, Martin Towle; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Quentin, Capturing text from rendered documents using supplement information.
상세보기
Terrell, II, James Richard; White, Marc; Jablokov, Igor Roditis, Continuous speech transcription performance indication.
상세보기
King, Martin Towle; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Quentin, Data capture from rendered documents using handheld device.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Determining actions involving captured information and electronic content associated with rendered documents.
상세보기
Beach, Richard; Butler, Christopher; Ford, Jon; Marquette, Brian; Omland, Christopher, Distributed dictation/transcription system.
상세보기
Beach, Richard; Butler, Christopher; Ford, Jon; Marquette, Brian; Omland, Christopher, Distributed dictation/transcription system.
상세보기
Beach, Richard; Butler, Christopher; Ford, Jon; Marquette, Brian; Omland, Christopher, Distributed dictation/transcription system.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Document enhancement system and method.
상세보기
Jablokov, Victor Roman; Jablokov, Igor Roditis; Terrell, II, James Richard; Paden, Scott Edward, Facilitating presentation by mobile device of additional content for a word or phrase upon utterance thereof.
상세보기
Jablokov, Victor Roditis; Jablokov, Igor Roditis; Terrell, II, James Richard; White, Marc; Paden, Scott Edward, Facilitating presentation of ads relating to words of a message.
상세보기
White, Marc; Strohofer, Cliff, Filtering transcriptions of utterances.
상세보기
White, Marc; Strohofer, Cliff, Filtering transcriptions of utterances.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Handheld device for capturing text from both a document printed on paper and a document displayed on a dynamic display device.
상세보기
Jablokov, Victor R.; Jablokov, Igor R.; White, Marc, Hosted voice recognition system for wireless devices.
상세보기
Jablokov, Victor R.; Jablokov, Igor R.; White, Marc, Hosted voice recognition system for wireless devices.
상세보기
King, Martin T.; Stephens, Redwood; Mannby, Claes-Fredrik; Peterson, Jesse; Sanvitale, Mark; Smith, Michael J., Identifying a document by performing spectral analysis on the contents of the document.
상세보기
King, Martin T.; Mannby, Claes-Fredrik; Smith, Michael J., Image search using text-based elements within the contents of images.
상세보기
Hoffberg, Steven M.; Hoffberg-Borghesani, Linda I., Internet appliance system and method.
상세보기
Vessiere, Gilles; Bachelerie, Joël, Lexical correction of erroneous text by transformation into a voice message.
상세보기
Kobal, Jeffrey S.; Dhanakshirur, Girish, Method and system for automatic transcription prioritization.
상세보기
Kobal, Jeffrey S.; Dhanakshirur, Girish, Method and system for automatic transcription prioritization.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Method and system for character recognition.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Quentin, Method and system for character recognition.
상세보기
Marquette, Brian; Corfield, Charles; Espy, Todd, Method and systems for measuring user performance with speech-to-text conversion for dictation systems.
상세보기
Marquette, Brian; Corfield, Charles; Espy, Todd, Method and systems for simplifying copying and pasting transcriptions generated from a dictation based speech-to-text system.
상세보기
Marquette, Brian; Corfield, Charles; Espy, Todd, Method and systems for simplifying copying and pasting transcriptions generated from a dictation based speech-to-text system.
상세보기
Jablokov, Victor Roman; Jablokov, Igor Roditis, Methods and systems for dynamically updating web service profile information by parsing transcribed message strings.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Methods and systems for initiating application processes by data capture from rendered documents.
상세보기
King, Martin T.; Mannby, Claes-Fredrik; Arends, Thomas C.; Bajorins, David P.; Fox, Daniel C., Optical scanners, such as hand-held optical scanners.
상세보기
King, Martin T.; Stephens, Redwood; Mannby, Claes-Fredrik; Peterson, Jesse; Sanvitale, Mark; Smith, Michael J., Performing actions based on capturing information from rendered documents, such as documents under copyright.
상세보기
King, Martin T.; Stephens, Redwood; Mannby, Claes-Fredrik; Peterson, Jesse; Sanvitale, Mark; Smith, Michael J., Performing actions based on capturing information from rendered documents, such as documents under copyright.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Portable scanning device.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Processing techniques for text capture from a rendered document.
상세보기
King, Martin T.; Kushler, Clifford A.; Stafford-Fraser, James Q.; Grover, Dale L., Processing techniques for visual capture data from a rendered document.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Publishing techniques for adding value to a rendered document.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Search engines and systems with handheld document data capture devices.
상세보기
Burckart, Erik J.; Grigsby, Travis M.; Ivory, Andrew; Shook, Aaron K., System for recording spoken phone numbers during a voice call.
상세보기
Zimmerman, Roger S.; Antunes, Christopher S.; Barron, Jeremy E.; Tomasulo, Sharon Lee; Fiore, Claudia W.; Johnson, Christopher E.; Khesin, Anatole; Miller, Joshua, Systems and methods for automated transcription training.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Triggering actions in response to optically or acoustically capturing keywords from a rendered document.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Triggering actions in response to optically or acoustically capturing keywords from a rendered document.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Triggering actions in response to optically or acoustically capturing keywords from a rendered document.
상세보기
King, Martin T.; Grover, Dale L.; Kushler, Clifford A.; Stafford-Fraser, James Q., Triggering actions in response to optically or acoustically capturing keywords from a rendered document.
상세보기
King, Martin T.; Mannby, Claes-Fredrik; Valenti, William, Using gestalt information to identify locations in printed information.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Graphical user interface for determining speech recognition accuracy 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (9)

이 특허를 인용한 특허 (55)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Graphical user interface for determining speech recognition accuracy 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (9)

이 특허를 인용한 특허 (55)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트