[특허]Context sensitive text recognition and marking from speech

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-003/00 G06F-003/13
출원번호	US-0421601 (2006-06-01)
등록번호	US-8171412 (2012-05-01)
발명자 / 주소	Sand, Anne R. Miller, Steven M.
출원인 / 주소	International Business Machines Corporation
대리인 / 주소	Bauer, Andrea
인용정보	피인용 횟수 : 1 인용 특허 : 12

초록 ▼

A visual presentation system and method for synchronizing presentation data being viewed in a display with speech input. A system is disclosed that includes: a speech recognition system for recognizing speech input; an association system for determining a context of the speech input and matching the

A visual presentation system and method for synchronizing presentation data being viewed in a display with speech input. A system is disclosed that includes: a speech recognition system for recognizing speech input; an association system for determining a context of the speech input and matching the context with a relevant portion of the presentation data; and a visual coordination system for coordinating the display of a data item from the presentation data based on a match made by the association system.

대표청구항 ▼

1. A visual presentation system for synchronizing a display of a set of presentation with a speech input of a speaker, the system comprising: a speech recognition system for processing the speech input of the speaker during a visual presentation,wherein the processing of the speech input of the spea

1. A visual presentation system for synchronizing a display of a set of presentation with a speech input of a speaker, the system comprising: a speech recognition system for processing the speech input of the speaker during a visual presentation,wherein the processing of the speech input of the speaker includes converting the speech input into a spoken information data set;an association system for processing the spoken information data set to determine a display order of portions of the set of presentation data,wherein relevant portions of the set of presentation data are displayed in an order based upon the speech input of the speaker,wherein the processing of the spoken information data set includes: preprocessing the set of presentation data to exclude a predetermined list of terms from consideration in a presentation data match determination;determining a context of the speech input by analyzing speech patterns in the speech input over a predetermined time interval; anddetermining a set of matches of the context with the relevant portions of the set of presentation data; anda visual coordination system for coordinating the display of the relevant portions of the set of presentation data based on the set of matches determined by the association system,wherein the coordinating of the display of the relevant portions includes adjusting the display order of the relevant portions to synchronize the visual presentation with the speech input, andwherein the visual coordination system includes a user selection system for allowing a user to select which of the relevant portions to display from among the set of matches during the visual presentation. 2. The visual presentation system of claim 1, wherein the association system includes a system for controlling a sensitivity for determining the context. 3. The visual presentation system of claim 1, wherein the preprocessing is performed prior to the beginning of the visual presentation and the relevant portion of the set of presentation data is selected from the group consisting of: data items, metadata and location data. 4. The visual presentation system of claim 1, wherein the context is-determined based on a frequency, volume or speed of an uttered set of words in the speech input. 5. The visual presentation system of claim 1, wherein the visual coordination system selects and displays locations within the set of presentation data, wherein the locations are displayed independently to an audience on an audience display and the user on a user display and are selected from the group consisting of: a view, a word, a phrase, a text segment, a graphic object, a visual element, a section, a slide, and a page. 6. The visual presentation system of claim 1, wherein the visual coordination system further includes a system for marking data items and saving an output of the presentation. 7. The visual presentation system of claim 6, wherein the visual coordination system includes a marking selected from the group consisting of: a first type of marking for visually identifying a data item in the display that is yet to be discussed;a second type of marking for visually identifying a data item in the display currently being discussed; anda third type of marking for visually identifying a data item in the display that was previously discussed. 8. A method for synchronizing a display of a set of presentation data with a set of speech inputs of a speaker, the method comprising: preprocessing the set of presentation data to exclude a predetermined list of terms from consideration in a presentation data match determination;capturing a speech input of the speaker during a visual presentation;providing a speech recognition system to process the speech input,wherein the processing of the speech input includes converting the speech input into a spoken information data set;determining a context of the speech input based on the spoken information data set,wherein the context is determined in response to a word or a phrase being recognized a predetermined plurality of times over a predetermined time interval;matching the context with a relevant portion of the set of presentation data; andcoordinating the display of the relevant portion of the set of presentation data based on the matching,wherein the coordinating of the display of the relevant portion includes automatically adjusting the display of the relevant portion to synchronize the visual presentation with the speech input. 9. The method of claim 8, wherein the predetermined plurality of times is adjustable via a sensitivity control. 10. The method of claim 8, wherein the preprocessing is performed prior to the beginning of the visual presentation and the relevant portion of the set of presentation data is selected from the group consisting of: data items, metadata, and location data. 11. The method of claim 8, wherein the context is determined based on a frequency, volume or speed of an uttered set of words in the speech input. 12. The method of claim 8, wherein the coordinating step selects and displays locations within the set of presentation data, wherein the locations are displayed independently to an audience on an audience display and the user on a user display and are selected from the group consisting of: a view, a word, a phrase, a text segment, a graphic object, a visual element, a section, a slide, and a page. 13. The method of claim 8, wherein the coordinating step further includes marking data items and saving an output of the presentation. 14. The method of claim 13, wherein the coordinating step includes a step selected from the group consisting of: using a first type of marking for visually identifying a data item in the display that is yet to be discussed;using a second type of marking for visually identifying a data item in the display currently being discussed; andusing a third type of marking for visually identifying a data item in the display that was previously discussed. 15. A computer program product stored on a computer useable medium for synchronizing a display of a set of presentation data with a set of speech inputs of a speaker, the program product comprising: program code configured to preprocess the set of presentation data to exclude a predetermined list of terms from consideration in a presentation data match determination;program code configured for determining a context of a speech input over a predetermined time interval,wherein the context is determined by analyzing speech patterns in the speech input of the speaker over the predetermined time interval;program code configured for matching the context with a plurality of the relevant portions of the set of presentation data; andprogram code configured for coordinating the display of the relevant portions of the set of presentation data during a visual presentation based on the matching,wherein the coordinating of the display of the relevant portions includes automatically adjusting the display of the relevant portions to synchronize the visual presentation with the speech input. 16. The computer program product of claim 15, wherein the program code for determining the context is adjustable via a sensitivity control. 17. The computer program product of claim 15, wherein the preprocessing is performed prior to the beginning of the visual presentation and the relevant portion of the set of presentation data is selected from the group consisting of: data items, metadata, and location data. 18. The computer program product of claim 15, wherein the context is determined based on a frequency, volume or speed of an uttered set of words in the speech input. 19. The computer program product of claim 15, wherein the program code configured for coordinating the display of the presentation data selects and displays a plurality of matching data items, and provides a user selection system for allowing a user to select which relevant portions to display. 20. The computer program product of claim 15, wherein the program code configured for coordinating the display of the presentation data automatically selects which relevant portions to display from the plurality of matches. 21. The computer program product of claim 20, wherein the program code configured for coordinating the display of the presentation data includes a function selected from the group consisting of: using a first type of marking for visually identifying a data item in the display that is yet to be discussed;using a second type of marking for visually identifying a data item in the display currently being discussed; andusing a third type of marking for visually identifying a data item in the display that was previously discussed. 22. A method for deploying a system synchronizing a display of a set of presentation data with a set of speech inputs of a speaker, the method comprising: providing a computer infrastructure being operable to: preprocess the set of presentation data to exclude a predetermined list of terms from consideration in a presentation data match determination;determine a context of a speech input over a predetermined time interval,wherein the context is determined in response to a word or phrase being recognized a predetermined plurality of times over the predetermined time interval;match the context with a relevant portion of the set of presentation data; andcoordinate the display of the set of presentation data based on the matching step, including displaying the relevant portion of the set of presentation data,wherein the coordinating of the display of the relevant portion includes automatically adjusting the display of the relevant portion to synchronize a visual presentation with the speech input.

LOADING...

이 특허에 인용된 특허 (12) 인용/피인용 타임라인 분석

Drake Samuel (San Jose CA) Griefer Allan D. (San Jose CA) Powers ; Jr. John T. (Morgan Hill CA) Thomas John G. (Santa Cruz CA), Automated presentation capture, storage and playback system.
상세보기
Bodin,William Kress; Burkhart,Michael John; Eisenhauer,Daniel G.; Schumacher,Daniel Mark; Watson,Thomas J., Creating a voice response grammar from a presentation grammar.
상세보기
Eastwood Peter Rowland ; Happ Alan J. ; Klein Alice G. ; Kruse Daniel William ; Milenkovic Maria, Display indications of speech processing states in speech recognition system.
상세보기
Deutscher,John; Gogia,Sunit; Snyder,Brian; Honey,Brian; Beauford,Amy; Orme Doutre,Daniel; Johnson,Becky, Interactive presentation viewing system employing multi-media components.
상세보기
Nolting, Daniel L., Media presentation system controlled by voice to text commands.
상세보기
Rtischev Dimitry (Menlo Park CA) Bernstein Jared C. (Palo Alto CA) Chen George T. (Menlo Park CA) Butzberger John W. (Foster City CA), Method and apparatus for voice-interactive language instruction.
상세보기
Goldstein Mikael,SEX ; Lockner Mikael,SEX, Method and device to input characters.
상세보기
Paul D. Jaramillo ; Frank Haiqing Wu, Method and system for performing speech recognition based on best-word scoring of repeated speech attempts.
상세보기
Brocious,Larry A.; Gabel,Jonathan L.; Loose,David C.; VanBuskirk,Ronald E.; Wang,Huifang; Woodward,Steven G., Method, system, and apparatus for limiting available selections in a speech recognition system.
상세보기
Tanikoshi Koichiro (Hitachi JPX) Yamaashi Kimiya (Hitachi JPX) Tani Masayuki (Katsuta JPX) Tanifuji Shinya (Hitachi JPX) Yoshino Hitoshi (Chiba JPX), Presentation supporting method and apparatus therefor.
상세보기
Din, Salah U., Recording meeting minutes based upon speech recognition.
상세보기
Soltis, Jr., Donald C.; Colon-Bonet, Glenn T., Systems and methods for variable control of power dissipation in a pipelined processor.
상세보기

이 특허를 인용한 특허 (1) 인용/피인용 타임라인 분석

Vangala, Vipindeep; Gunda, Rajesh, Intelligent assistance in presentations.
상세보기

활용도 분석정보

상세보기

다운로드

내보내기

활용도 Top5 특허

해당 특허가 속한 카테고리에서 활용도가 높은 상위 5개 콘텐츠를 보여줍니다.
더보기 버튼을 클릭하시면 더 많은 관련자료를 살펴볼 수 있습니다.

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

[미국특허] Context sensitive text recognition and marking from speech 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (12) 인용/피인용 타임라인 분석

이 특허를 인용한 특허 (1) 인용/피인용 타임라인 분석

활용도 분석정보

활용도 Top5 특허

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

[미국특허] Context sensitive text recognition and marking from speech 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (12) 인용/피인용 타임라인 분석

이 특허를 인용한 특허 (1) 인용/피인용 타임라인 분석

활용도 분석정보

활용도 Top5 특허 더보기

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

활용도 Top5 특허