[특허]Interactive robot, speech recognition method and computer program product

Interactive robot, speech recognition method and computer program product 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G10L-015/00 G10L-021/00 G05B-019/19 B25J-005/00 B25J-009/18
출원번호	UP-0311429 (2005-12-20)
등록번호	US-7680667 (2010-04-21)
우선권정보	JP-2004-374946(2004-12-24)
발명자 / 주소	Sonoura, Takafumi Suzuki, Kaoru
출원인 / 주소	Kabuhsiki Kaisha Toshiba
대리인 / 주소	Nixon & Vanderhye, PC
인용정보	피인용 횟수 : 27 인용 특허 : 4

초록 ▼

An interactive robot capable of speech recognition includes a sound-source-direction estimating unit that estimates a direction of a sound source for target voices which are required to undergo speech recognition; a moving unit that moves the interactive robot in the sound-source direction; a target-voice acquiring unit that acquires the target voices at a position after moving; and a speech recognizing unit that performs speech recognition of the target voices.

대표청구항 ▼

What is claimed is: 1. An interactive robot capable of speech recognition, comprising: a sound-source-direction estimating unit that estimates a direction of a sound source for target voices which are required to undergo speech recognition; a moving unit that moves the interactive robot in the sound-source direction; a target-voice acquiring unit that acquires the target voices at a position after moving; a target-voice holding unit that holds voice patterns of the target voices, the target voices including misrecognition-notification voices signifying that speech recognition by the speech recognizing unit is erroneous; a speech recognizing unit that performs speech recognition of the target voices by pattern matching of the voice patterns of the target voices, which are held in the target-voice holding unit, with the target voices acquired by the target-voice acquiring unit; a recognition-accuracy evaluating unit that calculates, as an accuracy of recognition results, an agreement accuracy between the acquired target voices and the voice patterns of the target voices held in the target-voice holding unit; wherein the moving unit moves the interactive robot itself in the direction of the sound source when the recognition accuracy for results of speech recognition of the target voices is smaller than a predetermined recognition-accuracy threshold and when the misrecognition-notification voices held in the target-voice holding unit are recognized. 2. The interactive robot according to claim 1, further comprising a voice-producing directing unit by which the sound source of the target voices is directed to produce voices after the robot is moved in the direction of the sound source, wherein the speech recognizing unit performs speech recognition of the target voices produced according to the voice-producing direction. 3. The interactive robot according to claim 1, further comprising: a signal-to-noise ratio calculating unit that calculates a signal-to-noise ratio of the target voices; and a signal-to-noise-ratio evaluating unit that compares the calculated signal-to-noise ratio and a predetermined threshold for the signal-to-noise ratio, wherein the moving unit moves the interactive robot itself in the direction of the sound source when the signal-to-noise ratio is smaller than the threshold for the signal-to-noise ratio. 4. The interactive robot according to claim 3, wherein the target voices are voices produced by an interlocutor communicating with the interactive robot, and the interactive robot further comprises an image acquiring unit that acquires images including the interlocutor as the sound source of the target voices; and a mouth-movement detecting unit that detects, from the images, mouth movement caused by voices produced by the interlocutor, wherein the moving unit moves the interactive robot itself in the direction of the sound source when the signal-to-noise ratio is smaller than the threshold for the signal-to-noise ratio, and the mouth movement of the interlocutor is detected. 5. The interactive robot according to claim 3, wherein the target voices are voices produced by an interlocutor communicating with the interactive robot, and the interactive robot further comprises an image acquiring unit that acquires images including the interlocutor as the sound source of the target voices; and a mouth-movement detecting unit that detects, from the images, mouth movement caused by voices produced by the interlocutor, wherein the moving unit moves the interactive robot itself in the direction of the sound source when the signal-to-noise ratio is equal to or larger than the threshold for the signal-to-noise ratio, and the mouth movement of the interlocutor is not detected. 6. The interactive robot according to claim 1, wherein the target voices are voices produced by an interlocutor communicating with the interactive robot, and the interactive robot further comprises an image acquiring unit that acquires images including the interlocutor as the sound source of the target voices; and a mouth-movement detecting unit that detects, from the images acquired in the image acquiring unit, mouth movement caused by voices produced by the interlocutor, wherein the moving unit moves the interactive robot itself in the direction of the sound source when the recognition-accuracy is smaller than the threshold for the recognition accuracy, and the mouth movement of the interlocutor is detected. 7. The interactive robot according to claim 1, wherein the target voices are voices produced by an interlocutor communicating with the interactive robot, and the interactive robot further comprises an image acquiring unit that acquires images including the interlocutor as the sound source of the target voices; and a mouth-movement detecting unit that detects, from the images, mouth movement caused by voices produced by the interlocutor, wherein the moving unit moves the interactive robot itself in the direction of the sound source when the recognition-accuracy is equal to or larger than the threshold for the recognition accuracy, and the mouth movement of the interlocutor is not detected. 8. The interactive robot according to claim 1, wherein the target voices are voices produced by an interlocutor communicating with the interactive robot, the interactive robot further comprises an image acquiring unit that acquires images including the interlocutor as the sound source of the target voices; and a mouth-movement detecting unit that detects, from the images acquired in the image acquiring unit, mouth movement caused by voices produced by the interlocutor, wherein the moving unit moves the interactive robot in the direction of the sound source when the mouth movement is detected and the target voices are not acquired. 9. The interactive robot according to claim 1, wherein the target voices are voices produced by an interlocutor communicating with the interactive robot, and the interactive robot further comprises an image acquiring unit that acquires images including the interlocutor as the sound source of the target voices; and a mouth-movement detecting unit that detects, from the images, mouth movement of the interlocutor, wherein the moving unit moves the interactive robot in the direction of the sound source when the mouth movement is not detected and the target voices are not acquired. 10. The interactive robot according to claim 1, further comprising a microphone array that has a plurality of microphones which pick up the target voices, wherein the direction of the sound source is estimated, based on differential arrival time between plane waves of the target voices picked up with corresponding voice microphones. 11. The interactive robot according to claim 1, further comprising a distance measuring sensor that measures a distance between the target voices and the interactive robot, wherein the sound-source-direction estimating unit estimates the direction of the sound source, based on measured results. 12. The interactive robot according to claim 1, further comprising an image forming unit that forms an image of the sound source of the target voices, wherein the sound-source-direction estimating unit estimates the direction of the sound source, assuming that an image-forming direction is the direction of the sound source. 13. The interactive robot according to claim 1, further comprising: a signal-strength measurement unit that measures signal strength of the target voices at a position after the interactive robot is moved by the moving unit; and an amplification-gain-adjustment unit that, based on the value of the signal strength, adjusts a gain of amplification by which voice signal of the target voices is amplified, wherein the speech recognizing unit performs speech recognition of the target voices acquired after the gain of amplification is adjusted. 14. A computer-implemented method for an interactive robot capable of speech recognition, the method comprising: estimating a direction of the sound source of target voices which are required to undergo speech recognition; moving the interactive robot in the direction of the sound source; acquiring the target voices when the interactive robot is located at a position after moving; performing speech recognition of the target voices by pattern matching of voice patterns of the target voices, which are held in a target-voice holding unit, with the acquired target voices, where the target voices held in the target-voice holding unit include misrecognition-notification voices signifying that speech recognition is erroneous; calculating, as an accuracy of recognition results, an agreement accuracy between the acquired target voices and the voice patterns of the target voices held in the target-voice holding unit; and moving the interactive robot itself in the direction of the sound source when the recognition accuracy for results of speech recognition of the target voices is smaller than a predetermined recognition-accuracy threshold and when the misrecognition-notification voices held in the target-voice holding unit are recognized. 15. A computer program product having a computer readable medium including programmed instructions for performing speech recognition processing on an interactive robot capable of speech recognition, wherein the instructions, when executed by a computer, cause the computer to perform: estimating a direction of the sound source of target voices which are required to undergo speech recognition; moving the interactive robot in the direction of the sound source; acquiring the target voices when the interactive robot is located at a position after moving; performing speech recognition of the target voices by pattern matching of voice patterns of the target voices, which are held in a target-voice holding unit, with the acquired target voices, where the target voices held in the target-voice holding unit include misrecognition-notification voices signifying that speech recognition is erroneous; calculating, as an accuracy of recognition result, an agreement accuracy between the acquired target voices and the voice patterns of the target voices held in the target-voice holding unit; and moving the interactive robot itself in the direction of the sound source when the recognition accuracy for results of speech recognition of the target voices is smaller than a predetermined recognition-accuracy threshold and when the misrecognition-notification voices held in the target-voice holding unit are recognized.

이 특허에 인용된 특허 (4)

Chigier Benjamin (Brookline MA), Automatic speech recognition.
상세보기
Petroni Marco,CAX ; Peters Steven Douglas,CAX, Method and apparatus for providing an improved feature set in speech recognition by performing noise cancellation and background masking.
상세보기
Nakamura Shogo (Matsudo JPX), Noise suppression apparatus.
상세보기
Salazar George A. ; Haynes Dena S. ; Sommers Marc J., Real-time reconfigurable adaptive speech recognition command and control apparatus and method.
상세보기

이 특허를 인용한 특허 (27)

Osterhout, Ralph F.; Haddick, John D.; Lohse, Robert Michael; Cella, Charles; Nortrup, Robert J.; Nortrup, Edward H., AR glasses with event and sensor triggered AR eyepiece interface to external devices.
상세보기
Osterhout, Ralph F.; Haddick, John D.; Lohse, Robert Michael; Cella, Charles; Nortrup, Robert J.; Nortrup, Edward H., AR glasses with event and sensor triggered control of AR eyepiece applications.
상세보기
Osterhout, Ralph F.; Haddick, John D.; Lohse, Robert Michael; Cella, Charles; Nortrup, Robert J.; Nortrup, Edward H., AR glasses with event and user action control of external applications.
상세보기
Haddick, John D.; Osterhout, Ralph F.; Lohse, Robert Michael, Adjustable extension for temple arm.
상세보기
Yoshizawa, Shinichi; Nakatoh, Yoshihisa, Audio source direction detecting device.
상세보기
Wong, Adrian; Miao, Xiaoyu, Displaying sound indications on a wearable computing system.
상세보기
Wong, Adrian; Miao, Xiaoyu, Displaying sound indications on a wearable computing system.
상세보기
Wong, Adrian; Miao, Xiaoyu, Displaying sound indications on a wearable computing system.
상세보기
Osterhout, Ralph F.; Haddick, John D.; Lohse, Robert Michael; Border, John N.; Miller, Gregory D.; Stovall, Ross W., Eyepiece with uniformly illuminated reflective display.
상세보기
Miller, Gregory D.; Border, John N.; Osterhout, Ralph F., Grating in a light transmissive illumination system for see-through near-eye display glasses.
상세보기
Border, John N.; Bietry, Joseph; Haddick, John D.; Lohse, Robert Michael, Light control in head mounted displays.
상세보기
Osterhout, Ralph F.; Lohse, Robert Michael, Method and apparatus for biometric data capture.
상세보기
Miller, Gregory D.; Border, John N.; Osterhout, Ralph F., Optical imperfections in a light transmissive illumination system for see-through near-eye display glasses.
상세보기
Lee, Stephanie; Dooley, Douglas; Osentoski, Sarah; Hsiao, Kaijen, Robotic creature and method of operation.
상세보기
Miller, Gregory D., See-through display with an optical assembly including a wedge-shaped illumination system.
상세보기
Border, John N.; Bietry, Joseph; Osterhout, Ralph F., See-through near-eye display glasses including a curved polarizing film in the image source, a partially reflective, partially transmitting optical element and an optically flat film.
상세보기
Border, John N.; Haddick, John D.; Lohse, Robert Michael; Osterhout, Ralph F., See-through near-eye display glasses including a modular image source.
상세보기
Border, John N.; Haddick, John D.; Osterhout, Ralph F., See-through near-eye display glasses including a partially reflective, partially transmitting optical element.
상세보기
Border, John N.; Osterhout, Ralph F., See-through near-eye display glasses including an auto-brightness control for the display brightness based on the brightness in the environment.
상세보기
Border, John N.; Bietry, Joseph; Osterhout, Ralph F., See-through near-eye display glasses wherein image light is transmitted to and reflected from an optically flat film.
상세보기
Border, John N.; Osterhout, Ralph F., See-through near-eye display glasses with a fast response photochromic film system for quick transition from dark to clear.
상세보기
Border, John N.; Haddick, John D.; Osterhout, Ralph F., See-through near-eye display glasses with a light transmissive wedge shaped illumination system.
상세보기
Border, John N.; Haddick, John D.; Osterhout, Ralph F., See-through near-eye display glasses with a small scale image source.
상세보기
Border, John N.; Haddick, John D.; Osterhout, Ralph F., See-through near-eye display glasses with a small scale image source.
상세보기
Border, John N.; Haddick, John D.; Lohse, Robert Michael; Osterhout, Ralph F., See-through near-eye display glasses with the optical assembly including absorptive polarizers or anti-reflective coatings to reduce stray light.
상세보기
Haddick, John D.; Osterhout, Ralph F., System and method for delivering content to a group of see-through near eye display eyepieces.
상세보기
Haddick, John D.; Osterhout, Ralph F.; Lohse, Robert Michael, System and method for social networking gaming with an augmented reality.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Interactive robot, speech recognition method and computer program product 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (4)

이 특허를 인용한 특허 (27)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Interactive robot, speech recognition method and computer program product 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (4)

이 특허를 인용한 특허 (27)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트