[특허]Active speaker location detection

[미국특허] Active speaker location detection 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	H04N-007/15 H04N-005/232 H04R-003/00 H04R-029/00 G06T-007/00 H04N-007/14
출원번호	US-0991847 (2016-01-08)
등록번호	US-9621795 (2017-04-11)
발명자 / 주소	Whyte, Oliver Arthur Cutler, Ross Bhattacharjee, Avronil Kowdle, Adarsh Prakash Murthy Kirk, Adam Birchfield, Stanley T. Zhang, Cha
출원인 / 주소	MICROSOFT TECHNOLOGY LICENSING, LLC
대리인 / 주소	Alleman Hall McCoy Russell & Tuttle LLP
인용정보	피인용 횟수 : 1 인용 특허 : 8

초록 ▼

Various examples related to determining a location of an active speaker are provided. In one example, image data of a room from an image capture device is received and a three dimensional model is generated. First audio data from a first microphone array at the image capture device is received. Second audio data from a second microphone array laterally spaced from the image capture device is received. Using the three dimensional model, a location of the second microphone array with respect to the image capture device is determined. Using the audio data and the location and angular orientation of the second microphone array, an estimated location of the active speaker is determined. Using the estimated location, a setting for the image capture device is determined and outputted to highlight the active speaker.

대표청구항 ▼

1. A method for determining a location of an active speaker, the method comprising: from an image capture device, receiving image data of a room in which the active speaker and at least one inactive speaker are located;using the image data, generating a three dimensional model of at least a portion of the room;from a first microphone array at the image capture device, receiving first audio data from the room;from a second microphone array that is laterally spaced from the image capture device, receiving second audio data from the room;using the three dimensional model, determining a location of the second microphone array with respect to the image capture device;using at least the first audio data, the second audio data, the location of the second microphone array, and an angular orientation of the second microphone array, determining an estimated location in the three dimensional model of the active speaker;using the estimated location of the active speaker to compute a setting for the image capture device; andoutputting the setting to control the image capture device to highlight the active speaker. 2. The method of claim 1, wherein the image capture device comprises a color camera and the image data comprises color image data. 3. The method of claim 1, wherein the image capture device comprises a depth camera and the image data comprises depth data. 4. The method of claim 1, wherein the image data comprises signals corresponding to light emitted from a plurality of light sources of the second microphone array, and the method further comprises using the signals to determine the angular orientation of the second microphone array with respect to the image capture device. 5. The method of claim 4, wherein the plurality of light sources are illuminated in a spatially-recognizable manner. 6. The method of claim 1, further comprising: receiving a signal from a magnetometer in the second microphone array; andusing the magnetometer signal, determining the angular orientation of the second microphone array. 7. The method of claim 1, further comprising determining that at least one of the first microphone array and the second microphone array has moved; and based on determining that at least one of the first microphone array and the second microphone array has moved, recomputing one or more of the location and the angular orientation of the second microphone array. 8. The method of claim 7, wherein determining that at least one of the first microphone array and the second microphone array has moved comprises analyzing a signal received from one or more of an accelerometer in the first microphone array, a magnetometer in the first microphone array, an accelerometer in the second microphone array, and a magnetometer in the second microphone array. 9. The method of claim 1, further comprising: determining that the image data does not comprise image data of a plurality of light sources of the second microphone array; andoutputting a notification indicating that the second microphone array is occluded from view of the image capture device. 10. A video conferencing device, comprising: an image capture device for capturing image data of a room in which an active speaker and at least one inactive speaker are located;a first microphone array;a processor; andan active speaker location program executable by the processor, the active speaker location program configured to: using the image data, generate a three dimensional model of at least a portion of the room;receive first audio data of the room from the first microphone array;receive second audio data of the room from a second microphone array that is laterally spaced from the image capture device;using the three dimensional model, determine a location of the second microphone array with respect to the image capture device;using at least the first audio data, the second audio data, the location of the second microphone array, and an angular orientation of the second microphone array, determine an estimated three dimensional location of the active speaker;use the estimated location of the active speaker to compute a setting for the image capture device; andoutput the setting to control the image capture device to highlight the active speaker. 11. The video conferencing device of claim 10, wherein the image capture device comprises a color camera and the image data comprises color image data. 12. The video conferencing device of claim 10, wherein the image capture device comprises a depth camera and the image data comprises depth data. 13. The video conferencing device of claim 10, wherein the image data comprises signals corresponding to light emitted from a plurality of light sources of the second microphone array, and the active speaker location program is configured to determine the angular orientation of the second microphone array using the signals. 14. The video conferencing device of claim 13, wherein the plurality of light sources are illuminated in a spatially-recognizable manner. 15. The video conferencing device of claim 10, wherein the active speaker location program is configured to determine the angular orientation of the second microphone array using a signal received from a magnetometer in the second microphone array. 16. The video conferencing device of claim 10, wherein the active speaker location program is further configured to: determine that the second microphone array has moved from a first location to a second location; andbased on determining that that the second microphone array has moved, recompute one or more of the location and the angular orientation of the second microphone array. 17. The video conferencing device of claim 16, wherein determining that the second microphone array has moved comprises receiving a signal from an accelerometer in the second microphone array. 18. The video conferencing device of claim 10, wherein the active speaker location program is further configured to: determine that the image data does not comprise image data of a plurality of light sources of the second microphone array; andoutput a notification indicating that the second microphone array is occluded from view of the image capture device. 19. A method for determining a location of an active speaker, the method comprising: from an image capture device, receiving image data of a room in which the active speaker and at least one inactive speaker are located;using the image data, generating a three dimensional model of at least a portion of the room;from a first microphone array at the image capture device, receiving first audio data from the room;from a second microphone array that is laterally spaced from the image capture device, receiving second audio data from the room;using the three dimensional model, determining a location of the second microphone array with respect to the image capture device;determining an angular orientation of the second microphone array with respect to the image capture device by receiving light emitted from a plurality of light sources of the second microphone array;using at least the first audio data, the second audio data, the location of the second microphone array, and the angular orientation of the second microphone array, determining an estimated three dimensional location of the active speaker;using the estimated location of the active speaker to compute a setting for the image capture device; andoutputting the setting to control the image capture device to zoom into the active speaker. 20. The method of claim 19, further comprising: receiving a signal from an accelerometer in the second microphone array;using the signal, determining that the second microphone array has experienced an acceleration; andbased on determining that that the second microphone array has experienced an acceleration, recomputing the angular orientation of the second microphone array.

이 특허에 인용된 특허 (8) 인용/피인용 타임라인 분석

Taylor,Michael James; Rowe,Simon Michael, Image processing apparatus.
상세보기
Benesty, Jacob; Elko, Gary Wayne; Huang, Yiteng, Method and apparatus for passive acoustic source localization for video camera steering applications.
상세보기
Satoda Kozo,JPX ; Hiraike Ryuichi,JPX, Multi-site television conference system and central control apparatus and conference terminal for use with the system.
상세보기
Cutler, Ross G., Satellite microphone array for video conferencing.
상세보기
Cutler, Ross G., Satellite microphones for improved speaker detection and zoom.
상세보기
Cutler, Ross G., Satellite microphones for improved speaker detection and zoom.
상세보기
Addeo Eric J. (Long Valley NJ) Robbins John D. (Denville NJ) Shtirmer Gennady (Morris Plains NJ), Sound localization system for teleconferencing using self-steering microphone arrays.
상세보기
Basart, Edwin J.; Rucinski, David B., Speaker identification and representation for a phone.
상세보기

이 특허를 인용한 특허 (1) 인용/피인용 타임라인 분석

Aas, Rune Øistein; Tangeland, Kristian; Hellerud, Erik, Auto-calibration of relative positions of multiple speaker tracking systems.
상세보기

활용도 분석정보

상세보기

다운로드

내보내기

활용도 Top5 특허

해당 특허가 속한 카테고리에서 활용도가 높은 상위 5개 콘텐츠를 보여줍니다.
더보기 버튼을 클릭하시면 더 많은 관련자료를 살펴볼 수 있습니다.

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

[미국특허] Active speaker location detection 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (8) 인용/피인용 타임라인 분석

이 특허를 인용한 특허 (1) 인용/피인용 타임라인 분석

활용도 분석정보

활용도 Top5 특허

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

[미국특허] Active speaker location detection 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (8) 인용/피인용 타임라인 분석

이 특허를 인용한 특허 (1) 인용/피인용 타임라인 분석

활용도 분석정보

활용도 Top5 특허 더보기

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

활용도 Top5 특허