[특허]Sound localization with artificial neural network

Sound localization with artificial neural network 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-015/18 G06E-001/00 G06E-003/00 G06G-007/00 G06N-003/08
출원번호	US-0216807 (2014-03-17)
등록번호	US-9129223 (2015-09-08)
발명자 / 주소	Velusamy, Kavitha Crump, Edward Dietz
출원인 / 주소	Amazon Technologies, Inc.
대리인 / 주소	Lee & Hayes, PLLC
인용정보	피인용 횟수 : 3 인용 특허 : 15

초록 ▼

The location of a sound within a given spatial volume may be used in applications such as augmented reality environments. An artificial neural network processes time-difference-of-arrival data (TDOA) from a known microphone array to determine a spatial location of the sound. The neural network may be located locally or available as a cloud service. The artificial neural network is trained with perturbed and non-perturbed TDOA data.

대표청구항 ▼

1. A system comprising: a first microphone;a second microphone;one or more processors; andone or more computer-readable media storing computer-executable instructions that, when executed, cause the one or more processors to perform acts comprising:determine a difference between a time-of-arrival of a first acoustic signal at the first microphone and a time-of-arrival of a second acoustic signal at the second microphone, the first acoustic signal and the second acoustic signal associated with an acoustic source;receive the determined difference at a plurality of input nodes of an artificial neural network; andgenerate, at a plurality of output nodes of the artificial neural network, spatial coordinates of the acoustic source. 2. The system as recited in claim 1, wherein the first microphone and the second microphone are positioned in a pre-determined arrangement within an environment. 3. The system as recited in claim 2, wherein the artificial neural network is pre-trained based at least in part on the pre-determined arrangement of the first microphone and the second microphone within the environment. 4. The system as recited in claim 1, wherein a greater number of the plurality of input nodes positively correlates to a more precise generated spatial coordinates of the acoustic source at the plurality of output nodes on the artificial neural network. 5. The system as recited in claim 1, the acts further comprising: train the artificial neural network with backpropagation using an acoustic signal at known spatial coordinates. 6. The system as recited in claim 1, wherein the acoustic signal comprises human speech. 7. The system as recited in claim 1, wherein a user at least in part generates the acoustic signal. 8. The system as recited in claim 1, wherein the acoustic signal comprises an audible gesture within an augmented reality environment. 9. One or more computer-readable media storing computer-executable instructions that, when executed, cause one or more processors to perform acts comprising: receiving a first indication of a first acoustic signal generated from a source at a first microphone;receiving a second indication of a second acoustic signal generated from the source at a second microphone;determining a difference of time between the receiving of the first indication and the receiving of the second indication;receiving, at a plurality of input nodes of an artificial neural network, the determined difference; andgenerating, at a plurality of output nodes of the artificial neural network, spatial coordinates of the source. 10. The one or more computer-readable storage media as recited in claim 9, further comprising: varying the determined difference of time within a pre-determined range; andtraining the artificial neural network to associate the varied determined difference with the generated spatial coordinates. 11. The one or more computer-readable storage media as recited in claim 9, further comprising: training the artificial neural network to associate the determined difference with the generated spatial coordinates. 12. The one or more computer-readable storage media as recited in claim 9, wherein the generating comprises generating the spatial coordinates locally using a first artificial neural network, and further comprises generating spatial coordinates remotely using a second artificial neural network, the second artificial neural network comprising additional nodes compared to the first artificial neural network. 13. The one or more computer-readable storage media as recited in claim 12, wherein the second artificial neural network is configured to execute as a cloud compute resource accessible to a plurality of users. 14. The one or more computer-readable storage media as recited in claim 12, further comprising modifying the spatial coordinates of the source with combined results from the first artificial neural network and the second artificial neural network. 15. A system configured to estimate a physical location of an acoustic source within an environment, the system comprising: first and second microphones positioned in the environment;one or more processors;one or more computer-readable media storing computer-executable instructions that, when executed, cause the one or more processors to perform acts comprising: measure, at the first microphone, a time of arrival of an acoustic signal associated with the acoustic source;measure, at the second microphone, a time of arrival of the acoustic signal associated with the acoustic source;determine a difference between the measured time of arrival of the acoustic signal at the first microphone and the measure time of arrival of the acoustic signal at the second microphone;receive the determined difference at a plurality of input nodes of a trained artificial neural network; andestimate, at a plurality of output nodes of the trained artificial neural network, the physical location of the acoustic source within the environment. 16. The system as recited in claim 15, wherein the first and second microphones are positioned in the environment in a pre-determined arrangement, the first microphone having a known location relative to the second microphone. 17. The system as recited in claim 16, wherein the trained artificial network is pre-trained based on the pre-determined arrangement of the first and second microphones. 18. The system as recited in claim 15, the acts further comprising: vary the determined difference within a pre-determined range of time; andupdate the trained artificial neural network to associate the varied determined difference with the estimated physical location. 19. The system as recited in claim 15, the system further comprising: at least one camera to capture structured light within the environment;a ranging system to project structured light within the environment; and wherein the acts further comprising:update the trained artificial neural network with physical characteristics of the environment based at least on reflected structured light captured by the at least one camera. 20. The system as recited in claim 19, wherein the ranging system comprises a one of a structured light module, a laser range finder, or an optical range finder.

이 특허에 인용된 특허 (15)

Chhetri, Amit S.; Velusamy, Kavitha; Chu, Wai C.; Gopalan, Ramya, Acoustic echo cancellation using blind source separation.
상세보기
Taub, Robert D.; Cabanilla, J. Alexander; Tourtellot, George, Collaborative music creation.
상세보기
Taub, Robert D.; Cabanilla, J. Alexander; Tourtellot, George, Collaborative music creation.
상세보기
Zaitsev, Oleg V., Efficient management of computer resources.
상세보기
Millikin,Rhonda L., Identification and location of an object via passive acoustic detection.
상세보기
Taub, Robert D.; Cabanilla, J. Alexander, Music transcription.
상세보기
Taub, Robert D.; Cabanilla, J. Alexander, Music transcription.
상세보기
Taub, Robert D.; Cabanilla, J. Alexander, Music transcription.
상세보기
Taub, Robert D.; Cabanilla, J. Alexander, Music transcription.
상세보기
Taub, Robert D.; Cabanilla, J. Alexander, Music transcription.
상세보기
Taub, Robert D.; Cabanilla, J. Alexander; Tourtellot, George, Music-based search engine.
상세보기
Velusamy, Kavitha; Chhetri, Amit S.; Gopalan, Ramya; Chu, Wai C.; Li, Wei, Null-forming techniques to improve acoustic echo cancellation.
상세보기
Velusamy, Kavitha; Crump, Edward Dietz, Sound localization with artificial neural network.
상세보기
Doi,Kunio; Li,Qiang; Katsuragawa,Shigehiko; Ishida,Takayuki, System for computerized processing of chest radiographic images.
상세보기
Wu, Duanpei; Velusamy, Kavitha, Techniques for performing key frame requests in media servers and endpoint devices.
상세보기

이 특허를 인용한 특허 (3)

Sun, HaoHai, Automated collaboration system.
상세보기
Jones, David K.; Payne, Jamie; Holstege, Cody Jay; Schneider, Mark Andrew, Floor power distribution system.
상세보기
Jones, David K.; Payne, Jamie; Holstege, Cody Jay; Schneider, Mark Andrew, Floor power distribution system.
상세보기

내보내기 메뉴

내보내기 구분

파일저장
인쇄
메일전송

구성항목

기본정보
상세정보

관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC

저장형식

Text(ASCII format)
Excel format
PIAS분석(.xls)

메일정보

받는사람 (필수): @
보내는사람 (선택): @
제목
내용: KISTI 검색결과 이메일 서비스

안내

총 건의 자료가 검색되었습니다.

다운받으실 자료의 인덱스를 입력하세요. (1-10,000)

검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다.

데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요)

다운로드 파일은 UTF-8 형태로 저장됩니다.
파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오.

Text(ASCII format)
Excel format

AI-Helper ※ AI-Helper는 을 사용합니다.

AI-Helper

안녕하세요, AI-Helper입니다. 좌측 "선택된 텍스트"에서 텍스트를 선택하여 요약, 번역, 용어설명을 실행하세요.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

연합인증

Sound localization with artificial neural network 원문보기