[특허]Method and system for detecting an audio event for smart home devices

Method and system for detecting an audio event for smart home devices 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G10L-025/51 G10L-025/03 G10L-025/72 G10L-025/78 G06K-009/00 H04R-003/00 G08B-021/02 G10L-025/27 G08B-001/08
출원번호	US-0737678 (2015-06-12)
등록번호	US-9965685 (2018-05-08)
발명자 / 주소	Matsuoka, Yoky Nongpiur, Rajeev Conrad Dixon, Michael
출원인 / 주소	Google LLC
대리인 / 주소	Morris & Kamlay LLP
인용정보	피인용 횟수 : 0 인용 특허 : 8

초록 ▼

This application discloses a method implemented by an electronic device to detect a signature event (e.g., a baby cry event) associated with an audio feature (e.g., baby sound). The electronic device obtains a classifier model from a remote server. The classifier model is determined according to predetermined capabilities of the electronic device and ambient sound characteristics of the electronic device, and distinguishes the audio feature from a plurality of alternative features and ambient noises. When the electronic device obtains audio data, it splits the audio data to a plurality of sound components each associated with a respective frequency or frequency band and including a series of time windows. The electronic device further extracts a feature vector from the sound components, classifies the extracted feature vector to obtain a probability value according to the classifier model, and detects the signature event based on the probability value.

대표청구항 ▼

1. A method for detecting a signature event associated with an audio feature, comprising: on an electronic device having one or more processors and memory storing one or more programs for execution by the one or more processors, automatically and without user intervention: obtaining from a remote server a classifier model that distinguishes an audio feature from a plurality of alternative features and ambient noises, wherein the classifier model is determined by the remote server according to a number of false positives generated by the classifier model, predefined capabilities of the electronic device and ambient sound characteristics of the electronic device, the predefined capabilities of the electronic device comprising one or more of computational capabilities, storage capabilities, and caching capabilities;obtaining audio data associated with an audio signal;splitting the audio data to a plurality of sound components each associated with a respective frequency or frequency band and including a series of time windows;statistically analyzing each of the plurality of sound components across the series of time windows;extracting a feature vector from the plurality of sound components based on the statistical analysis, the feature vector including a plurality of elements that are arranged according a predetermined order;classifying the extracted feature vector based on the classifier model to obtain a probability value indicating whether the audio signal includes the audio feature within the series of time windows; anddetecting the signature event associated with the audio feature based on the probability value and issuing an alert indicating occurrence of the signature event. 2. The method of claim 1, the feature vector including a first subset of elements associated with energy levels of a first subset of sound components, and a second subset of elements associated with harmonic characteristics of a second subset of sound components, wherein the first and second subsets of elements in the feature vector are arranged according a predetermined order. 3. The method of claim 2, wherein the first subset of elements are associated with variations of the energy levels for each of the first subset of sound components with respect to the series of time windows. 4. The method of claim 2, wherein the first subset of elements includes one or more of a maximum energy level, a minimum energy level, a median energy level, a mean energy level and a difference between the maximum and minimum energy levels that each of the first subset of sound components has across the series of time windows. 5. The method of claim 2, wherein the first subset of elements includes one or more of a maximum energy variation, a minimum energy variation, a median energy variation, a mean energy variation and a difference between the maximum and minimum energy variations that each of the first subset of sound components has across the series of time windows. 6. The method of claim 2, wherein the harmonic characteristics of the second subset of sound components are associated with a respective harmonic peak for each sound component at each of the series of time windows, and include one or more of an intensity value, a harmonic frequency and a variation of the harmonic frequency of the respective harmonic peak. 7. The method of claim 6, wherein the second subset of elements includes one or more of a maximum value, a minimum value, a median value, a mean value and a difference between the maximum and minimum values of each harmonic characteristic. 8. The method of claim 2, wherein statistically analyzing each of the plurality of sound components across the series of time windows includes: for each sound component at each of the series of time windows, statistically analyzing the energy levels for the first subset of sound components to obtain a respective energy level. 9. The method of claim 1, wherein statistically analyzing each of the plurality of sound components across the series of time windows further includes: for each sound component at each of the series of time windows: identifying a respective harmonic peak;obtaining the intensity and the frequency of the respective harmonic peak; andobtaining the variation of the frequency of the respective harmonic peak with respect to that of another time window preceding to the respective time window. 10. The method of claim 1, wherein splitting the audio data to the plurality of sound components includes: for each executive time window: applying a Fast Fourier Transform (FFT) to obtain a plurality of FFT coefficients associated with the energy levels and the harmonic characteristics for the plurality of sound components each associated with the respective frequency or frequency band. 11. The method of claim 1, wherein the feature vector further includes a plurality of Cepstral coefficients obtained by a FFT. 12. The method of claim 1, wherein at least two of the time windows are consecutive time windows that partially overlap in time. 13. The method of claim 1, wherein each of the series of time windows lasts 30 msec. 14. The method of claim 1, wherein the plurality of sound components includes at least three sound components that are associated respectively with a low frequency band, an intermediate frequency band and a high frequency band. 15. The method of claim 1, wherein each of the plurality of sound components is associated with one or more of the following frequency bands: 0-900 Hz, 1000-5000 Hz and 6000 Hz and higher. 16. The method of claim 1, wherein the plurality of sound components includes at least one sound component that is associated with a frequency or frequency band related to a baby cry. 17. The method of claim 1, wherein the classifier model is selected from a group consisting of: a neural network, a linear support vector machine (SVM), a naïve Bayes classifier, a Gaussian Mixture Model. 18. The method of claim 1, wherein the audio signal further includes an alternative series of time windows that are distinct from the series of time windows and is associated with at least one additional probability value indicating whether the audio signal includes the audio feature within the alternative series of time windows, wherein the signature event associated with the audio feature is detected based on both the probability value associated with the series of time windows and the at least one additional probability value. 19. The method of claim 18, wherein the signature event associated with the audio feature is detected when both the probability value associated with the series of time windows and the at least one additional probability value are larger than a predetermined probability threshold. 20. The method of claim 1, wherein the probability value that indicates whether the audio signal includes the audio feature has a magnitude between 0 and 1. 21. The method of claim 1, wherein the audio feature is associated with a baby sound, and the signature event is associated with an extended baby cry event. 22. A method for detecting a signature event associated with an audio feature, comprising: on an electronic device having one or more processors and memory storing one or more programs for execution by the one or more processors, automatically and without user intervention: obtaining audio data associated with an audio signal;splitting the audio data to a plurality of sound components each associated with a respective frequency or frequency band and including a series of time windows;statistically analyzing each of the plurality of sound components across the series of time windows;extracting a feature vector from the plurality of sound components based on the statistical analysis, the feature vector including a first subset of elements associated with energy levels of a first subset of sound components, and a second subset of elements associated with harmonic characteristics of a second subset of sound components, wherein the first and second subsets of elements in the feature vector are arranged according a predetermined order;classifying the extracted feature vector based on a classifier model provided by a remote server to obtain a probability value indicating whether the audio signal includes the audio feature within the series of time windows, wherein the classifier is configured to recognize the audio feature according to feature vectors that include elements arranged according to the predetermined order, wherein the classifier model is determined by the remote server based on a number of false positives generated by the classifier model and the predefined capabilities of the electronic device comprising one or more of computational capabilities, storage capabilities, and caching capabilities;detecting the signature event associated with the audio feature based on the probability value and issuing an alert indicating occurrence of the signature event. 23. An electronic device for detecting a signature event associated with an audio feature, the electronic device comprising: one or more processors; andmemory storing one or more programs to be executed by the one or more processors, the one or more programs comprising instructions for: obtaining acoustic data associated with an audio signal;splitting the audio data to a plurality of sound components each associated with a respective frequency or frequency band and including a series of time windows;statistically analyzing each of the plurality of sound components across the series of consecutive time windows;extracting a feature vector from the plurality of sound components based on the statistical analysis, the feature vector including a first subset of elements associated with energy levels of a first subset of sound components, and a second subset of elements associated with harmonic characteristics of a second subset of sound components, wherein the first and second subsets of elements in the feature vector are arranged according a predetermined order;classifying the extracted feature vector based on a classifier model provided by a remote server to obtain a probability value indicating whether the audio signal includes the audio feature within the series of consecutive time windows, wherein the classifier is configured to recognize the audio feature according to feature vectors that include elements arranged according to the predetermined order, wherein the classifier model is determined by the remote server based on a number of false positives generated by the classifier model and the predefined capabilities of the electronic device comprising one or more of computational capabilities, storage capabilities, and caching capabilities;detecting the signature event associated with the audio feature based on the probability value and issuing an alert indicating occurrence of the signature event.

이 특허에 인용된 특허 (8)

Wen,Xue; Lee,Yongbeom; Lee,Jaewon, Apparatus, method, and medium for detecting and discriminating impact sound.
상세보기
Bilobrov, Sergiy, Audio fingerprint extraction by scaling in time and resampling.
상세보기
Hsieh Chau-Kai (Chiung Lin TWX), Baby cry recognizer.
상세보기
Mascaro, Massimo; Bradley, David C., Identifying speech portions of a sound model using various statistics thereof.
상세보기
Wang, Yuh-Ching; Li, Kuo-Yuan, Sound event detecting module for a sound event recognition system and method thereof.
상세보기
Thompson Barbara J. (531 S. Gay St. ; Ste. 1112 Knoxville TN 37902), Sound-activated playback device.
상세보기
Stefan Besling DE; Eric Thelen DE, User model-improvement-data-driven selection and update of user-oriented recognition model of a given type for word recognition at network server.
상세보기
Jain Jaswant R. (Chatsworth CA), Voice coder/decoder and methods of coding/decoding.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Method and system for detecting an audio event for smart home devices 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (8)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Method and system for detecting an audio event for smart home devices 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (8)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트