[특허]Technologies for robust crying detection using temporal characteristics of acoustic features

Technologies for robust crying detection using temporal characteristics of acoustic features 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G10L-025/51 G10L-025/24 G10L-025/72 G10L-025/27
출원번호	US-0979108 (2015-12-22)
등록번호	US-9899034 (2018-02-20)
발명자 / 주소	Hofer, Joachim Bocklet, Tobias Stemmer, Georg Pearce, David Czyryba, Sebastian Bauer, Josef G.
출원인 / 주소	Intel IP Corporation
대리인 / 주소	Barnes & Thornburg LLP
인용정보	피인용 횟수 : 0 인용 특허 : 1

초록 ▼

Technologies for identifying sounds are disclosed. A sound identification device may capture sound data, and split the sound data into frames. The sound identification device may then determine an acoustic feature vector for each frame, and determine parameters based on how each acoustic feature varies over the duration of time corresponding to the frames. The sound identification device may then determine if the sound matches a pre-defined sound based on the parameters. In one embodiment, the sound identification device may be a baby monitor, and the pre-defined sound may be a baby crying.

대표청구항 ▼

1. A sound identification device for identifying sounds, the sound identification device comprising: a sound data capture module to acquire sound data;a sound frame determination module to determine a plurality of frames of sound data based on the sound data;a sound identification module to: determine an acoustic feature matrix having two dimensions and comprising a plurality of first-dimension vectors and a plurality of second-dimension vectors, wherein each second-dimension vector of the plurality of second-dimension vectors corresponds to a corresponding frame of the plurality of frames and each first-dimension vector of the plurality of first-dimension vectors comprises an acoustic feature vector associated with the corresponding frame, and wherein each first-dimension vector of the plurality of first-dimension vectors is associated with a different acoustic feature;determine a plurality of temporal parameters for each first-dimension vector of the plurality of first-dimension vectors;determine, based on the pluralities of temporal parameters, whether the sound data corresponds to a pre-defined sound, wherein each of the plurality of temporal parameters is based on how the acoustic feature associated with the corresponding first-dimension vector changes over the time associated with the plurality of frames. 2. The sound identification device of claim 1, wherein each acoustic feature vector of the acoustic feature matrix comprises mel-frequency cepstrum coefficients. 3. The sound identification device of claim 1, wherein the pre-defined sound is a cry of an infant. 4. The sound identification device of claim 1, wherein the pre-defined sound is a cough. 5. The sound identification device of claim 4, further comprising a communication module to provide, based on the sound data corresponding to the cough, an alert to a user of the sound identification device and to provide, based on the sound data corresponding to the cough, a suggestion to the user. 6. The sound identification device of claim 1, wherein to determine the plurality of temporal parameters for each first-dimension vector of the plurality of first-dimension vectors comprises to determine the plurality of temporal parameters for each first-dimension vector of the plurality of first-dimension vectors by performing a Fourier-related transform on the corresponding first-dimension vector. 7. The sound identification device of claim 6, wherein the sound identification module is to select a subset of the plurality of temporal parameters, and wherein to determine, based on the plurality of temporal parameters, whether the sound data corresponds to the pre-defined sound comprises to determine, based on the subset of the plurality of temporal parameters, whether the sound data corresponds to the pre-defined sound. 8. The sound identification device of claim 1, wherein to determine whether the sound data corresponds to the pre-defined sound comprises to determine a probability that the sound data corresponds to the pre-defined sound and to compare the probability to a threshold. 9. The sound identification device of claim 1, further comprising a communication module to transmit a notification to a mobile compute device. 10. One or more non-transitory machine-readable media comprising a plurality of instructions stored thereon that, when executed, cause a sound identification device to: acquire sound data;determine a plurality of frames of sound data based on the sound data;determine an acoustic feature matrix having two dimensions and comprising a plurality of first-dimension vectors and a plurality of second-dimension vectors, wherein each second-dimension vector of the plurality of second-dimension vectors corresponds to a corresponding frame of the plurality of frames and each first-dimension vector of the plurality of first-dimension vectors comprises an acoustic feature vector associated with the corresponding frame, andwherein each first-dimension vector of the plurality of first-dimension vectors is associated with a different acoustic feature;determine a plurality of temporal parameters for each first-dimension vector of the plurality of first-dimension vectors;determine based on the pluralities of temporal parameters, whether the sound data corresponds to a pre-defined sound. 11. The one or more non-transitory computer-readable media of claim 10, wherein each acoustic feature vector of the acoustic feature matrix comprises mel-frequency cepstrum coefficients. 12. The one or more non-transitory computer-readable media of claim 10, wherein the pre-defined sound is a cry of an infant. 13. The one or more non-transitory computer-readable media of claim 10, wherein the pre-defined sound is a cough. 14. The one or more non-transitory computer-readable media of claim 10, wherein to determine the plurality of temporal parameters for each first-dimension vector of the plurality of first-dimension vectors comprises to determine the plurality of temporal parameters for each first-dimension vector of the plurality of first-dimension vectors by performing a Fourier-related transform on the corresponding first-dimension vector. 15. The one or more non-transitory computer-readable media of claim 14, wherein the plurality of instructions further cause the sound identification device to select a subset of the plurality of temporal parameters, wherein to determine, based on the plurality of temporal parameters, whether the sound data corresponds to the pre-defined sound comprises to determine, based on the subset of the plurality of temporal parameters, whether the sound data corresponds to the pre-defined sound. 16. The one or more non-transitory computer-readable media of claim 10, wherein to determine, based on the plurality of temporal parameters, whether the sound data corresponds to the pre-defined sound comprises to determine a probability that the sound data corresponds to the pre-defined sound and to compare the probability to a threshold. 17. The one or more non-transitory computer-readable media of claim 10, wherein the plurality of instructions further cause the sound identification device to transmit a notification to a mobile compute device. 18. A method for sound identification by a sound identification device, the method comprising: acquiring, by the sound identification device, sound data;determining, by the sound identification device, a plurality of frames of sound data based on the sound data;determining, by the sound identification device, an acoustic feature matrix having two dimensions and comprising a plurality of first-dimension vectors and a plurality of second-dimension vectors, wherein each second-dimension vector of the plurality of second-dimension vectors corresponds to a corresponding frame of the plurality of frames and each first-dimension vector of the plurality of first-dimension vectors comprises an acoustic feature vector associated with the corresponding frame, andwherein each first-dimension vector of the plurality of first-dimension vectors is associated with a different acoustic feature;determining, by the sound identification device, a plurality of temporal parameters for each first-dimension vector of the plurality of first-dimension vectors;determining, by the sound identification device, based on the pluralities of temporal parameters, whether the sound data corresponds to a pre-defined sound. 19. The method of claim 18, wherein each acoustic feature vector of the acoustic feature matrix comprises mel-frequency cepstrum coefficients. 20. The method of claim 18, wherein the pre-defined sound is a cry of an infant. 21. The method of claim 18, wherein the pre-defined sound is a cough. 22. The method of claim 18, wherein determining the plurality of temporal parameters for each first-dimension vector of the plurality of first-dimension vectors comprises determining the plurality of temporal parameters for each first-dimension vector of the plurality of first-dimension vectors by performing a Fourier-related transform on the corresponding first-dimension vector. 23. The method of claim 22, further comprising selecting, by the sound identification device, a subset of the plurality of temporal parameters, wherein determining, based on the plurality of temporal parameters, whether the sound data corresponds to the pre-defined sound comprises determining, based on the subset of the plurality of temporal parameters, whether the sound data corresponds to the pre-defined sound. 24. The method of claim 18, wherein determining, based on the plurality of temporal parameters, whether the sound data corresponds to the pre-defined sound comprises determining a probability that the sound data corresponds to the pre-defined sound and comparing the probability to a threshold. 25. The method of claim 18, further comprising transmitting a notification to a mobile compute device.

이 특허에 인용된 특허 (1)

Hsieh Chau-Kai (Chiung Lin TWX), Baby cry recognizer.
상세보기

내보내기 메뉴

내보내기 구분

파일저장
인쇄
메일전송

구성항목

기본정보
상세정보

관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC

저장형식

Text(ASCII format)
Excel format
PIAS분석(.xls)

메일정보

받는사람 (필수): @
보내는사람 (선택): @
제목
내용: KISTI 검색결과 이메일 서비스

안내

총 건의 자료가 검색되었습니다.

다운받으실 자료의 인덱스를 입력하세요. (1-10,000)

검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다.

데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요)

다운로드 파일은 UTF-8 형태로 저장됩니다.
파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오.

Text(ASCII format)
Excel format

AI-Helper ※ AI-Helper는 을 사용합니다.

AI-Helper

안녕하세요, AI-Helper입니다. 좌측 "선택된 텍스트"에서 텍스트를 선택하여 요약, 번역, 용어설명을 실행하세요.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

연합인증

Technologies for robust crying detection using temporal characteristics of acoustic features 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (1)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Technologies for robust crying detection using temporal characteristics of acoustic features 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (1)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트