[논문]Random Forest를 결정로직으로 활용한 로봇의 실시간 음향인식 시스템 개발

송주만; 김창민; 김민욱; 박용진; 이서영; 손정관

doi:10.7746/jkros.2022.17.3.273

Random Forest를 결정로직으로 활용한 로봇의 실시간 음향인식 시스템 개발
A Real-Time Sound Recognition System with a Decision Logic of Random Forest for Robots 원문보기

로봇학회논문지 = The journal of Korea Robotics Society, v.17 no.3, 2022년, pp.273 - 281

송주만 (LG Electronics) , 김창민 (LG Electronics) , 김민욱 (LG Electronics) , 박용진 (LG Electronics) , 이서영 (LG Electronics) , 손정관 (LG Electronics)

Abstract ▼ AI-Helper

In this paper, we propose a robot sound recognition system that detects various sound events. The proposed system is designed to detect various sound events in real-time by using a microphone on a robot. To get real-time performance, we use a VGG11 model which includes several convolutional neural networks with real-time normalization scheme. The VGG11 model is trained on augmented DB through 24 kinds of various environments (12 reverberation times and 2 signal to noise ratios). Additionally, based on random forest algorithm, a decision logic is also designed to generate event signals for robot applications. This logic can be used for specific classes of acoustic events with better performance than just using outputs of network model. With some experimental results, the performance of proposed sound recognition system is shown on real-time device for robots.

주제어

참고문헌 (17)

K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," Computer Vision and Pattern Recognition, 2015, DOI: 10.48550/arXiv.1409.1556.
S. Hershey, S. Chaudhuri, D. P. W. Ellis, J. F. Gemmeke, A. Jansen, R. C. Moore, M. Plakal, D. Platt, R. A. Saurous, B. Seybold, M. Slaney, R. J. Weiss, and K. Wilson, "CNN architectures for large-scale audio classification," 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 2017, DOI: 10.1109/ICASSP.2017.7952132.
S. Suh, S. Park, Y. Jeong, and T. Lee, "Designing Acoustic Scene Classification Models with CNN Variants," DCASE 2020 Challenge, 2020, [Online], https://dcase.community/documents/challenge2020/technical_reports/DCASE2020_Suh_101.pdf
H. Seo, J. Park, and Y. Park, "Acoustic scene classification using various pre-processed features and convolutional neural networks," DCASE 2019 Challenge, 2019, [Online], https://dcase.community/documents/challenge2019/technical_reports/DCASE2019_Seo_72.pdf
T. K. Ho, "Random decision forests," 3rd International Conference on Document Analysis and Recognition, Montreal, QC, Canada, 1995, DOI: 10.1109/ICDAR.1995.598994.
C.-Y. Yu, H. Liu, and Z.-M. Qi, "Sound Event Detection Using Deep Random Forest," DCASE 2017 Challenge, 2017, [Online], https://dcase.community/documents/challenge2017/technical_reports/DCASE2017_Yu_162.pdf
I. McLoughlin, H. Zhang, Z. Xie, Y. Song, and W. Xiao, "Robust sound event classification using deep neural networks," IEEE/ACM Transactions On Audio, Speech, And Language Processing, vol. 3, no. 3, March, 2015, DOI: 10.1109/TASLP.2015.2389618.

상세보기
I. Ozer, Z. Ozer, and O. Findik, "Noise robust sound event classification with convolutional neural network," Neurocomputing, vol. 272, no. 10, pp. 505-512, Jan., 2018, DOI: 10.1016/j.neucom.2017.07.021.

상세보기
K. Wang, J. Zhang, S. Sun, Y. Wang, F. Xiang, and L. Xie, "Investigating generative adversarial networks based speech dereverberation for robust speech recognition," Interspeech 2018, 2018, DOI: 10.21437/Interspeech.2018-1780.
J. Lee, D. Lee, H.-S. Choi, and K. Lee, "Room adaptive conditioning method for sound event classification in reverberant environments," 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 2021, DOI: 10.1109/ICASSP39728.2021.9413929.
NVIDIA, "Jetson AGX Xavier Developer Kit," [Online], https://developer.nvidia.com/embedded/jetson-agx-xavier-developer-kit, Accessed; May 27, 2022.
NVIDIA, "TensorRT," [Online], https://developer.nvidia.com/tensorrt, Accessed: May 27, 2019.
K. He, X. Zhang, S. Ren, and J. Sun, "Delving deep into rectifiers: Surpassing human-level performance on imagenet classification," 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 2015, DOI: 10.1109/ICCV.2015.123.
P. Harar, R. Bammer, A. Breger, M. Dorfler, and Z. Smekal, "Improving Machine Hearing on Limited Data Sets," 2019 11th International Congress On Ultra Modern Telecommunications And Control Systems And Workshops (ICUMT), Dublin, Ireland, 2019, DOI: 10.1109/ICUMT48472.2019.8970740.
D. Morawiec, "sklearn-porter," [Online], https://github.com/nok/sklearn-porter, Accessed: May 27, 2022.
D. P. Kingma and J. Ba, "Adam: A method for stochastic optimization," 3rd International Conference on Learning Representations, 2015, DOI: 10.48550/arXiv.1412.6980.
J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," Journal Acoustic Society of America, vol. 65, no. 4, 1979, DOI: 10.1121/1.382599.

상세보기

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명, 저널/프로시딩명, 저자 , 발행년, 권, 호, 시작페이지, 끝페이지, 발행기관 관리번호, 논문명, 대등논문명, 저자 , 저널/프로시딩명, 발행기관, 발행년, 발행언어, 권, 호, 시작페이지, 끝페이지, ISBN, ISSN, 주제분야, 키워드, 초록(한글), 초록(영문), 저자(소속기관)
저장형식	Text(ASCII format) Excel format RefWorks Direct Export RIS format (for Reference Manager, ProCite, EndNote), Scholar's Aids, Mendeley
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Random Forest를 결정로직으로 활용한 로봇의 실시간 음향인식 시스템 개발
A Real-Time Sound Recognition System with a Decision Logic of Random Forest for Robots 원문보기

Abstract ▼ AI-Helper

주제어

참고문헌 (17)

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

원문 URL 링크

오픈액세스(OA) 유형

연관된 기능

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Random Forest를 결정로직으로 활용한 로봇의 실시간 음향인식 시스템 개발 A Real-Time Sound Recognition System with a Decision Logic of Random Forest for Robots 원문보기

Abstract ▼ AI-Helper

주제어

참고문헌 (17)

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

원문 URL 링크

오픈액세스(OA) 유형

연관된 기능

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

Random Forest를 결정로직으로 활용한 로봇의 실시간 음향인식 시스템 개발
A Real-Time Sound Recognition System with a Decision Logic of Random Forest for Robots 원문보기