[논문]임베디드 GPU에서의 딥러닝 기반 실시간 보행자 탐지 기법

비엔 지아 안; 이철

doi:10.5909/jbe.2019.24.2.357

임베디드 GPU에서의 딥러닝 기반 실시간 보행자 탐지 기법
Deep Learning-Based Real-Time Pedestrian Detection on Embedded GPUs 원문보기

방송공학회논문지 = Journal of broadcast engineering, v.24 no.2, 2019년, pp.357 - 360

비엔 지아 안 (부경대학교 컴퓨터공학과) , 이철 (부경대학교 컴퓨터공학과)

초록
AI-Helper

본 논문은 임베디드 GPU에서 실시간 동작하는 딥 컨볼루션 뉴럴 네트워크(CNN) 기반의 보행자 탐지 기법을 제안한다. 제안하는 기법에서는 먼저 영상 내 보행자 크기에 대한 통계적 분석을 통해서 최적의 컨볼루션 층의 개수를 결정한다. 또한, 본 논문에서는 다중 스케일 CNN 학습 기법을 적용하여 영상 내의 보행자 크기 변화에 강인한 탐지 기법을 개발한다. 컴퓨터 모의실험을 통해 제안하는 알고리즘이 임베디드 GPU에서 실시간 동작하면서도 기존의 기법과 비교하여 평균적으로 높은 정확도를 보임을 확인한다.

Abstract ▼ AI-Helper

We propose an efficient single convolutional neural network (CNN) for pedestrian detection on embedded GPUs. We first determine the optimal number of the convolutional layers and hyper-parameters for a lightweight CNN. Then, we employ a multi-scale approach to make the network robust to the sizes of the pedestrians in images. Experimental results demonstrate that the proposed algorithm is capable of real-time operation, while providing higher detection performance than conventional algorithms.

주제어

표/그림 (4)

표 표 1. 제안하는 네트워크의 파라미터 요약 Table 1. Summary of the proposed network layers
표 표 2. Caltech 데이터셋을 이용한 recall 및 IoU 성능 비교. Table 2. Comparison of the detection performance using recall and IoU on the Caltech test dataset.
표 표 3. YOLOv2, tiny YOLO 및 제안하는 기법의 속도 비교. Table 3. The computation speed in fps of YOLOv2, tiny YOLO, and the proposed algorithm
표 표 4. 모델 크기 및 네트워크 파라미터 수 비교 Table 4. Comparison of the model size and network parameters.

AI 본문요약
AI-Helper

* AI 자동 식별 결과로 적합하지 않은 문장이 있을 수 있으니, 이용에 유의하시기 바랍니다.

제안 방법

Based on the analysis, to increase the detection performance for pedestrians in small and medium scales, the proposed network consists of two stages, i.e., feature extraction and prediction. The feature extraction stage includes convolution layers with max pooling, batch norm [12], and leaky ReLU [13] after each convolution as an activation function.
표 2. Caltech 데이터셋을 이용한 recall 및 IoU 성능 비교.
We proposed a fast single CNN for pedestrian detection on embedded GPUs in this work. First, we determined the optimal number of convolution layers in the network based on the statistical analysis of pedestrians. Then, to process images of different sizes, we employ a multi-scale approach to train the network with different sizes of images.
Otherwise, the values are close to 0. Table 2 shows that, while tiny YOLO and YOLOv2 provide higher detection rates for large scale, the proposed algorithm outperforms tiny YOLO and YOLOv2 for medium and small scales. This is because the proposed network is designed to be shallow to improve the computational efficiency.

대상 데이터

We used the Darknet deep learning library [14] to implement the proposed network. The model is trained on the Caltech dataset [1] with stochastic gradient descent (SGD) optimizer on Nvidia Titan X GPU. We trained the network with the loss function in [9].

데이터처리

표 3. YOLOv2, tiny YOLO 및 제안하는 기법의 속도 비교.

참고문헌 (14)

P. Dollar, C. Wojek, B. Schiele, and P. Perona, “Pedestrian detection: An Evaluation of The State of The Art,” IEEE Transaction Pattern Analysis and Machine Intelligence, Vol. 34, No. 4, pp. 743-761, April 2012.

상세보기
P. Sermanet, K. Kavukcuoglu, S. Chintala, and Y. Lecun, "Pedestrian Detection with Unsupervised Multi-Stage Feature Learning," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pp. 3626-3633, June 2013.
P. Dollar, R. Appel, S. Belongie, and P. Perona, “Fast Feature Pyramids for Object Detection,” IEEE Transaction Pattern Analysis and Machine Intelligence, Vol. 36, No. 8, pp. 1532-1545, August 2014.

상세보기
X. Wang, T. X. Han, and S. Yan, "An HOG-LBP Human Detector with Partila Occlusion Handling," Proceeding of IEEE Conference on Computer Vision, pp. 32-39, September 2009.
S. Zhang, C. Bauckhage, and A. B. Cremers, "Informed Haar-Like Features Improve Pedestrian Detection," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pp. 947-954, June 2014.
K. Simonyan and A. Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition," Proceedings of International Conference on Learning and Representations, May 2015.
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, "Going Deeper with Convolutions," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-9, June 2015.
K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, June 2016.
J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, June 2016.
J. Redmon and A. Farhadi, "YOLO9000: Better, Faster, Stronger," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263-7271, July 2017.
S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks," Proceeding International Conference on Neural Information Processing Systems, pp. 91-99, December 2015.
S. Ioffe and C. Szegedy, "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift," Proceeding of International Conference on Machine Learning, pp. 448-456, July 2015.
A. L. Maas, A. Y. Hannun, and A. Y. Ng, "Rectifier Nonlinearities Improve Neural Network Acoustic Models," International Conference on Machine Learning Workshop on Deep Learning for Audio, Speech, and Language, 2013.
J. Redmon, "Darknet: Open Source Neural Network in C," http://pjreddie.com/darknet/, 2013-2016.

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명, 저널/프로시딩명, 저자 , 발행년, 권, 호, 시작페이지, 끝페이지, 발행기관 관리번호, 논문명, 대등논문명, 저자 , 저널/프로시딩명, 발행기관, 발행년, 발행언어, 권, 호, 시작페이지, 끝페이지, ISBN, ISSN, 주제분야, 키워드, 초록(한글), 초록(영문), 저자(소속기관)
저장형식	Text(ASCII format) Excel format RefWorks Direct Export RIS format (for Reference Manager, ProCite, EndNote), Scholar's Aids, Mendeley
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증