[논문]Enhanced Machine Learning Algorithms: Deep Learning, Reinforcement Learning, and Q-Learning

Park, Ji Su; Park, Jong Hyuk

doi:10.3745/jips.02.0139

Enhanced Machine Learning Algorithms: Deep Learning, Reinforcement Learning, and Q-Learning 원문보기

Journal of information processing systems, v.16 no.5, 2020년, pp.1001 - 1007

Park, Ji Su (Dept. of Computer Science and Engineering, Jeonju University) , Park, Jong Hyuk (Dept. of Computer Science and Engineering, Seoul National University of Science & Technology (SeoulTech))

Abstract ▼ AI-Helper

In recent years, machine learning algorithms are continuously being used and expanded in various fields, such as facial recognition, signal processing, personal authentication, and stock prediction. In particular, various algorithms, such as deep learning, reinforcement learning, and Q-learning, are continuously being improved. Among these algorithms, the expansion of deep learning is rapidly changing. Nevertheless, machine learning algorithms have not yet been applied in several fields, such as personal authentication technology. This technology is an essential tool in the digital information era, walking recognition technology as promising biometrics, and technology for solving state-space problems. Therefore, algorithm technologies of deep learning, reinforcement learning, and Q-learning, which are typical machine learning algorithms in various fields, such as agricultural technology, personal authentication, wireless network, game, biometric recognition, and image recognition, are being improved and expanded in this paper.

주제어

AI 본문요약
AI-Helper

* AI 자동 식별 결과로 적합하지 않은 문장이 있을 수 있으니, 이용에 유의하시기 바랍니다.

제안 방법

Wen [9] proposed an improved convolutional neural network (CNN) based on Gabor filter to achieve gait recognition. In this paper, a Gabor filter-based walking feature extraction layer was inserted into the existing CNN and used to extract the walking features from the walking silhouette image. A metric learning technique was used to calculate the distance between two walks and a k-nearest neighbor (KNN) classifier was used to classify the walks.
Thus, the input image is calculated as a threshold value for several Boolean maps, and the initial protrusion map is calculated as the sum of the weights of the generated Boolean maps. Morphological operations and Gaussian filters are used in this paper to further refine the initial map to produce the final high-quality extrusion map.
This paper considered the improvement and extension of algorithms in a variety of fields, including agriculture, personal authentication, wireless networks, games, biometrics, and image recognition. Therefore, the present paper comprehensively discusses the following technologies: improved defect detection algorithm, survey of deep learning in agriculture techniques, next-generation personal authentication scheme, rate adaptation with Q-learning, evaluation of operational security, reinforcement learning activation functions, fast leaf recognition and retrieval, intelligent management and service, gait recognition, intelligent on-demand routing protocol, thangka image inpainting algorithm, classroom rollcall system, localization algorithm, hybrid genetic ant colony optimization (ACO) algorithm, visual saliency detection, heuristic and statistical prediction algorithms, object augmentation scheme, and dynamic action space handling method. The present paper mainly aims to rapidly provide popular and trendy research to researchers.
[12] proposed a new face recognition model based on game theory for classroom rollover by using CNNs with outstanding performance in the field of face recognition. This model uses multiple face images as inputs and constructs a student identity list by identifying each face with a confidence score. The optimization goal is then determined by tracking the face with the same identity or low confidence.
[8] introduced an Internet-based intelligent gas valve management and service system design scheme. This paper added a sensor and GPRS (General Packet Radio Service) module to the existing gas valve, and added a networking function between the gas valve and the server while using wireless packet communication technology. The authors suggested that the method proposed in this paper is more convenient and efficient than the existing gas valve management and service projects.
This paper considered the improvement and extension of algorithms in a variety of fields, including agriculture, personal authentication, wireless networks, games, biometrics, and image recognition. Therefore, the present paper comprehensively discusses the following technologies: improved defect detection algorithm, survey of deep learning in agriculture techniques, next-generation personal authentication scheme, rate adaptation with Q-learning, evaluation of operational security, reinforcement learning activation functions, fast leaf recognition and retrieval, intelligent management and service, gait recognition, intelligent on-demand routing protocol, thangka image inpainting algorithm, classroom rollcall system, localization algorithm, hybrid genetic ant colony optimization (ACO) algorithm, visual saliency detection, heuristic and statistical prediction algorithms, object augmentation scheme, and dynamic action space handling method.
Lee [6] proposed an agent using a reinforcement learning algorithm and a neural network to evaluate which activation function can get the best result when an agent learns a game through reinforcement learning in a 2D racing game environment. This paper evaluated the activation functions in the network by switching them together and measured the reward, the output of the advantage function, and the output of the loss function while training and testing them. The experimental result showed that the best activation function for the agent to learn the game and the difference between the best and the worst was 35.
This paper features 18 high-quality articles following a rigorous review process. This paper reviewed the technologies developed in various research fields, such as improved defect detection algorithm, survey of deep learning in agriculture techniques, next-generation personal authentication scheme, rate adaptation with Q-learning, evaluation of operational security, reinforcement learning activation functions, fast leaf recognition and retrieval, intelligent management and service, gait recognition, intelligent on-demand routing protocol, Thangka image inpainting algorithm, classroom roll-call system, localization algorithm, hybrid genetic ACO algorithm, visual saliency detection, heuristic and statistical prediction algorithms, object augmentation scheme, and dynamic action space handling method.
Two algorithms are compared to verify the effectiveness of the proposed algorithm. This paper selected a common denim fabric defect sample image, which includes normal image, weft missing, hole breaking, and oil pollution. The experimental results showed a good average detection rate of common defects of denim is more than 91.
[2] surveyed the development of deep neural-based work efforts in the agriculture domain over the last 5 years and investigated 32 research contributions that apply deep learning techniques to the agriculture domain. This paper surveyed different types of deep neural network architectures in agriculture and the current state-of-the-art methods. They found that deep learning was better in performance than other technologies, which concluded that deep learning will receive more attention and broader applications in future research.
Cho [4] proposed a reinforcement learning agent based on Q-learning to control the data transmission rates of nodes in carrier sensing multiple access with collision avoidance (CSMA/CA)-based wireless networks. This paper used the ns3-gym framework to simulate reinforcement learning and investigated the effects of the parameters of Q-learning on the performance of the reinforcement learning agent. The experimental results showed that the proposed reinforcement learning agent adequately adjusts the modulation and coding scheme (MCS) levels according to the changes in the network, and achieves a high throughput comparable to those of the existing data transmission rate adaptation schemes.
In order to express the evaluation index in the evaluation process and to reduce the uncertainty and ambiguity of the evaluation element, the use of the interval number was suggested. Through this, the objectivity and reliability of the evaluation results were improved, and a new idea was provided for the evaluation of operational safety management of high-speed rail passenger stations.
Plant leaf species recognition using images is difficult because of the large inter-class and small distances between different species. Thus, the authors developed an angular description method of leaf contours by using a new scale generation rule to address such difficulty. The evaluation results show that the proposed method has a faster computation time than the existing method and has high recognition and search accuracy.
However, the technical limitations of the EEG signal interpretation by using dry electrodes were regarded as an obstacle to the use of the EEG-based encryption system. To overcome this problem, this paper used not only EEG signals but also deep learning techniques that is a multinomial classification with a one-hot encoding.

성능/효과

Thus, the authors developed an angular description method of leaf contours by using a new scale generation rule to address such difficulty. The evaluation results show that the proposed method has a faster computation time than the existing method and has high recognition and search accuracy.
This paper evaluated the activation functions in the network by switching them together and measured the reward, the output of the advantage function, and the output of the loss function while training and testing them. The experimental result showed that the best activation function for the agent to learn the game and the difference between the best and the worst was 35.4%.
Moreover, this method solves the state official problem by applying the reinforced deep learning. The experimental results reveal that the state space of the proposed method is reduced by approximately 0.33% of Qlearning. However, the existing Q-learning reduced the cost and time required for learning with similar results.
To compare the IAODV and AODV algorithms, three evaluation indices, packet loss rate, packet transfer rate, and routing overhead were adopted. The experimental results show that IAODV has better performance and better adaptability in Ad Hoc network with frequent node mobility.
This paper selected a common denim fabric defect sample image, which includes normal image, weft missing, hole breaking, and oil pollution. The experimental results showed a good average detection rate of common defects of denim is more than 91.25%.
This paper used the ns3-gym framework to simulate reinforcement learning and investigated the effects of the parameters of Q-learning on the performance of the reinforcement learning agent. The experimental results showed that the proposed reinforcement learning agent adequately adjusts the modulation and coding scheme (MCS) levels according to the changes in the network, and achieves a high throughput comparable to those of the existing data transmission rate adaptation schemes.

참고문헌 (18)

S. Ma, W. Liu, C. You, S. Jia, and Y. Wu, "An improved defect detection algorithm of jean fabric based on optimized Gabor filter," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1008-1014, 2020. https://doi.org/10.3745/JIPS.02.0140

원문보기 상세보기
C. Ren, D. K. Kim, and D. Jeong, "A survey of deep learning in agriculture: techniques and their applications," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1015-1033, 2020. https://doi.org/10.3745/JIPS.04.0187

원문보기 상세보기
G. C. Yang, "Next-generation personal authentication scheme based on EEG signal and deep learning," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1034-1047, 2020. https://doi.org/10.3745/JIPS.03.0147

원문보기 상세보기
S. Cho, "Rate adaptation with Q-learning in CSMA/CA wireless networks," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1048-1063, 2020. https://doi.org/10.3745/JIPS.03.0148

원문보기 상세보기
L. Wang, C. Jin, and C. Xu, "An evaluative study of the operational safety of high-speed railway stations based on IEM-Fuzzy comprehensive assessment theory," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1064-1073, 2020. https://doi.org/10.3745/JIPS.04.0188

원문보기 상세보기
D. Lee, "Comparison of reinforcement learning activation functions to improve the performance of the racing game learning agent," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1074-1082, 2020. https://doi.org/10.3745/JIPS.02.0141

원문보기 상세보기
G. Xu and S. Zhang, "Fast leaf recognition and retrieval using multi-scale angular description method," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1083-1094, 2020. https://doi.org/10.3745/JIPS.02.0142

원문보기 상세보기
X. Wang, F. Wang, Y. Song, G. Zhang, and S. Wang, "Design of intelligent management and service system for gas valve," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1095-1104, 2020. https://doi.org/10.3745/JIPS.04.0189

원문보기 상세보기
J. Wen, "Gait recognition based on GF-CNN and metric learning," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1105-1112, 2020. https://doi.org/10.3745/JIPS.02.0143

원문보기 상세보기
Y. Ye, X. Sun, M. Liu, J. Mi, T. Yan, and L. Ding, "Intelligent on-demand routing protocol for ad hoc network," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1113-1128, 2020. https://doi.org/10.3745/JIPS.03.0149

원문보기 상세보기
F. Yao, "Thangka image inpainting algorithm based on wavelet transform and structural constraints," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1129-1144, 2020. https://doi.org/10.3745/JIPS.02.0144

원문보기 상세보기
J. Zhu, F. Yu, G. Liu, M. Sun, D. Zhao, Q. Geng, and J. Su, "Classroom roll-call system based on ResNet networks," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1145-1157, 2020. https://doi.org/10.3745/JIPS.04.0190

원문보기 상세보기
L. Zhao and K. Zhang, "Localization algorithm for wireless sensor networks based on modified distance estimation," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1158-1168, 2020. https://doi.org/10.3745/JIPS.03.0150

원문보기 상세보기
P. Wang, J. Bai, and J. Meng, "A hybrid genetic ant colony optimization algorithm with an embedded cloud model for continuous optimization," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1169-1182, 2020. https://doi.org/10.3745/JIPS.01.0059

원문보기 상세보기
M. T. N. Truong and S. Kim, "A study on visual saliency detection in infrared images using Boolean map approach," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1183-1195, 2020. https://doi.org/10.3745/JIPS.02.0145

원문보기 상세보기
S. Malik, I. Ullah, D. Kim, and K. Lee, "Heuristic and statistical prediction algorithms survey for smart environments," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1196-1213, 2020. https://doi.org/10.3745/JIPS.04.0191

원문보기 상세보기
S. B. Jang and Y. W. Ko, "An efficient object augmentation scheme for supporting pervasiveness in a mobile augmented reality," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1214-1222, 2020. https://doi.org/10.3745/JIPS.04.0192

원문보기 상세보기
S. Woo and Y. Sung, "Dynamic action space handling method for reinforcement learning models," Journal of Information Processing Systems, vol. 16, no. 5, pp. 1223-1230, 2020. https://doi.org/10.3745/JIPS.02.0146

원문보기 상세보기

표제어: PCR

동의어: Packet Collision Rate

용어 설명 출처 목록 (6)

용어 설명: PCR은 세균 특이성이 있는 primer를 이용하여 적은 수의 세균이 있을지라도 쉽게 검출할 수 있는 유용한 방법이며, 이를 이용하여 구강 내 치면세균막이나 타액에서 직접 세균을 검출할 수 있게 되었다[8].

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명, 저널/프로시딩명, 저자 , 발행년, 권, 호, 시작페이지, 끝페이지, 발행기관 관리번호, 논문명, 대등논문명, 저자 , 저널/프로시딩명, 발행기관, 발행년, 발행언어, 권, 호, 시작페이지, 끝페이지, ISBN, ISSN, 주제분야, 키워드, 초록(한글), 초록(영문), 저자(소속기관)
저장형식	Text(ASCII format) Excel format RefWorks Direct Export RIS format (for Reference Manager, ProCite, EndNote), Scholar's Aids, Mendeley
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증