[논문]Walking Control of a Biped Robot on Static and Rotating Platforms Based on Hybrid Reinforcement Learning

Xi, Ao; Chen, Chao

doi:10.1109/access.2020.3015506

Walking Control of a Biped Robot on Static and Rotating Platforms Based on Hybrid Reinforcement Learning 원문보기

IEEE access : practical research, open solutions, v.8, 2020년, pp.148411 - 148424

Xi, Ao (Laboratory of Motion Generation and Analysis, Faculty of Engineering, Monash University, Clayton, VIC, Australia) , Chen, Chao (Laboratory of Motion Generation and Analysis, Faculty of Engineering, Monash University, Clayton, VIC, Australia)

Abstract ▼ AI-Helper

In this paper, we proposed a novel Hybrid Reinforcement Learning framework to maintain the stability of a biped robot (NAO) while it is walking on static and dynamic platforms. The reinforcement learning framework consists of the Model-based off-line Estimator, the Actor Network Pre-training scheme, and the Mode-free on-line optimizer. We proposed the Hierarchical Gaussian Processes as the Mode-based Estimator to predict a rough model of the system and to obtain the initial control input. Then, the initial control input is employed to pre-train the Actor Network by using the initial control input. Finally, a model-free optimizer based on Deep Deterministic Policy Gradient framework is introduced to fine tune the Actor Network and to generate the best actions. The proposed reinforcement learning framework not only successfully avoids the distribution mismatch problem while combining model-based scheme with model-free structure, but also improves the sample efficiency for the on-line learning procedure. Simulation results show that the proposed Hybrid Reinforcement Learning mechanism enables the NAO robot to maintain balance while walking on static and dynamic platforms. The robustness of the learned controllers in adapting to platforms with different angles, different magnitudes, and different frequencies is tested.

참고문헌 (33)

Xi, Ao, Mudiyanselage, Thushal Wijekoon, Tao, Dacheng, Chen, Chao. Balance control of a biped robot on a rotating platform based on efficient reinforcement learning. IEEE/CAA journal of automatica sinica, vol.6, no.4, 938-951.

상세보기
arXiv 1707 06347 Proximal policy optimization algorithms schulman 2017
Proc 31th Int Conf Mach Learn Trust Region Policy Optimization schulman 2015 1889
arXiv 1612 07139 A survey of deep network solutions for learning control in robotics: From reinforcement to imitation tai 2016
Shouyi Wang, Chaovalitwongse, W., Babuska, R.. Machine Learning Algorithms in Bipedal Robot Control. IEEE transactions on systems, man and cybernetics. a publication of the IEEE Systems, Man, and Cybernetics Society. Part C, Applications and reviews, vol.42, no.5, 728-743.

상세보기
Reinforcement Learning An Introduction sutton 2018
10.1109/ICHR.2010.5686320
10.1109/ROBIO.2017.8324682
Navarro-Guerrero, N., Weber, C., Schroeter, P., Wermter, S.. Real-world reinforcement learning for autonomous humanoid robot docking. Robotics and autonomous systems, vol.60, no.11, 1400-1407.

상세보기
Gil, Cristyan R., Calvo, Hiram, Sossa, Humberto. Learning an Efficient Gait Cycle of a Biped Robot Based on Reinforcement Learning and Artificial Neural Networks. Applied sciences, vol.9, no.3, 502-.

상세보기
Lin, Jin-Ling, Hwang, Kao-Shing, Jiang, Wei-Cheng, Chen, Yu-Jen. Gait Balance and Acceleration of a Biped Robot Based on Q-Learning. IEEE access : practical research, open solutions, vol.4, 2439-2449.

상세보기
Hwang, Kao-Shing, Jiang, Wei-Cheng, Chen, Yu-Jen, Shi, Haobin. Motion Segmentation and Balancing for a Biped Robot's Imitation Learning. IEEE transactions on industrial informatics, vol.13, no.3, 1099-1108.

상세보기
10.1109/IRC.2019.00102
Polydoros, Athanasios S., Nalpantidis, Lazaros. Survey of Model-Based Reinforcement Learning: Applications on Robotics. Journal of intelligent & robotic systems, vol.86, no.2, 153-173.

상세보기
Reinforcement Learning An Introduction sutton 2018
10.1109/ROBOT.1998.680985
Alcaraz-Jiménez, J.J., Herrero-Pérez, D., Martínez-Barberá, H.. Robust feedback control of ZMP-based gait for the humanoid robot Nao. The International journal of robotics research, vol.32, no.9, 1074-1088.

상세보기
VUKOBRATOVIĆ, MIOMIR, BOROVAC, BRANISLAV. ZERO-MOMENT POINT - THIRTY FIVE YEARS OF ITS LIFE. International Journal of Humanoid Robotics : IJHR, vol.1, no.1, 157-173.

상세보기
Ohashi, E., Sato, T., Ohnishi, K.. A Walking Stabilization Method Based on Environmental Modes on Each Foot for Biped Robot. IEEE transactions on industrial electronics : a publication of the IEEE Industrial Electronics Society, vol.56, no.10, 3964-3974.

상세보기
arXiv 1509 02971 Continuous control with deep reinforcement learning lillicrap 2015
Huang, Qiang, Yokoi, K., Kajita, S., Kaneko, K., Arai, H., Koyachi, N., Tanie, K.. Planning walking patterns for a biped robot. IEEE transactions on robotics and automation : A publication of the IEEE Robotics and Automation Society, vol.17, no.3, 280-289.

상세보기
Kim, Jung-Yup, Park, Ill-Woo, Oh, Jun-Ho. Walking Control Algorithm of Biped Humanoid Robot on Uneven and Inclined Floor. Journal of intelligent & robotic systems, vol.48, no.4, 457-484.

상세보기
Yi, Jiang, Zhu, Qiuguo, Xiong, Rong, Wu, Jun. Walking Algorithm of Humanoid Robot on Uneven Terrain with Terrain Estimation. International journal of advanced robotic systems, vol.13, no.1, 35-.

상세보기
Mechanical Systems Design Handbook Modeling Measurement and Control hurmuzlu 2001
10.23919/ACC.2019.8814833
Bipedal Robots Modeling Design and Walking Synthesis chevallereau 2008
Proc 28th Int Conf Mach Learn PILCO: A model-based and data-efficient approach to policy search deisenroth 2011 465
arXiv 1603 00748 Continuous deep Q-Learning with model-based acceleration gu 2016
Deisenroth, Marc Peter, Fox, Dieter, Rasmussen, Carl Edward. Gaussian Processes for Data-Efficient Learning in Robotics and Control. IEEE transactions on pattern analysis and machine intelligence, vol.37, no.2, 408-423.

상세보기
10.1109/ICRA.2018.8463189
arXiv 1802 09081 Temporal difference models: Model-free deep RL for model-based control pong 2018
arXiv 1905 01718 Curious meta-controller: Adaptive alternation between model-based and model-free control in deep reinforcement learning burhan hafez 2019
arXiv 1803 00101 Model-based value estimation for efficient model-free reinforcement learning feinberg 2018

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명, 저널/프로시딩명, 저자 , 발행년, 권, 호, 시작페이지, 끝페이지, 발행기관 관리번호, 논문명, 대등논문명, 저자 , 저널/프로시딩명, 발행기관, 발행년, 발행언어, 권, 호, 시작페이지, 끝페이지, ISBN, ISSN, 주제분야, 키워드, 초록(한글), 초록(영문), 저자(소속기관)
저장형식	Text(ASCII format) Excel format RefWorks Direct Export RIS format (for Reference Manager, ProCite, EndNote), Scholar's Aids, Mendeley
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Walking Control of a Biped Robot on Static and Rotating Platforms Based on Hybrid Reinforcement Learning 원문보기

Abstract ▼ AI-Helper

참고문헌 (33)

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

원문 URL 링크

오픈액세스(OA) 유형

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트