[논문]심층강화학습 라이브러리 기술동향

신승재; 조충래; 전홍석; 윤승현; 김태연

doi:10.22648/etri.2019.j.340608

심층강화학습 라이브러리 기술동향
A Survey on Deep Reinforcement Learning Libraries 원문보기

전자통신동향분석 = Electronics and telecommunications trends, v.34 no.6, 2019년, pp.87 - 99

신승재 (지능네트워크연구실) , 조충래 (지능네트워크연구실) , 전홍석 (지능네트워크연구실) , 윤승현 (지능네트워크연구실) , 김태연 (지능네트워크연구실)

Abstract ▼ AI-Helper

Reinforcement learning is a type of machine learning paradigm that forces agents to repeat the observation-action-reward process to assess and predict the values of possible future action sequences. This allows the agents to incrementally reinforce the desired behavior for a given observation. Thanks to the recent advancements of deep learning, reinforcement learning has evolved into deep reinforcement learning that introduces promising results in various control and optimization domains, such as games, robotics, autonomous vehicles, computing, industrial control, and so on. In addition to this trend, a number of programming libraries have been developed for importing deep reinforcement learning into a variety of applications. In this article, we briefly review and summarize 10 representative deep reinforcement learning libraries and compare them from a development project perspective.

주제어

참고문헌 (72)

장수영 외, "심층 강화학습 기술 동향," 전자통신동향분석 34권 제4호, 2019. 8, pp. 1-14.

원문보기 상세보기
R.S. Sutton et al., Reinforcement Learning: An Introduction, 2nd edition, Cambridge, MA, USA: MIT Press, 2018.
Y. LeCun et al., "Deep Learning," Nature, vol. 521, May 2015. pp. 436-444.

상세보기
V. Mnih et al., "Playing Atari with Deep Reinforcement Learning," arXiv:1312.5602, Dec. 2013.
A.S. Polydoros et al., "Survey of Model-based Reinforcement Learning: Applications on Robotics," J. Intell. Robotic Syst., vol. 86, no. 2, Mar. 2017, pp. 153-173.

상세보기
J. Hwangbo et al., "Control of a Quadrotor with Reinforcement Learning," arXiv:1707.5110, July 2017.
J. Zhang et al., "Query-Efficient Imitation Learning for End-to-End Autonomous Driving," arXiv:1605.06450, May 2016.
H. Mao et al., "Resource Management with Deep Reinforcement Learning," in Proc. HotNets'16 , Atlanta, CA, USA, Nov. 2016, pp. 50-56.
H. Mao et al., "Neural Adaptive Video Streaming with Pensieve," in Proc. Conf. SIGCOMM'17 , Los Angeles, CA, USA, Aug. 2017, pp. 197-210.
H. Mao et al., "Learning Scheduling Algorithms for Data Processing Clusters," arXiv:1810.01963, Oct. 2018.
김근영 외, "기계학습을 활용한 5G 통신 동향," 전자통신동향분석 31권 제5호, 2016.10, pp. 1-10.

원문보기 상세보기
Y. Deng et al., "Deep Direct Reinforcement Learning for Financial Signal Representation and Trading," IEEE Trans. Neural Netw. Learning Syst., vol. 28, no. 3, March 2017, pp. 653-664.

상세보기
https://www.yna.co.kr/view/AKR20171018151400017?input1179m
V. Mnih et al., "Asynchronous Methods for Deep Reinforcement Learning," in Proc. Int Conf. Machine Learning, New York, USA, June 2016, pp. 1928-1937.
T.P. Lillicrap et al., "Continuous Control with Deep Reinforcement Learning," arXiv:1509:02971, Sept. 2015.
J. Schulman et al., "Trust Region Policy Optimization," in Proc. Int. Conf. Machin Learning, Lille, France, July 2015, pp. 1889-1897.
J. Schulman et al., "Proximal Policy Optimization Algorithms," arXiv:1707.06347, Jul. 2017.
T. Schaul et al., "Prioritized Experience Replay," arXiv: 1511.05952, Nov. 2015.
Z. Wang et al., "Dueling Network Architectures for Deep Reinforcement Learning," in Proc. Int Conf. Machine Learning, New York, USA, June 2016, pp. 1995-2003.
H. Hasselt et al., "Deep Reinforcement Learning with Double Q-Learning," in Proc. AAAI Conf. Artif. Intell., Fhoenix, AZ, USA, Feb. 2016, pp. 2094-2100.
오일석, 패턴인식, 교보문고, 2008년.
https://hunkim.github.io/ml/
I. Goodfellow et al., Deep Learning , MIT Press, 2016.
L. Espeholt et al., "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures," in Proc. Int. Conf. Machine Learning, Stockholm, Sweden, July 2018, pp. 1407-1416.
D. Horgan et al., "Distributed Prioritized Experienced Replay," arXiv:1803.00933, March 2018.
S. Kapturowski et al., "Recurrent Experience Replay in Distributed Reinforcement Learning," in Proc. Int. Conf. Machine Learning , Long Beach, CA, USA, May 2019.
R. Lowe et al., "Multi-Agent Actor Critic for Mixed Cooperative-Competitive Environments," arXiv:1706.02275, July 2017.
T. Rashid et al., "QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning," in Proc. Int. Conf. Machine Learning, Stockholm, Sweden, July 2018, pp. 4295-4304.
S. Li et al., "Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient," in Proc. AAAI Conf. Artif. Intell., Honolulu, HI, USA, Jan. 2019.
https://nervanasystems.github.io/coach/
https://github.com/NervanaSystems/coach
https://www.tensorflow.org/?hlko
https://mxnet.incubator.apache.org/
https://software.intel.com/en-us/frameworks/tensorflow
https://gym.openai.com/
https://github.com/openai/roboschool
https://github.com/Breakend/gym-extensions
https://github.com/bulletphysics/bullet3
http://vizdoom.cs.put.edu.pl/
http://carla.org/
https://github.com/deepmind/pysc2
https://github.com/deepmind/dm_control
https://opensource.google/projects/dopamine
P.S. Castro et al., "Dopamine: A Research Framework for Deep Reinforcement Learning," arXiv:1812.06110, Dec. 2018.
https://github.com/google/dopamine
https://keras.io/
https://github.com/keras-rl/keras-rl
https://github.com/openai/baselines
tps://www.open-mpi.org
https://spinningup.openai.com/en/latest/
https://github.com/openai/spinningup
https://gym.openai.com/envs/#mujoco
https://ray.readthedocs.io/en/latest/rllib.html
E. Liang et al., "RLlib: Abstractions for Distributed Reinforcement Learning," in Proc. Int. Conf. Machine Learning, Stockholm, Sweden, July 2018, pp. 3053-3062.
https://ray.readthedocs.io/en/latest/index.html#
https://github.com/ray-project/ray
https://pytorch.org/
https://stable-baselines.readthedocs.io/en/master/
https://github.com/hill-a/stable-baselines
https://github.com/araffin/rl-baselines-zoo
https://tensorforce.readthedocs.io/en/latest/
https://github.com/tensorforce/tensorforce
https://github.com/mgbellemare/Arcade-Learning-Environment
https://github.com/microsoft/MazeExplorer
https://github.com/openai/retro
https://opensim.stanford.edu
https://github.com/ntasfi/PyGame-Learning-Environment
https://github.com/tensorflow/agents
https://github.com/deepmind/trfl
https://winderresearch.com/a-comparison-of-reinforcementlearning-frameworks-dopamine-rllib-keras-rl-coach-trfltensorforce-coach-and-more/
https://medium.com/@vermashresth/a-primer-on-deepreinforcement-learning-frameworks-part-1-6c9ab6a0f555
https://mc.ai/choosing-a-deep-reinforcement-learning-library/

LOADING...

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명, 저널/프로시딩명, 저자 , 발행년, 권, 호, 시작페이지, 끝페이지, 발행기관 관리번호, 논문명, 대등논문명, 저자 , 저널/프로시딩명, 발행기관, 발행년, 발행언어, 권, 호, 시작페이지, 끝페이지, ISBN, ISSN, 주제분야, 키워드, 초록(한글), 초록(영문), 저자(소속기관)
저장형식	Text(ASCII format) Excel format RefWorks Direct Export RIS format (for Reference Manager, ProCite, EndNote), Scholar's Aids, Mendeley
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

심층강화학습 라이브러리 기술동향
A Survey on Deep Reinforcement Learning Libraries 원문보기

Abstract ▼ AI-Helper

주제어

표/그림 (3)

표/그림 (3)

참고문헌 (72)

이 논문을 인용한 문헌

연구과제 타임라인

관련 콘텐츠

원문 보기

원문 URL 링크

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

심층강화학습 라이브러리 기술동향 A Survey on Deep Reinforcement Learning Libraries 원문보기

Abstract ▼ AI-Helper

주제어

표/그림 (3)

표/그림 (3)

참고문헌 (72)

이 논문을 인용한 문헌

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

관련 콘텐츠

원문 보기

원문 URL 링크

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

심층강화학습 라이브러리 기술동향
A Survey on Deep Reinforcement Learning Libraries 원문보기