최소 단어 이상 선택하여야 합니다.
최대 10 단어까지만 선택 가능합니다.
다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
NTIS 바로가기Computer methods and programs in biomedicine, v.116 no.2, 2014년, pp.81 - 96
Calinon, S. , Bruno, D. , Malekzadeh, M.S. , Nanayakkara, T. , Caldwell, D.G.
In minimally invasive surgery, tools go through narrow openings and manipulate soft organs to perform surgical tasks. There are limitations in current robot-assisted surgical systems due to the rigidity of robot tools. The aim of the STIFF-FLOP European project is to develop a soft robotic arm to pe...
Billard 1371 2008 Handbook of Robotics Robot programming by demonstration
Robotics and Autonomous Systems Argall 57 5 469 2009 10.1016/j.robot.2008.10.024 A survey of robot learning from demonstration
2007 Imitation and Social Learning in Robots, Humans, and Animals: Behavioural, Social and Communicative Dimensions
Abbeel 2004 Proc. Intl. Conf. on Machine Learning (ICML) Apprenticeship learning via inverse reinforcement learning
Ratliff 2009 Intl. Conf. on Artificial Intelligence and Statistics (AIStats) Inverse optimal heuristic control for imitation learning
Lopes 31 2009 Proc. European Conf. on Machine Learning and Knowledge Discovery in Databases Active learning for reward estimation in inverse reinforcement learning
Kalakrishnan 1331 2013 IEEE Intl. Conf. on Robotics and Automation (ICRA) Learning objective functions for manipulation
IEEE Transactions on Robotics Howard 29 4 2013 10.1109/TRO.2013.2256311 Transferring human impedance behaviour to heterogeneous variable impedance actuators
Journal of the American College of Surgeons Anderson 215 1 107 2012 10.1016/j.jamcollsurg.2012.02.005 The first national examination of outcomes and trends in robotic surgery in the United States
Allard 2007 Medicine Meets Virtual Reality (MMVR) SOFA - an open source framework for medical simulation
Journal of Behavioral Robotics Rueckstiess 1 1 14 2010 Exploring parameter space in reinforcement learning, Paladyn
Peters 262 2007 Proc. IEEE Intl. Symp. on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) Using reward-weighted regression for reinforcement learning of task space control
Stulp 1 2012 Proc. Intl. Conf. on Machine Learning (ICML) Path integral policy improvement with covariance matrix adaptation
Machine Learning Vamplew 84 1-2 51 2010 10.1007/s10994-010-5232-5 Empirical evaluation methods for multiobjective reinforcement learning algorithms
Barrett 41 2008 Proc. Intl. Conf. on Machine Learning (ICML) Learning all optimal policies with multiple criteria
Konidaris 2006 Proc. Intl. Conf. on Simulation of Adaptive Behavior, Animals to Animats 9 An adaptive robot motivational system
Trends in Neurosciences Gurney 27 8 453 2004 10.1016/j.tins.2004.06.003 Computational models of the basal ganglia: from robots to membranes
Ghahramani vol. 6 120 1994 Supervised learning from incomplete data via an EM approach
IEEE Transactions on Systems, Man and Cybernetics, Part B Calinon 37 2 286 2007 10.1109/TSMCB.2006.886952 On learning, representing and generalizing a task in a humanoid robot
Reiley 967 2010 Intl. Conf. of the IEEE Engineering in Medicine and Biology Society (EMBC) Motion generation of robotic surgical tasks: learning from expert demonstrations
Giataganas 2013 Proc. IEEE Intl. Conf. on Robotics and Automation (ICRA) Cooperative in situ microscopic scanning and simultaneous tissue surface reconstruction using a compliant robotic manipulator
McLachlan 2000 Finite Mixture Models
Journal of the Royal Statistical Society B Dempster 39 1 1 1977 Maximum likelihood from incomplete data via the EM algorithm
Annals of Statistics Wu 11 95 1983 10.1214/aos/1176346060 On the convergence properties of the EM algorithm
MacQueen 281 1967 Proc. of the 5th Berkeley Symp. on Mathematical Statistics and Probability Some methods for classification and analysis of multivariate observations
Neural Computation Schaal 10 8 2047 1998 10.1162/089976698300016963 Constructive incremental learning from only local information
Neural Computation Vijayakumar 17 12 2602 2005 10.1162/089976605774320557 Incremental online learning in high dimensions
Nguyen-Tuong 380 2008 IEEE/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS) Local Gaussian process regression for real-time model-based robot control
Grimes 1 2006 Proc. Robotics: Science and Systems (RSS) Dynamic imitation in a humanoid robot through nonparametric probabilistic inference
Image and Vision Computing Tian 31 3 223 2013 10.1016/j.imavis.2012.06.009 Canonical locality preserving latent variable model for discriminative pose inference
Neural Computation Dayan 9 2 271 1997 10.1162/neco.1997.9.2.271 Using expectation-maximization for reinforcement learning
Journal of Machine Learning Research Theodorou 11 3137 2010 A generalized path integral control approach to reinforcement learning
IEEE Robotics and Automation Magazine Kober 17 2 55 2010 10.1109/MRA.2010.936952 Imitation and reinforcement learning: Practical algorithms for motor primitives in robotics
Kroese 2004 The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation and Machine Learning
Robotics and Autonomous Systems Calinon 61 4 369 2013 10.1016/j.robot.2012.09.012 Compliant skills acquisition and multi-optima policy search with EM-based reinforcement learning
Calinon 1 2012 Proc. Intl. Conf. on Development and Learning (ICDL-EpiRob) Multi-optima exploration with adaptive Gaussian mixture model
Bruno 1374 2013 Proc. AAAI Conference on Artificial Intelligence Bayesian nonparametric multi-optima policy search in reinforcement learning
Neural Computation Xu 8 1 129 1996 10.1162/neco.1996.8.1.129 On convergence properties of the EM algorithm for Gaussian mixtures
Cianchetti 3567 2013 IEEE/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS) L.C.M.A. STIFF-FLOP surgical manipulator: mechanical design and experimental characterization of the single module
Annals of Statistics Schwarz 6 2 461 1978 10.1214/aos/1176344136 Estimating the dimension of a model
Malekzadeh 1746 2013 Proc. IEEE/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS) Skills transfer across dissimilar robots by learning context-dependent rewards
*원문 PDF 파일 및 링크정보가 존재하지 않을 경우 KISTI DDS 시스템에서 제공하는 원문복사서비스를 사용할 수 있습니다.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.