[논문]Human-robot skills transfer interfaces for a flexible surgical robot

Calinon, S.; Bruno, D.; Malekzadeh, M.S.; Nanayakkara, T.; Caldwell, D.G.

doi:10.1016/j.cmpb.2013.12.015

Human-robot skills transfer interfaces for a flexible surgical robot

Computer methods and programs in biomedicine, v.116 no.2, 2014년, pp.81 - 96

Calinon, S. , Bruno, D. , Malekzadeh, M.S. , Nanayakkara, T. , Caldwell, D.G.

Abstract ▼ AI-Helper

In minimally invasive surgery, tools go through narrow openings and manipulate soft organs to perform surgical tasks. There are limitations in current robot-assisted surgical systems due to the rigidity of robot tools. The aim of the STIFF-FLOP European project is to develop a soft robotic arm to perform surgical tasks. The flexibility of the robot allows the surgeon to move within organs to reach remote areas inside the body and perform challenging procedures in laparoscopy. This article addresses the problem of designing learning interfaces enabling the transfer of skills from human demonstration. Robot programming by demonstration encompasses a wide range of learning strategies, from simple mimicking of the demonstrator's actions to the higher level imitation of the underlying intent extracted from the demonstrations. By focusing on this last form, we study the problem of extracting an objective function explaining the demonstrations from an over-specified set of candidate reward functions, and using this information for self-refinement of the skill. In contrast to inverse reinforcement learning strategies that attempt to explain the observations with reward functions defined for the entire task (or a set of pre-defined reward profiles active for different parts of the task), the proposed approach is based on context-dependent reward-weighted learning, where the robot can learn the relevance of candidate objective functions with respect to the current phase of the task or encountered situation. The robot then exploits this information for skills refinement in the policy parameters space. The proposed approach is tested in simulation with a cutting task performed by the STIFF-FLOP flexible robot, using kinesthetic demonstrations from a Barrett WAM manipulator.

주제어

참고문헌 (43)

Billard 1371 2008 Handbook of Robotics Robot programming by demonstration
Robotics and Autonomous Systems Argall 57 5 469 2009 10.1016/j.robot.2008.10.024 A survey of robot learning from demonstration

상세보기
2007 Imitation and Social Learning in Robots, Humans, and Animals: Behavioural, Social and Communicative Dimensions
Abbeel 2004 Proc. Intl. Conf. on Machine Learning (ICML) Apprenticeship learning via inverse reinforcement learning
Ratliff 2009 Intl. Conf. on Artificial Intelligence and Statistics (AIStats) Inverse optimal heuristic control for imitation learning
Lopes 31 2009 Proc. European Conf. on Machine Learning and Knowledge Discovery in Databases Active learning for reward estimation in inverse reinforcement learning
Mombaur 451 2011 10.1007/978-3-642-19457-3_27 Robotics Research, Vol. 70 of Springer Tracts in Advanced Robotics An inverse optimal control approach to human motion modeling
Kalakrishnan 1331 2013 IEEE Intl. Conf. on Robotics and Automation (ICRA) Learning objective functions for manipulation
IEEE Transactions on Robotics Howard 29 4 2013 10.1109/TRO.2013.2256311 Transferring human impedance behaviour to heterogeneous variable impedance actuators

상세보기
Journal of the American College of Surgeons Anderson 215 1 107 2012 10.1016/j.jamcollsurg.2012.02.005 The first national examination of outcomes and trends in robotic surgery in the United States

상세보기
Allard 2007 Medicine Meets Virtual Reality (MMVR) SOFA - an open source framework for medical simulation
Journal of Behavioral Robotics Rueckstiess 1 1 14 2010 Exploring parameter space in reinforcement learning, Paladyn
Peters 262 2007 Proc. IEEE Intl. Symp. on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) Using reward-weighted regression for reinforcement learning of task space control
Stulp 1 2012 Proc. Intl. Conf. on Machine Learning (ICML) Path integral policy improvement with covariance matrix adaptation
Machine Learning Vamplew 84 1-2 51 2010 10.1007/s10994-010-5232-5 Empirical evaluation methods for multiobjective reinforcement learning algorithms

상세보기
Barrett 41 2008 Proc. Intl. Conf. on Machine Learning (ICML) Learning all optimal policies with multiple criteria
Konidaris 2006 Proc. Intl. Conf. on Simulation of Adaptive Behavior, Animals to Animats 9 An adaptive robot motivational system
Trends in Neurosciences Gurney 27 8 453 2004 10.1016/j.tins.2004.06.003 Computational models of the basal ganglia: from robots to membranes

상세보기
Ghahramani vol. 6 120 1994 Supervised learning from incomplete data via an EM approach
IEEE Transactions on Systems, Man and Cybernetics, Part B Calinon 37 2 286 2007 10.1109/TSMCB.2006.886952 On learning, representing and generalizing a task in a humanoid robot

상세보기
Reiley 967 2010 Intl. Conf. of the IEEE Engineering in Medicine and Biology Society (EMBC) Motion generation of robotic surgical tasks: learning from expert demonstrations
Giataganas 2013 Proc. IEEE Intl. Conf. on Robotics and Automation (ICRA) Cooperative in situ microscopic scanning and simultaneous tissue surface reconstruction using a compliant robotic manipulator
McLachlan 2000 Finite Mixture Models
Journal of the Royal Statistical Society B Dempster 39 1 1 1977 Maximum likelihood from incomplete data via the EM algorithm
Annals of Statistics Wu 11 95 1983 10.1214/aos/1176346060 On the convergence properties of the EM algorithm

상세보기
MacQueen 281 1967 Proc. of the 5th Berkeley Symp. on Mathematical Statistics and Probability Some methods for classification and analysis of multivariate observations
Neural Computation Schaal 10 8 2047 1998 10.1162/089976698300016963 Constructive incremental learning from only local information

상세보기
Neural Computation Vijayakumar 17 12 2602 2005 10.1162/089976605774320557 Incremental online learning in high dimensions

상세보기
Nguyen-Tuong 380 2008 IEEE/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS) Local Gaussian process regression for real-time model-based robot control
Grimes 1 2006 Proc. Robotics: Science and Systems (RSS) Dynamic imitation in a humanoid robot through nonparametric probabilistic inference
Image and Vision Computing Tian 31 3 223 2013 10.1016/j.imavis.2012.06.009 Canonical locality preserving latent variable model for discriminative pose inference

상세보기
Neural Computation Dayan 9 2 271 1997 10.1162/neco.1997.9.2.271 Using expectation-maximization for reinforcement learning

상세보기
Journal of Machine Learning Research Theodorou 11 3137 2010 A generalized path integral control approach to reinforcement learning

상세보기
IEEE Robotics and Automation Magazine Kober 17 2 55 2010 10.1109/MRA.2010.936952 Imitation and reinforcement learning: Practical algorithms for motor primitives in robotics

상세보기
Kroese 2004 The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation and Machine Learning
Hansen 75 2006 10.1007/3-540-32494-1_4 Towards a New Evolutionary Computation, Vol. 192 of Studies in Fuzziness and Soft Computing The CMA evolution strategy: a comparing review
Robotics and Autonomous Systems Calinon 61 4 369 2013 10.1016/j.robot.2012.09.012 Compliant skills acquisition and multi-optima policy search with EM-based reinforcement learning

상세보기
Calinon 1 2012 Proc. Intl. Conf. on Development and Learning (ICDL-EpiRob) Multi-optima exploration with adaptive Gaussian mixture model
Bruno 1374 2013 Proc. AAAI Conference on Artificial Intelligence Bayesian nonparametric multi-optima policy search in reinforcement learning
Neural Computation Xu 8 1 129 1996 10.1162/neco.1996.8.1.129 On convergence properties of the EM algorithm for Gaussian mixtures

상세보기
Cianchetti 3567 2013 IEEE/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS) L.C.M.A. STIFF-FLOP surgical manipulator: mechanical design and experimental characterization of the single module
Annals of Statistics Schwarz 6 2 461 1978 10.1214/aos/1176344136 Estimating the dimension of a model

상세보기
Malekzadeh 1746 2013 Proc. IEEE/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS) Skills transfer across dissimilar robots by learning context-dependent rewards

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명, 저널/프로시딩명, 저자 , 발행년, 권, 호, 시작페이지, 끝페이지, 발행기관 관리번호, 논문명, 대등논문명, 저자 , 저널/프로시딩명, 발행기관, 발행년, 발행언어, 권, 호, 시작페이지, 끝페이지, ISBN, ISSN, 주제분야, 키워드, 초록(한글), 초록(영문), 저자(소속기관)
저장형식	Text(ASCII format) Excel format RefWorks Direct Export RIS format (for Reference Manager, ProCite, EndNote), Scholar's Aids, Mendeley
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Human-robot skills transfer interfaces for a flexible surgical robot

Abstract ▼ AI-Helper

주제어

참고문헌 (43)

이 논문을 인용한 문헌

관련 콘텐츠

원문 URL 링크

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트