[특허]Apparatus and methods for haptic training of robots

Apparatus and methods for haptic training of robots 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G05B-019/18 B25J-009/16 G05D-001/00 G05D-001/02 G06N-003/00 G06N-003/04 G06N-099/00
출원번호	US-0102410 (2013-12-10)
등록번호	US-9597797 (2017-03-21)
발명자 / 주소	Ponulak, Filip Kazemi, Moslem Laurent, Patryk Sinyavskiy, Oleg Izhikevich, Eugene
출원인 / 주소	Brain Corporation
대리인 / 주소	Gazdzinski & Associates, PC
인용정보	피인용 횟수 : 1 인용 특허 : 77

초록 ▼

Robotic devices may be trained by a trainer guiding the robot along a target trajectory using physical contact with the robot. The robot may comprise an adaptive controller configured to generate control commands based on one or more of the trainer input, sensory input, and/or performance measure. The trainer may observe task execution by the robot. Responsive to observing a discrepancy between the target behavior and the actual behavior, the trainer may provide a teaching input via a haptic action. The robot may execute the action based on a combination of the internal control signal produced by a learning process of the robot and the training input. The robot may infer the teaching input based on a comparison of a predicted state and actual state of the robot. The robot's learning process may be adjusted in accordance with the teaching input so as to reduce the discrepancy during a subsequent trial.

대표청구항 ▼

1. A processor-implemented method of operating a robot, the method being performed by one or more processors configured to execute computer program instructions, the method comprising: during a first trial, operating, using one or more processors, the robot to perform a task characterized by a target trajectory; andresponsive to observing a discrepancy between an actual trajectory and the target trajectory, adjusting the actual trajectory with a robotic trainer with a physical contact with the robot;wherein: the performance of the task is configured based on a learning process configured to determine a first control signal at the first trial;the adjusting of the actual trajectory comprises modifying the learning process so as to determine a second control signal; andduring a second trial subsequent to the first trial, the first and the second control signals cooperate to transition the actual trajectory towards the target trajectory. 2. The method of claim 1, wherein: the determination of the first control signal is configured based on a context conveying information about an environment of the robot;the determination of the second control signal is configured based on the context. 3. The method of claim 1, wherein: the learning process is configured based on a teaching signal; andthe modifying of the learning process is configured based on the teaching signal being determined based on an evaluation of the adjusting of the actual trajectory. 4. The method of claim 3, wherein: the learning process comprises a supervised learning process characterized by an output;the determination of the first control signal is configured based on a context conveying information about an environment of the robot; andthe teaching signal comprises a supervisory input into the learning process configured to convey a target output associated with the context. 5. The method of claim 4, wherein: the teaching signal is configured based on a combination of the first control signal and the second control signal. 6. A non-transitory computer readable medium comprising a plurality of instruction which, when executed by one or more processors, effectuate control of a robotic apparatus by: based on a context, determine a first control signal configured to transition the robotic apparatus to a first state;determine a discrepancy between a current trajectory associated with a current state, and a first trajectory associated with the first state, where the discrepancy between the trajectories comprises a measurable difference; anddetermine a second control signal based on the discrepancy, the second control signal configured to transition the robotic apparatus to the current state;wherein: the current state is configured based on the first control signal and a state modification, wherein the state modification is applied with a physical contact to the robotic apparatus. 7. The non-transitory computer readable medium of claim 6, wherein: the determination of the first control signal and the determination of the second control signal are configured in accordance with an online learning process; andthe online learning process is configured to be updated at a plurality of first time intervals based on the context and a teaching signal. 8. The non-transitory computer readable medium of claim 7, wherein: for a given first interval of the plurality of first time intervals, a change in the context is configured to cause an adaptation of the learning process, the adaptation being configured to produce another version of a control signal; andthe context is configured to convey information related to one or more of a sensory input, a robot state, and the teaching signal. 9. The non-transitory computer readable medium of claim 8, wherein: the determination of the first control signal is characterized by an update rate having a plurality of second intervals associated therewith; anda given second interval is configured to match the given first interval. 10. The non-transitory computer readable medium of claim 8, wherein: the context comprises a time history of one or more of the sensory input, the robot state, and the teaching signal determined over one or more of the plurality of first time intervals;the determination of the first control signal is characterized by an update rate having a plurality of second intervals associated therewith; andindividual ones of the plurality of the second intervals comprise one or more of the plurality of first time intervals. 11. The non-transitory computer readable medium of claim 8, wherein: individual ones of the current state and the first state are characterized by a state parameter; andthe determination of the discrepancy is configured based on an evaluation of a distance measure between the state parameter of the current state and the state parameter of the first state. 12. The non-transitory computer readable medium of claim 11, wherein: the state parameter comprises a vector comprising two or more components configured to characterize one or more of a position, a motion, an orientation, an energy use, and an available energy of the robotic apparatus. 13. The non-transitory computer readable medium of claim 11, wherein: the robotic apparatus comprises a manipulator comprising first and second actuators;the first control signal is configured to actuate at least one of the first and second actuators;the state parameter comprises a vector comprising two or more components configured to characterize a configuration of the manipulator, the configuration being characterized by one or more of an orientation, an actuator torque, a position, and a motion. 14. The non-transitory computer readable medium of claim 13, wherein: the discrepancy is configured based on an intervention from a user, the intervention configured to alter state parameters of the first and the second actuators substantially contemporaneously with one another. 15. The non-transitory computer readable medium of claim 8, wherein: the discrepancy is configured based on the physical contact;individual ones of the current state and the first state are characterized by a state parameter; andthe determination of the discrepancy is configured based on a comparison of the first state and the current state. 16. The non-transitory computer readable medium of claim 15, wherein: the determination of the discrepancy is configured based on the measurable difference, comprising a difference measure between the first state and the current state;the first state comprises a predicted state determined in accordance with a forward model of the robotic apparatus, the forward model configured to predict the first state of the robotic apparatus based on the context. 17. The non-transitory computer readable medium of claim 16, wherein: the forward model is characterized by a model parameter; andthe determination of the discrepancy at a given time is configured to modify the model parameter so as to enable a determination of a third control signal at a subsequent time, the third control signal being capable of a transition of the robotic apparatus to the current state responsive to an occurrence of the context at the subsequent time. 18. The non-transitory computer readable medium of claim 8, wherein: the determination of the first control signal and the determination of the second control signal are configured in accordance with an online learning process characterized by a learning parameter configured to be updated at a plurality of time intervals;the determination of the discrepancy is configured to be effectuated at a given interval of the plurality of time intervals; andfor a subsequent interval of the plurality of time intervals, the learning parameter is configured based on the discrepancy determined for the given interval. 19. The non-transitory computer readable medium of claim 18, wherein: the first control signal is determined during the given interval having the context associated therewith; andan update of the learning parameter based on the discrepancy during the given interval is configured to give raise to the second control signal responsive to an occurrence of the context during the subsequent interval. 20. The non-transitory computer readable medium of claim 18, wherein: the determination of the discrepancy is configured to be effectuated based on an indication provided during the physical contact, the indication being configured to convey information related to an occurrence of the state modification. 21. The non-transitory computer readable medium of claim 20, wherein: the indication comprises one or more visual signal comprising information related to the physical contact. 22. The non-transitory computer readable medium of claim 20, wherein: the robotic apparatus comprises an actuator configured to be controlled using one or more of the first and the second control signals; andthe indication comprises a current torque value of the actuator associated with the current state. 23. The non-transitory computer readable medium of claim 20, wherein: the robotic apparatus comprises an actuator configured to be controlled using one or more of the first and the second control signals; andthe indication comprises a current position value of the actuator associated with the current state. 24. An adaptive robot apparatus, comprising: a manipulator comprising first and second joints characterized by first and second joint angles, respectively;a sensor module configured to convey information related to one or more of an environment of the adaptive robot apparatus and the manipulator; andan adaptive controller operable in accordance with a learning process configured to: guide the manipulator to a target state in accordance with the information;determine a discrepancy between a target trajectory that corresponds to the target state and a current trajectory that corresponds to a current state; andupdate the learning process based on the discrepancy;wherein: the discrepancy is configured based on an intervention by a user, the intervention by the user comprising modification of the first and the second joint angles with a physical contact with the manipulator; andthe updated learning process comprises determination of a correction signal, the correction signal configured to guide the manipulator to the current state based on an occurrence of the information. 25. The apparatus of claim 24, wherein: the learning process is configured in accordance with a teaching signal;the guiding of the manipulator to the target state is configured based on a control signal determined by the learning process in accordance with the information; andthe teaching signal is configured based on the correction signal. 26. The apparatus of claim 25, wherein: the learning process is configured to be updated at one or more time intervals;the information comprises a time history of one or more of a sensor module output, a configuration of the manipulator, the control signal, and the teaching signal determined over one or more time intervals.

이 특허에 인용된 특허 (77)

Ito, Masato; Minamino, Katsuki; Yoshiike, Yukiko; Suzuki, Hirotaka; Kawamoto, Kenta, Apparatus and method for embedding recurrent neural networks into the nodes of a self-organizing map.
상세보기
Szatmary, Botond; Richert, Micah, Apparatus and methods for activity-based plasticity in a spiking neuron network.
상세보기
Fisher, Dimitry; Szatmary, Botond; Izhikevich, Eugene, Apparatus and methods for encoding of sensory data using artificial spiking neurons.
상세보기
Ponulak, Filip; Passot, Jean-Baptiste; Izhikevich, Eugene; Coenen, Olivier, Apparatus and methods for reinforcement-guided supervised learning.
상세보기
Ponulak, Filip; Sinyavskiy, Oleg, Apparatus and methods for state-dependent learning in spiking neuron networks.
상세보기
Ulug Mehmet E. (1537 E. Hillsboro Blvd. ; #342 Deerfield FL 33441), Artificial neural network method and architecture.
상세보기
Ulug Mehmet E. (1537 E. Hillsboro Blvd. #342 Deerfield Beach FL 33441), Artificial neural network method and architecture adaptive signal filtering.
상세보기
DeYong Mark R. (Las Cruces NM) Findley Randall L. (Austin TX) Eskridge Thomas C. (Las Cruces NM) Fields Christopher A. (Rockville MD), Asynchronous temporal neural processing element.
상세보기
Merfeld, Daniel M.; Rauch, Steven D.; Wall, III, Conrad; Weinberg, Marc, Balance prosthesis.
상세보기
Tamayama, Ken; Tomitaka, Tadafusa; Koyanagi, Masakazu; Iijima, Toshiyuki; Hosonuma, Naoyasu, Camera controlling device and method for predicted viewing.
상세보기
Sigel, Kirk; DeAngelis, Douglas; Ciholas, Mike, Camera with object recognition/data output.
상세보기
Lee,Dong Seok, Cleaning robot and control method thereof.
상세보기
Frank D. Francone ; Peter Nordin SE; Wolfgang Banzhaf DE, Computer implemented machine learning method and system including specifically defined introns.
상세보기
Hattori Shinichi (Nagasaki JPX), Controller for movable robot for moving a work element through a continuous path.
상세보기
Gienger, Michael, Controlling the trajectory of an effector.
상세보기
Danko, George, Coordinated joint motion control system.
상세보기
Bose Chinmoy B. (Green Brook NJ), Differential process controller using artificial neural networks.
상세보기
Shen, Wei-Min; Salemi, Behnam; Will, Peter, Distributed control and coordination of autonomous agents in a dynamic, reconfigurable system.
상세보기
Burt Peter J. (Princeton Township ; Mercer County NJ), Dynamic method for recognizing objects and image processing system therefor.
상세보기
Jim-Shih Liaw ; Theodore W. Berger, Dynamic synapse for signal processing in neural networks.
상세보기
Liaw, Jim-Shih; Berger, Theodore W., Dynamic synapse for signal processing in neural networks.
상세보기
Sinyavskiy, Oleg; Polonichko, Vadim, Dynamically reconfigurable stochastic learning apparatus and methods.
상세보기
Hoffberg Steven M. ; Hoffberg-Borghesani Linda I., Ergonomic man-machine interface incorporating adaptive pattern recognition based control system.
상세보기
Petersen,Peter, Fuel mixer apparatus and method.
상세보기
Watanabe,Atsushi; Kosaka,Tetsuya; Nagatsuka,Yoshiharu, Graphic display apparatus for robot system.
상세보기
Rodriguez Guillermo (La Canada CA) Kreutz Kenneth K. (San Diego CA) Jain Abhinandan (Altadena CA), High level language-based robotic control system.
상세보기
Herr, Hugh Miller; Casler, Rick; Han, Zhixiu, Hybrid terrain-adaptive lower-extremity systems.
상세보기
Matsugu, Masakazu; Mori, Katsuhiko; Ishii, Mie; Mitarai, Yusuke, Information processing apparatus, information processing method, pattern recognition apparatus, and pattern recognition method.
상세보기
Commons, Michael Lamport, Intelligent control with hierarchical stacked neural networks.
상세보기
Izhikevich, Eugene M.; Szatmary, Botond; Petre, Csaba, Invariant pulse latency coding systems and methods systems and methods.
상세보기
Ito,Masato, Legged mobile robot and its motion teaching method, and storage medium.
상세보기
Spoerre Julie K. (Tallahassee FL) Lin Chang-Ching (Tallahassee FL) Wang Hsu-Pin (Tallahassee FL), Machine performance monitoring and fault classification using an exponentially weighted moving average scheme.
상세보기
Brown Robert A. (8 Foster St. Mattapoisett MA 02739), Machine that learns what it actually does.
상세보기
Grossberg Stephen (Newton Highlands MA) Kuperstein Michael (Brookline MA), Massively parellel real-time network architectures for robots capable of self-calibrating their operating parameters thr.
상세보기
Sakaue Shiyuki (Yokohama JPX) Sugimoto Koichi (Hiratsuka JPX) Arai Shinichi (Yokohama JPX), Method and apparatus for controlling a robot hand along a predetermined path.
상세보기
Rosenberg Louis B. ; Jackson Bernard G., Method and apparatus for controlling human-computer interface systems providing force feedback.
상세보기
Jeffrey L. Hamilton ; Bret D. Schlussman, Method and apparatus for manipulating and displaying graphical objects in a computer display device.
상세보기
Cooper David L., Method and apparatus for neural networking using semantic attractor architecture.
상세보기
Ellingsworth, Martin E., Method and system for classifying documents.
상세보기
Zhang, Emily-DanDan; Qi, Levy-LiWei; Murphy, Steve, Method and system for optimizing the layout of a robot work cell.
상세보기
Peltola Tero (Helsinki FIX) Matakselka Jorma (Vantaa FIX) Harju Esa (Espoo FIX) Salovuori Heikki (Helsinki FIX) Keskinen Jukka (Vantaa FIX) Makinen Kari (Helsinki FIX) Roikonen Olli (Espoo FIX), Method for congestion management in a frame relay network and a node in a frame relay network.
상세보기
Buckley Theresa M. (424 Homer Ave. Palo Alto CA 94301), Method for neural network control of motion using real-time environmental feedback.
상세보기
Cheng, Tsz; Lei, Hui; Ye, Yiming, Method, system, and apparatus for remote interactions.
상세보기
Francis, Jr., Anthony G., Methods and systems for autonomous robotic decision making.
상세보기
Perreirra Noel D. (Allentown PA) Tucker Michael (Bethlehem PA), Methods for refining original robot command signals.
상세보기
Lee, Haeyeon; Ota, Yasuhiro; Breazeal, Cynthia; Lee, Jun Ki, Methods of robot behavior generation and robots utilizing the same.
상세보기
Nugent,Alex, Nanotechnology neural network methods and systems.
상세보기
Linsker,Ralph, Neural networks for prediction and control.
상세보기
Ahissar, Ehud, Neuronal phase-locked loops.
상세보기
Thaler Stephen L., Non-algorithmically implemented artificial neural networks and components thereof.
상세보기
Wilson Charles L. (Darnestown MD) Garris Michael D. (Gaithersburg MD) Wilkinson ; Jr. Robert A. (Hyattstown MD), Object/anti-object neural network segmentation.
상세보기
Yokono, Jun; Sabe, Kohtaro; Costa, Gabriel; Ohashi, Takeshi, Operational control method, program, and recording media for robot device, and robot device.
상세보기
Bock Otmar,CAX ; D'Eleuterio Gabriele,CAX ; Lipitkas John,CAX ; Grodski Julius,CAX, Parametric control device.
상세보기
Eguchi, Toru; Yamada, Akihiro; Kusumi, Naohiro; Sekiai, Takaaki; Fukai, Masayuki; Shimizu, Satoru, Plant control system and thermal power generation plant control system.
상세보기
Linnell, Jeffrey, Programming of a robotic arm using a motion capture system.
상세보기
Galkowski Peggy J. ; Glickstein Ira S. ; Stiles Peter N. ; Szczerba Robert J., Real-time mission adaptable route planner.
상세보기
Gregg J?rgen Suaning AU, Retinal stimulator.
상세보기
Onoue Kazuhiko,JPX ; Koyama Masataka,JPX ; Abe Kazuhiro,JPX ; Kurosaki Yoshimitsu,JPX, Robot control unit.
상세보기
Genco Genov ; Zlatko M. Sotirov ; Eugene Bonev, Robot motion compensation system.
상세보기
Guiremand Harry A., Robotic interface.
상세보기
Fisher, Dimitry; Izhikevich, Eugene, Robotic learning and evolution apparatus.
상세보기
Pack, Robert T.; Vale, Marshall J.; Kearns, Justin H., Robotics systems.
상세보기
Hickman, Ryan; Kuffner, Jr., James J.; Bruce, James R.; Gharpure, Chaitanya; Kohler, Damon; Poursohi, Arshan; Francis, Jr., Anthony G.; Lewis, Thor, Shared robot knowledge base for use with cloud computing system.
상세보기
Chu, Lonny L., Sound data output and manipulation using haptic feedback.
상세보기
Richert, Micah; Piekniewski, Filip; Izhikevich, Eugene; Sokol, Sach; Chan, Victor Hokkiu; Levin, Jeffrey Alexander, Spiking network apparatus and method with bimodal spike-timing dependent plasticity.
상세보기
Miserocchi,Nathan P., System and method for managing graphical data.
상세보기
Petre, Csaba; Szatmary, Botond; Izhikevich, Eugene M., Systems and methods for invariant pulse latency coding.
상세보기
Richert, Micah, Temporal winner takes all spiking neuron network sensory processing apparatus and methods.
상세보기
Yim Mark H. ; Lamping John O. ; Mao Eric W., Touchable user interface using self movable robotic modules.
상세보기
Blumberg, Bruce; Brooks, Rodney; Buehler, Christopher J.; Deegan, Patrick A.; DiCicco, Matthew; Dye, Noelle; Ens, Gerry; Linder, Natan; Siracusa, Michael; Sussman, Michael; Williamson, Matthew M., Training and operating industrial robots.
상세보기
Nugent,Alex, Training of a physical neural network.
상세보기
Kaplan, Frédéric; Oudeyer, Pierre-Yves, Training of autonomous robots.
상세보기
Ringwall Carl G. (Scotia NY), Velocity sensor and method of producing a velocity signal.
상세보기
John R. Lapham, Versatile robot control system.
상세보기
Mochizuki, Yoshiyuki; Naka, Toshiya; Asahara, Shigeo, Virtual space control data receiving apparatus,virtual space control data transmission and reception system, virtual space control data receiving method, and virtual space control data receiving prog.
상세보기
Spitzer Robert (W. Bloomfield MI) Hassoun Mohamad (Dearborn MI), Waveform analysis apparatus and method using neural network techniques.
상세보기
Braun, Scott D.; Steinweg, Calvin C.; Weingarth, Christine E.; Knutsen, Neil W., Wireless industrial control user interface.
상세보기

이 특허를 인용한 특허 (1)

Ur, Shmuel; Dabija, Vlad; Hirshberg, David, System, method and product for utilizing prediction models of an environment.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Apparatus and methods for haptic training of robots 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (77)

이 특허를 인용한 특허 (1)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Apparatus and methods for haptic training of robots 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (77)

이 특허를 인용한 특허 (1)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트