Apparatus and methods for haptic training of robots
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G05B-019/18
B25J-009/16
G05D-001/00
G05D-001/02
출원번호
US-0464104
(2017-03-20)
등록번호
US-9844873
(2017-12-19)
발명자
/ 주소
Ponulak, Filip
Kazemi, Moslem
Laurent, Patryk
Sinyavskiy, Oleg
Izhikevich, Eugene
출원인 / 주소
Brain Corporation
대리인 / 주소
Gazdzinski & Associates, PC
인용정보
피인용 횟수 :
0인용 특허 :
105
초록▼
Robotic devices may be trained by a trainer guiding the robot along a target trajectory using physical contact with the robot. The robot may comprise an adaptive controller configured to generate control commands based on one or more of the trainer input, sensory input, and/or performance measure. T
Robotic devices may be trained by a trainer guiding the robot along a target trajectory using physical contact with the robot. The robot may comprise an adaptive controller configured to generate control commands based on one or more of the trainer input, sensory input, and/or performance measure. The trainer may observe task execution by the robot. Responsive to observing a discrepancy between the target behavior and the actual behavior, the trainer may provide a teaching input via a haptic action. The robot may execute the action based on a combination of the internal control signal produced by a learning process of the robot and the training input. The robot may infer the teaching input based on a comparison of a predicted state and actual state of the robot. The robot's learning process may be adjusted in accordance with the teaching input so as to reduce the discrepancy during a subsequent trial.
대표청구항▼
1. A method of operating a robot comprising: determining a first control signal associated with an environmental context of the robot;causing, based at least in part on the determined first control signal, the robot to perform a task characterized by a target trajectory; andresponsive to observing a
1. A method of operating a robot comprising: determining a first control signal associated with an environmental context of the robot;causing, based at least in part on the determined first control signal, the robot to perform a task characterized by a target trajectory; andresponsive to observing a discrepancy between an actual trajectory and the target trajectory, adjusting the actual trajectory based at least in part on a physical contact by an operator with the robot, wherein adjusting of the actual trajectory comprises determining a second control signal configured to cooperate with the first control signal to cause the robot to transition the actual trajectory towards the target trajectory in a subsequent performance of the task in the environmental context. 2. The method of claim 1, further comprising performing a learning process that associates the first control signal with the environmental context. 3. The method of claim 2, wherein determining the second control signal further comprises modifying the learning process. 4. The method of claim 2, wherein the learning process comprises a supervised learning process configured to adjust based at least in part on a teaching signal comprising a prior motor control output and a motor command correction. 5. The method of claim 2, wherein: the learning process is configured based on a teaching signal; andthe modifying of the learning process is configured based on the teaching signal being determined based on an evaluation of the adjusting of the actual trajectory. 6. The method of claim 1, wherein the physical contact comprises at least one of releasing, moving, manipulating, interacting with, and touching the robot. 7. The method of claim 1, further comprising: determining as an output of a learning process the first and second control signals based at least in part on the environmental context of the robot; andcausing, based on the determined first and second control signal, the robot to perform the task characterized by the target trajectory. 8. The method of claim 1, further comprising determining the environmental context of the robot based at least in part on a sensory input. 9. A robot apparatus, comprising: one or more actuators configured to maneuver the robot apparatus;a sensor module configured to convey information related to an environment of the robot apparatus; andan adaptive controller operable in accordance with a learning process configured to: guide the robot apparatus using the one or more actuators to a target state in accordance with the information;determine a discrepancy between a target trajectory that corresponds to the target state and a current trajectory that corresponds to a current state, the determination based at least in part on a physical contact by a user; andupdate the learning process based on the determined discrepancy, wherein the updated learning process comprises a determination of a correction signal to guide the robot apparatus using the one or more actuators to the target state based on a subsequent conveyance of information by the sensor module. 10. The apparatus of claim 9, wherein the learning process is configured in accordance with a teaching signal. 11. The apparatus of claim 10, wherein the guiding of the robot apparatus using the one or more actuators to the target state is configured based on a control signal determined by the learning process in accordance with the conveyed information, and the teaching signal is configured based on the correction signal. 12. The apparatus of claim 11, wherein the teaching signal is inferred based at least in part on a comparison between the current state and the target state. 13. The apparatus of claim 9, wherein the physical contact comprises at least one of releasing, moving, manipulating, interacting with, and touching the robot apparatus. 14. The apparatus of claim 9, wherein the target state is a first pose of the robot apparatus and the current state is a second pose of the robot apparatus. 15. The apparatus of claim 9, wherein the robot apparatus is an autonomous vehicle. 16. A non-transitory computer readable medium comprising a plurality of instruction which, when executed by one or more processors, effectuate control of a robotic apparatus by: based on a context, determine a first control signal configured to transition the robotic apparatus to a first state;determine a discrepancy between a current trajectory associated with a current state, and a first trajectory associated with the first state, where the discrepancy between the trajectories comprises a measurable difference; anddetermine a second control signal based on the discrepancy, the second control signal configured to transition the robotic apparatus to the current state. 17. The non-transitory computer readable medium of claim 16, wherein the determination of the first control signal and the determination of the second control signal are configured in accordance with a learning process. 18. The non-transitory computer readable medium of claim 17, wherein: a change in the context is configured to cause an adaptation of the learning process, the adaptation being configured to produce another version of a control signal; andthe context is configured to convey information related to one or more of a sensory input, a robot state, and the teaching signal. 19. The non-transitory computer readable medium of claim 16, wherein: individual ones of the current state and the first state are characterized by a state parameter; andthe determination of the discrepancy is configured based on an evaluation of a distance measure between the state parameter of the current state and the state parameter of the first state. 20. The non-transitory computer readable medium of claim 16, wherein the discrepancy is based at least in part on a physical contact by a user.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (105)
Werbos Paul J., 3-brain architecture for an intelligent decision and control system.
Ito, Masato; Minamino, Katsuki; Yoshiike, Yukiko; Suzuki, Hirotaka; Kawamoto, Kenta, Apparatus and method for embedding recurrent neural networks into the nodes of a self-organizing map.
DeYong Mark R. (Las Cruces NM) Findley Randall L. (Austin TX) Eskridge Thomas C. (Las Cruces NM) Fields Christopher A. (Rockville MD), Asynchronous temporal neural processing element.
Kerr Randal H. (Richford NY) Mesnard Robert M. (Endicott NY), Automatic generation of executable computer code which commands another program to perform a task and operator modificat.
Frank D. Francone ; Peter Nordin SE; Wolfgang Banzhaf DE, Computer implemented machine learning method and system including specifically defined introns.
Spoerre Julie K. (Tallahassee FL) Lin Chang-Ching (Tallahassee FL) Wang Hsu-Pin (Tallahassee FL), Machine performance monitoring and fault classification using an exponentially weighted moving average scheme.
Grossberg Stephen (Newton Highlands MA) Kuperstein Michael (Brookline MA), Massively parellel real-time network architectures for robots capable of self-calibrating their operating parameters thr.
Abdallah, Muhammad E; Platt, Robert; Wampler, II, Charles W.; Reiland, Matthew J; Sanders, Adam M, Method and apparatus for automatic control of a humanoid robot.
Sakaue Shiyuki (Yokohama JPX) Sugimoto Koichi (Hiratsuka JPX) Arai Shinichi (Yokohama JPX), Method and apparatus for controlling a robot hand along a predetermined path.
Peltola Tero (Helsinki FIX) Matakselka Jorma (Vantaa FIX) Harju Esa (Espoo FIX) Salovuori Heikki (Helsinki FIX) Keskinen Jukka (Vantaa FIX) Makinen Kari (Helsinki FIX) Roikonen Olli (Espoo FIX), Method for congestion management in a frame relay network and a node in a frame relay network.
Wilson Charles L. (Darnestown MD) Garris Michael D. (Gaithersburg MD) Wilkinson ; Jr. Robert A. (Hyattstown MD), Object/anti-object neural network segmentation.
Yokono, Jun; Sabe, Kohtaro; Costa, Gabriel; Ohashi, Takeshi, Operational control method, program, and recording media for robot device, and robot device.
Eguchi, Toru; Yamada, Akihiro; Kusumi, Naohiro; Sekiai, Takaaki; Fukai, Masayuki; Shimizu, Satoru, Plant control system and thermal power generation plant control system.
Coenen, Olivier, Proportional-integral-derivative controller effecting expansion kernels comprising a plurality of spiking neurons associated with a plurality of receptive fields.
Hickman, Ryan; Kuffner, Jr., James J.; Bruce, James R.; Gharpure, Chaitanya; Kohler, Damon; Poursohi, Arshan; Francis, Jr., Anthony G.; Lewis, Thor, Shared robot knowledge base for use with cloud computing system.
Shaffer Gary K. (Butler PA) Whittaker William L. (Pittsburgh PA) West Jay H. (Pittsburgh PA) Clow Richard G. (Phoenix AZ) Singh Sanjiv J. (Pittsburgh PA) Lay Norman K. (Peoria IL) Devier Lonnie J. (P, System and method for detecting obstacles in the path of a vehicle.
Blumberg, Bruce; Brooks, Rodney; Buehler, Christopher J.; Deegan, Patrick A.; DiCicco, Matthew; Dye, Noelle; Ens, Gerry; Linder, Natan; Siracusa, Michael; Sussman, Michael; Williamson, Matthew M., Training and operating industrial robots.
Mochizuki, Yoshiyuki; Naka, Toshiya; Asahara, Shigeo, Virtual space control data receiving apparatus,virtual space control data transmission and reception system, virtual space control data receiving method, and virtual space control data receiving prog.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.