[특허]Apparatus and methods for control of robot actions based on corrective user inputs

Apparatus and methods for control of robot actions based on corrective user inputs 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G05B-019/04 B25J-009/16 G06N-099/00 G06N-003/00 G05D-001/00
출원번호	US-0174858 (2016-06-06)
등록번호	US-9789605 (2017-10-17)
발명자 / 주소	Meier, Philip Passot, Jean-Baptiste Ibarz Gabardos, Borja Laurent, Patryk Sinyavskiy, Oleg O'Connor, Peter Izhikevich, Eugene
출원인 / 주소	BRAIN CORPORATION
대리인 / 주소	Gazdzinski & Associates, PC
인용정보	피인용 횟수 : 4 인용 특허 : 101

초록 ▼

Robots have the capacity to perform a broad range of useful tasks, such as factory automation, cleaning, delivery, assistive care, environmental monitoring and entertainment. Enabling a robot to perform a new task in a new environment typically requires a large amount of new software to be written, often by a team of experts. It would be valuable if future technology could empower people, who may have limited or no understanding of software coding, to train robots to perform custom tasks. Some implementations of the present invention provide methods and systems that respond to users' corrective commands to generate and refine a policy for determining appropriate actions based on sensor-data input. Upon completion of learning, the system can generate control commands by deriving them from the sensory data. Using the learned control policy, the robot can behave autonomously.

대표청구항 ▼

1. A method for performing robot actions by a robot, the method comprising: defining a policy comprising a plurality of parameters for determining robot actions based at least in part on sensory-data inputs, the defining of the policy comprising mapping the sensory-data inputs to robot actions;receiving a first sensory-data input from a sensor;performing a first robot action at a first action time, wherein the first robot action is determined based at least in part on the first sensory-data input and application of the policy;determining that a user input was received at an input time corresponding to the first action time, wherein a corrective command at least partially derived from the user input specifies a corrective robot action for physical performance, the user input being indicative of at least partial dissatisfaction with the first robot action; andmodifying the policy based on the corrective command and the first sensory-data input. 2. The method of claim 1, further comprising determining a second robot action at a second action time, wherein the second robot action is based at least in part on the modified policy and a second sensory-data input from the sensor. 3. The method of claim 1, wherein the modifying of the policy further comprises using a learning model. 4. The method of claim 1, wherein the at least partial dissatisfaction includes a discrepancy between a target robot action and the first robot action. 5. The method of claim 1, wherein the modifying of the policy comprises changing parameters relating sensory-data inputs to actuator responses that correspond to robot actions. 6. The method of claim 3, wherein the learning model includes updating parameters based on a gradient of error determined at least in part by a difference between the first robot action and a second robot action specified by a combination of the corrective command and the policy. 7. The method of claim 1, further comprising determining a first context-variable value for a context variable, wherein the first context-variable value is determined from the first sensory-data input and the policy is further determined based at least in part on the context variable. 8. A robot, comprising: an actuator configured to perform robot actions for robotic tasks;a sensor configured to detect an environmental context of the robot and generate sensory-data inputs; anda processor apparatus configured to: define a policy comprising a plurality of parameters configured to determine robot actions based at least in part on sensory-data inputs;determine that a user input was received at an input time corresponding to a performance of a first robot action corresponding to a detection of a first sensory-data input;generate a corrective command at least partially derived from the user input, the user input being indicative of at least partial dissatisfaction with the first robot action, andmodify the policy based on the corrective command and the first sensory-data input. 9. The robot of claim 8, further comprising a user interface configured to receive the user input. 10. The robot of claim 8, wherein the at least partial dissatisfaction includes a discrepancy between a target robot action and the first robot action. 11. The robot of claim 8, wherein the modification of the policy further comprises usage of a learning model. 12. The robot of claim 8, wherein the processor apparatus is further configured to determine a first context-variable value for a context variable, wherein the first context-variable value is determined from the first sensory-data input and the policy is further determined based at least in part on the context variable. 13. The robot of claim 8, wherein the sensor is at least one of a light sensor, a motion detector, an inertial measurement unit, and a global positioning system receiver. 14. A non-transitory computer-readable storage medium having a plurality of instructions stored thereon, the instructions being executable by a processing apparatus to operate a robot, the instructions configured to, when executed by the processing apparatus, cause the processing apparatus to: define a policy comprising a plurality of parameters configured to determine robot actions based at least in part on sensory-data inputs, wherein the policy maps the sensory-data inputs to robot actions;receive a first sensory-data input;perform a first robot action at a first action time, wherein the first action is determined based at least in part on the first sensory-data input and application of the policy;determine that a user input was received at an input time corresponding to the first action time, wherein a corrective command at least partially derived from the user input specifies a corrective robot action for physical performance, the user input being indicative of at least partial dissatisfaction with the first robot action; andmodify the policy based on the corrective command and the first sensory-data input. 15. The non-transitory computer-readable storage medium of claim 14, wherein the instructions are further configured to, when executed by the processing apparatus, determine a second robot action at a second action time, wherein the second robot action is based at least in part on the modified policy and a second sensory-data input. 16. The non-transitory computer-readable storage medium of claim 14, wherein the modification of the policy further comprises usage of a learning model. 17. The non-transitory computer-readable storage medium of claim 14, wherein the instructions are further configured to, when executed by the processing apparatus, assess whether the modified policy comprises an improvement over the policy prior to modification, the improvement being determined by a threshold being exceeded. 18. The non-transitory computer-readable storage medium of claim 14, wherein the modification of the policy comprises changing parameters relating sensory-data inputs to actuator responses that correspond to robot actions. 19. The non-transitory computer-readable storage medium of claim 16, wherein the learning model includes updating parameters based on a gradient of error determined at least in part by a difference between the first robot action and a second robot action specified by a combination of the corrective command and the policy. 20. The non-transitory computer-readable storage medium of claim 14, wherein the instructions are further configured to, when executed by the processing apparatus, determine a first context-variable value for a context variable, wherein the first context-variable value is determined from the first sensory-data input and the policy is further determined based at least in part on the context variable. 21. The non-transitory computer-readable storage medium of claim 14, wherein the at least partial dissatisfaction includes a discrepancy between a target robot action and the first robot action.

이 특허에 인용된 특허 (101)

Werbos Paul J., 3-brain architecture for an intelligent decision and control system.
상세보기
Laurent, Patryk; Passot, Jean-Baptiste; Wildie, Mark; Izhikevich, Eugene M., Adaptive robotic interface apparatus and methods.
상세보기
Ito, Masato; Minamino, Katsuki; Yoshiike, Yukiko; Suzuki, Hirotaka; Kawamoto, Kenta, Apparatus and method for embedding recurrent neural networks into the nodes of a self-organizing map.
상세보기
Szatmary, Botond; Richert, Micah, Apparatus and methods for activity-based plasticity in a spiking neuron network.
상세보기
Meier, Philip, Apparatus and methods for controlling attention of a robot.
상세보기
Fisher, Dimitry; Szatmary, Botond; Izhikevich, Eugene, Apparatus and methods for encoding of sensory data using artificial spiking neurons.
상세보기
Ponulak, Filip, Apparatus and methods for gating analog and spiking signals in artificial neural networks.
상세보기
Cipollini, Benjamin Neil; Izhikevich, Eugene, Apparatus and methods for object detection via optical flow cancellation.
상세보기
Ponulak, Filip; Passot, Jean-Baptiste; Izhikevich, Eugene; Coenen, Olivier, Apparatus and methods for reinforcement-guided supervised learning.
상세보기
Ponulak, Filip; Sinyavskiy, Oleg, Apparatus and methods for state-dependent learning in spiking neuron networks.
상세보기
Piekniewski, Filip Lukasz; Petre, Csaba; Sokol, Sach Hansen; Szatmary, Botond; Nageswaran, Jayram Moorkanikara; Izhikevich, Eugene M., Apparatus and methods for temporally proximate object recognition.
상세보기
Ulug Mehmet E. (1537 E. Hillsboro Blvd. ; #342 Deerfield FL 33441), Artificial neural network method and architecture.
상세보기
Ulug Mehmet E. (1537 E. Hillsboro Blvd. #342 Deerfield Beach FL 33441), Artificial neural network method and architecture adaptive signal filtering.
상세보기
DeYong Mark R. (Las Cruces NM) Findley Randall L. (Austin TX) Eskridge Thomas C. (Las Cruces NM) Fields Christopher A. (Rockville MD), Asynchronous temporal neural processing element.
상세보기
Kerr Randal H. (Richford NY) Mesnard Robert M. (Endicott NY), Automatic generation of executable computer code which commands another program to perform a task and operator modificat.
상세보기
Mattaboni Paul J. (9 Wildwood Dr. Medfield MA 02052), Autonomous mobile robot.
상세보기
Merfeld, Daniel M.; Rauch, Steven D.; Wall, III, Conrad; Weinberg, Marc, Balance prosthesis.
상세보기
Tamayama, Ken; Tomitaka, Tadafusa; Koyanagi, Masakazu; Iijima, Toshiyuki; Hosonuma, Naoyasu, Camera controlling device and method for predicted viewing.
상세보기
Sigel, Kirk; DeAngelis, Douglas; Ciholas, Mike, Camera with object recognition/data output.
상세보기
Lee,Dong Seok, Cleaning robot and control method thereof.
상세보기
Frank D. Francone ; Peter Nordin SE; Wolfgang Banzhaf DE, Computer implemented machine learning method and system including specifically defined introns.
상세보기
Dariush, Behzad; Jian, Bing, Control of robots from human motion descriptors.
상세보기
Summer, Matthew D.; Bosscher, Paul M.; Summer, Michael J.; Ortega-Morales, Miguel, Control synchronization for high-latency teleoperation.
상세보기
Hattori Shinichi (Nagasaki JPX), Controller for movable robot for moving a work element through a continuous path.
상세보기
Gienger, Michael, Controlling the trajectory of an effector.
상세보기
Danko, George, Coordinated joint motion control system.
상세보기
Bose Chinmoy B. (Green Brook NJ), Differential process controller using artificial neural networks.
상세보기
Shen, Wei-Min; Salemi, Behnam; Will, Peter, Distributed control and coordination of autonomous agents in a dynamic, reconfigurable system.
상세보기
Burt Peter J. (Princeton Township ; Mercer County NJ), Dynamic method for recognizing objects and image processing system therefor.
상세보기
Jim-Shih Liaw ; Theodore W. Berger, Dynamic synapse for signal processing in neural networks.
상세보기
Liaw, Jim-Shih; Berger, Theodore W., Dynamic synapse for signal processing in neural networks.
상세보기
Sinyavskiy, Oleg; Polonichko, Vadim, Dynamically reconfigurable stochastic learning apparatus and methods.
상세보기
Hoffberg Steven M. ; Hoffberg-Borghesani Linda I., Ergonomic man-machine interface incorporating adaptive pattern recognition based control system.
상세보기
Petersen,Peter, Fuel mixer apparatus and method.
상세보기
Watanabe,Atsushi; Kosaka,Tetsuya; Nagatsuka,Yoshiharu, Graphic display apparatus for robot system.
상세보기
Rodriguez Guillermo (La Canada CA) Kreutz Kenneth K. (San Diego CA) Jain Abhinandan (Altadena CA), High level language-based robotic control system.
상세보기
Herr, Hugh Miller; Casler, Rick; Han, Zhixiu, Hybrid terrain-adaptive lower-extremity systems.
상세보기
Matsugu, Masakazu; Mori, Katsuhiko; Ishii, Mie; Mitarai, Yusuke, Information processing apparatus, information processing method, pattern recognition apparatus, and pattern recognition method.
상세보기
Commons, Michael Lamport, Intelligent control with hierarchical stacked neural networks.
상세보기
Izhikevich, Eugene M.; Szatmary, Botond; Petre, Csaba, Invariant pulse latency coding systems and methods systems and methods.
상세보기
Ito,Masato, Legged mobile robot and its motion teaching method, and storage medium.
상세보기
Spoerre Julie K. (Tallahassee FL) Lin Chang-Ching (Tallahassee FL) Wang Hsu-Pin (Tallahassee FL), Machine performance monitoring and fault classification using an exponentially weighted moving average scheme.
상세보기
Brown Robert A. (8 Foster St. Mattapoisett MA 02739), Machine that learns what it actually does.
상세보기
Grossberg Stephen (Newton Highlands MA) Kuperstein Michael (Brookline MA), Massively parellel real-time network architectures for robots capable of self-calibrating their operating parameters thr.
상세보기
Abdallah, Muhammad E; Platt, Robert; Wampler, II, Charles W.; Reiland, Matthew J; Sanders, Adam M, Method and apparatus for automatic control of a humanoid robot.
상세보기
Sakaue Shiyuki (Yokohama JPX) Sugimoto Koichi (Hiratsuka JPX) Arai Shinichi (Yokohama JPX), Method and apparatus for controlling a robot hand along a predetermined path.
상세보기
Rosenberg Louis B. ; Jackson Bernard G., Method and apparatus for controlling human-computer interface systems providing force feedback.
상세보기
Jeffrey L. Hamilton ; Bret D. Schlussman, Method and apparatus for manipulating and displaying graphical objects in a computer display device.
상세보기
Cooper David L., Method and apparatus for neural networking using semantic attractor architecture.
상세보기
Ellingsworth, Martin E., Method and system for classifying documents.
상세보기
Zhang, Emily-DanDan; Qi, Levy-LiWei; Murphy, Steve, Method and system for optimizing the layout of a robot work cell.
상세보기
Peltola Tero (Helsinki FIX) Matakselka Jorma (Vantaa FIX) Harju Esa (Espoo FIX) Salovuori Heikki (Helsinki FIX) Keskinen Jukka (Vantaa FIX) Makinen Kari (Helsinki FIX) Roikonen Olli (Espoo FIX), Method for congestion management in a frame relay network and a node in a frame relay network.
상세보기
Buckley Theresa M. (424 Homer Ave. Palo Alto CA 94301), Method for neural network control of motion using real-time environmental feedback.
상세보기
Farry Kristin Ann ; Fernandez Julio Jaime ; Graham Jeffrey Scott, Method of evolving classifier programs for signal processing and control.
상세보기
Cheng, Tsz; Lei, Hui; Ye, Yiming, Method, system, and apparatus for remote interactions.
상세보기
Francis, Jr., Anthony G., Methods and systems for autonomous robotic decision making.
상세보기
Perreirra Noel D. (Allentown PA) Tucker Michael (Bethlehem PA), Methods for refining original robot command signals.
상세보기
Lee, Haeyeon; Ota, Yasuhiro; Breazeal, Cynthia; Lee, Jun Ki, Methods of robot behavior generation and robots utilizing the same.
상세보기
Coenen, Olivier; Sinyavskiy, Oleg; Polonichko, Vadim, Modulated stochasticity spiking neuron network controller apparatus and methods.
상세보기
Nugent,Alex, Nanotechnology neural network methods and systems.
상세보기
Linsker,Ralph, Neural networks for prediction and control.
상세보기
Palmer, Douglas A.; Florea, Michael, Neural processing unit.
상세보기
Ahissar, Ehud, Neuronal phase-locked loops.
상세보기
Thaler Stephen L., Non-algorithmically implemented artificial neural networks and components thereof.
상세보기
Wilson Charles L. (Darnestown MD) Garris Michael D. (Gaithersburg MD) Wilkinson ; Jr. Robert A. (Hyattstown MD), Object/anti-object neural network segmentation.
상세보기
Yokono, Jun; Sabe, Kohtaro; Costa, Gabriel; Ohashi, Takeshi, Operational control method, program, and recording media for robot device, and robot device.
상세보기
Bock Otmar,CAX ; D'Eleuterio Gabriele,CAX ; Lipitkas John,CAX ; Grodski Julius,CAX, Parametric control device.
상세보기
Eguchi, Toru; Yamada, Akihiro; Kusumi, Naohiro; Sekiai, Takaaki; Fukai, Masayuki; Shimizu, Satoru, Plant control system and thermal power generation plant control system.
상세보기
Laurent, Patryk; Passot, Jean-Baptiste; Sinyavskiy, Oleg; Ponulak, Filip; Gabardos, Borja Ibarz; Izhikevich, Eugene, Predictive robotic controller apparatus and methods.
상세보기
Linnell, Jeffrey, Programming of a robotic arm using a motion capture system.
상세보기
Coenen, Olivier, Proportional-integral-derivative controller effecting expansion kernels comprising a plurality of spiking neurons associated with a plurality of receptive fields.
상세보기
Galkowski Peggy J. ; Glickstein Ira S. ; Stiles Peter N. ; Szczerba Robert J., Real-time mission adaptable route planner.
상세보기
Gregg J?rgen Suaning AU, Retinal stimulator.
상세보기
Onoue Kazuhiko,JPX ; Koyama Masataka,JPX ; Abe Kazuhiro,JPX ; Kurosaki Yoshimitsu,JPX, Robot control unit.
상세보기
Inoue Yasuyuki,JPX ; Nagata Hideo,JPX, Robot controller.
상세보기
Kato, Tetsuaki; Ichinose, Masakazu; Inaba, Kiyonori, Robot having learning control function.
상세보기
Genco Genov ; Zlatko M. Sotirov ; Eugene Bonev, Robot motion compensation system.
상세보기
Guiremand Harry A., Robotic interface.
상세보기
Fisher, Dimitry; Izhikevich, Eugene, Robotic learning and evolution apparatus.
상세보기
Pack, Robert T.; Vale, Marshall J.; Kearns, Justin H., Robotics systems.
상세보기
Jeong Joon-Young (Seoul KRX), Running control system of robot and method thereof.
상세보기
Cooper, David L., Self-adjusting multi-layer neural network architectures and methods therefor.
상세보기
Hickman, Ryan; Kuffner, Jr., James J.; Bruce, James R.; Gharpure, Chaitanya; Kohler, Damon; Poursohi, Arshan; Francis, Jr., Anthony G.; Lewis, Thor, Shared robot knowledge base for use with cloud computing system.
상세보기
Chu, Lonny L., Sound data output and manipulation using haptic feedback.
상세보기
Richert, Micah; Piekniewski, Filip; Izhikevich, Eugene; Sokol, Sach; Chan, Victor Hokkiu; Levin, Jeffrey Alexander, Spiking network apparatus and method with bimodal spike-timing dependent plasticity.
상세보기
Hunt, Jonathan James; Sinyavskiy, Oleg, Spiking neuron classifier apparatus and methods using conditionally independent subsets.
상세보기
Sinyavskiy, Oleg; Coenen, Olivier J. M. D., Stochastic apparatus and methods for implementing generalized learning rules.
상세보기
Shaffer Gary K. (Butler PA) Whittaker William L. (Pittsburgh PA) West Jay H. (Pittsburgh PA) Clow Richard G. (Phoenix AZ) Singh Sanjiv J. (Pittsburgh PA) Lay Norman K. (Peoria IL) Devier Lonnie J. (P, System and method for detecting obstacles in the path of a vehicle.
상세보기
Berger,Kenneth A.; Turner,Larry J.; Wilcox,Andrew E., System and method for generating user interface code.
상세보기
Miserocchi,Nathan P., System and method for managing graphical data.
상세보기
Petre, Csaba; Szatmary, Botond; Izhikevich, Eugene M., Systems and methods for invariant pulse latency coding.
상세보기
Richert, Micah, Temporal winner takes all spiking neuron network sensory processing apparatus and methods.
상세보기
Yim Mark H. ; Lamping John O. ; Mao Eric W., Touchable user interface using self movable robotic modules.
상세보기
Blumberg, Bruce; Brooks, Rodney; Buehler, Christopher J.; Deegan, Patrick A.; DiCicco, Matthew; Dye, Noelle; Ens, Gerry; Linder, Natan; Siracusa, Michael; Sussman, Michael; Williamson, Matthew M., Training and operating industrial robots.
상세보기
Nugent,Alex, Training of a physical neural network.
상세보기
Kaplan, Frédéric; Oudeyer, Pierre-Yves, Training of autonomous robots.
상세보기
Ringwall Carl G. (Scotia NY), Velocity sensor and method of producing a velocity signal.
상세보기
John R. Lapham, Versatile robot control system.
상세보기
Mochizuki, Yoshiyuki; Naka, Toshiya; Asahara, Shigeo, Virtual space control data receiving apparatus,virtual space control data transmission and reception system, virtual space control data receiving method, and virtual space control data receiving prog.
상세보기
Spitzer Robert (W. Bloomfield MI) Hassoun Mohamad (Dearborn MI), Waveform analysis apparatus and method using neural network techniques.
상세보기
Braun, Scott D.; Steinweg, Calvin C.; Weingarth, Christine E.; Knutsen, Neil W., Wireless industrial control user interface.
상세보기

이 특허를 인용한 특허 (4)

Izhikevich, Eugene; Sinyavskiy, Oleg; Passot, Jean-Baptiste, Adaptive predictor apparatus and methods.
상세보기
Konolige, Kurt; Rajkumar, Nareshkumar; Hinterstoisser, Stefan, Generating a model for an object encountered by a robot.
상세보기
Ibarz Gabardos, Borja; Sinyavskiy, Oleg, Persistent predictor apparatus and methods for task switching.
상세보기
Gomi, Akihiro; Toshimitsu, Shunsuke, Robot, control apparatus, and robot system.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Apparatus and methods for control of robot actions based on corrective user inputs 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (101)

이 특허를 인용한 특허 (4)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Apparatus and methods for control of robot actions based on corrective user inputs 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (101)

이 특허를 인용한 특허 (4)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트