Apparatus and methods for training and operating of robotic devices. Robotic controller may comprise a predictor apparatus configured to generate motor control output. The predictor may be operable in accordance with a learning process based on a teaching signal comprising the control output. An ada
Apparatus and methods for training and operating of robotic devices. Robotic controller may comprise a predictor apparatus configured to generate motor control output. The predictor may be operable in accordance with a learning process based on a teaching signal comprising the control output. An adaptive controller block may provide control output that may be combined with the predicted control output. The predictor learning process may be configured to learn the combined control signal. Predictor training may comprise a plurality of trials. During initial trial, the control output may be capable of causing a robot to perform a task. During intermediate trials, individual contributions from the controller block and the predictor may be inadequate for the task. Upon learning, the control knowledge may be transferred to the predictor so as to enable task execution in absence of subsequent inputs from the controller. Control output and/or predictor output may comprise multi-channel signals.
대표청구항▼
1. A method of predicting a plant control output by an adaptive computerized predictor apparatus, the method comprising: configuring the adaptive computerized predictor apparatus, using one or more processors, to operate in accordance with a learning process based on a teaching input;at a first time
1. A method of predicting a plant control output by an adaptive computerized predictor apparatus, the method comprising: configuring the adaptive computerized predictor apparatus, using one or more processors, to operate in accordance with a learning process based on a teaching input;at a first time instance, based on a sensory context, causing the adaptive computerized predictor apparatus to generate a first predicted plant control output;configuring the adaptive computerized predictor apparatus, using the one or more processors, to provide the first predicted plant control output as the teaching input into the learning process;at a second time instance subsequent to the first time instance, causing the adaptive computerized predictor apparatus to generate a second predicted plant control output based on the sensory context and the teaching input;adjusting the learning process based on a difference between the second predicted plant control output and the teaching input; andcausing a plant to perform an action consistent with the sensory context and the adjusted learning process, the action being in accordance with a defined target trajectory;wherein the teaching input comprises the first predicted plant control output. 2. The method of claim 1, wherein: the plant comprises a robotic platform;responsive to the sensory context comprising a representation of an obstacle, the action comprises executing an avoidance maneuver by the robotic platform; andresponsive to the sensory context comprising a representation of an target, the action comprises executing an approach maneuver by the robotic platform. 3. The method of claim 1, further comprising basing the sensory context on sensory input into the learning process, a portion of the sensory input comprising a video sensor data and another portion of the sensory input comprising the predicted plant control output. 4. The method of claim 1, wherein the learning process comprises adapting a network of computerized neurons in accordance with the sensory context and the teaching input. 5. The method of claim 4, further comprising: interconnecting multiple ones of the network of computerized neurons with connections each characterized by a connection efficacy; andthe adapting the network of computerized neurons comprises adapting the connection efficacy of individual connections based on the sensory context and the teaching input. 6. The method of claim 4, wherein the adapting the network of computerized neurons is based on an error measure between the predicted plant control output and the teaching input. 7. The method of claim 4, further comprising: communicatively coupling individual ones of the network of computerized neurons to connections characterized by a connection efficacy;wherein individual ones of the network of computerized neurons are configured to be operable in accordance with a dynamic process characterized by an excitability parameter;basing the sensory context on input spikes delivered to the adaptive computerized predictor apparatus via a portion of the connections, individual ones of the input spikes being capable of increasing the excitability parameter associated with individual ones of the network of computerized neurons; andwherein the teaching input comprises one or more teaching spikes configured to adjust an efficacy of the portion of the connections, an efficacy adjustment for a given connection providing a portion of the input spikes to a given computerized neuron being configured based on one or more events occurring within a plasticity window, the one or more events including one or more of: (i) a presence of one or more input spikes on the given connection, (ii) an output being generated by the given computerized neuron, or (iii) an occurrence of at least one of the one or more teaching spikes. 8. The method of claim 7, wherein, responsive to the sensory context being updated at 40 ms intervals, selecting a plasticity window duration from a range between 5 ms and 200 ms, inclusive. 9. The method of claim 4, wherein: a portion of the network of computerized neurons comprise spiking neurons, individual ones of the spiking neurons being characterized by a neuron excitability parameter configured to determine an output spike generation by a corresponding spiking neuron;multiple ones of the spiking neurons is interconnected by connections characterized by second connection efficacy, individual ones of the connections being configured to communicate one or more spikes from one or more pre-synaptic spiking neurons to one or more post-synaptic spiking neurons; anda portion of the sensory context is based on sensory input into the learning process comprising the one or more spikes. 10. The method of claim 9, wherein the causing the adaptive computerized predictor apparatus to generate the first or the second predicted plant control output comprises generating one or more other spikes based on spike outputs by individual ones of the spiking neurons. 11. The method of claim 9, further comprising communicating the sensory input via a portion of the connections via one or more other spikes. 12. The method of claim 9, wherein: the predicted plant control output comprises a continuous signal configured based on one or more spike outputs by the individual ones of the spiking neurons; andthe continuous signal includes one or more of an analog signal, a polyadic signal with arity greater than one, an n-bit long discrete signal with n-bits greater than one, a real-valued signal, or a digital representation of a real-valued signal. 13. The method of claim 9, wherein: the sensory input comprises a continuous signal; andthe continuous signal includes one or more of an analog signal, a polyadic signal with arity greater than 1, an n-bit long discrete signal with n-bits greater than 1, or a real-valued signal, or a digital representation of an analog signal. 14. The method of claim 9, wherein the sensory input comprises a binary signal characterized by a single bit. 15. The method of claim 1, further comprising: updating the learning process at regular time intervals; andadapting a network of computerized neurons based on an error measure between (i) the predicted plant control output generated at a given time instance and (ii) the teaching signal determined at another given time instance prior to the given time instance, the given time instance and the another time instance being separated by a duration equal to one of the regular time intervals. 16. The method of claim 1, wherein: the plant comprises at least one motor comprising a motor interface; andthe predicted plant control output comprises one or more instructions to the motor interface configured to actuate the at least one motor. 17. The method of claim 1, wherein the learning process comprises a supervised learning process. 18. The method of claim 1, wherein the predicted plant control output comprises a vector of outputs comprising two or more output components. 19. The method of claim 1, wherein the learning process is configured based on one or more of a look up table, a hash-table, a data base table configured to store a relationship between a given sensory context, a given teaching input associated with the given sensory context, and the predicted plant control output generated for the given sensory context during learning. 20. A non-transitory computer-readable medium comprising instructions stored thereon, the instructions being configured to, when executed by a processing apparatus, cause the processing apparatus to: initialize a learning process based on a teaching input;generate a first predicted plant control output at a first time instance, based on a sensory context;provide the first predicted plant control output as the teaching input into the learning process;generate a second predicted plant control output based on the sensory context and the teaching input at a second time instance subsequent to the first time instance; andadjust the learning process based on an error measure between the predicted plant control output and the teaching input;wherein the predicted plant control output is configured to cause the plant to perform an action consistent with the sensory context and the learning process;wherein the learning process is configured based on a network of computerized neurons configured to be adapted in accordance with the sensory context and the teaching input; andwherein the adaptation of the network of computerized neurons is based on the error measure between the predicted plant control output and the teaching input. 21. The non-transitory computer-readable medium of claim 20, wherein the instructions are further configured to, when executed, cause the processing apparatus to: receive the sensory context via one or more sensors; andprovide the teaching input via a controller.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (97)
Werbos Paul J., 3-brain architecture for an intelligent decision and control system.
Ito, Masato; Minamino, Katsuki; Yoshiike, Yukiko; Suzuki, Hirotaka; Kawamoto, Kenta, Apparatus and method for embedding recurrent neural networks into the nodes of a self-organizing map.
DeYong Mark R. (Las Cruces NM) Findley Randall L. (Austin TX) Eskridge Thomas C. (Las Cruces NM) Fields Christopher A. (Rockville MD), Asynchronous temporal neural processing element.
Kerr Randal H. (Richford NY) Mesnard Robert M. (Endicott NY), Automatic generation of executable computer code which commands another program to perform a task and operator modificat.
Frank D. Francone ; Peter Nordin SE; Wolfgang Banzhaf DE, Computer implemented machine learning method and system including specifically defined introns.
Spoerre Julie K. (Tallahassee FL) Lin Chang-Ching (Tallahassee FL) Wang Hsu-Pin (Tallahassee FL), Machine performance monitoring and fault classification using an exponentially weighted moving average scheme.
Grossberg Stephen (Newton Highlands MA) Kuperstein Michael (Brookline MA), Massively parellel real-time network architectures for robots capable of self-calibrating their operating parameters thr.
Abdallah, Muhammad E; Platt, Robert; Wampler, II, Charles W.; Reiland, Matthew J; Sanders, Adam M, Method and apparatus for automatic control of a humanoid robot.
Sakaue Shiyuki (Yokohama JPX) Sugimoto Koichi (Hiratsuka JPX) Arai Shinichi (Yokohama JPX), Method and apparatus for controlling a robot hand along a predetermined path.
Peltola Tero (Helsinki FIX) Matakselka Jorma (Vantaa FIX) Harju Esa (Espoo FIX) Salovuori Heikki (Helsinki FIX) Keskinen Jukka (Vantaa FIX) Makinen Kari (Helsinki FIX) Roikonen Olli (Espoo FIX), Method for congestion management in a frame relay network and a node in a frame relay network.
Wilson Charles L. (Darnestown MD) Garris Michael D. (Gaithersburg MD) Wilkinson ; Jr. Robert A. (Hyattstown MD), Object/anti-object neural network segmentation.
Yokono, Jun; Sabe, Kohtaro; Costa, Gabriel; Ohashi, Takeshi, Operational control method, program, and recording media for robot device, and robot device.
Eguchi, Toru; Yamada, Akihiro; Kusumi, Naohiro; Sekiai, Takaaki; Fukai, Masayuki; Shimizu, Satoru, Plant control system and thermal power generation plant control system.
Coenen, Olivier, Proportional-integral-derivative controller effecting expansion kernels comprising a plurality of spiking neurons associated with a plurality of receptive fields.
Hickman, Ryan; Kuffner, Jr., James J.; Bruce, James R.; Gharpure, Chaitanya; Kohler, Damon; Poursohi, Arshan; Francis, Jr., Anthony G.; Lewis, Thor, Shared robot knowledge base for use with cloud computing system.
Shaffer Gary K. (Butler PA) Whittaker William L. (Pittsburgh PA) West Jay H. (Pittsburgh PA) Clow Richard G. (Phoenix AZ) Singh Sanjiv J. (Pittsburgh PA) Lay Norman K. (Peoria IL) Devier Lonnie J. (P, System and method for detecting obstacles in the path of a vehicle.
Blumberg, Bruce; Brooks, Rodney; Buehler, Christopher J.; Deegan, Patrick A.; DiCicco, Matthew; Dye, Noelle; Ens, Gerry; Linder, Natan; Siracusa, Michael; Sussman, Michael; Williamson, Matthew M., Training and operating industrial robots.
Mochizuki, Yoshiyuki; Naka, Toshiya; Asahara, Shigeo, Virtual space control data receiving apparatus,virtual space control data transmission and reception system, virtual space control data receiving method, and virtual space control data receiving prog.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.