Robotic control arbitration apparatus and methods
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
B25J-009/16
G05D-001/00
G06N-003/00
G06N-003/04
출원번호
US-0040498
(2013-09-27)
등록번호
US-9296101
(2016-03-29)
발명자
/ 주소
Laurent, Patryk
Passot, Jean-Baptiste
Izhikevich, Eugene
출원인 / 주소
Brain Corporation
대리인 / 주소
Gazdzinski & Associates PC
인용정보
피인용 횟수 :
0인용 특허 :
45
초록▼
Apparatus and methods for arbitration of control signals for robotic devices. A robotic device may comprise an adaptive controller comprising a plurality of predictors configured to provide multiple predicted control signals based on one or more of the teaching input, sensory input, and/or performan
Apparatus and methods for arbitration of control signals for robotic devices. A robotic device may comprise an adaptive controller comprising a plurality of predictors configured to provide multiple predicted control signals based on one or more of the teaching input, sensory input, and/or performance. The predicted control signals may be configured to cause two or more actions that may be in conflict with one another and/or utilize a shared resource. An arbitrator may be employed to select one of the actions. The selection process may utilize a WTA, reinforcement, and/or supervisory mechanisms in order to inhibit one or more predicted signals. The arbitrator output may comprise target state information that may be provided to the predictor block. Prior to arbitration, the predicted control signals may be combined with inputs provided by an external control entity in order to reduce learning time.
대표청구항▼
1. A controller apparatus of a robot, comprising: one or more processors configured to execute computer program modules, the computer program modules comprising:a first predictor module operable in accordance with a first learning process configured to provide a first and a second predicted action i
1. A controller apparatus of a robot, comprising: one or more processors configured to execute computer program modules, the computer program modules comprising:a first predictor module operable in accordance with a first learning process configured to provide a first and a second predicted action indication configured based on a context, the first predicted action indication being configured to cause the robot to execute a first action, the second predicted action indication being configured to cause the robot to execute a second action;a second predictor module operable in accordance with a second learning process configured to provide a predicted control output configured based on the context and an input signal, the predicted control output being configured to cause the robot to execute one of the first action or the second action; andan arbitrator module configured to provide one of the first and the second predicted action indication as the input signal to the second predictor module;wherein, based on a determination by the arbitrator module that there is no conflict between the first action and the second action, execute the first action and the second action contemporaneously. 2. The apparatus of claim 1, wherein: the context comprises (i) a sensory signal conveying information associated with an object within an environment of the robot and (ii) the input signal;the first action is configured to reduce a distance measure between the robot and the object; andthe second action is configured to increase the distance measure. 3. The apparatus of claim 1, wherein the context comprises a sensory signal conveying information associated with an object within an environment of the robot;at least one of the first or the second learning process comprises a supervised learning process configured based on a teaching signal, the supervised learning process being updated at a plurality of time instances; andthe teaching signal is determined based on a measure between (i) a combination of a control signal and at least one of the first and the second predicted action indications determined at first time instance of the plurality of time instances; and (i) the at least one of the first and the second predicted action indications determined at second time instance of the plurality of time instances, the second time instance occurring prior to the first time instance; andthe control signal is configured in accordance with the information associated with the object within the environment of the robot, the control signal being provided to the controller apparatus so as to communicate data related to individual ones of the first and the second actions. 4. The apparatus of Claim 1, wherein the first learning process comprises a first supervised learning process configured based on a first teaching signal, the first supervised learning process being updated at a plurality of time instances; andthe first teaching signal is determined based on a first performance measure between (i) a combination of a control signal and at least one of the first or the second predicted action indications determined at first time instance of the plurality of time instances; and (ii) the at least one of the first or the second predicted action indications determined at second time instance of the plurality of time instances, the second time instance occurring prior to the first time instance;the second learning process comprises a second supervised learning process configured based on a second teaching signal, the second supervised learning process being updated at individual ones of the plurality of time instances; andthe first teaching signal is determined based on (i) the predicted control output determined by the second supervised learning process at the first time instance; and (ii) the predicted control output determined by the second supervised learning process at the second time instance. 5. The apparatus of claim 3, wherein: the information associated with the objection within the environment of the robot is configured to convey one or more of object size, shape, color, type, or location. 6. The apparatus of claim 3, wherein the provision of the first or the second action indication as the input signal to the second predictor module is configured in accordance with a winner-takes-all mechanism configured to preclude one of the first or the second action from entering the input signal. 7. The apparatus of claim 1, wherein: at least one of the first or the second learning process comprises a supervised learning process configured based on a teaching signal, the supervised learning process being updated at a plurality of time intervals; andthe supervised learning process update is based on an error measure between (i) at least one of the first or the second predicted action indications determined at a given time instance and (ii) the teaching signal determined at another time instance prior to the given time instance, the given time instance and the other time instance separated by one of the time intervals. 8. The apparatus of claim 1, wherein the computer program modules further comprise: a combiner module configured to provide (1) a first combined output based on a first combination of a control input and the first predicted action indication, and (2) a second combined output based on a second combination of the control input and the second predicted action indication;wherein the first combined output is configured to cause execution if the first action and the second combined output is configured to cause execution of the second action. 9. The apparatus of claim 8, wherein: the context comprises a sensory signal conveying information associated with an object within an environment of the robot; andthe first action comprises an object approach action;the second action comprises an object avoid action;the first combination includes a transform function configured to combine the control input and the first predicted action indication via one or more operations including an additive operation; andthe second combination includes a transform function configured to combine the control input and the second predicted action indication via one or more operations comprises an additive operation. 10. The apparatus of claim 8, wherein: the first predictor module is further configured to provide the first predicted action indication and the second predicted action indication based on a teaching signal and the context in accordance with an adaptive process configured to reduce an error measure between one or both of (1) the first combined output and the first predicted action indication or (2) the second combined output and the second predicted action indication;the teaching signal comprises at least one of the first predicted action indication or the second predicted action indication;the teaching signal is provided responsive to the first combination and/or the second combination being determined at first time instance;the first predictor module is further configured to provide the first predicted action indication and the second predicted action indication at a second time instance subsequent to the first time instance; andthe first predictor module is further configured to receive the context at the first time instance and at the second time instance. 11. The apparatus of claim 1, wherein: based on execution of the first action being in conflict with contemporaneous execution of the second action, the arbitrator module is configured to relay one and only one of the first predicted action indication or the second predicted action indication to the input signal. 12. The apparatus of claim 11, wherein: the relaying is configured based on gating at least one of the first predicted action indication or the second predicted action indication by the arbitrator module, the gating preventing execution of the respective action; andthe gating is configured based on the application of a zero gain to at least one of the first predicted action indication or the second predicted action indication. 13. An apparatus configured to arbitrate multiple predicted outputs, the apparatus comprising: one or more processors configured to: provide an inhibitory signal based on (i) a determination of an object characteristic within an input and (ii) a parameter indicative of an association between the object characteristic and one or more actions; andbased on the inhibitory signal, prevent a relay of all but one of the multiple predicted outputs to the input;wherein: individual ones of the multiple predicted outputs are configured based on the object characteristic and are adapted to cause contemporaneous execution of the one or more actions and a single action associated with the but one of the multiple predicted outputs unless the inhibitory signal is present; andthe parameter is determined prior to the arbitration of the multiple predicted outputs based on one or more training trials characterized by presence of the object characteristic and the but one of the multiple predicted outputs. 14. The apparatus of claim 13, further comprising: a first spiking neuron and a second spiking neuron, each of the first spiking neuron and the second spiking neuron configured to convey a spike from an input connection to an output connection; andan inhibitory connection configured to communicate inhibitory efficacy from the first spiking neuron to the second spiking neuron;wherein: the multiple predicted outputs comprise a first spiking signal configured to convey a first spike and a second spiking signal configured to convey a second spike;the inhibitory signal comprises an inhibitory spike communicated from the first spiking neuron to the second spiking neuron via the inhibitory connection; andthe inhibitory spike is configured to provide the inhibitory efficacy to the second spiking neuron thereby reducing probability of the conveyance of spikes within the second spiking signal from an input connection to the output connection of the second spiking neuron. 15. The apparatus of claim 14, wherein the first spiking neuron is adapted to provide the inhibitory spike using a winner-takes-all mechanism configured based on an occurrence of the first spike within the first spiking signal prior to the second spike within the second spiking signal. 16. The apparatus of claim 14, wherein the first spiking neuron is adapted to provide the inhibitory spike using a winner-takes-all mechanism configured based on an occurrence of the first spike within the first spiking signal prior to the second spike within the second spiking signal. 17. The apparatus of claim 14, wherein the first spiking neuron is configured to provide the inhibitory spike using a winner-takes-all mechanism configured based on an efficacy associated with the first spike within the first spiking signal exceeding an efficacy associated with the second spike within the second spiking signal. 18. The apparatus of claim 14, wherein: the second spiking neuron is configured to communicate another inhibitory spike to the first spiking neuron based on an occurrence of another spike within the second spiking signal prior to another spike within the first spiking signal; andthe communication of the another inhibitory spike is configured to provide another inhibitory efficacy to the first spiking neuron thereby reducing probability of the conveyance of spikes within the first spiking signal from the input connection to the output connection of the first spiking neuron. 19. The apparatus of claim 14, wherein: the first spiking signal is configured to cause execution of the single action;the second spiking signal is configured to cause execution of an action of the one or more actions;the one or more actions comprise an object approach action and an object avoid action;the apparatus is configured to be embodied in a mobile robotic platform;the object characteristic comprises a position measure of the object relative the robotic platform; andindividual ones of the multiple predicted outputs are configured based on a learning process configured to be adapted based on the input and a teaching signal indicative of a target action. 20. A controller apparatus of a robot to perform target approach and obstacle avoidance based on visual sensory input, comprising: a first predictor module configured to effectuate target approach of a target by the robot via: reception of sensory input of a first representation of the target;provision of a target approach predicted action to an arbitrator module;a second predictor module configured to effectuate obstacle avoidance of an obstacle by the robot via: reception of sensory input of a second representation of the obstacle;provision of an obstacle avoidance predicted action to the arbitrator module; andthe arbitrator module configured to: receive the target approach predicted action from the first predictor module;receive the obstacle avoidance predicted action from the second predictor module;provision of a preferred target state position within a view frame to the first predictor module;provision of a preferred obstacle state position within the view frame to the second predictor modules;determine whether the target approach predicted action and the obstacle avoidance predicted action are in conflict;when the target approach predicted action and the obstacle avoidance predicted action are in conflict, suppress provision of one of the preferred target state position and the preferred obstacle state position; andwhen the target approach predicted action and the obstacle avoidance predicted action are not in conflict, allow contemporaneous execution of the target approach predicted action and the obstacle avoidance predicted action.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (45)
Ito, Masato; Minamino, Katsuki; Yoshiike, Yukiko; Suzuki, Hirotaka; Kawamoto, Kenta, Apparatus and method for embedding recurrent neural networks into the nodes of a self-organizing map.
DeYong Mark R. (Las Cruces NM) Findley Randall L. (Austin TX) Eskridge Thomas C. (Las Cruces NM) Fields Christopher A. (Rockville MD), Asynchronous temporal neural processing element.
Grossberg Stephen (Newton Highlands MA) Kuperstein Michael (Brookline MA), Massively parellel real-time network architectures for robots capable of self-calibrating their operating parameters thr.
Peltola Tero (Helsinki FIX) Matakselka Jorma (Vantaa FIX) Harju Esa (Espoo FIX) Salovuori Heikki (Helsinki FIX) Keskinen Jukka (Vantaa FIX) Makinen Kari (Helsinki FIX) Roikonen Olli (Espoo FIX), Method for congestion management in a frame relay network and a node in a frame relay network.
Wilson Charles L. (Darnestown MD) Garris Michael D. (Gaithersburg MD) Wilkinson ; Jr. Robert A. (Hyattstown MD), Object/anti-object neural network segmentation.
Yokono, Jun; Sabe, Kohtaro; Costa, Gabriel; Ohashi, Takeshi, Operational control method, program, and recording media for robot device, and robot device.
Hickman, Ryan; Kuffner, Jr., James J.; Bruce, James R.; Gharpure, Chaitanya; Kohler, Damon; Poursohi, Arshan; Francis, Jr., Anthony G.; Lewis, Thor, Shared robot knowledge base for use with cloud computing system.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.