IPIQ

G05B2219/33033

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

20220388159 · 2022-12-08 ·

Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Deep reinforcement learning for robotic manipulation

11400587 · 2022-08-02 ·

Google Llc

MODEL-FREE CONTROL OF DYNAMICAL SYSTEMS WITH DEEP RESERVOIR COMPUTING

20220100153 · 2022-03-31 ·

A technique is provided for control of a nonlinear dynamical system to an arbitrary trajectory. The technique does not require any knowledge of the dynamical system, and thus is completely model-free. When applied to a chaotic system, it is capable of stabilizing unstable periodic orbits (UPOs) and unstable steady states (USSs), controlling orbits that require non-vanishing control signal, synchronization to other chaotic systems, and so on. It is based on a type of recurrent neural network (RNN) known as a reservoir computer (RC), which, as shown, is capable of directly learning how to control an unknown system. Precise control to a desired trajectory is obtained by iteratively adding layers to the controller, forming a deep recurrent neural network.

Deep reinforcement learning for robotic manipulation

11897133 · 2024-02-13 ·

Google Llc

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

20240131695 · 2024-04-25 ·

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

20190232488 · 2019-08-01 ·

Deep reinforcement learning for robotic manipulation

12240113 · 2025-03-04 ·

Google Llc

Training DNN by updating an array using a chopper

12293281 · 2025-05-06 ·

International Business Machines Corporation

Tayfun Gokmen

Embodiments disclosed herein include a method of training a DNN. A processor initializes an element of an A matrix. The element may include a resistive processing unit. A processor determines incremental weight updates by updating the element with activation values and error values from a weight matrix multiplied by a chopper value. A processor reads an update voltage from the element. A processor determines a chopper product by multiplying the update voltage by the chopper value. A processor directs storage of an element of a hidden matrix. The element of the hidden matrix may include a summation of continuous iterations of the chopper product. A processor updates a corresponding element of a weight matrix based on the element of the hidden matrix reaching a threshold state.

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

20250153352 · 2025-05-15 ·

Patent classifications

G05B2219/33033