G05B2219/33056

CONTROLLER AND CONTROL METHOD
20190317472 · 2019-10-17 · ·

A controller that performs, for one or more axes of a machine, position control by taking friction into consideration includes a data acquisition unit acquiring at least a position command and a position feedback and a compensation torque estimation unit estimating coefficients of a friction model used when the position control is performed, on the basis of a position deviation which is a difference between the position command and the position feedback.

Method and apparatus for reinforcement machine learning

A method and an apparatus for exclusive reinforcement learning are provided, comprising: collecting information of states of an environment through the communication interface and performing a statistical analysis on the states using the collected information; determining a first state value of a first state among the states in a training phase and a second state value of a second state among the states in an inference phase based on analysis results of the statistical analysis; performing reinforcement learning by using one reinforcement learning unit of a plurality of reinforcement learning unit which performs reinforcement learnings from different perspectives according to the first state value; and selecting one of actions determined by the plurality of reinforcement learning unit based on the second state value and applying selected action to the environment.

ROBOTIC DEMONSTRATION LEARNING

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using simulated local demonstration data for robotic demonstration learning. One of the methods includes receiving perceptual data of a workcell of a robot to be configured to execute a task according to a skill template, wherein the skill template specifies one or more subtasks required to perform the skill, wherein at least one of the subtasks is a demonstration subtask that relies on learning visual characteristics of the workcell. A virtual model is generated of a portion of the workcell. A training system generates simulated local demonstration data from the virtual model of the portion of the workcell and tunes a base control policy for the demonstration subtask using the simulated local demonstration data generated from the virtual model of the portion of the workcell.

METHOD FOR PERSONALIZED SOCIAL ROBOT INTERACTION

A system and method for personalization of an interaction between a social robot and a user. The method includes: collecting, by at least one of a plurality of sensors, a first set of sensory data indicating a current state of the user; determining, based on the first set of sensory data, whether at least one predetermined goal to be achieved by the user has not yet been achieved; if not, selecting a first operational schema from a plurality of operational schemas having a highest priority score; performing the first operational schema; collecting, by one or more of the plurality of sensors, a second set of sensory data from the user, wherein the second set of sensory data is indicative of a user's response to the first operational schema; and determining an achievement status of the at least one predetermined goal based on the user's response.

APPARATUS AND METHODS FOR ONLINE TRAINING OF ROBOTS
20190184556 · 2019-06-20 ·

Robotic devices may be trained by a user guiding the robot along a target trajectory using a correction signal. A robotic device may comprise an adaptive controller configured to generate control commands based on one or more of the trainer input, sensory input, and/or performance measure. Training may comprise a plurality of trials. During an initial portion of a trial, the trainer may observe robot's operation and refrain from providing the training input to the robot. Upon observing a discrepancy between the target behavior and the actual behavior during the initial trial portion, the trainer may provide a teaching input (e.g., a correction signal) configured to affect robot's trajectory during subsequent trials. Upon completing a sufficient number of trials, the robot may be capable of navigating the trajectory in absence of the training input.

Machine tool for generating optimum acceleration/deceleration
10261497 · 2019-04-16 · ·

A machine tool includes an operation evaluation section that evaluates an operation thereof and a machine learning device that performs the machine learning of a movement amount of an axis thereof. The machine learning device calculates a reward based on state data including the output of the operation evaluation section, performs the machine learning of the determination of the movement amount of the axis, and determines the movement amount of the axis based on a machine learning result and outputs the determined movement amount. The machine learning device performs the machine learning of the determination of the movement amount of the axis based on the determined movement amount of the axis, the acquired state data, and the calculated reward.

Robot control device, robot system, and robot control method

A robot control device includes: a trained model built by being trained on work data; a control data acquisition section which acquires control data of the robot based on data from the trained model; base trained models built for each of a plurality of simple operations by being trained on work data; an operation label storage section which stores operation labels corresponding to the base trained models; a base trained model combination information acquisition section which acquires combination information when the trained model is represented by a combination of a plurality of the base trained models, by acquiring a similarity between the trained model and the respective base trained models; and an information output section which outputs the operation label corresponding to each of the base trained models which represent the trained model.

MACHINING EQUIPMENT SYSTEM AND MANUFACTURING SYSTEM
20190033839 · 2019-01-31 · ·

Provided is a machining equipment system including machining equipment that performs machining of a workpiece; a control device that controls the machining equipment on the basis of a machining condition; a state obtaining device that obtains a state of the machining equipment during the machining; an inspection device that inspects the workpiece after the machining; and a machine learning device that performs machine learning on the basis of a result of inspection by the inspection device and the state of the machining equipment, obtained by the state obtaining device, wherein the machine learning device modifies the machining condition on the basis of a result of the machine learning so as to improve the machining accuracy of the workpiece or so as to minimize the defect rate of the workpiece.

Controller-equipped machining apparatus having machining time measurement function and on-machine measurement function

A machining apparatus is provided with a machine learning device that performs machine learning. The machine learning device performs the machine learning by receiving the input of machining accuracy between a machining shape of a workpiece measured on-machine and design data on the workpiece and machining time of the workpiece measured by a measurement device. Based on a result of the machine learning, the machining apparatus changes machining conditions such that the machining accuracy increases and the machining time becomes as short as possible.

System and method for managing load-modifying demand response of energy consumption

A system for providing location specific load-modifying demand response (DR) and methods for making and using same. The system advantageously can manage load on individual distribution level power circuits. Information regarding forecasted and real-time loading of individual distribution level power circuits is provided to a DR Locational Application. The DR Locational Application leverages the ability of smart devices containing real-time locational functionality, such as global positioning system, to either indicate to the consumer when it is appropriate to consume power, or automatically control the energy consumption, of the smart device along with any other loads that are verified by the smart device to be in the same location. The DR Locational Application can inform the consumer, the DR Aggregator or Crowd-Source Organization, and the distribution operator of the energy consumption sum of the response for each of its distribution level circuits.