IPIQ

G06F17/18

COMBINING MATH-PROGRAMMING AND REINFORCEMENT LEARNING FOR PROBLEMS WITH KNOWN TRANSITION DYNAMICS

20230041035 · 2023-02-09 ·

A computer implemented method of improving parameters of a critic approximator module includes receiving, by a mixed integer program (MIP) actor, (i) a current state and (ii) a predicted performance of an environment from the critic approximator module. The MIP actor solves a mixed integer mathematical problem based on the received current state and the predicted performance of the environment. The MIP actor selects an action a and applies the action to the environment based on the solved mixed integer mathematical problem. A long-term reward is determined and compared to the predicted performance of the environment by the critic approximator module. The parameters of the critic approximator module are iteratively updated based on an error between the determined long-term reward and the predicted performance.

SYSTEMS AND METHODS FOR REINFORCEMENT LEARNING WITH SUPPLEMENTED STATE DATA

20230038434 · 2023-02-09 ·

Systems are methods are provided for training an automated agent. The automated agent maintains a reinforcement learning neural network and generates, according to outputs of the reinforcement learning neural network, signals for communicating resource task requests. The system includes a communication interface, a processor, memory, and software code stored in the memory. The software code, when executed, causes the system to: instantiate an automated agent for communicating resource task requests; receive a current feature data structure related to a resource of the resource task requests; maintain a plurality of historical feature data structures related to said resource for a plurality of prior time steps; compute normalized feature data using the current feature data structure and the plurality of historical feature data structures; compute supplemented state data appended with the normalized feature data; and transmit said supplemented state data to the reinforcement learning neural network to train said automated agent.

SYSTEMS AND METHODS FOR REINFORCEMENT LEARNING WITH SUPPLEMENTED STATE DATA

20230038434 · 2023-02-09 ·

Method and apparatus for determining output token

11574190 · 2023-02-07 ·

Samsung Electronics Co., Ltd.

Min-Joong Lee

A method for determining an output token includes predicting a first probability of each of candidate output tokens of a first model, predicting a second probability of each of the candidate output tokens of a second model interworking with the first model, adjusting the second probability of each of the candidate output tokens based on the first probability, and determining the output token among the candidate output tokens based on the first probability and the adjusted second probability.

Method and apparatus for determining output token

11574190 · 2023-02-07 ·

Samsung Electronics Co., Ltd.

Min-Joong Lee

Methods and systems of industrial processes with self organizing data collectors and neural networks

11573557 · 2023-02-07 ·

Strong Force IoT Portfolio 2016, LLC

Systems and methods for data collection for an industrial heating process are disclosed. The system according to one embodiment can include a plurality of data collectors, including a swarm of self-organized data collector members, wherein the swarm of self-organized data collector members organize to enhance data collection based on at least one of capabilities and conditions of the data collector members of the swarm, and wherein the plurality of data collectors is coupled to a plurality of input channels for acquiring collected data relating to the industrial heating process, and a data acquisition and analysis circuit for receiving the collected data via the plurality of input channels and structured to analyze the received collected data using a neural network to monitor a plurality of conditions relating to the industrial heating process.

System and method for quality assessment of product description

11574351 · 2023-02-07 ·

A system for assessing text content of a product. The system includes a computing device having a processor and a storage device storing computer executable code. The computer executable code, when executed at the processor, is configured to: provide text contents and confounding features of products; train a first regression model using the text content and the confounding features of the products; train the second regression model using the confounding features; operate the first regression model using the text contents and the confounding features to obtain a total loss; operate the second regression model using the confounding features of to obtain a partial loss; subtract the total loss from the partial loss to obtain a residual loss; use the residual loss to evaluate models and parameters for the regression models; and use the first regression model to obtain log odds of the words indicating importance of the words.

System and method for quality assessment of product description

11574351 · 2023-02-07 ·

Methods and systems for predicting keystrokes using a unified neural network

11573697 · 2023-02-07 ·

Samsung Electronics Co., Ltd.

Methods and systems for predicting keystrokes using a neural network analyzing cumulative effects of a plurality of factors impacting the typing behavior of a user. The factors may include typing pattern, previous keystrokes, specifics of keyboard used for typing, and contextual parameters pertaining to a device displaying the keyboard and the user. A plurality of features may be extracted and fused to obtain a plurality of feature vectors. The plurality of feature vectors can be optimized and processed by the neural network to identify known features and learn unknown features that are impacting the typing behavior. Thereby, the neural network predicts keystrokes using the known and unknown features.

Methods and systems for predicting keystrokes using a unified neural network

11573697 · 2023-02-07 ·

Samsung Electronics Co., Ltd.

Patent classifications

G06F17/18