G06N7/00

Generating hyper-parameters for machine learning models using modified Bayesian optimization based on accuracy and training efficiency
11556826 · 2023-01-17 · ·

The present disclosure relates to systems, methods, and non-transitory computer readable media for selecting hyper-parameter sets by utilizing a modified Bayesian optimization approach based on a combination of accuracy and training efficiency metrics of a machine learning model. For example, the disclosed systems can fit accuracy regression and efficiency regression models to observed metrics associated with hyper-parameter sets of a machine learning model. The disclosed systems can also implement a trade-off acquisition function that implements an accuracy-training efficiency balance metric to explore the hyper-parameter feature space and select hyper-parameters for training the machine learning model considering a balance between accuracy and training efficiency.

Method for identifying misallocated historical production data using machine learning to improve a predictive ability of a reservoir simulation
11555943 · 2023-01-17 · ·

A method for training a predictive reservoir simulation in which high-confidence reservoir sample data is used to identify misallocated historical production data used in the simulation. A neural network algorithm is trained with high-confidence reservoir historical production data. High-confidence reservoir sample data is obtained by at least one sensor at a reservoir location over a time interval, after which the reservoir historical production data is parametrically varied over the time interval to determine a time-indexed discrepancy between the reservoir historical production data and the high-confidence reservoir sample data over the time interval. The time-indexed discrepancy and a defined threshold discrepancy are then used as inputs to a machine learning process to further train the neural network algorithm to identify reservoir historical production data whose discrepancy exceeds the threshold discrepancy and thereby constitutes misallocated historical production data. The misallocated data is later back allocated to respective wells by back propagation algorithm.

Method and system for dynamically predicting vehicle arrival time using a temporal difference learning technique

This disclosure relates generally to a method and system for dynamically predicting vehicle arrival time using a temporal difference learning technique. Due to varying uncertainties predicting vehicle arrival time and travel time are crucial elements to make the public transport travel more attractive and reliable with increased traffic volumes. The method includes receiving a plurality of inputs in real time and then extracting a plurality of temporal events from a closest candidate trip pattern using a historical database. Further, a trained temporal difference predictor model (TTDPM) is utilized for dynamically predicting the arrival time from the current location of the vehicle to the target destination based on the plurality of nonlinear features. The non-linear features and linear approximator formulation of TTDPM provides fast gradient computation improves training time. Additionally, updating the revised state information at every iteration provides better accuracy of arrival time prediction in real time.

Image tagging based upon cross domain context

A method described herein includes receiving a digital image, wherein the digital image includes a first element that corresponds to a first domain and a second element that corresponds to a second domain. The method also includes automatically assigning a label to the first element in the digital image based at least in part upon a computed probability that the label corresponds to the first element, wherein the probability is computed through utilization of a first model that is configured to infer labels for elements in the first domain and a second model that is configured to infer labels for elements in the second domain. The first model receives data that identifies learned relationships between elements in the first domain and elements in the second domain, and the probability is computed by the first model based at least in part upon the learned relationships.

Descriptor uniqueness for entity clustering

A mechanism is provided in a data processing system to implement a cognitive natural language processing (NLP) system with descriptor uniqueness identification to support named entity mention clustering. The mechanism annotates a set of documents from a corpus of documents for entity types and mentions, collects descriptor usages from all documents in the corpus of documents, analyzes the descriptor usages to classify the descriptors as base terms or modifier terms, generates compatibility scores for the descriptors, and performs entity merging of entity clusters based on the compatibility scores.

Apparatus and method for recommending a destination

A destination recommending apparatus includes: a navigation device configured to collect driving pattern information of a user; and a controller configured to calculate a visit probability of a destination at a current location or the visit probability of the destination at a current time, based on the driving pattern information. The controller is configured to predict the destination based on the visit probability.

Systems and methods for probabilistic semantic sensing in a sensory network

Systems and methods for probabilistic semantic sensing in a sensory network are disclosed. The system receives raw sensor data from a plurality of sensors and generates semantic data including sensed events. The system correlates the semantic data based on classifiers to generate aggregations of semantic data. Further, the system analyzes the aggregations of semantic data with a probabilistic engine to produce a corresponding plurality of derived events each of which includes a derived probability. The system generates a first derived event, including a first derived probability, that is generated based on a plurality of probabilities that respectively represent a confidence of an associated semantic datum to enable at least one application to perform a service based on the plurality of derived events.

Method and apparatus for refining an automated coding model

A method, apparatus and computer program product refine an automated coding model, such as for a medical chart. For each respective candidate code from a set of candidate codes, the method predicts a probability of the respective code being contained in a medical chart. The method also selects one of the candidate codes as being contained in the medical chart based upon the probability and removes the selected candidate code from the set of candidate codes. The method then repeatedly predicts the probability of a respective code being contained in the medical chart, selects one of the candidate codes based upon the predicted probability and removes the selected candidate code from the set of candidate codes. The method further determines a categorical crossentropy loss as to permit adjustment of one or more parameters of the automated coding model.

System for generating topic inference information of lyrics

A system for generating topic inference information of lyrics that can provide more useful for topic interpretation of lyrics. A device for learning topic numbers performs an operation of updating and learning topic numbers, which performs an operation of updating topic numbers on all of a plurality of lyrics data of each of a plurality of artists, for a predetermined number of times. The operation of updating topic numbers updates the topic number assigned to a given lyrics data of a given artist using a random number generator having a deviation of appearance probability corresponding to a probability distribution over topic numbers. An outputting device outputs the topic numbers of the plurality of lyrics data for each of the plurality artists, and a probability distribution over words for each of the topic numbers.

Correlation of bio-impedance measurements and a physiological parameter for a wearable device

An apparatus device may include a bio-impedance sensor configured to take a bio-impedance measurement from a body of an individual, an optical sensor configured to take an optical measurement from the body of the individual, and a processing device configured to receive a first bio-impedance measurement from the bio-impedance sensor taken during a first period of time and a first optical measurement from the optical sensor taken during the first period of time, receive first location information of the individual during the first period of time, determine a first correlation between a physiological parameter and at least one of the first location, the first bio-impedance measurement, or the first optical measurement, and determine a first level of the physiological parameter based on the first correlation.