Patent classifications
G06F18/2178
RECOMMENDING THE MOST RELEVANT CHARITY FOR A NEWS ARTICLE
The disclosure relates to AI-based machine-learning and natural language modeling to identify semantic similarities between sets of content having natural language text. For example, a system may generate a relevance classification that indicates whether content such as articles are non-specifically relevant to charities without identifying a particular charity. If the content is non-specifically relevant to charities, the system may apply a natural language model to generate sentence embeddings based on the content and determine a level similarity between the sentence embeddings and a query embedding generated from a charity query. The charity query may itself be generated from a full description of the charity through an encoder-decoder architecture with reinforcement learning.
ASSIGNMENT OF CLINICAL IMAGE STUDIES USING ONLINE LEARNING
Methods and systems for training a model using machine learning for automatically distributing medical imaging studies to radiologists. One method includes receiving one or more medical images included in a medical study, each of the one or more medical images including image metadata defining characteristics of the corresponding medical image. The method further includes receiving radiologist metadata for each one of the plurality of radiologists, generating a state representation of the image metadata and the radiologist metadata, and providing the state representation to the model. The method further includes assigning, with the model, at least one of the one or more medical images to one of the plurality of radiologists, calculating feedback based on a change in the state representation after the at least one of the one or more medical images is assigned to one of the plurality of radiologists, and adjusting the model based on the feedback.
Systems and methods for extracting specific data from documents using machine learning
Computer implemented systems and methods are disclosed for extracting specific data using machine learning algorithms. In accordance with some embodiments, a memory device that stores at least a set of computer executable instructions for a machine learning algorithm and a pre-fill engine; and at least one processor that executes the instructions that cause the pre-fill engine to perform functions that include: receiving electronic documents, seed dataset documents, and pre-fill questions; determining output data that enable navigation through the electronic documents using the machine learning algorithm; determining output questions that enable navigation through the electronic documents using the machine learning algorithm; determining output documents to enable navigation through the electronic documents using the machine learning algorithm; and presenting one or more answers for one or more of the output questions using a graphical user interface.
Self-supervised document-to-document similarity system
Examples provide a self-supervised language model for document-to-document similarity scoring and ranking long documents of arbitrary length in an absence of similarity labels. In a first stage of a two-staged hierarchical scoring, a sentence similarity matrix is created for each paragraph in the candidate document. A sentence similarity score is calculated based on the sentence similarity matrix. In the second stage, a paragraph similarity matrix is constructed based on aggregated sentence similarity scores associated with the first candidate document. A total similarity score for the document is calculated based on the normalize the paragraph similarity matrix for each candidate document in a collection of documents. The model is trained using a masked language model and intra-and-inter document sampling. The documents are ranked based on the similarity scores for the documents.
Diagnostic systems and methods for deep learning models configured for semiconductor applications
Methods and systems for performing diagnostic functions for a deep learning model are provided. One system includes one or more components executed by one or more computer subsystems. The one or more components include a deep learning model configured for determining information from an image generated for a specimen by an imaging tool. The one or more components also include a diagnostic component configured for determining one or more causal portions of the image that resulted in the information being determined and for performing one or more functions based on the determined one or more causal portions of the image.
Machine-learning training service for synthetic data
Various embodiments, methods and systems for implementing a distributed computing system machine-learning training service are provided. Initially a machine learning model is accessed. A plurality of synthetic data assets are accessed, where a synthetic data asset is associated with asset-variation parameters that are programmable for machine-learning. The machine learning model is retrained using the plurality of synthetic data assets. The machine-learning training service is further configured for executing real-time calls to generate an on-the-fly-generated synthetic data asset such that the on-the-fly-generated synthetic data asset is rendered in real-time to preclude pre-rendering and storing the on-the-fly-generated synthetic data asset. The machine-learning training service further supports hybrid-based machine learning training, where the machine learning model is trained based on a combination of the plurality of synthetic data assets, a plurality of non-synthetic data assets, and synthetic data asset metadata associated with the plurality of synthetic data assets.
TRUST RELATED MANAGEMENT OF ARTIFICIAL INTELLIGENCE OR MACHINE LEARNING PIPELINES IN RELATION TO THE TRUSTWORTHINESS FACTOR EXPLAINABILITY
There are provided measures for trust related management of artificial intelligence or machine learning pipelines in relation to the trustworthiness factor “explainability”. Such measures exemplarily comprise, at a first network entity managing artificial intelligence or machine learning trustworthiness in a network, transmitting a first artificial intelligence or machine learning trustworthiness related message towards a second network entity managing artificial intelligence or machine learning trustworthiness in an artificial intelligence or machine learning pipeline in said network, and receiving a second artificial intelligence or machine learning trustworthiness related message from said second network entity.
VARIABLE DENSITY-BASED CLUSTERING ON DATA STREAMS
In some implementations, a device may receive, from a data stream, a set of data points arranged in a dimensional data space. The device may compare the set of data points to identify one or more clusters using values of a distance parameter for data points included in the set of data points, wherein the values of distance parameter includes different values of the distance parameter for different data points. The device may transmit an indication of the one or more clusters to cause a device to display information associated with the one or more clusters. The device may receive, from the device, feedback information associated with at least one data point, wherein the feedback information indicates that at least one data point is associated with an error. The device may modify a value of the distance parameter associated with the at least one data point to a modified value.
Computing device for training artificial neural network model, method of training the artificial neural network model, and memory system for storing the same
A computing device for training an artificial neural network model includes: a model analyzer configured to receive a first artificial neural network model and split the first artificial neural network model into a plurality of layers; a training logic configured to calculate first sensitivity data varying as the first artificial neural network model is pruned, calculate a target sensitivity corresponding to a target pruning rate based on the first sensitivity data, calculate second sensitivity data varying as each of the plurality of layers is pruned, and output, based on the second sensitivity data, an optimal pruning rate of each of the plurality of layers, the optimal pruning rate corresponding to the target pruning rate; and a model updater configured to prune the first artificial neural network model based on the optimal pruning rate to obtain a second artificial neural network model, and output the second artificial neural network model.
Machine learning verification procedure
Systems, methods, and techniques to efficiently and effectively verifying and calibrating a machine learning model. The method can include training a machine learning model by at least processing training data with the machine learning model. The method can further include manipulating a first data set of the training data and applying the manipulated first data set to the machine learning model to thereby determine a first matching rate. In addition, the method can include applying the manipulated first data set to a rule engine to thereby determine a second matching rate and determining a difference between the first matching rate and the second matching rate. The method can further include determining whether the difference is within a predefined threshold range and providing an error indication if the determined difference is outside of the predefined threshold range.