Patent classifications
G06K9/62
DEEP LEARNING SYSTEM FOR DETERMINING AUDIO RECOMMENDATIONS BASED ON VIDEO CONTENT
Embodiments are disclosed for determining an answer to a query associated with a graphical representation of data. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an input including an unprocessed audio sequence and a request to perform an audio signal processing effect on the unprocessed audio sequence. The one or more embodiments further include analyzing, by a deep encoder, the unprocessed audio sequence to determine parameters for processing the unprocessed audio sequence. The one or more embodiments further include sending the unprocessed audio sequence and the parameters to one or more audio signal processing effects plugins to perform the requested audio signal processing effect using the parameters and outputting a processed audio sequence after processing of the unprocessed audio sequence using the parameters of the one or more audio signal processing effects plugins.
INVENTORY MANAGEMENT SYSTEM IN A REFRIGERATOR APPLIANCE
A refrigerator appliance is provided including a cabinet defining a chilled chamber, a door rotatably hinged to the cabinet to provide selective access to the chilled chamber, and an inventory management system mounted within the chilled chamber for monitoring objects positioned within the chilled chamber. The inventory management system includes a camera assembly that obtains a plurality of images of food items as they are being added to or removed from the chilled chamber. A controller of the appliance analyzes the images using a machine learning image recognition process to identify an object and monitor the object between different images to determine a motion vector associated with its movement.
DATA CLASSIFICATION BASED ON RECURSIVE CLUSTERING
Methods and systems are presented for providing a machine learning model framework configured to perform complex data classifications. Upon receiving a request for classifying data, the data is recursively assigned to one or more clusters. During each iteration of clustering assignment, a set of clusters is selected based on a previously assigned cluster for the data, and the data is then assigned to a particular cluster from the selected set of clusters. The machine learning model framework also includes a plurality of machine learning models configured to perform simple data classifications. A particular machine learning model is selected from the plurality of machine learning model based on the one or more clusters to which the document is assigned. The particular machine learning model is then used to classify the document.
CHARACTERIZING LIQUID REFLECTIVE SURFACES IN 3D LIQUID METAL PRINTING
A method includes defining a model for a liquid while the liquid is positioned at least partially within a nozzle of a printer. The method also includes synthesizing video frames of the liquid using the model to produce synthetic video frames. The method also includes generating a labeled dataset that includes the synthetic video frames and corresponding model values. The method also includes receiving real video frames of the liquid while the liquid is positioned at least partially within the nozzle of the printer. The method also includes generating an inverse mapping from the real video frames to predicted model values using the labeled dataset. The method also includes reconstructing the liquid in the real video frames based at least partially upon the predicted model values.
VISUAL REPRESENTATION GENERATION FOR BIAS CORRECTION
In some implementations, a device may determine an interaction profile for a provider that is to engage with a user in a communication session. The interaction profile may be based on interaction data relating to interpersonal interactions involving the provider during one or more previous communication sessions. The interaction profile may indicate a bias of the provider in connection with one or more categories of users. The device may generate, based on the interaction profile, a visual representation that depicts at least a face of a person for presentation to the provider during the communication session. One or more characteristics associated with the one or more categories of users may be absent from the face of the person. The device may cause presentation of the visual representation to the provider during the communication session.
Multi-Modal Regression to Predict Customer Intent to Contact a Merchant
Provided herein are systems and methods for using multi-modal regression to predict customer intent to contact a merchant. Multi-modal data including numerical data and unstructured data are extracted from customer interactions with the merchant. Features of the numerical data and the unstructured data are separately extracted and classified using techniques specific to the data types. The features for each type are then separately used to predict probabilities of customer intent. A neural network is used to combine the predictions into a single set of estimates of customer intent. This set of estimates of customer intents is used to estimate a probability that the customer will contact the merchant. The customer is then contacted based on the estimate.
CONFIDENCE-BASED ASSISTED LEARNING
Techniques are disclosed for assisted learning with enhanced privacy. A method comprises: sending first statistical information from a first agent to a second agent in an architecture having at least two agents, wherein a first set of sample weights correspond to training the first machine learning model, wherein the first statistical information comprises the second set of sample weights determined from a first model weight; receiving, from the second agent, second statistical information comprising the second model weight and updated first set of sample weights or, from a third agent of the architecture, third statistical information comprising a third model weight and a next iteration of the first set of sample weights; and updating the first machine learning model using the second statistical information or the third statistical information.
MODULAR ADAPTATION FOR CROSS-DOMAIN FEW-SHOT LEARNING
A method, apparatus and system for adapting a pre-trained network for application to a different dataset includes arranging at least two different types of active adaptation modules in a pipeline configuration, wherein an output of a previous active adaptation module produces an input for a next active adaptation module in the pipeline in the form of adapted network data until a last active adaptation module, and wherein each of the at least two different types of adaptation modules can be switched on or off, determining at least one respective hyperparameter for each of the at least two different types of active adaptation modules, and applying the at least one respective determined hyperparameter to each of the at least two different types of active adaptation modules for processing received data from the pretrained network to determine an adapted network.
IMPLICIT CURRICULUM LEARNING
Systems and techniques for facilitating implicit curriculum learning are provided. These allow for improved machine learning systems that can automatically execute curriculum learning without drawbacks such as pre-sorting data into different epochs which may have varying degrees of difficulty (e.g. easiest first, then harder epochs). Applicant's techniques can be executed more efficiently by automatically iterating over a data set, which need not be manually separated into different epochs. Thus, a system can access a neural network and a set of labeled data candidates. In various aspects, the system can perform a plurality of training epochs on the neural network based on the set of labeled data candidates. In various instances, the system can iteratively update the set of labeled data candidates as the plurality of training epochs are performed, by removing, after each training epoch, a dropout percentage of those labeled data candidates which the neural network correctly classified during the training epoch.
ARTIFICIAL INTELLIGENCE ARCHITECTURES FOR DETERMINING IMAGE AUTHENTICITY
The present disclosure generally relates to systems that include an artificial intelligence (AI) architecture for determining whether an image is manipulated. The architecture can include a constrained convolutional layer, separable convolutional layers, maximum-pooling layers, a global average-pooling layer, and a fully connected layer. In one specific example, the constrained convolutional layer can detect one or more image-manipulation fingerprints with respect to an image and can generate feature maps corresponding to the image. The global average-pooling layer can generate a vector of feature values by averaging the feature maps. The fully connected layer can then generate, based on the vector of feature values, an indication of whether the image was manipulated or not manipulated.