G06N3/047

Transferring large datasets by using data generalization

A computer-implemented method for transferring data is provided. In an illustrative embodiment, the method includes retrieving, by a computer, an original dataset to be sent from a sender to a receiver. The method also includes generating, by the computer, a model based on at least a subset of the original dataset. The model generates a predicted dataset. The model is selected from a plurality of model types based on data complexity of the original dataset and a desired level of approximation of the predicted dataset to the original dataset. The method also includes transferring, by the computer, the model to the receiver. The receiver uses the model to generate the predicted dataset, wherein the predicted dataset matches the original dataset to a selected degree of approximation. Transfer of the model is quicker than transfer of the original dataset.

Collaborative multi-parties/multi-sources machine learning for affinity assessment, performance scoring, and recommendation making

Provided is a process that includes sharing information among two or more parties or systems for modeling and decision-making purposes, while limiting the exposure of details either too sensitive to share, or whose sharing is controlled by laws, regulations, or business needs.

Decipherable deep belief network method of feature importance analysis for road safety status prediction

A method for visualizing and analyzing contributions of various input features for traffic safety status prediction is provided. The method includes initializing a deep belief network (DBN) with input features; performing unsupervised learning/training by observing changes of weights of the input features during the unsupervised learning/training; when the unsupervised learning/training process is complete, performing supervised learning/training process by generating a reconstructed input layer based on results of each hidden layer; and continually running the supervised learning/training and generating a weight diagram based on both visualization and numerical analysis that calculates contributions of the input features. The input features may include one or more of annual average daily commercial traffic (AADCT), median width, left shoulder width, right shoulder width, curve deflection, and exposure for traffic safety status prediction.

Predictive resolutions for tickets using semi-supervised machine learning

Aspects of the subject disclosure may include, for example, a method in which a processing system collects information associated with trouble tickets each including a problem abstract and a log text. The method includes analyzing the log text to obtain a problem resolution for that ticket; defining ticket clusters according to the problem abstracts, and labeling the clusters. The processing system creates a library of the labeled clusters, each entry including a cluster label, a problem abstract for that cluster, and a resolution summary for that problem abstract, indicating a mapping of the problem abstract to the resolution summary for that cluster. The method includes training, based on the mapping, machine-learning applications for a predicted resolution summary for each cluster and for classifying a new ticket. The method includes assigning the new ticket to a cluster according to the classifying. Other embodiments are disclosed.

Training multiple neural networks with different accuracy
11556793 · 2023-01-17 · ·

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a deep neural network. One of the methods includes generating a plurality of feature vectors that each model a different portion of an audio waveform, generating a first posterior probability vector for a first feature vector using a first neural network, determining whether one of the scores in the first posterior probability vector satisfies a first threshold value, generating a second posterior probability vector for each subsequent feature vector using a second neural network, wherein the second neural network is trained to identify the same key words and key phrases and includes more inner layer nodes than the first neural network, and determining whether one of the scores in the second posterior probability vector satisfies a second threshold value.

Architecture exploration and compiler optimization using neural networks
11556684 · 2023-01-17 · ·

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for optimizing integrated circuit architectures or compiler designs using an optimization engine. The optimization engine includes an auto-encoder and one or more regressors. Once trained, the optimization engine can encode initial, discrete input values of a set of input characteristics into a continuous domain and use continuous optimization techniques to identify final input values of the set of input characteristics that optimize one or more output characteristics.

Variational autoencoding for anomaly detection
11556855 · 2023-01-17 · ·

A machine learning model including an autoencoder may be trained based on training data that includes sequences of non-anomalous performance metrics from an information technology system but excludes sequences of anomalous performance metrics. The trained machine learning model may process a sequence of performance metrics from the information technology system by generating an encoded representation of the sequence of performance metrics and generating, based on the encoded representation, a reconstruction of the sequence of performance metrics. An occurrence of the anomaly at the information technology system may be detected based on a reconstruction error present in reconstruction of the sequence of performance metrics. Related systems, methods, and articles of manufacture are provided.

Neural network processing for multi-object 3D modeling

Embodiments are directed to neural network processing for multi-object three-dimensional (3D) modeling. An embodiment of a computer-readable storage medium includes executable computer program instructions for obtaining data from multiple cameras, the data including multiple images, and generating a 3D model for 3D imaging based at least in part on the data from the cameras, wherein generating the 3D model includes one or more of performing processing with a first neural network to determine temporal direction based at least in part on motion of one or more objects identified in an image of the multiple images or performing processing with a second neural network to determine semantic content information for an image of the multiple images.

Training Speech Synthesis to Generate Distinct Speech Sounds

A method (800) of training a text-to-speech (TTS) model (108) includes obtaining training data (150) including reference input text (104) that includes a sequence of characters, a sequence of reference audio features (402) representative of the sequence of characters, and a sequence of reference phone labels (502) representative of distinct speech sounds of the reference audio features. For each of a plurality of time steps, the method includes generating a corresponding predicted audio feature (120) based on a respective portion of the reference input text for the time step and generating, using a phone label mapping network (510), a corresponding predicted phone label (520) associated with the predicted audio feature. The method also includes aligning the predicted phone label with the reference phone label to determine a corresponding predicted phone label loss (622) and updating the TTS model based on the corresponding predicted phone label loss.

Transaction-enabling systems and methods for customer notification regarding facility provisioning and allocation of resources

The present disclosure describes transaction-enabling systems and methods. A system can include a facility including a core task including a customer relevant output and a controller. The controller may include a facility description circuit to interpret a plurality of historical facility parameter values and corresponding facility outcome values and a facility prediction circuit to operate an adaptive learning system, wherein the adaptive learning system is configured to train a facility production predictor in response to the historical facility parameter values and the corresponding outcome values. The facility description circuit also interprets a plurality of present state facility parameter values, wherein the trained facility production predictor determines a customer contact indicator in response to the plurality of present state facility parameter values and a customer notification circuit provides a notification to a customer in response.