Patent classifications
G06N3/096
DATA PROCESSING METHOD AND APPARATUS
Relating to the field of artificial intelligence, and specifically relating to the field of natural language processing, a data processing method includes and an apparatus performs: determining original text samples, where masking processing is not performed on the original text samples; and performing mask processing on the original text samples to obtain mask training samples, where the mask processing makes mask proportions of the mask training samples unfixed, and the mask training samples each are used to train a pretrained language model PLM. Training the PLM by using the mask training samples whose mask proportions are unfixed can enhance mode diversity of the training samples of the PLM. Therefore, features learned by the PLM are also diversified, a generalization capability of the PLM can be improved, and a natural language understanding capability of the PLM obtained through training can be improved.
PATHOLOGICAL DIAGNOSIS ASSISTING METHOD USING AI, AND ASSISTING DEVICE
Diagnosis is assisted by acquiring microscopical observation image data while specifying the position, classifying the image data into histological types with the use of AI, and reconstructing the classification result in a whole lesion. There is provided a pathological diagnosis assisting method that can provide an assistance technology which performs a pathological diagnosis efficiently with satisfactory accuracy by HE staining which is usually used by pathologists. Furthermore, there are provided a pathological diagnosis assisting system, a pathological diagnosis assisting program, and a pre-trained model.
METHOD OF TRAINING DEEP LEARNING MODEL AND METHOD OF PROCESSING NATURAL LANGUAGE
A method of training a deep learning model, a method of processing a natural language, an electronic device, and a storage medium are provided, which relate to a field of artificial intelligence, in particular to deep learning technology and natural language processing technology. The method includes: inputting first sample data into a first deep learning model to obtain a first output result; training the first deep learning model according to the first output result and a first target output result, the first target output result is obtained by processing the first sample data using a reference deep learning model; inputting second sample data into a second deep learning model to obtain a second output result; and training the second deep learning model according to the second output result and a second target output result, to obtain a trained second deep learning model.
DATA RETRIEVAL USING REINFORCED CO-LEARNING FOR SEMI-SUPERVISED RANKING
A computer-implement method comprises: training a classifier with labeled data from a dataset; classifying, by the trained classifier, unlabeled data from the dataset; providing, by the classifier to a policy gradient, a reward signal for each data/query pair; transferring, by the classifier to a ranker, learning; training, by the policy gradient, the ranker; ranking data from the dataset based on a query; and retrieving data from the ranked data in response to the query.
COMMUNICATION SYSTEM BASED ON NEURAL NETWORK MODEL, AND CONFIGURATION METHOD THEREFOR
The present disclosure relates to a communication system based on a neural network model, and a configuration method therefor. The communication system includes at least one master node and multiple child nodes that are in communication connection with the master node, and a child node neural network model is configured in each of the multiple child nodes. The configuration method for the communication system includes: obtaining feature information of the multiple child nodes; and dynamically configuring the child node neural network models on the basis of the obtained feature information.
Performance of Complex Optimization Tasks with Improved Efficiency Via Neural Meta-Optimization of Experts
Example systems perform complex optimization tasks with improved efficiency via neural meta-optimization of experts. In particular, provided is a machine learning framework in which a meta-optimization neural network can learn to fuse a collection of experts to provide a predicted solution. Specifically, the meta-optimization neural network can learn to predict the output of a complex optimization process which optimizes over outputs from the collection of experts to produce an optimized output. In such fashion, the meta-optimization neural network can, after training, be used in place of the complex optimization process to produce a synthesized solution from the experts, leading to orders of magnitude faster and computationally more efficient prediction or problem solution.
System And Method For Training A Self-Supervised Ego Vehicle
A system for training a machine learning framework to estimate depths of objects captured in 2-D images includes a first trained machine learning network and a second untrained or minimally trained machine learning framework. The first trained machine learning network is configured to analyze 2-D images of target spaces including target objects and to provide output indicative of 3-D positions of the target objects in the target spaces. The second machine learning network can be configured to provide an output responsive to receiving a 2-D input image. A comparator receives the outputs from the first and second machine learning networks based on a particular 2-D image. The comparator compares the output of the first trained machine learning network with the output of the second machine learning network. A feedback mechanism is operative to alter the second machine learning network based at least in part on the output of the comparator.
METHOD AND APPARATUS FOR TRAINING ARTIFICIAL INTELLIGENCE BASED ON EPISODE MEMORY
The present disclosure relates to a method and apparatus for training artificial intelligence based on an episodic memory. According to an embodiment of the present disclosure, a method for training artificial intelligence based on an episodic memory may include: constructing an episodic memory by using a feature vector of a training dataset stored in a full memory; obtaining output data by inputting query data into an artificial intelligence model; deriving a similarity between the output data and a feature vector in the constructed episodic memory; and deriving an episode loss function based on the similarity.
TRANSFER/FEDERATED LEARNING APPROACHES TO MITIGATE BLOCKAGE IN MILLIMETER WAVE SYSTEMS
A UE may train a NN, based on a blockage of a beam transmission, to indicate one or more beam weights in association with the blockage of the beam transmission. The UE may store, in an ML database, information indicative of at least one of the trained NN or the one or more beam weights indicated via the trained NN, such that the UE may communicate, to an ML server, the information via the trained NN. The ML server may train the NN, based on a TL/FL procedure for the one or more beam weights associated with the at least one blockage, to indicate one or more TL/FL beam weights in association with the at least one blockage, and communicate, to at least one UE, information indicative of at least one of the trained NN or the one or more TL/FL beam weights indicated via the trained NN.
SYSTEMS AND METHODS FOR ANOMALY RECOGNITION AND DETECTION USING LIFELONG DEEP NEURAL NETWORKS
Industrial quality control is challenging for artificial neural networks (ANNs) and deep neural networks (DNNs) because of the nature of the processed data: there is an abundance of consistent data representing good products, but little data representing bad products. In quality control, the task is changed from conventional DNN task of “recognize what I learned best” to “recognize what I have never seen before.” Lifelong DNN (L-DNN) technology is a hybrid semi-supervised neural architecture that combines the ability of DNNs to be trained, with high precision, on known classes, while being sensitive to any number of unknown classes or class variations. When used for industrial inspection, L-DNN exploits its ability to learn with little and highly unbalanced data. L-DNN's real-time learning capability takes advantage of rare cases of poor-quality products that L-DNN encounters after deployment. L-DNN can be applied to industrial inspections and manufacturing quality control.