Patent classifications
G06F18/214
Ontology matching based on weak supervision
A method is for matching a set of first classes assigned to a first data set with a set of second classes assigned to a second data set. The method includes constructing, via a set of pre-processing functions, a plurality of alignment profiles such that at least one alignment profile is assigned to each of the first classes and each of the second classes. The method includes generating a comparison matrix for each group of the alignment profiles, such that each group includes at least one of the first classes and at least one of the second classes. The method includes training a first machine learning model, through supervised training, based on the generated comparison matrices and based on probabilistic labels generated by a second machine learning model.
Storage system and storage control method
A storage system that performs irreversible compression on time-series data using a compressor/decompressor based on machine learning calculates a statistical amount value of each of one or more kinds of statistical amounts based on one or more parameters in relation to original data (time-series data input to a compressor/decompressor) and calculates a statistical amount value of each of the one or more kinds of statistical amounts based on the one or more kinds of parameters in relation to decompressed data (time-series data output from the compressor/decompressor) corresponding to the original data. The machine learning of the compressor/decompressor is performed based on the statistical amount value calculated for each of the one or more kinds of statistical amounts in relation to the original data and the statistical amount value calculated for each of the one or more kinds of statistical amounts in relation to the decompressed data.
Internal thermal fault diagnosis method of oil-immersed transformer based on deep convolutional neural network and image segmentation
The disclosure provides an internal thermal fault diagnosing method for an oil-immersed transformer based on DCNN and image segmentation, including: 1) dividing an internal area of a transformer, and using fault areas and normal status as labels of DCNN; 2) through lattice Boltzmann simulation, randomly obtaining multiple feature images of the internal temperature field distribution of the transformer under normal and various fault state modes, and the fault area serves as a label to form the underlying training sample set; 3) obtaining historical monitoring information of the infrared camera or temperature sensor, and forming its corresponding fault diagnosis results into labels; 4) combining all monitoring information contained in each sample into one image, and then extracting the same monitoring information from the samples in the sample set to form a new image; 5) segmenting image sample and then inputting the same into DCNN for training to obtain diagnosis results.
Representative document hierarchy generation
In some aspects, a method includes performing optical character recognition (OCR) based on data corresponding to a document to generate text data, detecting one or more bounded regions from the data based on a predetermined boundary rule set, and matching one or more portions of the text data to the one or more bounded regions to generate matched text data. Each bounded region of the one or more bounded regions encloses a corresponding block of text. The method also includes extracting features from the matched text data to generate a plurality of feature vectors and providing the plurality of feature vectors to a trained machine-learning classifier to generate one or more labels associated with the one or more bounded regions. The method further includes outputting metadata indicating a hierarchical layout associated with the document based on the one or more labels and the matched text data.
Training a card type classifier with simulated card images
A computer model to identify a type of physical card is trained using simulated card images. The physical card may exist with various subtypes, some of which may not exist or be unavailable when the model is trained. To more robustly identify these subtypes, the training data set for the computer model includes simulated card images that are generated for the card type. The simulated card images are generated based on a semi-randomized background that varies in appearance, onto which an identifying marking of the card type is superimposed, such that the training data for the computer model includes additional randomized sample card images and ensure the model is robust to further variations in subtypes.
Systems and methods for improved operations of ski lifts
Systems and methods for improved operations of ski lifts increase skier safety at on-boarding and off-boarding locations by providing an always-on, always-alert system that “watches” these locations, identifies developing problem situations, and initiates mitigation actions. One or more video cameras feed live video to a video processing module. The video processing module feeds resulting sequences of images to an artificial intelligence (AI) engine. The AI engine makes an inference regarding existence of a potential problem situation based on the sequence of images. This inference is fed to an inference processing module, which determines if the inference processing module should send an alert or interact with the lift motor controller to slow or stop the lift.
Automation system and method
A computer-implemented method, computer program product and computing system for receiving a complex task; processing the complex task to define a plurality of discrete tasks each having a discrete goal; executing the plurality of discrete tasks on a plurality of machine-accessible public computing platforms; determining if any of the plurality of discrete tasks failed to achieve its discrete goal; and if a specific discrete task failed to achieve its discrete goal, defining a substitute discrete task having a substitute discrete goal.
Method, apparatus, and system for determining polyline homogeneity
An approach is provided for an asymmetric evaluation of polygon similarity. The approach, for instance, involves receiving a first polygon representing an object depicted in an image. The approach also involves generating a transformation of the image comprising image elements whose values are based on a respective distance that each image element is from a nearest image element located on a first boundary of the first polygon. The approach further involves determining a subset of the plurality of image elements of the transformation that intersect with a second boundary of a second polygon. The approach further involves calculating a polygon similarity of the second polygon with respect the first polygon based on the values of the subset of image elements normalized to a length of the second boundary of the second polygon.
CLASSIFICATION MODEL TRAINING METHOD, SYSTEM, ELECTRONIC DEVICE AND STRORAGE MEDIUM
Provided are a classification model training method, system, electronic device, and storage medium. The method includes: determining sampling rates of first-class samples and second-class samples in a data set, and setting the samples with a sampling rate less than a preset value as target samples (S101); determining data distribution feature information of the target samples based on Euclidean distances between all the samples in the data set (S102); wherein the data distribution feature information is information describing the number of same-class samples in nearest neighbor samples, and the nearest neighbor samples are two samples at a Euclidean distance less than a preset distance; generating new samples corresponding to the target samples based on the data distribution feature information (S103); and training the classification model using the first-class samples, the second-class samples and the new samples (S104).
CLASSIFICATION MODEL TRAINING METHOD, SYSTEM, ELECTRONIC DEVICE AND STRORAGE MEDIUM
Provided are a classification model training method, system, electronic device, and storage medium. The method includes: determining sampling rates of first-class samples and second-class samples in a data set, and setting the samples with a sampling rate less than a preset value as target samples (S101); determining data distribution feature information of the target samples based on Euclidean distances between all the samples in the data set (S102); wherein the data distribution feature information is information describing the number of same-class samples in nearest neighbor samples, and the nearest neighbor samples are two samples at a Euclidean distance less than a preset distance; generating new samples corresponding to the target samples based on the data distribution feature information (S103); and training the classification model using the first-class samples, the second-class samples and the new samples (S104).