Patent classifications
G06V30/19127
DARK WEB CONTENT ANALYSIS AND IDENTIFICATION
In some examples, dark web content analysis and identification may include ascertaining data that includes text and images, and analyzing the data by performing deep learning based text and image processing to extract text embedded in the images, and deep embedded clustering to generate clusters. Clusters that are to be monitored may be ascertained from the generated clusters. A determination may be made as to whether the ascertained data is sufficient for classification. If so, a deep convolutional generative adversarial networks (DCGAN) based detector may be utilized to analyze further data with respect to the ascertained clusters, and alternatively, a convolutional neural network (CNN) based detector may be utilized to analyze the further data with respect to the ascertained clusters. Based on the analysis of the further data, an operation associated with a website related to the further data may be controlled.
Binary feature compression for autonomous devices
Systems, methods, tangible non-transitory computer-readable media, and devices associated with object localization and generation of compressed feature representations are provided. For example, a computing system can access training data including a source feature representation and a target feature representation. An encoded target feature representation can be generated based on the target feature representation and a machine-learned encoding model. A binarized target feature representation can be generated based on the encoded target feature representation and lossless binarization operations. A reconstructed target feature representation can be generated based on the binarized target feature representation and a machine-learned decoding model. A matching score for the source feature representation and the reconstructed target feature representation can be determined. A loss associated with the matching score can be determined. Parameters of the machine-learned encoding model and the machine-learned decoding model can be adjusted based on the loss.
TYRE SIDEWALL IMAGING METHOD
A computer implemented method is proposed for classifying one or more embossed and/or engraved markings on a sidewall of a tyre into one or more classes comprising digital image data of the sidewall of the tyre. The method comprises generating a first image channel from a first portion of the digital image data relating to a corresponding first portion of the sidewall of the tyre. Generating the first image channel comprises performing histogram equalisation on the first portion of the digital image data to generate the first image channel. The method further comprises generating a first feature map using the first image channel and applying a first classifier to the first feature map to classify said embossed and/or engraved markings into one or more first classes.
IMAGE RECOGNITION METHOD, APPARATUS, TERMINAL, AND STORAGE MEDIUM
An image recognition method, apparatus, terminal, and storage medium are disclosed in embodiments of the present disclosure. A target image may be acquired, the target image being an image of a certificate to be recognized; text area recognition is performed on the target image to obtain a text area image of a target text corresponding to the certificate to be recognized; a text direction of the target text is determined according to the text area image; direction adjustment is performed on the text area image according to the text direction to obtain an adjusted text area image; and text recognition is performed on the adjusted text area image to obtain a text content of the target text.
Dark web content analysis and identification
In some examples, dark web content analysis and identification may include ascertaining data that includes text and images, and analyzing the data by performing deep learning based text and image processing to extract text embedded in the images, and deep embedded clustering to generate clusters. Clusters that are to be monitored may be ascertained from the generated clusters. A determination may be made as to whether the ascertained data is sufficient for classification. If so, a deep convolutional generative adversarial networks (DCGAN) based detector may be utilized to analyze further data with respect to the ascertained clusters, and alternatively, a convolutional neural network (CNN) based detector may be utilized to analyze the further data with respect to the ascertained clusters. Based on the analysis of the further data, an operation associated with a website related to the further data may be controlled.
DYNAMIC DETECTION AND RECOGNITION OF MEDIA SUBJECTS
A system for indexing animated content receives detections extracted from a media file, where each one of the detections includes an image extracted from a corresponding frame of the media file that corresponds to a detected instance of an animated character. The system determines, for each of the received detections, an embedding defining a set of characteristics for the detected instance. The embedding associated with each detection is provided to a grouping engine that is configured to dynamically configure at least one grouping parameter based on a total number of the detections received. The grouping engine is also configured to sort the detections into groups using the grouping parameter and the embedding for each detection. A character ID is assigned to each one of the groups of detections, and the system indexes the groups of detections in a database in association with the character ID assigned to each group.
Dynamic detection and recognition of media subjects
A system for indexing animated content receives detections extracted from a media file, where each one of the detections includes an image extracted from a corresponding frame of the media file that corresponds to a detected instance of an animated character. The system determines, for each of the received detections, an embedding defining a set of characteristics for the detected instance. The embedding associated with each detection is provided to a grouping engine that is configured to dynamically configure at least one grouping parameter based on a total number of the detections received. The grouping engine is also configured to sort the detections into groups using the grouping parameter and the embedding for each detection. A character ID is assigned to each one of the groups of detections, and the system indexes the groups of detections in a database in association with the character ID assigned to each group.
SELF-IMPROVING DOCUMENT CLASSIFICATION AND SPLITTING FOR DOCUMENT PROCESSING IN ROBOTIC PROCESS AUTOMATION
Systems and methods for classifying and splitting an electronic file into a plurality of extracted documents are provided. The electronic file is received. An initial portion of the electronic file is classified using a trained classifier and extracted from the electronic file as an extracted document associated with the classification. It is iteratively determined whether each respective next portion of the electronic file should be added to the extracted document until it is determined that the respective next portion should not be added to the extracted document. In response to determining that the respective next portion should be added to the extracted document, the respective next portion is extracted from the electronic file and added to the extracted document. In response to determining that the respective next portion should not be added to the extracted document, the classifying and the iteratively determining are repeated using the respective next portion as the initial portion. The extracted documents are output. The trained classifier can be trained to learn sets of word vectors and other relevant information associated with document classifications, in order to improve accuracy.
Mapper component for a neuro-linguistic behavior recognition system
Techniques are disclosed for generating a sequence of symbols based on input data for a neuro-linguistic model. The model may be used by a behavior recognition system to analyze the input data. A mapper component of a neuro-linguistic module in the behavior recognition system receives one or more normalized vectors generated from the input data. The mapper component generates one or more clusters based on a statistical distribution of the normalized vectors. The mapper component evaluates statistics and identifies statistically relevant clusters. The mapper component assigns a distinct symbol to each of the identified clusters.
END TO END TRAINABLE DOCUMENT EXTRACTION
A processor may receive an image and identify a plurality of characters in the image using a machine learning (ML) model. The processor may generate at least one word-level bounding box indicating one or more words including at least a subset of the plurality of characters and/or may generate at least one field-level bounding box indicating at least one field including at least a subset of the one or more words. The processor may overlay the at least one word-level bounding box and the at least one field-level bounding box on the image to form a masked image including a plurality of optically-recognized characters and one or more predicted fields for at least a subset of the plurality of optically-recognized characters.