G06V30/268

Method and apparatus for building text classification model, and text classification method and apparatus

The present disclosure provides a method and apparatus for building a text classification model, and a text classification method and apparatus. The method of building a text classification model comprises: obtaining a training sample; obtaining a vector matrix corresponding to the text, after performing word segmentation for the text based on an entity dictionary; using the vector matrix corresponding to the text and a class of the text to train a first classification model and a second classification model respectively; during the training process, using a loss function of the first classification model and a loss function of the second classification model to obtain a loss function of the text classification model, and using the loss function of the text classification model to adjust parameters for the first classification model and the second classification model, to obtain the text classification model formed by the first classification model and the second classification model. The text classification method comprises: obtaining a to-be-classified text; obtaining a vector matrix corresponding to the text, after performing word segmentation for the text based on an entity dictionary; inputting the vector matrix into a text classification model, and obtaining a classification result of the text according to output of the text classification model. The text classification effect can be improved through the technical solutions of the present disclosure.

Method and device for verifying recognition result in character recognition

A method and a device for verifying a recognition result in character recognition are provided. The device constructs a hidden Markov chain for a character string to be recognized, using recognition result output of a character recognition process. The recognition result includes candidate characters of each character in the character string. The device solves for an optimal path forming a candidate character string according to the hidden Markov chain and a pre-trained state transition matrix. The device recognizes non-Chinese characters in the character string according to state transition probabilities in the optimal path. The device verifies the recognition result according to the non-Chinese characters. The device feeds back a verification result to the character recognition process, wherein the character recognition process applied to the character string to be recognized is modified by the verification result.

Handwriting recognition systems and methods
10748031 · 2020-08-18 · ·

The present disclosure includes systems and methods for handwriting recognition. Handwriting data is received. Geometric data of text in handwriting data is determined. Sub-characters of the text are determined. Sub-characters of text are matched to a model. Most probable characters of the text is determined based on the matching.

Electronic handwriting analysis through adaptive machine-learning
10740601 · 2020-08-11 · ·

An improved machine learning system is provided. For example, a content management server may provide a digital assessment of a user's handwriting to assess the user's knowledge of a language. The assessment may comprise adaptive technology to help determine initial questions to provide to the user as well as follow-up questions to clarify appropriate remediation content in a particular context. The content management server may also provide real-time analysis, including assessing multiple users at the same time in adjusting the assessment based on the digital input from each of these users. In some examples, the content management server may incorporate handwriting analysis methods to perform object detection and score handwriting input.

CONTENT-AWARE SELECTION
20200250453 · 2020-08-06 ·

An image editing program can include a content-aware selection system. The content-aware selection system can enable a user to select an area of an image using a label or a tag that identifies object in the image, rather than having to make a selection area based on coordinates and/or pixel values. The program can receive a digital image and metadata that describes an object in the image. The program can further receive a label, and can determine from the metadata that the label is associated with the object. The program can then select a bounding box for the object, and identify in the bounding box, pixels that represent the object. The program can then output a selection area that surrounds the pixels.

Method and apparatus for improving recognition accuracy for the handwritten input of alphanumeric characters and gestures
10726250 · 2020-07-28 · ·

A method for automatically selecting one of a plurality of recognition algorithms for a handwritten input of alphanumeric characters and/or gestures into a selected input field displayed on a screen using a touch-sensitive input apparatus comprises carrying out optical character recognition in a region of the screen which comprises at least the input field and the immediate environment of the input field, or carrying out voice recognition for a voice instruction acoustically output after the selected input field has been displayed. Terms describing field types are searched for in the result of the optical character recognition or the voice recognition, and a recognition algorithm which is adapted to a field type found in the result of the optical character recognition or the voice recognition is selected.

CATALOGING DATABASE METADATA USING A PROBABILISTIC SIGNATURE MATCHING PROCESS

A system and computer implemented method for cataloging database metadata using a probabilistic signature matching process are provided. The method includes receiving an input name to be matched to keys in a data corpus; dividing the received input name into a plurality of text segments; identifying a set of matching keys by matching each of the plurality text segments against keys in the data corpus; analyzing the set of matching keys to construct a tag; and cataloging the metadata with the matching key as the construct tag.

Generating digital document content from a digital image
10671799 · 2020-06-02 · ·

One or more embodiments of systems and methods for a digital content management system for creating a digital document from handwritten content are described herein. For example, the digital content management system receives a digital image of handwritten content and analyzes the digital image to identify handwritten content as well as to identify specific command indicators. In response to identifying a command indicator associated with a command to create a digital document, the digital content management system creates a new digital document and adds digital content portions to the digital document that correspond to the identified content portions identified within the handwritten content depicted within the digital image.

MAPPER COMPONENT FOR A NEURO-LINGUISTIC BEHAVIOR RECOGNITION SYSTEM
20200167679 · 2020-05-28 · ·

Techniques are disclosed for generating a sequence of symbols based on input data for a neuro-linguistic model. The model may be used by a behavior recognition system to analyze the input data. A mapper component of a neuro-linguistic module in the behavior recognition system receives one or more normalized vectors generated from the input data. The mapper component generates one or more clusters based on a statistical distribution of the normalized vectors. The mapper component evaluates statistics and identifies statistically relevant clusters. The mapper component assigns a distinct symbol to each of the identified clusters.

OBJECT RECOGNITION DEVICE
20200151463 · 2020-05-14 ·

An object recognition device according to an embodiment includes a camera that captures an image of an imaging area. A storage device stores, for each of a plurality of registered objects, dictionary feature information for identifying the corresponding object and dictionary boundary information for identifying an actual boundary area of the corresponding object. A processor receives the captured image from the camera, and determines an object area in the captured image. The processor extracts feature information from the object area, and, based on the extracted feature information compared to the dictionary feature information, identifies each object included in the object area. The processor also extracts boundary information corresponding to each identified object included in the object area, and, based on the extracted boundary information compared to the dictionary boundary information with respect to each identified object, determines an overlap state of each identified object in the object area.