G06V30/18057

Text classification
11551461 · 2023-01-10 · ·

A text classifying apparatus (100), an optical character recognition unit (1), a text classifying method (S220) and a program are provided for performing the classification of text. A segmentation unit (110) segments an image into a plurality of lines of text (401-412; 451-457; 501-504; 701-705) (S221). A selection unit (120) selects a line of text from the plurality of lines of text (S222-S223). An identification unit (130) identifies a sequence of classes corresponding to the selected line of text (S224). A recording unit (140) records, for the selected line of text, a global class corresponding to a class of the sequence of classes (S225-S226). A classification unit (150) classifies the image according to the global class, based on a confidence level of the global class (S227-S228).

TRAINING OBJECT DETECTION SYSTEMS WITH GENERATED IMAGES

Apparatuses, systems, and techniques to identify objects within an image using self-supervised machine learning. In at least one embodiment, a machine learning system is trained to recognize objects by training a first network to recognize objects within images that are generated by a second network. In at least one embodiment, the second network is a controllable network.

METHODS, SYSTEMS, ARTICLES OF MANUFACTURE, AND APPARATUS FOR DECODING PURCHASE DATA USING AN IMAGE

Methods, apparatus, systems, and articles of manufacture are disclosed that decode purchase data using an image. An example apparatus includes a dictionary including associated product descriptions and barcodes, interface circuitry, and processing circuitry to execute machine readable instructions to obtain purchase details and barcodes corresponding to a receipt, the purchase details including receipt product descriptions, generate a search query that includes a first receipt product description of the receipt product descriptions, a list of barcodes corresponding to the barcodes, and a store identifier associated with the receipt, execute a search against the dictionary using the search query to identify a barcode from the list of barcodes that corresponds to the first receipt product description, and in response to identifying the barcode that corresponds to the first receipt product description, associating the barcode and the first receipt product description and adding the association to the dictionary.

METHOD AND APPARATUS FOR INTELLIGENT PHARMACOVIGILANCE PLATFORM
20220415467 · 2022-12-29 ·

Disclosed herein are a method and apparatus for providing a pharmacovigilance (PV) platform, wherein a method for operating a server may include: receiving input data from a user device; generating at least one command set from the input data by using a first artificial intelligence model that is selected to process the input data; determining whether or not a user that provides the input data has an authority to execute the at least one command set; generating a result of a task, when the user has the authority, by using a second artificial intelligence model that is selected to perform the task according to the at least one command set; generating output data that displays the result of the task by using a visualization module that is selected to visualize the result of the task; and transmitting the output data to the user device.

Systems and methods of instant-messaging bot for robotic process automation and robotic textual-content extraction from images

Systems and methods of instant-messaging bot for robotic process automation (RPA) and robotic textual-content extraction from digital images include a chatbot application, a software RPA manager, and an instant-messaging (IM) platform, all built for an enterprise. The enterprise IM platform is connected to one or more public IM platforms over the Internet. The RPA manager contains multiple modules of enterprise workflows and receives instructions from the enterprise chatbot for executing individual workflows. The system allows enterprise users connected to the enterprise IM platform, and external users connected to the public IM platforms, to use instant messaging to initiate enterprise workflows that are automated with the help of the enterprise chatbot and delivered via instant messaging. Furthermore, textual-content extraction from digital images is incorporated in the RPA manager as an enterprise workflow, and provides improved convolutional neural network (CNN) methods for textual-content extraction.

COMPUTATIONALLY EFFICIENT AND ROBUST EAR SADDLE POINT DETECTION

A computer-implemented method includes receiving a two-dimensional (2-D) side view face image of a person, identifying a bounded portion or area of the 2-D side view face image of the person as an ear region-of-interest (ROI) area showing at least a portion of an ear of the person, and processing the identified ear ROI area of the 2-D side view face image, pixel-by-pixel, through a trained fully convolutional neural network model (FCNN model) to predict a 2-D ear saddle point (ESP) location for the ear shown in the ear ROI area. The FCNN model has an image segmentation architecture.

Calculation practicing method, system, electronic device and computer readable storage medium

The disclosure provides a calculation practicing method, a system, an electronic device and a computer readable storage medium, the calculation practicing method includes: providing a calculation question; identifying the type and content of the calculation question; generating an answer area according to the type and content of the calculation question; receiving an answering operation in which the user inputs the answer string for the calculation question in the answer area; identifying the answer string inputted by the user; and determining whether each of the answer characters in the answer string is correct, if there is an incorrect answer character, it will be marked, so that the calculation practice can be realized through the electronic device, which is convenient for students to carry out training.

Utilizing machine learning models, position based extraction, and automated data labeling to process image-based documents

A device may receive image data that includes an image of a document and lexicon data identifying a lexicon, and may perform an extraction technique on the image data to identify at least one field in the document. The device may utilize form segmentation to automatically generate label data identifying labels for the image data, and may process the image data, the label data, and data identifying the at least one field, with a first model, to identify visual features. The device may process the image data and the visual features, with a second model, to identify sequences of characters, and may process the image data and the sequences of characters, with a third model, to identify strings of characters. The device may compare the lexicon data and the strings of characters to generate verified strings of characters that may be utilized to generate a digitized document.

Deep learning based on image encoding and decoding
11593632 · 2023-02-28 · ·

A deep learning based compression (DLBC) system trains multiple models that, when deployed, generates a compressed binary encoding of an input image that achieves a reconstruction quality and a target compression ratio. The applied models effectively identifies structures of an input image, quantizes the input image to a target bit precision, and compresses the binary code of the input image via adaptive arithmetic coding to a target codelength. During training, the DLBC system reconstructs the input image from the compressed binary encoding and determines the loss in quality from the encoding process. Thus, the models can be continually trained to, when applied to an input image, minimize the loss in reconstruction quality that arises due to the encoding process while also achieving the target compression ratio.

Reservoir computing neural networks based on synaptic connectivity graphs

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing a reservoir computing neural network. In one aspect there is provided a reservoir computing neural network comprising: (i) a brain emulation sub-network, and (ii) a prediction sub-network. The brain emulation sub-network is configured to process the network input in accordance with values of a plurality of brain emulation sub-network parameters to generate an alternative representation of the network input. The prediction sub-network is configured to process the alternative representation of the network input in accordance with values of a plurality of prediction sub-network parameters to generate the network output. The values of the brain emulation sub-network parameters are determined before the reservoir computing neural network is trained and are not adjusting during training of the reservoir computing neural network.