Patent classifications
G06V30/19
VISUALIZATION OF THE IMPACT OF TRAINING DATA
An example operation may include one or more of generating a plurality of bounding boxes at a plurality of content areas in an image corresponding to a plurality of pieces of text within the image, converting the plurality of bounding boxes into a plurality of bounding box vectors based on attributes of the plurality of bounding boxes, training a machine learning model to transform a bounding box into a location in vector space based on the plurality of bounding box vectors, and storing the trained machine learning model in memory.
SYSTEM AND METHOD FOR GENERATING BEST POTENTIAL RECTIFIED DATA BASED ON PAST RECORDINGS OF DATA
Various methods, apparatuses/systems, and media for data processing are disclosed. A processor receives a digital document; applies an optical character recognition (OCR) algorithm on said received digital document by utilizing an OCR tool; identifies defective data extracted by the OCR tool resulted from relatively inferior image quality of the received digital document; implements an auto rectification algorithm on the identified defective data; automatically generates, in response to implementing the auto rectification algorithm, corresponding auto-rectified data for each identified defective data; records the defective data and corresponding auto-rectified data at a field level; receives user input data on said recorded auto-rectified data; determines whether the auto-rectified data is correct or not; and populates, based on determining that the auto-rectified data is correct, a machine learning model with said received user input data to be utilized for subsequently received digital document.
DOCUMENT PROCESSING
A method of document processing is provided. An implementation solution is: obtaining target text information and target layout information of a target document, the target text information includes target text included in the target document and character position information of the target text, and the target layout information is used to characterize the region where text in the target document is located; fusing the target text information and the target layout information to obtain first multimodal information of the target document; and inputting the first multimodal information into an intelligent document comprehension model, and obtaining at least one target word in the target document and at least one feature vector corresponding to the at least one target word output by the intelligent document comprehension model, each target word is related to semantics of the target document.
DOCUMENT PROCESSING
A method of document processing is provided. An implementation solution is: obtaining target text information and target layout information of a target document, the target text information includes target text included in the target document and character position information of the target text, and the target layout information is used to characterize the region where text in the target document is located; fusing the target text information and the target layout information to obtain first multimodal information of the target document; and inputting the first multimodal information into an intelligent document comprehension model, and obtaining at least one target word in the target document and at least one feature vector corresponding to the at least one target word output by the intelligent document comprehension model, each target word is related to semantics of the target document.
RECOGNIZING HANDWRITTEN TEXT BY COMBINING NEURAL NETWORKS
A method for recognizing handwritten text is disclosed. The method comprises receiving data comprising a sequence of ink points; applying the received data to a neural network-based sequence classifier trained with a Connectionist Temporal Classification (CTC) output layer using forced alignment to generate an output; generating a character hypothesis as a portion of the sequence of ink points; applying the character hypothesis to a character classifier to obtain a first probability corresponding to the probability that the character hypothesis includes the given character; processing the output of the CTC output layer to determine a second probability corresponding to the probability that the given character is observed within the character hypothesis; and combining the first probability and the second probability to obtain a combined probability corresponding to the probability that the character hypothesis includes the given character.
SYSTEM AND METHOD FOR GENERATING ACCESSIBLE USER EXPERIENCE DESIGN GUIDANCE MATERIALS
A system and method for generating accessible user experience (UX) design guidance materials for software products uses page elements that are optically extracted from an input UX prototype page image and automatically classified into predefined element types to find accessibility rules for at least some of the extracted page elements. At least one accessible UX design guidance material is generated for the input UX prototype page image that indicates the extracted page elements and the accessibility rules corresponding to at least some of the extracted page elements.
SYSTEM AND METHOD FOR GENERATING ACCESSIBLE USER EXPERIENCE DESIGN GUIDANCE MATERIALS
A system and method for generating accessible user experience (UX) design guidance materials for software products uses page elements that are optically extracted from an input UX prototype page image and automatically classified into predefined element types to find accessibility rules for at least some of the extracted page elements. At least one accessible UX design guidance material is generated for the input UX prototype page image that indicates the extracted page elements and the accessibility rules corresponding to at least some of the extracted page elements.
STORAGE SPACE OPTIMIZATION FOR EMAILS
In some implementations, a storage optimization system may receive a plurality of emails. Accordingly, the system may identify at least one email associated with a limited capacity in the plurality of emails. The system may further scan, from the at least one email, one or more hyperlinks to determine a website associated with the at least one email and an identifier associated with an event. The system may determine, using a database, a traversal path and at least one application programming interface (API) call associated with the website. Accordingly, the system may traverse the website using the traversal path and the at least one API using the identifier to determine that the limited capacity is filled. The system may delete the at least one email associated with the limited capacity based on determining that the limited capacity is filled.
STORAGE SPACE OPTIMIZATION FOR EMAILS
In some implementations, a storage optimization system may receive a plurality of emails. Accordingly, the system may identify at least one email associated with a limited capacity in the plurality of emails. The system may further scan, from the at least one email, one or more hyperlinks to determine a website associated with the at least one email and an identifier associated with an event. The system may determine, using a database, a traversal path and at least one application programming interface (API) call associated with the website. Accordingly, the system may traverse the website using the traversal path and the at least one API using the identifier to determine that the limited capacity is filled. The system may delete the at least one email associated with the limited capacity based on determining that the limited capacity is filled.
USER INTERFACES FOR MANAGING VISUAL CONTENT IN MEDIA
The present disclosure generally relates to methods and user interfaces for managing visual content at a computer system. In some embodiments, methods and user interfaces for managing visual content in media are described. In some embodiments, methods and user interfaces for managing visual indicators for visual content in media are described. In some embodiments, methods and user interfaces for inserting visual content in media are described. In some embodiments, methods and user interfaces for identifying visual content in media are described. In some embodiments, methods and user interfaces for translating visual content in media are described. In some embodiments, methods and user interfaces for translating visual content in media are described. In some embodiments, methods and user interfaces for managing user interface objects for visual content in media are described.