Patent classifications
G06V30/147
Semantic page segmentation of vector graphics documents
Disclosed systems and methods categorize text regions of an electronic document into document object types based on a combination of semantic information and appearance information from the electronic document. A page segmentation application executing on a computing device provides a textual feature representation and a visual feature representation to a neural network. The application identifies a correspondence between a location of the set of pixels in the electronic document and a location of a particular document object type in an output page segmentation. The application further outputs a classification of the set of pixels as being the particular document object type based on the identified correspondence.
Image processing apparatus, control method for image processing apparatus, and storage medium
An image processing apparatus includes a reading unit configured to read a document, a processing unit configured to perform character recognition processing of recognizing characters included in an image of the document read by the reading unit, an identification unit configured to identify transfer destination information regarding a transfer destination and a transfer amount from a result obtained by the character recognition processing performed by the processing unit, a transmission unit configured to transmit the transfer destination information regarding the transfer destination and the transfer amount which are identified by the identification unit, a storage unit configured to store the transfer destination information regarding the transfer destination, and an object display unit configured to display an object to call the transfer destination information regarding the transfer destination stored in the storage unit.
Method, system, medium, and smart device for cutting video using video content
The present invention discloses a method and system for cutting video using video content. The method comprises: acquiring recorded video produced by user's recording operation; extracting features of recorded audio in the recorded video and judging whether the recorded audio is damaged; and if not, extracting human voice data from the recorded audio which has been filtered out background sound, intercepting video segment corresponding to effective human voice, and displaying the video segment as clip video; and if yes, extracting image feature data of person's mouth shape and human movements in the recorded video after image processing, fitting the image feature data and the human voice data which has been filtered out background sound, and displaying the video segment with the highest fitting degree as clip video.
Systems and methods for capturing data from a medical device
A method for transferring data from a medical device to a server comprises receiving a video stream from the medical device, capturing an image from the video stream, transmitting the image to the server via a data network, and extracting the data from the image. The image may illustrate and/or represent data over a period of time. The method may also comprise transmitting, from a data module receiving the video stream from the medical device, a signal to a router that indicates that the data module is connected to the network. The method may also comprise transmitting a command to the data module to start capturing the image, transferring the image to the router, broadcasting a signal indicating that the data module has captured the image, receiving the broadcasted signal at the server, and storing the image at the server.
Artificial intelligence assisted warranty verification
A system for performing remote artificial intelligence-assisted electronic warranty verification including at least one processor configured to transmit an instruction to capture at least one product image of a specific product, receive and perform product image analysis on the product image to identify at least one product-distinguishing characteristic, transmit an instruction to capture an image of a purchase receipt, receive and perform receipt image analysis on the purchase receipt image to identify product purchase information, access a universal data structure containing data on products offered by suppliers, use the at least one product-distinguishing characteristic and product purchase information to identify in the universal data structure the specific product, identify in the universal data structure a link to a warranty data structure of the supplier, access the link to lookup the specific product and receive a warranty coverage indication from the supplier data structure, and transmit an indication of warranty coverage.
GEOGRAPHIC OBJECT DETECTION APPARATUS AND GEOGRAPHIC OBJECT DETECTION METHOD
A geographic object recognition unit (120) recognizes, using image data (192) obtained by photographing in a measurement region where a geographic object exists, a type of the geographic object from an image that the image data (192) represents. A position specification unit (130) specifies, using three-dimensional point cloud data (191) indicating a three-dimensional coordinate value of each of a plurality of points in the measurement region, a position of the geographic object.
Multi-dimensional table reproduction from image
Embodiments facilitate selection and assignment of a known user model, based upon input comprising table images of original data. A table engine receives the image and performs pre-processing (e.g., rasterization, Optical Character Recognition, coordinate representation) thereupon to identify image entities. After filtering original numerical data, a similarity (e.g., a distance) is calculated between an image entity and a dimension member of the known user model. Based upon this similarity, the table engine selects and assigns the known user model to the incoming tables images, generating a file representing table columns and rows. This file is received at the UI of an analytics platform, which in turn populates the model with data of the user (rather than the original data) via an API. Embodiments may be particularly valuable in allowing a user to rapidly generate multi-dimensional tables comprising their own data, based upon raw table images received from an external party.
PRODUCT/SERVICE ORDERING SYSTEM, PRODUCT/SERVICE ORDERING METHOD, AND PROGRAM FOR THE SAME
To provide a product/service ordering system and the like that search for a similar product/image based on a product/service image shot and stored by an orderer and instantly transmit product/service-related information to the orderer, to enable an order to be placed quickly. A product/service ordering system enables an order to be placed based on a product/service image, and includes product/service master information storing means, which is for accumulating information on a product/service in a server; image determining means, which is for performing distinguishing between the product/service image and a product/service similar image extracted from product/service master information accumulated in the server or from external big data that is not accumulated in the server and includes a web article on the Internet; and product/service-related information searching means, which is for searching the product/service master information or the external big data for the information on the product/service.
METHOD AND SYSTEM TO DETECT A TEXT FROM MULTIMEDIA CONTENT CAPTURED AT A SCENE
Detection of textual phrases in a non-horizontal orientation at a scene is a target problem. This disclosure relates to a processor implemented method to detect a text from multimedia content captured at a scene. An input original image is processed by a trained model to obtain individual character with bounding box on the original image. The original image is positioned by a gradient to obtain a rotated image if number of detected characters is not equal to number of expected characters on the original image. At least one missing character bounding box on the original image and on the rotated image are estimated to construct a horizontal text image if number of detected characters is not equal to number of expected characters on the rotated image. At least one missing character in the estimated bounding box is detected by at least one text returned from an optical character reader.
METHODS AND SYSTEMS FOR AUTHENTICATION OF A PHYSICAL DOCUMENT
Described herein are computerized methods and systems for authentication of a physical document. An image capture device coupled to a mobile device captures a sequence of images of a physical document as at least one of the physical document or the image capture device is rotated, during which the mobile device tracks the physical document throughout the sequence of images, and adjusts operational parameters of the image capture device based upon imaging conditions associated with the physical document. The mobile device selects images from the sequence of images and classifies the physical document using the selected images. The mobile device identifies a region of interest in the physical document using the selected images and the classification. The mobile device reconstructs the region of interest, generates an authentication score for the document using the reconstructed region of interest, and determines whether the physical document is authentic based upon the authentication score.