Patent classifications
G06F16/5846
Suggested actions for images
- Juan Carlos Anorga ,
- David Lieb ,
- Madhur Khandelwal ,
- Evan Millar ,
- Timothy Novikoff ,
- Mugdha Kulkarni ,
- Leslie Ikemoto ,
- Jorge Verdu ,
- Jingyu Cui ,
- Sharadh Ramaswamy ,
- Raja Ratna Murthy Ayyagari ,
- Marc Cannon ,
- Alexander Roe ,
- Shaun Tungseth ,
- Songbo Jin ,
- Matthew Bridges ,
- Ruirui Jiang ,
- Jeremy Selier ,
- Austin Suszek ,
- Gang Song
Implementations relate to causing a command to be executed based on an image. In some implementations, a computer-implemented method includes obtaining and programmatically analyzing an image to determine suggested actions. The method causes a user interface to be displayed that includes user interface elements corresponding to default actions, and to suggested actions that are determined based on analyzing the image. The method receives user input indicative of selection of a particular action from the default actions and the suggested actions. The method causes a command to be executed by a computing device for the particular action that was selected.
Visual image search using text-based search engines
The present technology analyzes the content of images to create complex representations of the images and then reduces the complexity of these representations into a size that is both suitable for comparison but also contains critical image descriptive aspects. These reduced complexity representations can then be used to efficiently search for similar images. Moreover, the reduced complexity representations are formatted such that they can take advantage of existing text search engines, which are well suited to efficiently searching through a large number of unique results.
Image search method, apparatus, and device
Embodiments of the specification provide an image search method, an apparatus, and a device. The method includes: obtaining an input image associated with an image search, wherein the input image includes a plurality of first text blocks; selecting a to-be-processed image from a target database, wherein the to-be-processed image includes a plurality of second text blocks; and generating a first graph structural feature based on the plurality of first text blocks; generating a second graph structural feature based on the plurality of second text blocks; determining that the first graph structural feature and the second graph structural feature satisfy a condition; and in response to determining that the first graph structural feature and the second graph structural feature satisfy the condition, outputting the to-be-processed image as a search result.
System and method of identifying visual objects
A system and method of identifying objects is provided. In one aspect, the system and method includes a hand-held device with a display, camera and processor. As the camera captures images and displays them on the display, the processor compares the information retrieved in connection with one image with information retrieved in connection with subsequent images. The processor uses the result of such comparison to determine the object that is likely to be of greatest interest to the user. The display simultaneously displays the images the images as they are captured, the location of the object in an image, and information retrieved for the object.
Generating accurate and natural captions for figures
Techniques of captioning for figures includes generating a caption unit for a figure by defining a finite set of caption types. From each caption type, additional input for that caption type, as well as figure image data and figure metadata, an automated system may generate a respective caption unit, each caption unit including a sequence of words. Further, the generated caption for a figure includes a combination of the generated caption units.
Artificial intelligence device for providing search service and method thereof
Disclosed herein are an artificial intelligence device including a memory configured to store user interest data, a processor configured to generate a keyword combination including at least one of a time keyword, a place keyword, an object keyword or an application type keyword based on the user interest data, and a display configured to display at least one of a time keyword, a place keyword, an object keyword or an application type keyword included in the keyword combination.
IDENTIFYING PRODUCT METADATA FROM AN ITEM IMAGE
A metadata extraction machine accesses an image that depicts an item. The item depicted in the image may have an attribute that describes a characteristic of the item and an attribute descriptor that corresponds to the attribute of the item and specifies a value of the attribute. The metadata extraction machine performs an analysis of the image. The analysis may include identifying the attribute descriptor corresponding to the attribute based on image segmentation of the image. The metadata extraction machine transmits a communication to a device of a user based on the identifying of the attribute descriptor corresponding to the attribute of the item depicted in the image.
Systems and methods for improved optical character recognition of health records
Systems and methods to improve the optical character recognition of records, and in particular health records, are provided. An image of a medical record is received, and an initial optical image recognition (OCR) on the image is performed to identify text information. The OCR signal quality may be measured, and areas of insufficient OCR signal quality may be isolated. The signal quality is determined by a weighted average of semantic analysis of the resulting text, and/or OCR accuracy measures. The OCR process may be repeated on the isolated regions of lower signal quality, each time using a different OCR transform, until all regions are completed with a desired degree of signal quality (accuracy). All the regions of the document may then be recompiled into a single document for outputting.
Propensity model based optimization
Apparatuses, systems, methods, and computer program products are presented for a propensity module based optimization. An apparatus comprises a processor and a memory that stores code executable by the processor to receive an electronic submission for a pass/fail interface, identify information from the electronic submission to suggest to a user for entering into an input field for the pass/fail interface prior to submitting the electronic submission to the pass/fail interface to reduce a likelihood that the electronic submission will be rejected at the pass/fail interface, determine the likelihood that the electronic submission will be accepted by the pass/fail interface, and submit the electronic submission to the pass/fail interface in response to the likelihood satisfying a threshold.
Logo picture processing method, apparatus, device and medium
The present disclosure provides a logo picture processing method, apparatus, device and medium, and relates to technical field of image processing, and specifically to the technical field of artificial intelligence such as deep learning and computer vision. The logo picture processing method includes: obtaining a logo picture including: a current logo graph and current text information; performing text recognition on the logo picture to obtain the current text information; searching for a picture that matches both the current logo graph and the current text information, to obtain a matched picture. The present disclosure may improve the accuracy of the matched picture of the logo picture and thereby improve the logo picture recognition accuracy.