G06F16/58

Automated image retrieval with graph neural network

A content retrieval system uses a graph neural network architecture to determine images relevant to an image designated in a query. The graph neural network learns a new descriptor space that can be used to map images in the repository to image descriptors and the query image to a query descriptor. The image descriptors characterize the images in the repository as vectors in the descriptor space, and the query descriptor characterizes the query image as a vector in the descriptor space. The content retrieval system obtains the query result by identifying a set of relevant images associated with image descriptors having above a similarity threshold with the query descriptor.

Automated image retrieval with graph neural network

A content retrieval system uses a graph neural network architecture to determine images relevant to an image designated in a query. The graph neural network learns a new descriptor space that can be used to map images in the repository to image descriptors and the query image to a query descriptor. The image descriptors characterize the images in the repository as vectors in the descriptor space, and the query descriptor characterizes the query image as a vector in the descriptor space. The content retrieval system obtains the query result by identifying a set of relevant images associated with image descriptors having above a similarity threshold with the query descriptor.

Localizing relevant objects in multi-object images

Solutions for localizing relevant objects in multi-object images include receiving a multi-object image; detecting a plurality of detected objects within the multi-object image; generating a primary heatmap for the multi-object image, the primary heatmap having at least one region of interest; determining a relevant detected object corresponding to a region of interest in the primary heatmap; determining an irrelevant detected object not corresponding to a region of interest in the primary heatmap; and indicating the relevant detected object as an output result but not indicating the irrelevant detected object as an output result. Some examples identify a plurality of objects that are visually similar to the relevant object and displaying the visually similar objects to a user, for example as recommendations of alternative catalog items on an e-commerce website. Some examples are able to identify a plurality of relevant objects and display multiple sets of visually similar objects.

MEDIA UNIT RETRIEVAL AND RELATED PROCESSES
20230124063 · 2023-04-20 ·

Media unit retrieval methods, systems and computer program products are provided that allow a user to search for an item by iteratively presenting media units such as images representing items to the user and receiving user input consisting of selections of the presented media units (including possibly the empty selection). Features, or attributes, a user is interested in, for example semantic features, are inferred from the interaction and media units are retrieved for presentation based on similarity with user-selected media units, through sampling of a probability distribution describing the intent or interests, or combinations of approaches. Accordingly, the user-experience is akin to a conversation about what the user is looking for. Retrieval may be based on both selected and unselected media units and the selection may comprise making a selection with a single action. Further, a database of media units can capture similarity relationships for efficient media unit retrieval.

MEDIA UNIT RETRIEVAL AND RELATED PROCESSES
20230124063 · 2023-04-20 ·

Media unit retrieval methods, systems and computer program products are provided that allow a user to search for an item by iteratively presenting media units such as images representing items to the user and receiving user input consisting of selections of the presented media units (including possibly the empty selection). Features, or attributes, a user is interested in, for example semantic features, are inferred from the interaction and media units are retrieved for presentation based on similarity with user-selected media units, through sampling of a probability distribution describing the intent or interests, or combinations of approaches. Accordingly, the user-experience is akin to a conversation about what the user is looking for. Retrieval may be based on both selected and unselected media units and the selection may comprise making a selection with a single action. Further, a database of media units can capture similarity relationships for efficient media unit retrieval.

IMAGE PROCESSING APPARATUS, IMAGE PROCESSING METHOD, IMAGE PROCESSING PROGRAM, AND RECORDING MEDIUM STORING PROGRAM
20230121182 · 2023-04-20 · ·

An image processing apparatus, an image processing method, a program, and a recording medium storing the program capable of extracting, from an input first image group, an image similar to an image extracted from a reference image group are provided. A first image group including a plurality of images of a user is transmitted to an order reception server. A second image group similar to the first image group is searched from a plurality of reference image groups stored in the order reception server. An image similar to an image previously extracted from the second image group is extracted from the first image group. An album is generated by arranging the image extracted from the first image group in a layout similar to a layout of an album generated from the second image group.

SYSTEMS AND METHODS FOR SCREENSHOT LINKING

Systems and methods of the present disclosure are directed to analyzing screenshots A system can include a computing device including a processor coupled to a memory and a display screen configured to display content. The system can include an application stored on the memory and executable by the processor. The application can include a screenshot receiver configured to access, from storage to which a screenshot of the content displayed on the display screen captured using a screenshot function of the computing device is stored, the screenshot including an image and a predetermined marker. The application can include a marker detector configured to detect the predetermined marker included in the screenshot. The application can include a link identifier configured to identify, using the predetermined marker, a link to a resource mapped to the image included in the screenshot, the resource accessible by the computing device via the link.

Display Apparatus with Intelligent User Interface

A display apparatus includes presence detection circuitry for detecting an individual in proximity to the display apparatus; a display for displaying video content and a user interface; a processor in communication with the user input circuitry, the display, and the search history database; and non-transitory computer readable media in communication with the processor that stores instruction code. The instruction code is executed by the processor and causes the processor to: a) determine, from the presence detection circuitry, a user in proximity of the display apparatus; b) determine one or more program types associated with the user; c) determine available programs that match the predicted one or more programs; and d) update the user interface to depict a listing of one or more of the available programs that match the predicted one or more programs.

IMAGE GENERATION SYSTEM AND METHOD

Embodiments of this application provide an image generation system and method. In an exemplary manufacturing industry scenario, a style requirement of a product category in a manufacturing industry is automatically captured according to user behavior data and product description information associated with the product category. Based on these data, a style description text may be generated and converted to product images by using a text prediction-based image generation model. The product images are further screened by using an image-text matching model, to obtain a product image with high quality. This process covers from style description text mining to text-to-image prediction to image quality evaluation. It provides an automation product image generation capability for the manufacturing industry, shorten a cycle of designing and producing the product image in the manufacturing industry, and improve production efficiency of the product image.

Systems and methods for providing media content listings according to points of interest
11632593 · 2023-04-18 · ·

Systems and methods are provided for allowing a user to obtain a listing of points of interest and associated media content listings based on the user's current geographic location. The user's current geographic location may be determined using, for example, a GPS transceiver incorporated in the user's user access device. Information may then be communicated from a remote server to the user access device that identifies points of interest associated with the geographic location as well as media content listings associated with the points of interest.