G06F16/56

METHOD AND SYSTEM FOR PERSONALIZED SUBSTITUTE PRODUCT RECOMMENDATION

Product recommendation is a very important aspect of e-commerce applications. Traditional product recommendation systems recommend products similar to a query image provided by a user and allows minimum or no personalization. It is challenging to incorporate personalization due to the presence of overlapping fine-grained attributes, variations in attribute style and visual appearance, small inter-class variation and class imbalance in the images of products. Embodiments of present disclosure address these challenges by a method of personalized substitute product recommendation using Personalized Attribute Search Networks (PAtSNets) comprising neural network layers interleaved with Attentive Style Embedding (ASE) modules to generate attribute-aware feature representation vector of a query image provided by the user and conforming to the personalization instructions specified by the user. This feature representation vector is then used to recommend substitute products to the user. Thus, embodiments of present disclosure enable accurate substitute product recommendation suiting user requirements.

METHOD AND SYSTEM FOR PERSONALIZED SUBSTITUTE PRODUCT RECOMMENDATION

Product recommendation is a very important aspect of e-commerce applications. Traditional product recommendation systems recommend products similar to a query image provided by a user and allows minimum or no personalization. It is challenging to incorporate personalization due to the presence of overlapping fine-grained attributes, variations in attribute style and visual appearance, small inter-class variation and class imbalance in the images of products. Embodiments of present disclosure address these challenges by a method of personalized substitute product recommendation using Personalized Attribute Search Networks (PAtSNets) comprising neural network layers interleaved with Attentive Style Embedding (ASE) modules to generate attribute-aware feature representation vector of a query image provided by the user and conforming to the personalization instructions specified by the user. This feature representation vector is then used to recommend substitute products to the user. Thus, embodiments of present disclosure enable accurate substitute product recommendation suiting user requirements.

SYSTEMS AND METHODS FOR VISION-LANGUAGE DISTRIBUTION ALIGNMENT
20230162490 · 2023-05-25 ·

Embodiments described herein a CROss-Modal Distribution Alignment (CROMDA) model for vision-language pretraining, which can be used for retrieval downstream tasks. In the CROMDA mode, global cross-modal representations are aligned on each unimodality. Specifically, a uni-modal global similarity between an image/text and the image/text feature queue are computed. A softmax-normalized distribution is then generated based on the computed similarity. The distribution thus takes advantage of property of the global structure of the queue. CROMDA then aligns the two distributions and learns a modal invariant global representation. In this way, CROMDA is able to obtain invariant property in each modality, where images with similar text representations should be similar and vice versa.

SYSTEMS AND METHODS FOR VISION-LANGUAGE DISTRIBUTION ALIGNMENT
20230162490 · 2023-05-25 ·

Embodiments described herein a CROss-Modal Distribution Alignment (CROMDA) model for vision-language pretraining, which can be used for retrieval downstream tasks. In the CROMDA mode, global cross-modal representations are aligned on each unimodality. Specifically, a uni-modal global similarity between an image/text and the image/text feature queue are computed. A softmax-normalized distribution is then generated based on the computed similarity. The distribution thus takes advantage of property of the global structure of the queue. CROMDA then aligns the two distributions and learns a modal invariant global representation. In this way, CROMDA is able to obtain invariant property in each modality, where images with similar text representations should be similar and vice versa.

Technique for controlling virtual image generation system using emotional states of user

A method of operating a virtual image generation system comprises allowing an end user to interact with a three-dimensional environment comprising at least one virtual object, presenting a stimulus to the end user in the context of the three-dimensional environment, sensing at least one biometric parameter of the end user in response to the presentation of the stimulus to the end user, generating biometric data for each of the sensed biometric parameter(s), determining if the end user is in at least one specific emotional state based on the biometric data for the each of the sensed biometric parameter(s), and performing an action discernible to the end user to facilitate a current objective at least partially based on if it is determined that the end user is in the specific emotional state(s).

Technique for controlling virtual image generation system using emotional states of user

A method of operating a virtual image generation system comprises allowing an end user to interact with a three-dimensional environment comprising at least one virtual object, presenting a stimulus to the end user in the context of the three-dimensional environment, sensing at least one biometric parameter of the end user in response to the presentation of the stimulus to the end user, generating biometric data for each of the sensed biometric parameter(s), determining if the end user is in at least one specific emotional state based on the biometric data for the each of the sensed biometric parameter(s), and performing an action discernible to the end user to facilitate a current objective at least partially based on if it is determined that the end user is in the specific emotional state(s).

INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD AND STORAGE MEDIUM
20230063244 · 2023-03-02 ·

The present disclosure has an object to provide an information processing apparatus utilizing an image transmitted from a server and displayed on an operation screen of the apparatus's main body. An information processing apparatus includes: reception unit configured to receive notification information from a server system formed by one or more server apparatuses; obtainment unit configured to obtain an image according to identification information, being included in the notification information, for indicating a storage location of the image in the server system; display unit configured to display the image upon triggered by a lapse of a predetermined period or by a particular event; and execution unit configured to execute a function associated with an object in response to an input made by a user with respect to the object included in the image displayed by the display unit.

TRAINING OF DIFFERENTIABLE RENDERER AND NEURAL NETWORK FOR QUERY OF 3D MODEL DATABASE
20230111048 · 2023-04-13 ·

System and method for differentiable networks trainable to learn an optimized query of a 3D model database used for object recognition includes training a first differentiable network configured as a differentiable renderer by generating 2D images from 3D models of a first object of a dissimilar second object while optimizing rendering parameters for producing 2D images by gradient descent of a first triple loss function. Visual variation among the images is maximized. A second differentiable network configured as a convolutional neural network defined by a regression function is trained by generating searchable feature vectors of the 2D images. The feature vectors are determined using optimized neural network parameters determined by gradient descent of a second triple loss function to achieve high correlation to an input image of the first object and low correlation to images of the second object.

TRAINING OF DIFFERENTIABLE RENDERER AND NEURAL NETWORK FOR QUERY OF 3D MODEL DATABASE
20230111048 · 2023-04-13 ·

System and method for differentiable networks trainable to learn an optimized query of a 3D model database used for object recognition includes training a first differentiable network configured as a differentiable renderer by generating 2D images from 3D models of a first object of a dissimilar second object while optimizing rendering parameters for producing 2D images by gradient descent of a first triple loss function. Visual variation among the images is maximized. A second differentiable network configured as a convolutional neural network defined by a regression function is trained by generating searchable feature vectors of the 2D images. The feature vectors are determined using optimized neural network parameters determined by gradient descent of a second triple loss function to achieve high correlation to an input image of the first object and low correlation to images of the second object.

Method and apparatus for out-of-distribution detection

Methods and systems for out-of-distribution (OOD) detection in autonomous driving systems are described. A method for use in an autonomous driving system may include filtering feature vectors. The feature vectors may be filtered using a first filter to obtain clusters of feature vectors. The method may include assigning one or more images to a respective cluster based on a feature vector of the image. The method may include filtering a subset of the images using a second filter to determine a classification model. The method may include storing the classification model on a vehicle control system of a vehicle. The method may include detecting an image using a vehicle sensor. The method may include classifying the detected image based on the classification model. The method may include performing a vehicle action based on the classified detected image.