G06V10/7715

Automatic graph scoring for neuropsychological assessments
11580636 · 2023-02-14 · ·

Systems and methods of the present invention provide for: receiving a digital image data; modifying the digital image data to reduce a width of a feature within the digital image data; executing a dimension reduction process on the feature; storing a feature vector comprising: at least one feature for each of the received digital image data, and a correct or incorrect label associated with each feature vector; selecting the feature vector from a data store; training a classification software engine to classify each feature vector according to the label; classifying the image data as correct or incorrect according to a classification software engine; and generating an output labeling a second digital image data as correct or incorrect.

Method for generating web code for UI based on a generative adversarial network and a convolutional neural network
11579850 · 2023-02-14 · ·

Provided is a method for generating web codes for a user interface (UI) based on a generative adversarial network (GAN) and a convolutional neural network (CNN). The method includes steps described below. A mapping relationship between display effects of a HyperText Markup Language (HTML) element and source codes of the HTML element is constructed. A location of an HTML element in an image I is recognized. Complete HTML codes of the image I are generated. The similarity between manually-written HTML codes and the generated complete HTML codes and the similarity between the image I and an image I.sub.1 generated by the generated complete HTML codes are obtained. After training, an image-to-HTML-code generation model M is obtained. A to-be-processed UI image is input into the model M so as to obtain corresponding HTML codes. According to the method of the present disclosure, an image-to-HTML-code generation model M can be obtained.

BEHAVIOR RECOGNITION METHOD AND SYSTEM, ELECTRONIC DEVICE AND COMPUTER-READABLE STORAGE MEDIUM
20230042187 · 2023-02-09 ·

A behavior recognition method and system, including: dividing video data into a plurality of video clips, performing frame extraction processing on each video clip to obtain frame images, and performing optical flow extraction on the frame images to obtain optical flow images; performing feature extraction on the frame images and the optical flow images to obtain feature maps of the frame images and the optical flow images; performing spatio-temporal convolution processing on the feature maps of the frame images and the optical flow images, and determining a spatial prediction result and a temporal prediction result; fusing the spatial prediction results of all the video clips to obtain a spatial fusion result, and fusing the temporal prediction results of all the video clips to obtain a temporal fusion result; and performing two-stream fusion on the spatial fusion result and the temporal fusion result to obtain a behavior recognition result.

Performance of Complex Optimization Tasks with Improved Efficiency Via Neural Meta-Optimization of Experts
20230040793 · 2023-02-09 ·

Example systems perform complex optimization tasks with improved efficiency via neural meta-optimization of experts. In particular, provided is a machine learning framework in which a meta-optimization neural network can learn to fuse a collection of experts to provide a predicted solution. Specifically, the meta-optimization neural network can learn to predict the output of a complex optimization process which optimizes over outputs from the collection of experts to produce an optimized output. In such fashion, the meta-optimization neural network can, after training, be used in place of the complex optimization process to produce a synthesized solution from the experts, leading to orders of magnitude faster and computationally more efficient prediction or problem solution.

TRAINING A NEURAL NETWORK USING A DATA SET WITH LABELS OF MULTIPLE GRANULARITIES
20230042450 · 2023-02-09 ·

This disclosure describes systems and methods for training a neural network with a training data set including data items labeled at different granularities. During training, each item within the training data set can be fed through the neural network. For items with labels of a higher granularity, weights of the network can be adjusted based on a comparison between the output of the network and the label of the item. For items with labels of a lower granularity, an output of the network can be fed through a conversion function that convers the output from the higher granularity to the lower granularity. The weights of the network can then be adjusted based on a comparison between the converted output and the label of the item.

SYSTEMS AND METHODS FOR OBJECT DETECTION

A computing system including a processing circuit in communication with a camera having a field of view. The processing circuit is configured to perform operations related to detecting, identifying, and retrieving objects disposed amongst a plurality of objects. The processing circuit may be configured to perform operations related to object recognition template generation, feature generation, hypothesis generation, hypothesis refinement, and hypothesis validation.

AUTOMATED HAPTICS GENERATION AND DISTRIBUTION

Embodiments provide systems and techniques for automated haptics generation and distribution. An example technique includes receiving media content from a computing device. The media content includes at least one of audio content or video content. One or more features of the media content is determined. A set of haptic data is generated for the media content, based on evaluating the one or more features of the media content with at least one machine learning model. Another example technique includes obtaining a set of haptic data associated with media content. The set of haptic data, metadata, and the media content is transmitted to a computing device.

Arrangement for producing head related transfer function filters
11557055 · 2023-01-17 · ·

When three-dimensional audio is produced by using headphones, particular HRTF-filters are used to modify sound for the left and right channels of the headphone. As the morphology of every ear is different, it is beneficial to have HRTF-filters particularly designed for the user of headphones. Such filters may be produced by deriving ear geometry from a plurality of images taken with an ordinary camera, detecting necessary features from images and fitting said features to a model that has been produced from accurately scanned ears comprising representative values for different sizes and shapes. Taken images are sent to a server (52) that performs the necessary computations and submits the data further or produces the requested filter.

IMAGE PROCESSING METHOD AND DEVICE, ELECTRONIC APPARATUS AND READABLE STORAGE MEDIUM
20230009202 · 2023-01-12 ·

The present disclosure provides an image processing method, an image processing device, an electronic apparatus and a readable storage medium. The image processing method includes: obtaining feature map data of an input image; extracting a feature region in the feature map data in accordance with a size of a convolution kernel; performing windowing processing on the feature region; and obtaining a windowed feature map of the input image in accordance with the feature region obtained after the windowing processing.

FOREGROUND EXTRACTION APPARATUS, FOREGROUND EXTRACTION METHOD, AND RECORDING MEDIUM

In a foreground extraction apparatus, an extraction result generation unit performs a foreground extraction using a plurality of foreground extraction models for an input image, and generates foreground extraction results. A selection unit selects one or more foreground extraction models among the plurality of foreground extraction models using respective foreground results acquired by the plurality of foreground extraction models. A foreground region generation unit extracts each foreground region based on the input image using the selected one or more foreground extraction models.