IPIQ

G06V10/806

Automated extraction of echocardiograph measurements from medical images

10987013 · 2021-04-27 ·

International Business Machines Corporation

Mechanisms are provided to implement an automated echocardiograph measurement extraction system. The automated echocardiograph measurement extraction system receives medical imaging data comprising one or more medical images and inputs the one or more medical images into a deep learning network. The deep learning network automatically processes the one or more medical images to generate an extracted echocardiograph measurement vector output comprising one or more values for echocardiograph measurements extracted from the one or more medical images. The deep learning network outputs the extracted echocardiograph measurement vector output to a medical image viewer.

MULTI-MODAL DETECTION ENGINE OF SENTIMENT AND DEMOGRAPHIC CHARACTERISTICS FOR SOCIAL MEDIA VIDEOS

20210103762 · 2021-04-08 ·

King Fahd University Of Petroleum And Minerals

A system and method for determining a sentiment, a gender and an age group of a subject in a video while the video is being played back. The video is separated into visual data and audio data, the video data is passed to a video processing pipeline and the audio data is passed to both an acoustic processing pipeline and a textual processing pipeline. The system and method performs, in parallel, a video feature extraction process in the video processing pipeline, an acoustic feature extraction process in the acoustic processing pipeline, and a textual feature extraction process in the textual processing pipeline. The system and method combines a resulting visual feature vector, acoustic feature vector, and a textual feature vector into a single feature vector, and determines the sentiment, the gender and the age group of the subject by applying the single feature vector to a machine learning model.

FINE-GRAINED IMAGE RECOGNITION METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM

20210133479 · 2021-05-06 ·

The present disclosure provides a fine-grained image recognition method, an electronic device and a computer readable storage medium. The method comprises the steps of feature extraction, calculation of feature discriminant loss function, calculation of feature diversity loss function and calculation of model optimization loss function. The present disclosure comprehensively considers influences of factors such as a large intra-class difference, a small inter-class difference, and a great influence of background noise of the fine-grained image, and makes constrains such that the feature maps belonging to each class are discriminative and have the features of corresponding class, thus reducing the intra-class difference, decreasing the learning difficulty and learning better discriminative features. The constraints make the feature maps belonging to each class have a diversity, which increases the inter-class difference, achieves a good result, and is easy for practical deployment, thereby obviously improving the effect of multiple fine-grained image classification tasks.

Multi-Modal Dense Correspondence Imaging System

20210103770 · 2021-04-08 ·

Mitsubishi Electric Research Laboratories, Inc.

Jeroen van Baar

A multi-modal dense correspondence image processing system submit the multi-modal images to a neural network to produce multi-modal features for each pixel of each of the multi-modal image. Each multi-modal image includes an image of a first modality and a corresponding image of a second modality different from the first modality. The neural network includes a first subnetwork trained to extract first features from pixels of the first modality, a second subnetwork trained to extract second features from pixels of the second modality, and a combiner configured to combine the first features and the second features to produce multi-modal features of a multi-modal image. The system compares the multi-modal features of a pair of multi-modal images to estimate a dense correspondence between pixels of the multi-modal images of the pair and outputs the dense correspondence between pixels of the multi-modal images in the pair.

METHOD AND APPARATUS FOR GENERATING INTEGRATED FEATURE VECTOR

20210124992 · 2021-04-29 ·

A method of generating an integrated feature vector according to an embodiment is a method performed in a computing device including one or more processors and a memory for storing one or more programs executed by the one or more processors. The method includes receiving a plurality of images of an object; and generating the integrated feature vector including a feature vector of each of the plurality of images, wherein the plurality of images is generated in a plurality of environments different from each other.

Interactive artificial intelligence analytical system

11010645 · 2021-05-18 ·

TalkMeUp

A method and system for an AI-based communication training system for individuals and organizations is disclosed. A video analyzer is used to convert a video signal into a plurality of human morphology features with an accompanying audio analyzer converting an audio signal into a plurality of human speech features. A transformation module transforms the morphology features and the speech features into a current multi-dimensional performance vector and combinatorial logic generates an integration of the current multi-dimensional performance vector and one or more prior multi-dimensional performance vectors to generate a multi-session rubric. Backpropagation logic applies a current multi-dimensional performance vector from the combinatorial logic to the video analyzer and the audio analyzer.

METHOD FOR VIDEO RECOGNITION CAPABLE OF ENCODING SPATIAL AND TEMPORAL RELATIONSHIPS OF CONCEPTS USING CONTEXTUAL FEATURES

20210125100 · 2021-04-29 ·

The proposed invention aims at encoding contextual information for video analysis and understanding, by encoding spatial and temporal relationships of objects and the main agent in a scene. The main target application of the invention is human activity recognition. The encoding of such spatial and temporal relationships may be crucial to distinguish different categories of human activities and may be important to help in the discrimination of different video categories, aiming at video classification, retrieval, categorization and other video analysis applications.

Landmark-free face attribute prediction

11003892 · 2021-05-11 ·

Sap Se

Implementations include receiving an input image including a face, processing the input image through a global transformation network to provide a set of global transformation parameters, applying the set of global transformation parameters to the input image to provide a globally transformed image, processing the globally transformed image through a global representation learning network to provide a set of global features, processing the set of global features through a part localization network to provide a set of part localization parameters, applying the set of part localization parameters to the globally transformed image to provide a locally transformed image, processing the locally transformed image through a part representation learning network to provide a set of local features, and outputting a label representing at least one attribute depicted in the input image based on fusing global feature(s) from the set of global features, and local feature(s) from the set of local features.

METHOD OF TRAINING A MACHINE LEARNING SYSTEM FOR AN OBJECT RECOGNITION DEVICE

20210142216 · 2021-05-13 ·

Marlon Ramon Ewert

A method of training a machine learning system for an object recognition device. The method includes: providing sensing element data; and training a machine learning system, using the provided sensing element data; at least one object being recognized from the sensing element data; and signal intensities of the sensing element data being used together with a reflection and/or absorption factor associated with the object.

METHOD AND SYSTEM FOR ITEM IDENTIFICATION

20210124906 · 2021-04-29 ·

The method for item identification preferably includes determining visual information for an item; calculating a first encoding using the visual information; calculating a second encoding using the first encoding; determining an item identifier for the item using the second encoding; optionally presenting information associated with the item to a user; and optionally registering a new item.

Patent classifications

G06V10/806