G06N3/0455

MULTIMODAL DATA PROCESSING
20230010160 · 2023-01-12 ·

Disclosed are a method for processing multimodal data using a neural network, a device, and a medium, and relates to the field of artificial intelligence and, in particular to multimodal data processing, video classification, and deep learning. The neural network includes: an input subnetwork configured to receive the multimodal data to output respective first features of a plurality of modalities; a plurality of cross-modal feature subnetworks, each of which is configured to receive respective first features of two corresponding modalities to output a cross-modal feature corresponding to the two modalities; a plurality of cross-modal fusion subnetworks, each of which is configured to receive at least one cross-modal feature corresponding to a corresponding target modality and other modalities to output a second feature of the target modality; and an output subnetwork configured to receive respective second features of the plurality of modalities to output a processing result of the multimodal data.

METHODS AND SYSTEMS FOR GENERATING THREE DIMENSIONAL (3D) MODELS OF OBJECTS

A method for generating a three-dimensional (3D) model of an object includes receiving a two-dimensional (2D) view of at least one object as an input, measuring geometrical shape coordinates of the at least one object from the input, identifying texture parameters of the at least one object from the input, predicting geometrical shape coordinates and texture parameters of occluded portions of the at least one object in the 2D view by processing the measured geometrical shape coordinates of the at least one object, the identified texture parameters of the at least one object, and the occluded portions of the at least one object, and generating a 3D model of the at least one object by mapping the measured geometrical shape coordinates and the identified texture parameters to the predicted geometrical shape coordinates and the predicted texture parameters of the occluded portions of the at least one object.

SYSTEM AND METHOD FOR ACTIVITY CLASSIFICATION
20230011394 · 2023-01-12 ·

One or more computing devices, systems, and/or methods are provided. In an example, a method comprises receiving, by a device, incoming motion data from a motion sensor, generating, by the device, an incoming embedding vector based on the incoming motion data, generating, by the device, a predicted embedding vector based on the incoming embedding vector, assigning, by the device, an activity classification based on the predicted embedding vector, and modifying an operating parameter of the device based on the activity classification.

HANDWRITING RECOGNITION PIPELINES FOR GENEALOGICAL RECORDS

Disclosed herein relates to example embodiments for recognizing handwritten information in a genealogical record. A computing server may receive a genealogical record. The genealogical record may take the form of an image of a physical form having a structured layout, fields, and handwritten information. The computing server may divide the genealogical record into a plurality of areas based on the structured layout. The computing server may identify, for a particular area, a type of field that is included within the particular area. The computing server may select a handwriting recognition model for identifying the handwritten information in the particular area. The handwriting recognition model may be selected based on the type of the field. The computing server may input an image of the particular area to the handwriting recognition model to generate text of the handwritten information. The computing server may store the text of the handwritten information.

NON-FACTOID QUESTION ANSWERING ACROSS TASKS AND DOMAINS

An approach for a non-factoid question answering framework across tasks and domains may be provided. The approach may include training a multi-task joint learning model in a general domain. The approach may also include initializing the multi-task joint learning model in a specific target domain. The approach may include tuning the joint learning model in the target domain. The approach may include determining which task of the multiple tasks is more difficult for the multi-task joint learning model to learn. The approach may also include dynamically adjusting the weights of the multi-task joint learning model, allowing the model to concentrate on learning the more difficult learning task.

GENERATIVE RELATION LINKING FOR QUESTION ANSWERING

Systems, devices, computer-implemented methods, and/or computer program products that facilitate generative relation linking for question answering over knowledge bases. In one example, a system can comprise a processor that executes computer executable components stored in memory. The computer executable components can comprise a relation linking component. The relation linking component can map relations identified in a natural language question to corresponding relations of a knowledge base using a generative model.

APPARATUS, METHOD, AND NON-TRANSITORY COMPUTER-READABLE STORAGE MEDIUM FOR IMPROVING IMAGE QUALITY OF A MEDICAL IMAGE VOLUME

An apparatus, method, and computer-readable medium for improving image quality of a medical volume. In an embodiment, the apparatus includes processing circuitry configured to receive a reconstructed input image volume from X-ray projection data corresponding to a three-dimensional region of an object to be examined, apply a pseudo-three-dimensional neural network (P3DNN) to the reconstructed input image volume, the application of the pseudo-three-dimensional neural network including generating, for the reconstructed input image volume, a plurality of three-dimensional image datasets representing a different anatomical plane of the reconstructed input image volume, applying at least one convolutional filter to each of a sagittal plane dataset, a transverse plane dataset, and a coronal plane dataset, and concatenating results of the applied at least one convolutional filter to generate an intermediate output image volume, and generate, based on the application of the P3DNN, an output image volume corresponding to the three-dimensional region of the object.

DOG COLLAR
20230210091 · 2023-07-06 · ·

An intelligent dog collar for monitoring physiological parameters of a dog, comprising: a movement sensor unit comprising an accelerometer and/or a gyrometer, wherein the movement sensor unit is configured to detect raw movement signals of the dog collar, a storage module storing a trained neural network, the neural network being configured to determine a physiologic information into raw movement signals detected by the movement sensor unit, a processing unit connected to the movement sensor unit and configured to operate the trained neural network, a memory configured to store the identified physiologic information, an interface for transmitting to a communication device the identified physiologic information.

LEARNING APPARATUS, LEARNING METHOD, IMAGE PROCESSING APPARATUS, ENDOSCOPE SYSTEM, AND PROGRAM
20230215003 · 2023-07-06 · ·

There are provided a learning apparatus, a learning method, an image processing apparatus, an endoscope system, and a program that enable generation of training data on the basis of output data from a learning model for which learning is performed by using normality data. A first learning model (500) is generated by performing first learning using normality data (502) as learning data or by performing first learning using as learning data, normality mask data (504) that is generated by making a part of normality data be lost, and second training data to be applied to a second learning model that identifies identification target data is generated by using output data output from the first learning model in response to input of abnormality data to the first learning model.

LEARNING APPARATUS, LEARNING METHOD, IMAGE PROCESSING APPARATUS, ENDOSCOPE SYSTEM, AND PROGRAM
20230215003 · 2023-07-06 · ·

There are provided a learning apparatus, a learning method, an image processing apparatus, an endoscope system, and a program that enable generation of training data on the basis of output data from a learning model for which learning is performed by using normality data. A first learning model (500) is generated by performing first learning using normality data (502) as learning data or by performing first learning using as learning data, normality mask data (504) that is generated by making a part of normality data be lost, and second training data to be applied to a second learning model that identifies identification target data is generated by using output data output from the first learning model in response to input of abnormality data to the first learning model.