Patent classifications
G06N3/088
Complex-valued neural network with learnable non-linearities in medical imaging
For machine training and application of a trained complex-valued machine learning model, an activation function of the machine learning model, such as a neural network, includes a learnable parameter that is complex or defined in a complex domain with two dimensions, such as real and imaginary or magnitude and phase dimensions. The complex learnable parameter is trained for any of various applications, such as MR fingerprinting, other medical imaging, or non-medical uses.
System and method for providing unsupervised domain adaptation for spatio-temporal action localization
A system and method for providing unsupervised domain adaption for spatio-temporal action localization that includes receiving video data associated with a source domain and a target domain that are associated with a surrounding environment of a vehicle. The system and method also include analyzing the video data associated with the source domain and the target domain and determining a key frame of the source domain and a key frame of the target domain. The system and method additionally include completing an action localization model to model a temporal context of actions occurring within the key frame of the source domain and the key frame of the target domain and completing an action adaption model to localize individuals and their actions and to classify the actions based on the video data. The system and method further include combining losses to complete spatio-temporal action localization of individuals and actions.
Refining training sets and parsers for large and dynamic text environments
Briefly stated, the invention is directed to retrieving a semantically matched knowledge structure. A question and answer pair is received, wherein the answer is received from a query of a search engine. A question is constraint-matched with the answer based on maximizing a plurality of constraints, wherein at least one of the plurality of the constraints is a similarity score between question and answer, wherein the constraint matching generates a matched sequence. For one or more answer sequences, a subsequence is found that are not parsed as answer slots. Query results are obtained from another search engine based on a combination of the answer or question, and the non-answer subsequence. And a KB based is refined on the query results and the constraint matching and based on a neural network training, for a further subsequent semantic matching, wherein the KB includes a dense semantic vector indication of concepts.
Medical image segmentation method based on U-Net
A medical image segmentation method based on a U-Net, including: sending real segmentation image and original image to a generative adversarial network for data enhancement to generate a composite image with a label; then putting the composite image into original data set to obtain an expanded data set, and sending the expanded data set to improved multi-feature fusion segmentation network for training. A Dilated Convolution Module is added between the shallow and deep feature skip connections of the segmentation network to obtain receptive fields with different sizes, which enhances the fusion of detail information and deep semantics, improves the adaptability to the size of the segmentation target, and improves the medical image segmentation accuracy. The over-fitting problem that occurs when training the segmentation network is alleviated by using the expanded data set of the generative adversarial network.
Medical image segmentation method based on U-Net
A medical image segmentation method based on a U-Net, including: sending real segmentation image and original image to a generative adversarial network for data enhancement to generate a composite image with a label; then putting the composite image into original data set to obtain an expanded data set, and sending the expanded data set to improved multi-feature fusion segmentation network for training. A Dilated Convolution Module is added between the shallow and deep feature skip connections of the segmentation network to obtain receptive fields with different sizes, which enhances the fusion of detail information and deep semantics, improves the adaptability to the size of the segmentation target, and improves the medical image segmentation accuracy. The over-fitting problem that occurs when training the segmentation network is alleviated by using the expanded data set of the generative adversarial network.
Unsupervised learning of metric representations from slow features
A method of unsupervised learning of a metric representation and a corresponding system for a mobile device determines a metric position information for a mobile device from an environmental representation. The mobile device comprises at least one sensor for acquiring sensor data and an odometer system configured to acquire displacement data of the mobile device. An environmental representation is generated based on the acquired sensor data by applying an unsupervised learning algorithm. The mobile device moves along a trajectory and the displacement data and the sensor data are acquired while the mobile device is moving along the trajectory. A set of mapping parameters is calculated based on the environmental representation and the displacement data. A metric position estimation is determined based on a further environmental representation and the calculated set of mapping parameters.
Unsupervised learning of metric representations from slow features
A method of unsupervised learning of a metric representation and a corresponding system for a mobile device determines a metric position information for a mobile device from an environmental representation. The mobile device comprises at least one sensor for acquiring sensor data and an odometer system configured to acquire displacement data of the mobile device. An environmental representation is generated based on the acquired sensor data by applying an unsupervised learning algorithm. The mobile device moves along a trajectory and the displacement data and the sensor data are acquired while the mobile device is moving along the trajectory. A set of mapping parameters is calculated based on the environmental representation and the displacement data. A metric position estimation is determined based on a further environmental representation and the calculated set of mapping parameters.
Dynamic engine for a cognitive reservoir system
Implementations described and claimed herein provide systems and methods for developing a reservoir. In one implementation, a static model of the reservoir is received. The static model has one or more clusters of rock types. A reservoir graph is generated from the static model. The reservoir graph represents each of the one or more clusters as a vertex. A graph connectivity of the reservoir graph is defined through a nodal connectivity of neighboring vertices. Pressure values are propagated across three-dimensional space of the reservoir graph using the connectivity. A dynamic model of the reservoir is generated using the pressure values and fluid saturation values.
Dynamic engine for a cognitive reservoir system
Implementations described and claimed herein provide systems and methods for developing a reservoir. In one implementation, a static model of the reservoir is received. The static model has one or more clusters of rock types. A reservoir graph is generated from the static model. The reservoir graph represents each of the one or more clusters as a vertex. A graph connectivity of the reservoir graph is defined through a nodal connectivity of neighboring vertices. Pressure values are propagated across three-dimensional space of the reservoir graph using the connectivity. A dynamic model of the reservoir is generated using the pressure values and fluid saturation values.
Generative adversarial neural network assisted video reconstruction
A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.