G06N3/044

DIALOG AGENTS WITH TWO-SIDED MODELING
20230050655 · 2023-02-16 ·

A central learning model is deployed as a user model and as an assistant model. Sensitive information utterances from a corpus of previously stored conversation language corresponding to user queries and chat agent responses thereto are used to train the user model to become an updated user model and to train the assistant model to become an updated assistant model, respectively. The user model provides user contexts corresponding to user queries to the assistant model and the assistant model provides assistant contexts corresponding to chat agent responses to the user model. During training, the user model does not provide plain-text queries to the assistant model and the assistant model does not provide plain-text responses to the user model. The updated assistant model may facilitate a federated training process produce an updated central model. An updated central model may be used to provide real-time chat agent responses to live user queries.

Query rephrasing using encoder neural network and decoder neural network

A method comprising receiving first data representative of a query. A representation of the query is generated using an encoder neural network and the first data. Words for a rephrased version of the query are selected from a set of words comprising a first subset of words comprising words of the query and a second subset of words comprising words absent from the query. Second data representative of the rephrased version of the query is generated.

Reinforcement learning using a relational network for generating data encoding relationships between entities in an environment

A neural network system is proposed, including an input network for extracting, from state data, respective entity data for each a plurality of entities which are present, or at least potentially present, in the environment. The entity data describes the entity. The neural network contains a relational network for parsing this data, which includes one or more attention blocks which may be stacked to perform successive actions on the entity data. The attention blocks each include a respective transform network for each of the entities. The transform network for each entity is able to transform data which the transform network receives for the entity into modified entity data for the entity, based on data for a plurality of the other entities. An output network is arranged to receive data output by the relational network, and use the received data to select a respective action.

Automated personalized classification of journey data captured by one or more movement-sensing devices

A technique is described herein for automatically logging journeys taken by a user, and then automatically classifying the purposes of the journeys. In one implementation, the technique obtains journey data from one or more movement-sensing devices as a user travels from a starting location to an ending location in a vehicle. The technique generates a set of features based on the journey data, and then uses a machine-trainable model (such as a neural network) to make its classification based on the features. The machine-trainable model accepts at least one feature that is based on statistical information regarding at least one aspect of prior journeys that the user has taken. Overall, the technique provides a resource-efficient solution that rapidly provides personalized results to individual respective users. In some implementations, the technique performs its personalization without sharing journey data with a remote server.

Reinforcement learning for concurrent actions

A computer-implemented method comprises instantiating a policy function approximator. The policy function approximator is configured to calculate a plurality of estimated action probabilities in dependence on a given state of the environment. Each of the plurality of estimated action probabilities corresponds to a respective one of a plurality of discrete actions performable by the reinforcement learning agent within the environment. An initial plurality of estimated action probabilities in dependence on a first state of the environment are calculated. Two or more of the plurality of discrete actions are concurrently performed within the environment when the environment is in the first state. In response to the concurrent performance, a reward value is received. In response to the received reward value being greater than a baseline reward value, the policy function approximator is updated, such that it is configured to calculate an updated plurality of estimated action probabilities.

Systems and methods for detecting documentation drop-offs in clinical documentation

In clinical documentation, mere documentation of a condition in a patient's records may not be enough. To be considered sufficiently documented, the patient's record needs to show that no documentation drop-offs (DDOs) have occurred over the course of the patient's stay. However, DDOs can be extremely difficult to detect. To solve this problem, the invention trains time-sensitive deep learning (DL) models on a per condition basis using actual and/or synthetic patient data. Utilizing an ontology, grouped concepts can be generated on the fly from real-time hospital data and used to generate time-series data that can then be analyzed by trained time-sensitive DL models to determine whether a DDO for a condition has occurred during the stay. Non-time-sensitive models can be used to detect all the conditions documented during the stay. Outcomes from the models can be compared to determine whether to notify a user that a DDO has occurred.

Systems for real-time intelligent haptic correction to typing errors and methods thereof

Systems and methods of the present disclosure enable context-aware haptic error notifications. The systems and methods include a processor to receive input segments into a software application from a character input component and determine a destination. A context identification model predicts a context classification of the input segments based at least in part on the software application and the destination. Potential errors are determined in the input segments based on the context classification. An error characterization machine learning model determines an error type classification and an error severity score associated with each potential error and a haptic feedback pattern is determined for each potential error based on the error type classification and the error severity score of each potential error of the one or more potential errors. And a haptic event latency is determined based on the error type classification and the error severity score of each potential error.

Intra-aortic pressure forecasting

Aspects of the present disclosure describe systems and methods for predicting an intra-aortic pressure of a patient receiving hemodynamic support from a transvalvular micro-axial heart pump. In some implementations, an intra-aortic pressure time series is derived from measurements of a pressure sensor of the transvalvular micro-axial heart pump and a motor speed time series is derived from a measured back electromotive force of a motor of the transvalvular micro-axial heart pump. Furthermore, in some implementations, machine learning algorithms, such as deep learning, are applied to the intra-aortic pressure and motor speed time series to accurately predict an intra-aortic pressure of the patient. In some implementations, the prediction is short-term (e.g., approximately 5 minutes in advance).

Systems and methods for artificial intelligence discovered codes

Systems and methods for artificial intelligence discovered codes are described herein. A method includes obtaining received samples from a receive decoder, obtaining decoded bits from the receive decoder based on the receiver samples, training an encoder neural network of a transmit encoder, the encoder neural network receiving parameters that comprise the information bits, the received samples, and the decoded bits. The encoder neural network is optimized using a loss function applied to the decoded bits and the information bits to calculate a forward error correcting code.

Systems and methods for assessing item compatibility

A compatibility score generator implementing a neural network is trained for assessing compatibility of items. Elements of a feature vector representing each item and of a compatibility data structure indicating items considered compatible are retrieved. The neural network is trained using training data corresponding to the items and indicating compatibility between pairs of items. The compatibility data structure is modified by removing indications that items of a pair of items are compatible. An encoding function generating encoded representations for the items based on the compatibility data structure is evaluated. Encoded representations are provided to a decoder that learns a likelihood that the indication had been removed when modified. The neural network and the decoder are optimized based on a loss function that reflects the decoder's ability to correctly determine whether the indication had been removed. The encoded representations generate a compatibility score for at least two items of interest.