G06F40/126

ENCODING VARIABLE LENGTH CHARACTERS USING SIMULTANEOUS PROCESSING
20220405460 · 2022-12-22 ·

Embodiments are directed to managing character encoding. A plurality characters that are each encoded as code units based on a character code may be provided such that the code units for each character represents a code point of a character encoding scheme. An encoding model may be determined based on the character code, one or more processor features, and a target character code. Process features may be employed to transform the code units into target code units based on the encoding model such that the target code units are based on the target character code and such that the target code units encode the code point for each character. The plurality of target characters may be provided to a target stream such that each target character may be encoded as the target code units.

TEXT EXTRACTION METHOD AND DEVICE, COMPUTER READABLE STORAGE MEDIUM AND ELECTRONIC DEVICE
20220398384 · 2022-12-15 ·

A text extraction method and device, computer-readable storage medium, and electronic device are described that relate to the technical field of machine learning. The method includes: acquiring to-be-extracted data and extracting a current trigger word in the to-be-extracted data using a target trigger word extraction model included in a target event extraction model; generating a current query sentence according to the current trigger word; and extracting a current event argument corresponding to the current trigger word according to the current query sentence and a target argument extraction model included in the target event extraction model, wherein the target trigger word extraction model and the target argument extraction model have a same model structure and parameter, and are connected in a cascading manner.

SYSTEMS AND METHODS FOR GAUGING DIFFERENCES BETWEEN NETWORK CONFIGURATIONS

Presented herein are embodiments that use a language model to embed or encode configuration elements (e.g., commands, prompts, etc.) into dense, latent representations that incorporate semantic and contextual information. Using a trained language model, a configuration for a network device may be converted into a set of configuration path sentences. Given a first set of encoded configuration path sentences for a first configuration and a second set of encoded configuration path sentences for a second configuration, these two sets may be compared to gauge a degree of difference between the two sets. In one or more embodiments, an Optimal Transport method with Wasserstein distance metric may be used to obtain a comparison value that gauges difference between the two configurations. In one or more embodiments, the comparison valuation may be labeled or classified by comparing the comparison value to one or more pre-defined thresholds.

SYSTEMS AND METHODS FOR GAUGING DIFFERENCES BETWEEN NETWORK CONFIGURATIONS

Presented herein are embodiments that use a language model to embed or encode configuration elements (e.g., commands, prompts, etc.) into dense, latent representations that incorporate semantic and contextual information. Using a trained language model, a configuration for a network device may be converted into a set of configuration path sentences. Given a first set of encoded configuration path sentences for a first configuration and a second set of encoded configuration path sentences for a second configuration, these two sets may be compared to gauge a degree of difference between the two sets. In one or more embodiments, an Optimal Transport method with Wasserstein distance metric may be used to obtain a comparison value that gauges difference between the two configurations. In one or more embodiments, the comparison valuation may be labeled or classified by comparing the comparison value to one or more pre-defined thresholds.

EMBEDDING-BASED GENERATIVE MODEL FOR PROTEIN DESIGN
20220375538 · 2022-11-24 ·

A system and method for designing protein sequences conditioned on a specific target fold. The system is a transformer-based generative framework for modeling a complex sequence-structure relationship. To mitigate the heterogeneity between the sequence domain and the fold domain, a Fold-to-Sequence model jointly learns a sequence embedding using a transformer and a fold embedding from the density of secondary structural elements in 3D voxels. The joint sequence-fold representation through novel intra-domain and cross-domain losses with an intra-domain loss forcing two semantically similar (where the proteins should have the same fold(s)) samples from the same domain to be close to each other in a latent space, while a cross-domain loss forces two semantically similar samples in different domains to be closer. In an embodiment, the Fold-to-Sequence model performs design tasks that include low resolution structures, structures with region of missing residues, and NMR structural ensembles.

EMBEDDING-BASED GENERATIVE MODEL FOR PROTEIN DESIGN
20220375538 · 2022-11-24 ·

A system and method for designing protein sequences conditioned on a specific target fold. The system is a transformer-based generative framework for modeling a complex sequence-structure relationship. To mitigate the heterogeneity between the sequence domain and the fold domain, a Fold-to-Sequence model jointly learns a sequence embedding using a transformer and a fold embedding from the density of secondary structural elements in 3D voxels. The joint sequence-fold representation through novel intra-domain and cross-domain losses with an intra-domain loss forcing two semantically similar (where the proteins should have the same fold(s)) samples from the same domain to be close to each other in a latent space, while a cross-domain loss forces two semantically similar samples in different domains to be closer. In an embodiment, the Fold-to-Sequence model performs design tasks that include low resolution structures, structures with region of missing residues, and NMR structural ensembles.

SYSTEM AND METHOD FOR TRANSMITTING COMMANDS AND DATA VIA NATURAL LANGUAGE-BASED FORMATS
20220374214 · 2022-11-24 ·

Disclosed is a method for communication between two devices or between an application and the device on which the application is running. The method uses a text or natural language based syntax in the creation of a library of text language elements. An approval computing device can approve the associations of the library. The library may be transmitted to a second computing device with a message. The text language elements are converted to computer executable code by a text language module or text language application running on the device.

SYSTEM AND METHOD FOR TRANSMITTING COMMANDS AND DATA VIA NATURAL LANGUAGE-BASED FORMATS
20220374214 · 2022-11-24 ·

Disclosed is a method for communication between two devices or between an application and the device on which the application is running. The method uses a text or natural language based syntax in the creation of a library of text language elements. An approval computing device can approve the associations of the library. The library may be transmitted to a second computing device with a message. The text language elements are converted to computer executable code by a text language module or text language application running on the device.

NATURAL LANGUAGE ANALYSIS OF USER SENTIMENT BASED ON DATA OBTAINED DURING USER WORKFLOW

A text-based real-time communication interface, such as a chatbot, is presented to a user for the exchange of customer support information. A user's freeform text input is analyzed using machine learning algorithms to derive the meaning of the input text as well as to determine the user sentiment expressed therein. These determinations may be further supported by signals extracted from session-based activity, which signals can be used to infer the intended workflow of the user and whether or not that workflow was achieved. The expressed user sentiment is considered along with other historical or session-based user data to generate tailored questions and responses to be delivered in real-time to the user. The responses are displayed to the user along with information that routes the user to a workflow resolution.

NATURAL LANGUAGE ANALYSIS OF USER SENTIMENT BASED ON DATA OBTAINED DURING USER WORKFLOW

A text-based real-time communication interface, such as a chatbot, is presented to a user for the exchange of customer support information. A user's freeform text input is analyzed using machine learning algorithms to derive the meaning of the input text as well as to determine the user sentiment expressed therein. These determinations may be further supported by signals extracted from session-based activity, which signals can be used to infer the intended workflow of the user and whether or not that workflow was achieved. The expressed user sentiment is considered along with other historical or session-based user data to generate tailored questions and responses to be delivered in real-time to the user. The responses are displayed to the user along with information that routes the user to a workflow resolution.