G06F40/16

Multimodal based punctuation and/or casing prediction

Techniques for predicting punctuation and casing using multimodal fusion are described. An exemplary method includes processing generated text by: tokenizing the generated text into sub-words, and generating a sequence of lexical features for the sub-words using a pre-trained lexical encoder; processing audio of the audio by: generating a sequence of frame level acoustic embeddings using a pre-trained acoustic encoder on the audio, and generating task specific embeddings from the frame level acoustic embeddings; performing multimodal fusion of the sub-word level acoustic embeddings and the sequence of lexical features by: aligning the task specific embeddings to the sequence of lexical features, and combining the sequence of lexical features and aligned acoustic sequence; predicting punctuation and casing from the combined sequence of lexical features and aligned acoustic sequence; concatenating the sub-words of the text, and applying the predicted punctuation and casing; and outputting text having the predicted punctuation and casing.

METHOD AND SYSTEM FOR AUTOMATIC FORMATTING OF PRESENTATION SLIDES
20230041867 · 2023-02-09 ·

Various embodiments provided herein relate to a method and system for automatic formatting of presentation slides. In at least one embodiment, the method comprises receiving an input slide, the input slide comprising one or more objects having a first slide layout configuration; determining metadata associated with the input slide, the determined metadata corresponding to property features of the one or more objects; analyzing the metadata to classify the one or more objects; identifying one or more slide layout functional objectives; based on the one or more slide layout functional objectives, applying one or more transformations to the detected objects, wherein each transformation comprises adjusting the metadata corresponding to the one or more detected objects to generate one or more adjusted objects; and generating a modified slide, the modified slide comprising one or more adjusted objects having a second slide layout configuration.

Text-based response environment action selection

In an approach, a processor trains a model, via a reinforcement learning process, to produce a first action function for relating states of a natural language based response environment to actions applicable to the natural language based response environment. A processor retrains the model, via the reinforcement learning process, to produce a second action function, including iterations of: applying the first action function to a current state representation of the natural language based response environment to obtain a ground-truth action representation, emphasizing a word of the current state representation based on relevancy to the ground-truth action representation to obtain a modified state representation, applying a model to the modified state representation to obtain an untrained action representation, and submitting the untrained action representation to a natural language based response environment to obtain a subsequent state representation, where the subsequent state representation becomes the current state representation for a subsequent iteration.

Text-based response environment action selection

In an approach, a processor trains a model, via a reinforcement learning process, to produce a first action function for relating states of a natural language based response environment to actions applicable to the natural language based response environment. A processor retrains the model, via the reinforcement learning process, to produce a second action function, including iterations of: applying the first action function to a current state representation of the natural language based response environment to obtain a ground-truth action representation, emphasizing a word of the current state representation based on relevancy to the ground-truth action representation to obtain a modified state representation, applying a model to the modified state representation to obtain an untrained action representation, and submitting the untrained action representation to a natural language based response environment to obtain a subsequent state representation, where the subsequent state representation becomes the current state representation for a subsequent iteration.

Machine learning based abbreviation expansion
11544457 · 2023-01-03 · ·

Techniques are described herein for determining a long-form of an abbreviation using a machine learning based approach that takes into consideration both sequential context and structural context, where the long-form corresponds to a meaning of the abbreviation as used in a sequence of words that form a sentence. In some embodiments, word representations are generated for different words in the sequence of words, and a combined representation is generated for the abbreviation based on a word representation corresponding to the abbreviation, a sequential context representation, and a structural context representation. The sequential context representation can be generated based on word representations for words positioned near the abbreviation. The structural context representation can be generated based on word representations for words that are syntactically related to the abbreviation. The combined representation can be input to a classification neural network trained to output a label representing the long-form of the abbreviation.

METHOD AND SYSTEM FOR IMPLEMENTING AN OPERATING SYSTEM HOOK IN A LOG ANALYTICS SYSTEM

Disclosed is a system, method, and computer program product for implementing a log analytics method and system that can configure, collect, and analyze log records in an efficient manner. An improved approach is provided for identifying log files that have undergone a change in status that would require retrieve of its log data, by including a module directly into the operating system that allows the log collection component to be reactively notified of any changes to pertinent log files.

SMART TABULAR PASTE FROM A CLIPBOARD BUFFER

Pasting content from a clipboard buffer as structured tabular data. A computer system determines a data type of content within a clipboard buffer. Based on the data type of the content, the computer system identifies a tabular pattern analysis technique to apply to the content. Based on applying the tabular pattern analysis technique to the content, the computer system identifies a portion of tabular content within the content. Using a clipboard application programming interface, the computer system presents the portion of tabular content to an application as paste data that is structured as a set of rows and a set of columns.

Rules/model-based data processing system for intelligent event prediction in an electronic data interchange system
11699025 · 2023-07-11 · ·

A system for electronic data interchange (EDI) management includes a memory for storing the EDI document data and a machine learning model representing a set of features of EDI documents and a corresponding status. The system further includes a processor and a non-transitory computer readable medium storing instructions for: accessing an EDI file, the EDI file comprising envelope metadata for an envelope and a first EDI document; and translating the EDI file into a first translated EDI document containing the envelope metadata and a set of EDI document data extracted from the first EDI document, the first translated EDI document formatted according to a hierarchical structure comprising attributes translatable into features processable by the machine learning model to determine a status of the first EDI document.

Rules/model-based data processing system for intelligent event prediction in an electronic data interchange system
11699025 · 2023-07-11 · ·

A system for electronic data interchange (EDI) management includes a memory for storing the EDI document data and a machine learning model representing a set of features of EDI documents and a corresponding status. The system further includes a processor and a non-transitory computer readable medium storing instructions for: accessing an EDI file, the EDI file comprising envelope metadata for an envelope and a first EDI document; and translating the EDI file into a first translated EDI document containing the envelope metadata and a set of EDI document data extracted from the first EDI document, the first translated EDI document formatted according to a hierarchical structure comprising attributes translatable into features processable by the machine learning model to determine a status of the first EDI document.

Correcting errors in copied text

A non-transitory computer-readable storage medium may include instructions stored thereon for propagating changes to copied text. When executed by at least one processor, the instructions may be configured to cause a computing system to at least present copied text within a user interface of the computing system, monitor the user interface for changes to the copied text, receive a change to the copied text, the change including replacing a first instance of a first word, within the copied text, with a first instance of a second word, and in response to receiving the change to the copied text, present a prompt to replace, within the copied text, a second instance of the first word with a second instance of the second word.