Patent classifications
G06F16/322
COMPUTER IMPLEMENTED METHOD FOR THE AUTOMATED ANALYSIS OR USE OF DATA
A computer implemented method for the automated analysis or use of data is implemented by a voice assistant. The method comprises the steps of: (a) storing in a memory a structured, machine-readable representation of data that conforms to a machine-readable language (‘machine representation’); the machine representation including representations of user speech or text input to a human/machine interface; and (b) automatically processing the machine representations to analyse the user speech or text input.
Inference Methods For Word Or Wordpiece Tokenization
Systems and methods for performing inference for word or wordpiece tokenization are disclosed using a left-to-right longest-match-first greedy process. In some examples, the vocabulary may be organized into a trie structure in which each node includes a precomputed token or token ID and a fail link, so that the tokenizer can parse the trie in a single pass to generate a list of only those tokens or token IDs that correspond to the longest matching vocabulary entries in the sample string, without the need for backtracking. In some examples, the vocabulary may be organized into a trie in which each node has a fail link, and any node that would share token(s) or token_ID(s) of a preceding node is instead given a prev_match link that points back to a chain of nodes with those token(s) or token_ID(s).
AUTOMATIC IDENTIFICATION OF DOCUMENT SECTIONS TO GENERATE A SEARCHABLE DATA STRUCTURE
Methods and apparatuses are described for automatically identifying text sections of a document to generate a searchable hierarchical data structure. A computing device receives a document comprising text entities and converts the document from a first format to a second format, including generating metadata associated with text alignment, text position, text spacing, or fonts. The computing device extracts the text blocks, including determining coordinates associated with each text block using the metadata. The computing device determines document sections using the document metadata by identifying strings in the extracted text blocks that indicate a presence of a bullet point in the document, assigns a hierarchical category to each identified document section, and inserts text of each document section into a hierarchical data structure based upon the assigned hierarchical category. The computing device traverses the hierarchical data structure using search request data to identify document sections relating to the search request data.
MACHINE LEARNING-BASED DIALOGUE AUTHORING ENVIRONMENT
Aspects of the present disclosure relate to a machine learning-based dialogue authoring environment. In examples, a developer or creator of a virtual environment may use a generative multimodal machine learning (ML) model to create or otherwise update aspects of a dialogue tree for one or more computer-controlled agents and/or players of the virtual environment. For example, the developer may provide an indication of context associated with the dialogue for use by the ML model, such that the ML model may generate a set of candidate interactions accordingly. The developer may select a subset of the candidate interactions for inclusion in the dialogue tree, which may then be used to generate associated nodes within the tree accordingly. Thus, nodes in the dialogue tree may be iteratively defined based on model output of the ML model, thereby assisting the developer with dialogue authoring for the virtual environment.
TECHNIQUES TO GENERATE AND STORE GRAPH MODELS FROM STRUCTURED AND UNSTRUCTURED DATA IN A CLOUD-BASED GRAPH DATABASE SYSTEM
Embodiments include systems, methods, articles of manufacture, and computer-readable media configured process data in a structured format and an unstructured format and applying one or more algorithms to detect elements and links between the elements in the data. Embodiments are further configured to generate a graph model comprising nodes comprising the elements and edges comprising the links.
Linear late-fusion semantic structural retrieval
Systems and methods for generating a fusion score between electronic documents. The method includes receiving a first electronic document by a document management system. The method further includes extracting a first set of features from the first electronic document including at least one feature type indicating the hierarchical structure of the first electronic document. The method also includes receiving a second electronic document by the document management server. The method further includes extracting a second set of features from the second electronic document including at least one feature type indicating the hierarchical structure of the second electronic document. The method further includes generating a fusion score based on a comparison of the first set of features and the second set of features.
Comment information processing method and apparatus, and storage medium and electronic device
A comment information processing method includes: acquiring a plurality of pieces of comment information, and generating a comment tree comprising a plurality of nodes according to an association relationship between the plurality of pieces of comment information, wherein the plurality of nodes correspond to the plurality of pieces of comment information on a one-to-one basis; generating a comment container for each target node in the plurality of nodes, the comment container being used for presenting the comment information corresponding to a parent node and immediate child nodes thereof; and changing, once a first touch event for the comment information corresponding to an immediate child node is detected, the immediate child node into the parent node of the comment container where the immediate child node is located, and presenting in the comment container the comment information corresponding to the changed parent node and the immediate child nodes thereof.
DETECTING HYPOCRISY IN TEXT
Techniques are disclosed for identifying hypocrisy in text. A computer system creates, from fragments of text, a syntactic tree that represents syntactic relationships between words in the fragments. The system identifies, in the syntactic tree, a first entity and a second entity. The system further determines that the first entity is opposite to the second entity. The system further determines a first sentiment score for a first fragment comprising the first entity and a second sentiment score for a second fragment comprising the second entity. The system, responsive to determining that the first sentiment score and the second sentiment score indicate opposite emotions, identifies the text as comprising hypocrisy and providing the text to an external device.
Container software discovery and cataloging
In an approach to discovering software in a container, one or more computer processors identify one or more sets of filesystem structure information for an active container. The one or more computer processors create a virtual filesystem based on the one or more identified sets of filesystem structure information. The one or more computer processors discover one or more sets of software by comparing a set of catalog entries to the created virtual filesystem. The one or more computer processors report the one or more sets of discovered software.
Automatic identification of document sections to generate a searchable data structure
Methods and apparatuses are described for automatically identifying text sections of a document to generate a searchable hierarchical data structure. A computing device receives a document comprising text entities and converts the document from a first format to a second format, including generating metadata associated with text alignment, text position, text spacing, or fonts. The computing device extracts the text blocks, including determining coordinates associated with each text block using the metadata. The computing device determines document sections using the document metadata by identifying strings in the extracted text blocks that indicate a presence of a bullet point in the document, assigns a hierarchical category to each identified document section, and inserts text of each document section into a hierarchical data structure based upon the assigned hierarchical category. The computing device traverses the hierarchical data structure using search request data to identify document sections relating to the search request data.