Patent classifications
G06F16/367
Methods and systems for modeling complex taxonomies with natural language understanding
Systems and methods are presented for the automatic placement of rules applied to topics in a logical hierarchy when conducting natural language processing. In some embodiments, a method includes: accessing, at a child node in a logical hierarchy, at least one rule associated with the child node; identifying a percolation criterion associated with a parent node to the child node, said percolation criterion indicating that the at least one rule associated with the child node is to be associated also with the parent node; associating the at least one rule with the parent node such that the at least one rule defines a second factor for determining whether the document is to also be classified into the parent node; accessing the document for natural language processing; and determining whether the document is to be classified into the parent node or the child node based on the at least one rule.
Ontology-augmented interface
A process including obtaining a set of natural-language text documents that discuss a topic, the set of documents containing different states of knowledge about the topic at different times. The process includes selecting an ontology from among a plurality of ontologies that correspond to different domains of knowledge, the selection being based on the ontology corresponding to a domain of knowledge including the topic. The process includes identifying concepts discussed in the documents using the ontology and detecting changes in at least some of the concepts over time based on differences between discussion of the concepts in documents authored at different times. The process includes updating natural language instructions on the topic based on the detected changes in the concepts and storing the updated natural language instructions in memory.
Data analysis using natural language processing to obtain insights relevant to an organization
Methods and apparatuses for generating insights for improving an organization from unstructured and structured data. Natural language processing is employed to process the aggregated data from various data sources to create topics and the features that impact the topics. These topics are then used to generate recommendations to improve customer satisfaction with the organization.
Differentially private dataset generation and modeling for knowledge graphs
A device may generate a synthetic knowledge graph based on a true knowledge graph, may partition the synthetic knowledge graph into a set of synthetic data partitions, and may determine, using a plurality of teacher models, an aggregated prediction. The aggregated prediction may be based on individual predictions from corresponding individual teacher models included in the plurality of teacher models. The device may determine, using a student model and based on the synthetic knowledge graph and noise, a student prediction. The student model may be trained based on historical synthetic knowledge graphs and historical aggregated predictions associated with the plurality of teacher models. The device may determine an error metric based on the aggregated prediction and the student prediction, and may perform an action associated with the synthetic knowledge graph based on the error metric.
SYSTEMS AND METHODS FOR UPDATING A KNOWLEDGE GRAPH THROUGH USER INPUT
Methods and systems are disclosed herein for updating a knowledge graph based on a user confirmation. A media guidance application receives a user communication and isolates a term of the user communication. The media guidance application identifies a candidate component of a knowledge graph associated with the term. The media guidance application requests user input directed to confirming whether the term is associated with the candidate component. In response to receiving the user input, the media guidance application modifies a strength of association between the term and the component.
METHOD AND SYSTEM FOR NEURAL DOCUMENT EMBEDDING BASED ONTOLOGY MAPPING
The present disclosure provides a neural document embedding based ontology mapping. Conventional methods that map ontology concepts across domains/species extensively take help of bridging ontologies. Initially the system receives a Human Phenotype (HP) Identification number (ID) pertaining to a phenotype. A first HP ID vector is computed from the HP ID using a trained word2vec model. A second HP ID vector is computed from the HP ID using a trained Doc2vec model. An average HP ID vector is computed based on the first HP ID vector and the second HP ID vector. A plurality of cosine similarity scores are computed based on a comparison between the average HP ID vector and a plurality of average MP ID vectors. The plurality of MP IDs are sorted based on the plurality of cosine similarity scores. The plurality of MP IDs corresponding to the HP ID are selected based on a selection threshold.
GENERATING A PRODUCT ONTOLOGY BASED UPON QUERIES IN A SEARCH ENGINE LOG
Technologies relating to construction of a product ontology based upon queries in a search engine log are described. Candidate phrases are extracted from queries in the search engine log. The candidate phrases are partitioned to generate candidate product classes, where the candidate phrases are partitioned by sequentially expanding the ending phrases of the candidate product phrases. Using such an ending phrase-based approach naturally forms a hierarchy of candidate product classes. Embeddings are generated for the candidate product classes, and the candidate product classes are clustered based upon the embeddings such that semantically equivalent product classes are merged.
BROADCAST STYLE DETERMINATION METHOD AND APPARATUS, DEVICE AND COMPUTER STORAGE MEDIUM
The present disclosure discloses a broadcast style determination method and apparatus, a device and a computer storage medium, and relates to voice and deep learning technologies in the field of artificial intelligence technologies. A specific implementation solution involves: performing named entity recognition on broadcast text to obtain at least one named entity; acquiring domain knowledge corresponding to the at least one named entity; and performing sentiment analysis by using the broadcast text and the domain knowledge, to determine a broadcast style of the broadcast text.
COMPUTERIZED NATURAL LANGUAGE PROCESSING WITH INSIGHTS EXTRACTION USING SEMANTIC SEARCH
A computerized method for extracting domain specific insights from a corpus of files containing large documents comprising: breaking down large chunks of text into smaller sentences/short paragraphs in a domain specific way, identifying and removing domain noise; identifying the sentence intents of the non-noise sentences; tagging the sentences with other domain specific attributes; defining a semantic ontology using a graph database based on the sentence intents, a multitude of mini-dictionaries and domain attributes; applying a pre-defined ontology to tag documents with domain specific hashtags; and combining the hashtags using machine learning techniques into insights.
Semantic search and rule methods for a distributed data system
Methods and systems are provided for searching and/or acting on information in a distributed data processing system. A method of processing a semantic rules engine may comprise receiving a rule data set, identifying a set of connected elements based on the rule data set, evaluating conditions related to the set of connected elements and the rule data set, determining a command set based on the evaluated conditions of the set of connected elements, and executing the command set.