G06F16/316

Automatic new concept definition

According to an aspect, automatically adding new concepts to a concept graph includes receiving a string of text, searching a corpus of data to locate additional text related to the string of text, and extracting concepts from the additional text. The extracted concepts include a subset of concepts in the concept graph. The adding new concepts also includes determining whether the string of text should be linked to an existing concept in the concept graph, performing the linking based on determining that the string of text should be linked to the existing concept in the concept graph and, based on determining that the string of text should not be linked to an existing concept in the concept graph, adding a new concept to the concept graph. The new concept is associated with the string of text.

Automatic new concept definition

According to an aspect, automatically adding new concepts to a concept graph includes receiving a string of text, searching a corpus of data to locate additional text related to the string of text, and extracting concepts from the additional text. The extracted concepts include a subset of concepts in the concept graph. The adding new concepts also includes determining whether the string of text should be linked to an existing concept in the concept graph, performing the linking based on determining that the string of text should be linked to the existing concept in the concept graph and, based on determining that the string of text should not be linked to an existing concept in the concept graph, adding a new concept to the concept graph. The new concept is associated with the string of text.

Semantic reverse search indexing of publication corpus
10430446 · 2019-10-01 · ·

Embodiments of the present disclosure relate generally to semantic indexing to improve search results of a large corpus. Some embodiments, with at least one of the keywords of the search query encoded by a semantic vector in a semantic vector space, identify a plurality of candidate publications in the publication corpus, the plurality of candidate publications encoded by a cluster of a plurality of semantic vectors in the semantic vector space, the identifying based on proximity in the semantic vector space between the at least one of the keywords of the search query and keywords in the plurality of candidate publications, the proximity based on a first machine-learned model that projects the at least one keyword in the search query and the keywords in the plurality of candidate publications into the semantic vector space.

METHODS AND SYSTEMS FOR A COMPLIANCE FRAMEWORK DATABASE SCHEMA
20190286642 · 2019-09-19 ·

Generating a compliance framework. The compliance framework facilitates an organization's compliance with multiple authority documents by providing efficient methodologies and refinements to existing technologies, such as providing hierarchical fidelity to the original authority document; separating auditable citations from their context (e.g., prepositions and or informational citations); asset focused citations; SNED and Live values, among others.

METHODS AND SYSTEMS FOR A COMPLIANCE FRAMEWORK DATABASE SCHEMA
20190286643 · 2019-09-19 ·

Generating a compliance framework. The compliance framework facilitates an organization's compliance with multiple authority documents by providing efficient methodologies and refinements to existing technologies, such as providing hierarchical fidelity to the original authority document; separating auditable citations from their context (e.g., prepositions and or informational citations); asset focused citations; SNED and Live values, among others.

NON-TRANSITORY COMPUTER READABLE RECORDING MEDIUM, METHOD FOR GENERATING, INFORMATION PROCESSING DEVICE, AND INFORMATION PROCESSING SYSTEM
20190278791 · 2019-09-12 · ·

An information processing device receives a plurality of pieces of code information corresponding to a plurality of words included in text data, and specifies a plurality of pieces of code information the appearance frequency of which exceeds a reference among the pieces of code information being received, based on the pieces of code information. The information processing device acquires a plurality of vectors associated with the pieces of code information being specified, by referring to a storage that stores therein a vector corresponding to a word in association with code information corresponding to the word, and generates a representative vector representing the vectors.

Undeliverable response handling in electronic mail systems

Systems, methods, apparatuses, and software for electronic mail systems and service in computing environments are provided herein. In one example, an electronic mail (email) messaging service is provided that identifies inbound email messages that include inactive sender addresses, processes the inactive sender addresses against suggestion information compiled based at least in part on monitored email replies related to the inactive sender addresses, and surfaces one or more suggested reply addresses for use in composing reply email messages in response to the inbound email messages.

METHODS, SYSTEMS, AND COMPUTER-READABLE MEDIA FOR SEMANTICALLY ENRICHING CONTENT AND FOR SEMANTIC NAVIGATION

Content of different formats may be sourced from various data sources such as content servers and ingested into a data integration server by an ingestion broker embodied on a non-transitory computer readable medium. The ingestion broker may normalize the content of different formats into a uniform representation that can be indexed and delivered across multiple digital channels for a variety of applications. The normalized content may be analyzed and semantic metadata may be determined from the normalized content. The normalized content can be semantically enriched by associating the semantic metadata and the like with the content. The semantic metadata can be stored in a semantic index that can be used for searching via the data integration server. During search, the semantic metadata can be instantiated as facets for user navigation and refinement of search criteria and additional semantic relationships can be assigned to the words in the normalized content.

Zero knowledge search engine

A document manager facilitates indexing of a plurality of documents stored in a document repository by obtaining a document of the plurality of documents stored in the document repository, where the document comprises a plurality of morphemes. The document manager encodes a morpheme of the plurality of morphemes using an encryption passphrase associated with the client device to generate an encoded morpheme, encodes a location array using the encryption passphrase to generate an encoded location array, where the location array comprises each location of the morpheme within the document, and encodes a unique identifier associated with a location of the document in the document repository using the encryption passphrase to generate an encoded document identifier. The document manager then sends the encoded morpheme, the encoded location array, and the encoded document identifier to a server device to be stored in a search index.

Indexing of large scale patient set

Systems and methods for indexing data include formulating an objective function to index a dataset, a portion of the dataset including supervision information. A data property component of the objective function is determined, which utilizes a property of the dataset to group data of the dataset. A supervised component of the objective function is determined, which utilizes the supervision information to group data of the dataset. The objective function is optimized using a processor based upon the data property component and the supervised component to partition a node into a plurality of child nodes.