G06F16/316

METHOD AND APPARATUS FOR GENERATING VECTOR REPRESENTATION OF KNOWLEDGE GRAPH

The present disclosure discloses a method and an apparatus for generating a vector representation of a knowledge graph, and relates to a field of a field of artificial intelligence technologies. The detailed implementing solution is: obtaining a knowledge graph, the knowledge graph including a plurality of entity nodes; obtaining a context type and context data corresponding to the knowledge graph; and generating vector representations corresponding to the plurality of entity nodes by a context model based on the context data and the context type.

APPARATUS AND METHOD FOR PROVIDING INDEXING AND SEARCH SERVICE BASED ON IMPORTANT SENTENCE

Disclosed herein are an apparatus and method for providing a search service based on important sentences. The apparatus for providing a search service based on important sentences includes memory in which at least one program and a previously trained word importance measurement model are recorded and a processor for executing the program. The program may include a word importance measurement unit for measuring the importance of each of multiple words included in input text in the corresponding input text based on the word importance measurement model and a sentence importance measurement unit for measuring the importance of each of at least one sentence included in the text based on the measured importance of each of the multiple words.

Providing responses to queries of transcripts using multiple indexes

The disclosure herein describes providing responses to natural language queries associated with transcripts at least by searching multiple indexes. A transcript associated with a communication among a plurality of speakers is obtained, wherein sets of artifact sections are identified in the transcript. A set of section indexes is generated from the transcript based on artifact type definitions. A natural language query associated with the transcript is analyzed using a natural language model and query metadata of the analyzed natural language query is obtained. At least one section index of the set of section indexes is selected based on the obtained query metadata and that selected section index is searched. A response to the natural language query is provided including result data from the searched at least one search index, wherein the result data includes a reference to an artifact section referenced by the searched section index(es).

Orchestrated supervision of a cognitive pipeline

A method, computer system, and a computer program product for coordinating supervision of at least one document processing pipeline is provided. The present invention may include receiving one or more documents. The present invention may then include parsing the received one or more documents to identify one or more performance indicators associated with the received one or more documents. The present invention may also include processing the parsed one or more documents based on a series of processor nodes. The present invention may further include identifying one or more deviations associated with the identified one or more performance indicators. The present invention may also include transferring the identified one or more deviations to a supervisor component. The present invention may then include generating at least one deviation escalation. The present invention may then further include reprocessing the generated at least one deviation escalation after a human response.

DATA PROCESSING SYSTEM FOR PROCESSING GENE SEQUENCING DATA

A data processing system can be operated in one of a preprocessing mode, a short-read mapping mode, a sequence assembly mode or a variant calling mode that are related to a to-be-tested DNA sequence. The data processing system includes a sorting engine that supports high-speed processing of sorting in the preprocessing mode and the sequence assembly mode, and a dynamic processing engine that supports dynamic programming calculations in the short-read mapping mode and the variant calling mode. The data processing system may be implemented on a system-on-chip (SoC) for performing accelerated processing of gene sequencing data with reduced memory requirements.

Automatic identification of document sections to generate a searchable data structure

Methods and apparatuses are described for automatically identifying text sections of a document to generate a searchable hierarchical data structure. A computing device receives a document comprising text entities and converts the document from a first format to a second format, including generating metadata associated with text alignment, text position, text spacing, or fonts. The computing device extracts the text blocks, including determining coordinates associated with each text block using the metadata. The computing device determines document sections using the document metadata by identifying strings in the extracted text blocks that indicate a presence of a bullet point in the document, assigns a hierarchical category to each identified document section, and inserts text of each document section into a hierarchical data structure based upon the assigned hierarchical category. The computing device traverses the hierarchical data structure using search request data to identify document sections relating to the search request data.

SEGMENTING MACHINE DATA INTO EVENTS

Methods and apparatus consistent with the invention provide the ability to organize and build understandings of machine data generated by a variety of information-processing environments. Machine data is a product of information-processing systems (e.g., activity logs, configuration files, messages, database records) and represents the evidence of particular events that have taken place and been recorded in raw data format. In one embodiment, machine data is turned into a machine data web by organizing machine data into events and then linking events together.

DIRECT STORAGE LOADING FOR ADDING DATA TO A DATABASE

Direct storage loading may be used to add data to a database. New data may be added to a database, using nodes different than a database engine to access a database. The addition of the new data may be assigned to different nodes. The nodes may obtain the data and store the data to storage locations according allocated space in the database by the database engine. The new data can then be made available for access at the database engine.

RELATION EXTRACTION ACROSS SENTENCE BOUNDARIES
20170351749 · 2017-12-07 ·

Systems, methods, and computer-readable media for providing entity relation extraction across sentences in a document using distant supervision. In some examples, a computing device can receive an input, such as a document comprising a plurality of sentences. The computing device can identify syntactic and/or semantic links between words in a sentence and/or between words in different sentences, and extract relationships between entities throughout the document. Techniques and technologies described herein populate a knowledge base (e.g., a table, chart, database etc.) of entity relations based on the extracted relationships. An output of the populated knowledge base can be used by a classifier to identify additional relationships between entities in various documents. Example techniques described herein can apply machine learning to train the classifier to predict relations between entities. The classifier can be trained using known entity relations, syntactic links and/or semantic links.

System and Method for Organizing and Indexing Citations for Research Papers
20230185833 · 2023-06-15 ·

The present invention relates to a reference organizing and indexing system for research papers. The system can be used by students, universities, researchers, and companies for saving time to correctly include citations and references in their research papers. The system offers a software application that is configured to identify the source and metadata including author, page numbers, and dates of references to be used as citations. The system eliminates the need of a researcher to manually search and collect the citation information. A plurality of citations is indexed which can then be retrieved in a proper format for inclusion in research papers. The citations can be added as footnotes, bibliographies, and/or in-text citations.