Patent classifications
G06F40/268
Method and apparatus for expressing time in an output text
Methods, apparatuses, and computer program products are described herein that are configured to express a time in an output text. In some example embodiments, a method is provided that comprises identifying a time period to be described linguistically in an output text. The method of this embodiment may also include identifying a communicative context for the output text. The method of this embodiment may also include determining one or more temporal reference frames that are applicable to the time period and a domain defined by the communicative context. The method of this embodiment may also include generating a phrase specification that linguistically describes the time period based on the descriptor that is defined by a temporal reference frame of the one or more temporal reference frames. In some examples, the descriptor specifies a time window that is inclusive of at least a portion of the time period to be described linguistically.
Method and apparatus for expressing time in an output text
Methods, apparatuses, and computer program products are described herein that are configured to express a time in an output text. In some example embodiments, a method is provided that comprises identifying a time period to be described linguistically in an output text. The method of this embodiment may also include identifying a communicative context for the output text. The method of this embodiment may also include determining one or more temporal reference frames that are applicable to the time period and a domain defined by the communicative context. The method of this embodiment may also include generating a phrase specification that linguistically describes the time period based on the descriptor that is defined by a temporal reference frame of the one or more temporal reference frames. In some examples, the descriptor specifies a time window that is inclusive of at least a portion of the time period to be described linguistically.
Refining training sets and parsers for large and dynamic text environments
Briefly stated, the invention is directed to retrieving a semantically matched knowledge structure. A question and answer pair is received, wherein the answer is received from a query of a search engine. A question is constraint-matched with the answer based on maximizing a plurality of constraints, wherein at least one of the plurality of the constraints is a similarity score between question and answer, wherein the constraint matching generates a matched sequence. For one or more answer sequences, a subsequence is found that are not parsed as answer slots. Query results are obtained from another search engine based on a combination of the answer or question, and the non-answer subsequence. And a KB based is refined on the query results and the constraint matching and based on a neural network training, for a further subsequent semantic matching, wherein the KB includes a dense semantic vector indication of concepts.
LEARNING DATA GENERATION DEVICE, METHOD, AND RECORD MEDIUM FOR STORING PROGRAM
A learning data generation device includes processing circuitry to extract a cause expression and a result expression from an input text, and to generate a modified text by at least one of a method of interchanging the cause expression and the result expression and a method of specifying one of the cause expression and the result expression as a modification target sentence and replacing the modification target sentence with a replacement candidate sentence dissimilar to the modification target sentence.
LEARNING DATA GENERATION DEVICE, METHOD, AND RECORD MEDIUM FOR STORING PROGRAM
A learning data generation device includes processing circuitry to extract a cause expression and a result expression from an input text, and to generate a modified text by at least one of a method of interchanging the cause expression and the result expression and a method of specifying one of the cause expression and the result expression as a modification target sentence and replacing the modification target sentence with a replacement candidate sentence dissimilar to the modification target sentence.
Effective retrieval of text data based on semantic attributes between morphemes
An apparatus generates an index including positions of morphemes included in a target text data and semantic attributes between the morphemes corresponding to the positions. The apparatus gives information including positions of morphemes included in an input query and semantic attributes between the morphemes corresponding to the positions to the query, and executes a retrieval on the target text data, based on the information given to the query and the index.
Effective retrieval of text data based on semantic attributes between morphemes
An apparatus generates an index including positions of morphemes included in a target text data and semantic attributes between the morphemes corresponding to the positions. The apparatus gives information including positions of morphemes included in an input query and semantic attributes between the morphemes corresponding to the positions to the query, and executes a retrieval on the target text data, based on the information given to the query and the index.
Machine learning based abbreviation expansion
Techniques are described herein for determining a long-form of an abbreviation using a machine learning based approach that takes into consideration both sequential context and structural context, where the long-form corresponds to a meaning of the abbreviation as used in a sequence of words that form a sentence. In some embodiments, word representations are generated for different words in the sequence of words, and a combined representation is generated for the abbreviation based on a word representation corresponding to the abbreviation, a sequential context representation, and a structural context representation. The sequential context representation can be generated based on word representations for words positioned near the abbreviation. The structural context representation can be generated based on word representations for words that are syntactically related to the abbreviation. The combined representation can be input to a classification neural network trained to output a label representing the long-form of the abbreviation.
Determining topics and action items from conversations
Embodiments are directed to organizing conversation information. Two or more machine learning (ML) models and a plurality of sentences provided from a conversation may be employed to generate insight scores for each sentence such that each insight score correlates to a probability that its sentence includes one or more of an action or a question. In response to one or more sentences having insight scores that exceed a threshold value an information score and a definiteness score may be determined for the one or more sentences. And one or more insights associated with the conversation may be generated based on the one or more sentences. A report may be generated that associates the one or more insights with one or more portions of the conversation that include the one or more sentences that are associated with the insights.
Determining topics and action items from conversations
Embodiments are directed to organizing conversation information. Two or more machine learning (ML) models and a plurality of sentences provided from a conversation may be employed to generate insight scores for each sentence such that each insight score correlates to a probability that its sentence includes one or more of an action or a question. In response to one or more sentences having insight scores that exceed a threshold value an information score and a definiteness score may be determined for the one or more sentences. And one or more insights associated with the conversation may be generated based on the one or more sentences. A report may be generated that associates the one or more insights with one or more portions of the conversation that include the one or more sentences that are associated with the insights.