Patent classifications
G06F17/27
MEDIA EVENT STRUCTURE AND CONTEXT IDENTIFICATION USING SHORT MESSAGES
The present disclosure is descriptive of discovering structure, content, and context of a media event, e.g., a live media event, using real-time discussions that unfold through short messaging services. Generally, a sampling of short messages of a plurality of users is obtained. The sampling of short messages corresponds to a media event. A segment in the media event is identified using the sampling of short messages, and at least one term taken from the sampling of short messages is identified. The at least one term is indicative of a context of the identified segment.
RULE-BASED SYSTEM AND METHOD TO ASSOCIATE ATTRIBUTES TO TEXT STRINGS
A method implemented in a data processing system includes receiving a plurality of text strings. A plurality of rules are applied to the text strings. If a condition specified in one of the rules exists in a given text string, one or more attributes are associated to that text string as metadata. One or more of the text strings are selected, using the metadata, as a potential title for the content. A final title is prepared based on the potential title, and the content is published online under the final title.
CATEGORIZING KEYWORDS
A keyword to be categorized is received. A category dictionary including categories having associated registered keywords, and a text corpus are received. Registered keywords are identified in the category dictionary having a degree of similarity to the keyword to be categorized that is equal to or greater than a predetermined value, and the categories associated with the identified registered keywords are extracted. Registered keywords are identified that are co-occurring in the text corpus with the keyword to be categorized, and the categories associated with the identified co-occurring registered keywords are extracted. A degree of importance is determined for each extracted category based on a function of the identified registered keywords in the category dictionary and/or a function of the identified co-occurring registered keywords. The extracted categories are outputted, with at least an indication of each category's relative importance, as category candidates for categorizing the keyword to be categorized.
EMOTION TYPE CLASSIFICATION FOR INTERACTIVE DIALOG SYSTEM
Techniques for selecting an emotion type code associated with semantic content in an interactive dialog system. In an aspect, fact or profile inputs are provided to an emotion classification algorithm, which selects an emotion type based on the specific combination of fact or profile inputs. The emotion classification algorithm may be rules-based or derived from machine learning. A previous user input may be further specified as input to the emotion classification algorithm. The techniques are especially applicable in mobile communications devices such as smartphones, wherein the fact or profile inputs may be derived from usage of the diverse function set of the device, including online access, text or voice communications, scheduling functions, etc.
Corpus Quality Analysis
A mechanism is provided in a data processing system for corpus quality analysis. The mechanism applies at least one filter to a candidate corpus to determine a degree to which the candidate corpus supplements existing corpora for performing a natural language processing (NLP) operation. Responsive to a determination to add the candidate corpus to the existing corpora based on a result of applying the at least one filter, the mechanism adds the candidate corpus to the existing corpora to form modified corpora. The mechanism performs the NLP operation using the modified corpora.
SYSTEM AND METHOD FOR SEMANTIC PROCESSING OF NATURAL LANGUAGE COMMANDS
A system, method and computer-readable storage devices are for processing natural language commands, such as commands to a robotic arm, using a Tag & Parse approach to semantic parsing. The system first assigns semantic tags to each word in a sentence and then parses the tag sequence into a semantic tree. The system can use statistical approach for tagging, parsing, and reference resolution. Each stage can produce multiple hypotheses, which are re-ranked using spatial validation. Then the system selects a most likely hypothesis after spatial validation, and generates or outputs a command. In the case of a robotic arm, the command is output in Robot Control Language (RCL).
READING DIFFICULTY LEVEL BASED RESOURCE RECOMMENDATION
Examples associated with reading difficulty level based resource recommendation are disclosed. One example may involve instructions stored on a computer readable medium. The instructions, when executed on a computer, may cause the computer to obtain a set of candidate resources related to a source document. The candidate resources may be obtained based on content extracted from the source document. The instructions may also cause the computer to identify reading difficulty levels of members of the set of candidate resources. The instructions may also cause the computer to recommend a selected candidate resource to a user. The selected candidate resource may be recommended based on subject matter similarity between the selected candidate resource and the source document. The selected candidate resource may also be recommended based on reading difficulty level similarity between the selected candidate resource and the source document.
OBTAINING TRANSLATIONS UTILIZING TEST STEP AND SUBJECT APPLICATION DISPLAYS
In one example of the disclosure, a machine-translation for each of a plurality of strings is determined, the strings for display upon execution of a subject application. A first display of a test step to be performed by a test application during execution of the subject application is caused. A second display of a state for the subject application that includes the plurality of strings is caused concurrent with the first display. A user-translation for each of the strings is obtained, the user-translations provided via a GUI included within the second display. A translation property file associated with the subject application is amended to include the user-translations.
SYSTEM PERFORMANCE LOGGING OF COMPLEX REMOTE QUERY PROCESSOR QUERY OPERATIONS
Described are methods, systems and computer readable media for performance logging of complex query operations.
STATE MACHINE BASED CONTEXT-SENSITIVE SYSTEM FOR MANAGING MULTI-ROUND DIALOG
The present invention discloses a state machine based context-sensitive multi-round dialog management system, comprising: an input module, for receiving multi-modal input information from a user; an intention identification engine module, for identifying intention information in the multi-modal input information; an intention module, for bringing multiple intention information identified by the intention identification engine module into one-to-one correspondence with multiple intention sub-modules at back ends; a state machine module, comprising a plurality of state machines for managing a relevant context in the dialog management system and providing support for an output result; an instruction parsing engine module, comprising a plurality of instruction parsing engine sub-modules for parsing corresponding intention information and acquiring the parsed multiple intention information; and an output module, for acquiring policy information according to the results from the parsing engine module and the intention identification module, and transmitting the policy information to the state machine module.