Patent classifications
G06F16/316
AN IMPROVED PSYCHOMETRIC TESTING SYSTEM
The present invention provides a method of categorising words and/or text wherein the following steps are performed: a) compiling a catalogue of selected words of a language which are identified and selected from at least one dictionary and which are descriptive of intrapersonal behaviours and/or interpersonal interactions, and the selected words being of one of, or combinations of two or more of, or all of, the following types: verbs, adjectives, nouns and idioms (nouns may be descriptors of behaviour, personality or emotion); b) identifying synonyms for each one of the selected words from at least one thesaurus; c) identifying archetypal words from the respective groups of one selected word and its respective synonyms; d) rating the archetypal words with scores relating to affiliation and dominance thereby producing a matrix; e) applying ratings to all of the selected words and the synonyms.
Topic bridging determination using topical graphs
One embodiment provides a method that includes generating, by a processor, a topic graph by converting information to topic nodes and edges in the topic graph. Bridges from a given topic node are provided to a neighbor node based on a weight assignment. The bridges are converted to an electronic form of communication. The processor orders suggested paths based on a determined distance from at least one other topic node to a topic goal node via an earliest topic node on each suggested path. The determined distance is found by a first search of the topic graph.
METHOD OF GENERATING BOOK DATABASE FOR READING EVALUATION
Provided is a method of generating a book database for reading evaluation of a user, a method of determining a type of a user utterance using a book database, and a method of generating a conversation sentence using a book database, the method of generating a book database for reading evaluation of a user including the steps of: dividing the entire text included in the received book data into units of sentences; assigning the sentences with sentence identifiers (IDs); assigning the words with word IDs; generating and storing a table in which the word ID and the sentence ID match with each other, the table referred to as a word-sentence matching table; and calculating a weight of each of the words using the number of the sentences including the word and storing the weights of the respective words.
Methods, systems, and computer-readable media for semantically enriching content and for semantic navigation
Content of different formats may be sourced from various data sources such as content servers and ingested into a data integration server by an ingestion broker embodied on a non-transitory computer readable medium. The ingestion broker may normalize the content of different formats into a uniform representation that can be indexed and delivered across multiple digital channels for a variety of applications. The normalized content may be analyzed and semantic metadata may be determined from the normalized content. The normalized content can be semantically enriched by associating the semantic metadata and the like with the content. The semantic metadata can be stored in a semantic index that can be used for searching via the data integration server. During search, the semantic metadata can be instantiated as facets for user navigation and refinement of search criteria and additional semantic relationships can be assigned to the words in the normalized content.
Techniques for database entries de-duplication
A system and method for data entries deduplication are provided. The method includes indexing an input data set, wherein the input data set is in a tabular formant and the indexing includes providing a unique Row identifier (RowID), wherein rows are the data entries; computing attribute similarity for each column across each pair of rows; computing, for each pair of rows, row-to-row similarity as a weighted sum of attribute similarities; clustering pairs of rows based on their row-to-row similarities; and providing an output data set including at least the clustered pairs of rows.
Methods and systems for topic detection in natural language communications
Techniques are disclosed for processing natural language communications. A computing device receives a set of natural language communications. The computing device generates a word index from the communications and generates, from the word index, a set of topics, each topic including two or more words. For each topic, the computing device generates a score indicative of an amount of semantic information represented by the topic. The computing device then discards topics that are supersets or subsets of other topics. The computing device presents the remaining topics based on to the score of each topic.
Post-speech recognition request surplus detection and prevention
Systems and methods for determining that artificial commands, in excess of a threshold value, are detected by multiple voice activated electronic devices is described herein. In some embodiments, numerous voice activated electronic devices may send audio data representing a phrase to a backend system at a substantially same time. Text data representing the phrase, and counts for instances of that text data, may be generated. If the number of counts exceeds a predefined threshold, the backend system may cause any remaining response generation functionality that particular command that is in excess of the predefined threshold to be stopped, and those devices returned to a sleep state. In some embodiments, a sound profile unique to the phrase that caused the excess of the predefined threshold may be generated such that future instances of the same phrase may be recognized prior to text data being generated, conserving the backend system's resources.
EDOC utility using non-structured-query-language databases
A database management system for processing large volumes of data in a key-value store database is provided. The system may be configured to receive a plurality of filled fillable request forms where each request form may include a request including a plurality of field labels and a plurality of fillable text fields corresponding to each of the plurality of the field labels. The system may be configured to extract each set of inputted data from each fillable text field. The system may be configured to store, in the key-value store database, for each request form, each of the plurality of field labels and the corresponding set of inputted data as a combination key-value pair. The combination key may be equal to a WIP ID number, form ID number and field ID number. The corresponding value may be equal to the set of data of the corresponding field ID number.
INTERACTIVE SYSTEMS AND METHODS
A method of producing an avatar video, the method comprising the steps of: providing a reference image of a person's face; providing a plurality of characteristic features representative of a facial model X0 of the person's face, the characteristic features defining a facial pose dependent on the person speaking; providing a target phrase to be rendered over a predetermined time period during the avatar video and providing a plurality of time intervals t within the predetermined time period; generating, for each of said times intervals t, speech features from the target phrase, to provide a sequence of speech features; and generating, using the plurality of characteristic features and sequence of speech features, a sequence of facial models Xt for each of said time intervals t.
System and method for multivariate testing of messages to subgroup in a one-to-many messaging platform
A system and method for multivariate testing of messages to a subgroup in a one-to-many messaging platform. A client text message is generated for transmission to a number of users via one or more messaging services. A subset of users is defined according to one or more attributes of the text message or the users, and the client text message is transmitted only to users in the subgroup. The transmission is analyzed for performance metrics, such as actions or reactions by users in the subgroup, and based on the performance metrics, the message is optimized for transmission to the larger group of users. Optimization happens rapidly.