Patent classifications
G06F17/27
Process for improving pronunciation of proper nouns foreign to a target language text-to-speech system
A system and method configured for use in a text-to-speech (TTS) system is provided. Embodiments may include identifying, using one or more processors, a word or phrase as a named entity and identifying a language of origin associated with the named entity. Embodiments may further include transliterating the named entity to a script associated with the language of origin. If the TTS system is operating in the language of origin, embodiments may include passing the transliterated script to the TTS system. If the TTS system is not operating in the language of origin, embodiments may include generating a phoneme sequence in the language of origin using a grapheme to phoneme (G2P) converter.
Semiotic class normalization
A language processing system for text normalization of an input string of a semiotic class. In an aspect, a method includes receiving an input string; accessing, for a semiotic class of non-standard words, a language universal covering grammar for a plurality of languages that generates, for each language of the plurality of languages, one or more sequences of word-level components for each instance of the semiotic class in the language; for each of the plurality of languages, accessing a lexical map specific to the language and that maps each sequence of word-level components for each instance of the semiotic class in the language verbalizations in the language; generating, from the language universal grammar and the lexical maps, a lattice of possible verbalizations of the input string; and selecting one of the possible verbalizations as a selected verbalization for the input string.
A Method For Suggesting Candidate Words As Replacements For An Input String Received At An Electronic Device
Provided is a computer implemented method for suggesting one or more candidate words from a list of words based on an input string received at an electronic device. Preferably, the list of words comprises an ordered list of words. The method comprises a first step of providing an array in memory for receiving edit distance values between initial substrings of the input string and initial substrings of each word in a set of some or all of the words in the list of words. The method includes applying an edit distance algorithm comparing a selected word to a received input string in order to obtain the aforementioned edit distance values. The selected word may comprise an empty string of no characters. The method includes determining for one or more words of the set of words a number of characters K by which it overlaps with a prefix of a previously considered word and retaining edit distance values in said memory array for said word being considered using some or all of the edit distance values for the K prefix characters of the previously considered word. From the edit distance values, it is possible to identify one or more candidate words from the word list as a replacement for the received input string in a fast and efficient manner.
AUTOMATED DERIVATION OF QUALITY ASSURANCE RULES
A system and method generates a rule set. The method being performed by a rule generating device includes receiving a plurality of previously generated reports where each of the previously generated reports includes respective analysis content of a respective image. The method includes generating a candidate rule based upon the analysis content where the candidate rule is configured to increase a quality assurance of future reports. The method includes generating a respective score for each candidate rule based upon the candidate rule and the previously generated reports. The method includes including the candidate rule into the rule set when the score is above a predetermined threshold.
GRAPH-DRIVEN AUTHORING IN PRODUCTIVITY TOOLS
Systems and techniques for graph-driven authoring in productivity applications can include using context of a user of a productivity application, the existing content of the user's document or an expression input with a command for finding relevant documents, and a date/time to obtain a set of results of relevant documents. The relevant documents include documents associated with people known to the user that contain content relevant to the topic(s) and/or entities from the context and that are current.
SYSTEM FOR AUTOMATED CODE CALCULATION AND AN AUTOMATED CODE CALCULATION METHOD
A method of configuring an automated code calculation system, comprising retrieving information relating to a group of two or more functions, updating a weighting value for each combination of an item from a first set and an item from a second set, where each item in the first set and each item in the second set corresponds to one or more functions, where the weighting value is higher if the items are both contained in the group of functions than if one of the items is not contained in the group of functions and storing the weighting values together with the corresponding combination of items.
INTERNET CLOUD-HOSTED NATURAL LANGUAGE INTERACTIVE MESSAGING SYSTEM SESSIONIZER
Provided are methods, systems, and computer-program products for providing a bot server to communicate with users using messaging applications. In some implementations, a method, system, and computer-program product for associating a message received by a bot server using a messaging application is provided. For example, a method can include receiving a hypertext transfer protocol (HTTP) post call message from a messaging application server. The method can further include parsing content of the message to identify a keyword. The method can further include identifying an existing session from a plurality of sessions based on a context of the message and the keyword. The method can further include associating information from the HTTP post call message with the existing session. The method can further include generating and sending a response to the HTTP post call message based on the existing session.
INTERNET CLOUD-HOSTED NATURAL LANGUAGE INTERACTIVE MESSAGING SYSTEM USER RESOLVER
Provided are methods, systems, and computer-program products for associating a plurality of messages with a user. For example, a method can include receiving two hypertext transfer protocol (HTTP) post call messages. The method can further include parsing content of the post call messages to identify keywords, identifying existing users from a plurality of users based on context of the post call messages and the keywords, and associating information from the post call messages with the existing users. In some examples, the existing users can be the same between the messages. The method can further include responding to a second HTTP post call message based on information from at least one or more of a first HTTP post call message, the second HTTP post call message, and an existing user.
INTERNET CLOUD-HOSTED NATURAL LANGUAGE INTERACTIVE MESSAGING SYSTEM SERVER COLLABORATION
Provided are methods, systems, and computer-program products for server collaboration. For example, a method can include receiving a hypertext transfer protocol (HTTP) post call message. The HTTP post call message can be received by a first bot server. In some implementations, the first bot server can include a first uniform resource identifier (URI). The HTTP post call message can be associated with a user and be directed to the URI from a first messaging application server. Information from the HTTP post call message can be associated with a session. The method can further include determining to transfer the session to a second bot server based on the HTTP post message. The method can further include sending a transfer request by the first bot server. The method can further include receiving a return request by the first bot server.
MULTI-STAGE RECOGNITION OF NAMED ENTITIES IN NATURAL LANGUAGE TEXT BASED ON MORPHOLOGICAL AND SEMANTIC FEATURES
Systems and methods for multi-stage recognition of named entities based on morphological and semantic features of natural language texts. An example method comprises: performing a lexico-morphological analysis of a natural language text comprising a plurality of tokens, each token comprising at least one natural language word; determining, based on the lexico-morphological analysis, one or more lexical meanings and grammatical meanings associated with each token of the plurality of tokens; for each token the plurality of tokens, evaluating one or more classifier functions using the lexical and grammatical meanings associated with the tokens, wherein a value of each classifier function is indicative of a degree of association of the token with a category of named entities; performing a syntactico-semantic analysis of at least part of the natural language text to produce a plurality of semantic structures representing the part of the natural language text; and interpreting the semantic structures using a set of production rules to determine, for one or more tokens comprised by the part of the natural language text, a degree of association of the token with a category of named entities.