Patent classifications
G06V30/246
Language identification for text strings
Aspects of the present disclosure include a system comprising a machine-readable storage medium storing at least one program and computer-implemented methods for detecting a language of a text string. Consistent with some embodiments, the method may include applying multiple language identification models to a text string. Each language identification model provides a predicted language of the text string and a confidence score associated with the predicted language. The method may further include weighting each associated confidence score based on historical performance of the corresponding language identification model in predicting languages of other text strings. The method may further include selecting a predicted language of the text string from among the multiple predicted languages provided by the multiple language identification models based on a result of the weighting of the confidence score associated with the particular predicted language.
Method and device for realizing chinese character input based on uncertainty information
The present invention provides a method and device for realizing Chinese character input based on uncertainty information, wherein the method comprises: receiving input information from a user; extracting at least two types of uncertainty information of Chinese characters to be input, from the input information; and, determining the matched Chinese characters according to the at least two types of uncertainty information and outputting the matched Chinese character(s). The device comprises a receiving module, an extracting module and a matching module. The method and device as provided by the present invention allow a user who has incomplete memory of pronunciation or glyph information of Chinese characters to be input to realize correct input of the Chinese characters by defining a certain range for candidate characters corresponding to the Chinese characters to be input, in combination with at least two types of the extracted uncertainty information of the Chinese characters to be input.
INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING APPARATUS, AND INFORMATION PROCESSING METHOD
An information processing system is communicable with a translation server through a network, and includes a receiver, circuitry, and a transmitter. The receiver receives content data indicating contents expressed in a first language and destination information indicating a destination to which the content data is to be transmitted. The circuitry determines, based on the destination information received by the receiver, a second language as a target language into which the contents expressed in the first language is to be translated. The transmitter transmits, to the destination indicated by the destination information, translated content data indicating contents that is translated by the translation server from the first language to the second language.
INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING APPARATUS, AND INFORMATION PROCESSING METHOD
An information processing system is communicable with a translation server through a network, and includes a receiver, circuitry, and a transmitter. The receiver receives content data indicating contents expressed in a first language and destination information indicating a destination to which the content data is to be transmitted. The circuitry determines, based on the destination information received by the receiver, a second language as a target language into which the contents expressed in the first language is to be translated. The transmitter transmits, to the destination indicated by the destination information, translated content data indicating contents that is translated by the translation server from the first language to the second language.
AUDIENCE-BASED OPTIMIZATION OF COMMUNICATION MEDIA
Introduced here are communication optimization platforms configured to improve comprehension, persuasion, or clarity of communications. Initially, a communication optimization platform can acquire input sample(s) that are associated with a source audience. The communication optimization platform can then create a linguistic profile for the source audience by examining the content of the input sample(s). Additionally or alternatively, the communication optimization platform may produce a psychographic profile that specifies various characteristics of the source audience, such as personality, opinions, attitudes, interests, etc. The communication optimization platform can then generate, based on the linguistic profile and/or the psychographic profile, affinity language for communicating with a target audience. By incorporating the affinity language into communications, the communication optimization platform can increase appeal to the target audience.
LANGUAGE IDENTIFICATION FOR TEXT STRINGS
Aspects of the present disclosure include a system comprising a machine-readable storage medium storing at least one program and computer-implemented methods for detecting a language of a text string. Consistent with some embodiments, the method may include applying multiple language identification models to a text string. Each language identification model provides a predicted language of the text string and a confidence score associated with the predicted language. The method may further include weighting each associated confidence score based on historical performance of the corresponding language identification model in predicting languages of other text strings. The method may further include selecting a predicted language of the text string from among the multiple predicted languages provided by the multiple language identification models based on a result of the weighting of the confidence score associated with the particular predicted language.
INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING APPARATUS, AND INFORMATION PROCESSING METHOD
An information processing system is communicable with a translation server through a network, and includes a receiver, circuitry, and a transmitter. The receiver receives content data indicating contents expressed in a first language and destination information indicating a destination to which the content data is to be transmitted. The circuitry determines, based on the destination information received by the receiver, a second language as a target language into which the contents expressed in the first language is to be translated. The transmitter transmits, to the destination indicated by the destination information, translated content data indicating contents that is translated by the translation server from the first language to the second language.
INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING APPARATUS, AND INFORMATION PROCESSING METHOD
An information processing system is communicable with a translation server through a network, and includes a receiver, circuitry, and a transmitter. The receiver receives content data indicating contents expressed in a first language and destination information indicating a destination to which the content data is to be transmitted. The circuitry determines, based on the destination information received by the receiver, a second language as a target language into which the contents expressed in the first language is to be translated. The transmitter transmits, to the destination indicated by the destination information, translated content data indicating contents that is translated by the translation server from the first language to the second language.
Multi-script handwriting recognition using a universal recognizer
Methods, systems, and computer-readable media related to a technique for providing handwriting input functionality on a user device. A handwriting recognition module is trained to have a repertoire comprising multiple non-overlapping scripts and capable of recognizing tens of thousands of characters using a single handwriting recognition model. The handwriting input module provides real-time, stroke-order and stroke-direction independent handwriting recognition. User interfaces for providing the handwriting input functionality are also disclosed.
Methods and systems that use hierarchically organized data structure containing standard feature symbols in order to convert document images to electronic documents
The current application is directed to methods and systems that convert document images, which contain Arabic text and text in other languages in which symbols are joined together to produce continuous words and portions of words, into corresponding electronic documents. In one implementation, a document-image-processing method and system to which the current application is directed employs numerous techniques and features that render efficiently computable an otherwise intractable or impractical document-image-to-electronic-document conversion. These techniques and features include transformation of text-image morphemes and words into feature symbols with associated parameters, efficiently identifying similar morphemes and words in an electronic store of standard-feature-symbol-encoded morphemes and words, and identifying candidate inter-character division points and corresponding traversal paths using the similar morphemes and words identified in the word store.