Patent classifications
G06F40/47
Multilingual intent matching engine
A server accesses a natural language query corresponding to one of a plurality of natural languages. The server maps, using a query-to-vector engine configured to leverage word embeddings in each of the plurality of natural languages to map natural language queries in the plurality of natural languages to vectors corresponding to meanings of the natural language queries, the natural language query to a vector. The server matches the vector to an intent representing a prediction associated with the natural language query. The server provides a response to the natural language query based on the intent.
Multilingual intent matching engine
A server accesses a natural language query corresponding to one of a plurality of natural languages. The server maps, using a query-to-vector engine configured to leverage word embeddings in each of the plurality of natural languages to map natural language queries in the plurality of natural languages to vectors corresponding to meanings of the natural language queries, the natural language query to a vector. The server matches the vector to an intent representing a prediction associated with the natural language query. The server provides a response to the natural language query based on the intent.
TRANSLATION APPARATUS, TRANSLATION SYSTEM, AND NON-TRANSITORY COMPUTER READABLE MEDIUM
A translation apparatus includes a translation unit which translates content of a document into a different language, a history creating unit which, in translation of the content from a first language into a second language, creates history information including a correspondence between original text in the first language and translated text in the second language, an extraction unit which, in translation of the content from the second language into another language, if content (present content) of the document in the second language is present in the history information, extracts content (absent content) that is not present in the history information, and a combining unit which combines a translation result obtained by translating the present content from the second language into the other language, with a replacement result obtained by replacing the absent content from the second language to the other language based on the history information.
METHOD AND APPARATUS FOR MANAGING INTERFACE, DEVICE AND READABLE STORAGE MEDIUM
A method and apparatus for managing an interface, a device and a non-transitory computer readable storage medium. The method includes: acquiring a first interface image of a target application, determining a first text contained in the first interface image, the first text being a text in a first language corresponding to the target application, acquiring a second text obtained by translating the first text, the second text being a text in a second language, and replacing the first text in the first interface image with the second text to obtain a second interface image, and displaying, based on the target application, the second interface image.
METHOD AND APPARATUS FOR MANAGING INTERFACE, DEVICE AND READABLE STORAGE MEDIUM
A method and apparatus for managing an interface, a device and a non-transitory computer readable storage medium. The method includes: acquiring a first interface image of a target application, determining a first text contained in the first interface image, the first text being a text in a first language corresponding to the target application, acquiring a second text obtained by translating the first text, the second text being a text in a second language, and replacing the first text in the first interface image with the second text to obtain a second interface image, and displaying, based on the target application, the second interface image.
Method and system of translating a source phrase in a first language into a target phrase in a second language
There is disclosed a method and system for translating a source phrase in a first language into a second language. The method being executable by a device configured to access an index comprising a set of source sentences in the first language, and a set of target sentences in the second language, each of the target sentence corresponding to a translation of a given source sentence. The method comprises: acquiring the source phrase; generating by a translation algorithm, one or more target phrases, each of the one or more target phrases having a different semantic meaning within the second language; retrieving, from the index, a respective target sentence for each of the one or more target phrases, the respective target sentence comprising one of the one or more target phrases; and selecting each of the one or more target phrase and the respective target sentences for display.
Method and system of translating a source phrase in a first language into a target phrase in a second language
There is disclosed a method and system for translating a source phrase in a first language into a second language. The method being executable by a device configured to access an index comprising a set of source sentences in the first language, and a set of target sentences in the second language, each of the target sentence corresponding to a translation of a given source sentence. The method comprises: acquiring the source phrase; generating by a translation algorithm, one or more target phrases, each of the one or more target phrases having a different semantic meaning within the second language; retrieving, from the index, a respective target sentence for each of the one or more target phrases, the respective target sentence comprising one of the one or more target phrases; and selecting each of the one or more target phrase and the respective target sentences for display.
METHOD FOR IDENTIFYING NOISE SAMPLES, ELECTRONIC DEVICE, AND STORAGE MEDIUM
The method for identifying noise samples, includes: obtaining an original sample set; obtaining a target sample set by adding masks to original training corpora in the original sample set using a preset adjustment rule; performing mask prediction on a plurality of target training corpora in the target sample set using a pre-trained language model to obtain a first mask prediction character corresponding to each target training corpus; matching the first mask prediction character corresponding to each target training corpus with a preset condition; and according to target training corpora of which first mask prediction characters do not match the preset condition in the target sample set, determining corresponding original training corpora in the original sample set as noise samples.
METHOD FOR IDENTIFYING NOISE SAMPLES, ELECTRONIC DEVICE, AND STORAGE MEDIUM
The method for identifying noise samples, includes: obtaining an original sample set; obtaining a target sample set by adding masks to original training corpora in the original sample set using a preset adjustment rule; performing mask prediction on a plurality of target training corpora in the target sample set using a pre-trained language model to obtain a first mask prediction character corresponding to each target training corpus; matching the first mask prediction character corresponding to each target training corpus with a preset condition; and according to target training corpora of which first mask prediction characters do not match the preset condition in the target sample set, determining corresponding original training corpora in the original sample set as noise samples.
Inverted Projection for Robust Speech Translation
The technology provides an approach to train translation models that are robust to transcription errors and punctuation errors. The approach includes introducing errors from actual automatic speech recognition and automatic punctuation systems into the source side of the machine translation training data. A method for training a machine translation model includes performing automatic speech recognition on input source audio to generate a system transcript. The method aligns a human transcript of the source audio to the system transcript, including projecting system segmentation onto the human transcript. Then the method performs segment robustness training of a machine translation model according to the aligned human and system transcripts, and performs system robustness training of the machine translation model, e.g., by injecting token errors into training data.