G10L13/086

Translation device

A storage unit stores a target term, a substitute term, a substitute translated term, and a representative term. The substitute translated term is a translation of the substitute term and is expressed in a second language. The representative term indicates a type of the target term and is expressed in the second language. A communication unit acquires a provisional translation that is a translation of a processed sentence from a first external device that has a translation function. When the storage unit does not store a target translated term that is a translation of the target term, a controller replaces the substitute translated term contained in the provisional translation with the representative term to generate a second display-purpose translated sentence, and then causes a display unit to display the second display-purpose translated sentence.

Digitized voice alerts

Methods, systems and processor-readable media for providing instant/real-time voice alerts automatically to remote electronic devices. An activity can be detected utilizing one or more sensors. A text message indicative of the activity can be generated and converted into a digitized voice alert. The activity can also be a live utterance (e.g., a live announcement), which can then be instantly converted into a digitized voice alert for automatic delivery in a selected series of languages following the base language (e.g., English). The combined digitized voice alert can then be instantly transmitted through a network for broadcast of consecutive alerts (e.g., English followed by Spanish followed by Vietnamese, etc.) to one or more remote electronic devices that communicate with the network for an automatic audio announcement of the digitized voice alert through the one or more remote electronic devices.

AUDIO SYNTHESIS METHOD AND APPARATUS, COMPUTER READABLE MEDIUM, AND ELECTRONIC DEVICE

This application discloses a method, an apparatus, a computer readable medium, and an electronic device for audio synthesis. The method includes: acquiring mixed language text information comprising text characters corresponding to at least two language types; performing text coding processing on the mixed language text information based on the at least two language types, to obtain an intermediate semantic coding feature of the mixed language text information; acquiring a target tone feature corresponding to a target tone subject, and performing decoding processing on the intermediate semantic coding feature based on the target tone feature to obtain an acoustic feature; and performing acoustic coding processing on the acoustic feature to obtain an audio corresponding to the mixed language text information.

GENERATION OF OPTIMIZED KNOWLEDGE-BASED LANGUAGE MODEL THROUGH KNOWLEDGE GRAPH MULTI-ALIGNMENT
20220230625 · 2022-07-21 ·

A language module is joint trained with a knowledge module for natural language understanding by aligning a first knowledge graph with a second knowledge graph. The knowledge module is trained on the aligned knowledge graphs. Then, the knowledge module is integrated with the language module to generate an integrated knowledge-language module.

MULTILINGUAL TEXT-TO-SPEECH SYNTHESIS
20220084500 · 2022-03-17 · ·

A multilingual text-to-speech synthesis method and system are disclosed. The method includes receiving an articulatory feature of a speaker regarding a first language, receiving an input text of a second language, and generating output speech data for the input text of the second language that simulates the speaker's speech by inputting the input text of the second language and the articulatory feature of the speaker regarding the first language to a single artificial neural network multilingual text-to-speech synthesis model. The single artificial neural network multilingual text-to-speech synthesis model is generated by learning similarity information between phonemes of the first language and phonemes of the second language based on a first learning data of the first language and a second learning data of the second language.

Assistive listening device systems, devices and methods for providing audio streams within sound fields

Embodiments herein relate to assistive listening devices and systems for providing audio streams to device wearers within sound fields. In an embodiment an assistive listening device is included having a control circuit, an electroacoustic transducer for generating sound in electrical communication with the control circuit, a power supply circuit in electrical communication with the control circuit, and a communications circuit in electrical communication with the control circuit. The control circuit can be configured to issue a communication to an audio communication device or audio provisioning device including at least one of a language preference, a set of hearing requirements, data regarding a presentation delay, and an authorization status identifier, digital code, digital token, or digital key specific to a wearer of the assistive listening device. Other embodiments are also included herein.

ELECTRONIC APPARATUS AND CONTROLLING METHOD THEREOF
20220076660 · 2022-03-10 · ·

Disclosed is an electronic apparatus. The electronic apparatus includes a memory configured to store first voice recognition information related to a first language and second voice recognition information related to a second language, and a processor to obtain a first text corresponding to a user voice that is received on the basis of first voice recognition information, based on an entity name being included in the user voice according to the obtained first text, identify a segment in the user voice in which the entity name is included, and obtain a second text corresponding to the identified segment of the user voice on the basis of the second voice recognition information, and obtain control information corresponding to the user voice on the basis of the first text and the second text.

Information processing device and information processing method for presentation of word-of-mouth information

There is provided an information processing device and an information processing method that are able to audibly present, to a user, word-of-mouth information in accordance with the user's latent demand. The information processing device that includes a controller that performs control to estimate a latent demand on the basis of a current user condition, search for word-of-mouth information corresponding to the demand, and present the searched word-of-mouth information to a user.

GENERATING VIDEOS WITH A CHARACTER INDICATING A REGION OF AN IMAGE
20220070550 · 2022-03-03 · ·

Methods, systems, and computer-readable media for generating videos with characters indicating regions of images are provided. For example, an image containing a first region may be received. At least one characteristic of a character may be obtained. A script containing a first segment of the script may be received. The first segment of the script may be related to the first region of the image. The at least one characteristic of a character and the script may be used to generate a video of the character presenting the script and at least part of the image, where the character visually indicates the first region of the image while presenting the first segment of the script.

Interactive system, apparatus, and method

According to one embodiment, an interactive system includes following units. The knowledge reference unit refers to a question-answering knowledge based on a result of analyzing an input sentence to acquire a candidate for an answer to the input sentence. The unknown keyword detection unit detects, from the input sentence, an unknown keyword. The related keyword estimation unit acquires, in response to the detection of the unknown keyword, one or more candidates for a related keyword having a meaning close to the unknown keyword from predetermined keywords. The response generation unit generates a response to the input sentence based on the one or more candidates for the related keyword when the unknown keyword is detected.