Patent classifications
G10L13/086
SPEECH GENERATION USING CROSSLINGUAL PHONEME MAPPING
Computer generated speech can be generated for cross-lingual natural language textual data streams by utilizing a universal phoneme set. In a variety of implementations, the natural language textual data stream includes a primary language portion in a primary language and a secondary language portion that is not in the primary language. Phonemes corresponding to the secondary language portion can be determined from a set of phonemes in a universal data set. These phonemes can be mapped back to a set of phonemes for the primary language. Audio data can be generated for these phonemes to pronounce the secondary language portion of the natural language textual data stream utilizing phonemes associated with the primary language.
METHODS AND SYSTEMS FOR CONTROL OF CONTENT IN AN ALTERNATE LANGUAGE OR ACCENT
Systems and methods are described herein for replaying content dialogue in an alternate language in response to a user command. While the content is playing on a media device, a first language in which the content dialogue is spoken is identified. Upon receiving a voice command to repeat a portion of the dialogue, the language in which the command was spoken is identified. The portion of the content dialogue to repeat is identified and translated from the first language to the second language. The translated portion of the content dialogue is then output. In this way, the user can simply ask in their native language for the dialogue to be repeated and the repeated portion of the dialogue is presented in the user's native language.
Generation of optimized knowledge-based language model through knowledge graph multi-alignment
A language module is joint trained with a knowledge module for natural language understanding by aligning a first knowledge graph with a second knowledge graph. The knowledge module is trained on the aligned knowledge graphs. Then, the knowledge module is integrated with the language module to generate an integrated knowledge-language module.
Speech translation method and system using multilingual text-to-speech synthesis model
A speech translation method using a multilingual text-to-speech synthesis model includes acquiring a single artificial neural network text-to-speech synthesis model having acquired learning based on a learning text of a first language and learning speech data of the first language corresponding to the learning text of the first language, and a learning text of a second language and learning speech data of the second language corresponding to the learning text of the second language, receiving input speech data of the first language and an articulatory feature of a speaker regarding the first language, converting the input speech data of the first language into a text of the first language, converting the text of the first language into a text of the second language, and generating output speech data for the text of the second language that simulates the speaker's speech.
Assistive listening device systems, devices and methods for providing audio streams within sound fields
Embodiments herein relate to assistive listening devices and systems for providing audio streams to device wearers within sound fields. In an embodiment an assistive listening device is included having a control circuit, an electroacoustic transducer for generating sound in electrical communication with the control circuit, a power supply circuit in electrical communication with the control circuit, and a communications circuit in electrical communication with the control circuit. The control circuit can be configured to issue a communication to an audio communication device or audio provisioning device including at least one of a language preference, a set of hearing requirements, data regarding a presentation delay, and an authorization status identifier, digital code, digital token, or digital key specific to a wearer of the assistive listening device. Other embodiments are also included herein.
Methods and systems for control of content in an alternate language or accent
Systems and methods are described herein for replaying content dialogue in an alternate language in response to a user command. While the content is playing on a media device, a first language in which the content dialogue is spoken is identified. Upon receiving a voice command to repeat a portion of the dialogue, the language in which the command was spoken is identified. The portion of the content dialogue to repeat is identified and translated from the first language to the second language. The translated portion of the content dialogue is then output. In this way, the user can simply ask in their native language for the dialogue to be repeated and the repeated portion of the dialogue is presented in the user's native language.
Learned condition text-to-speech synthesis
Devices and techniques are generally described for learned condition text-to-speech synthesis. In some examples, first data representing a selection of a type of prosodic expressivity may be received. In some further examples, a selection of content comprising text data may be received. First audio data may be determined that includes an audio representation of the text data. The first audio data may be generated based at least in part on sampling from a first latent distribution generated using a conditional primary variational autoencoder (VAE). The sampling from the first latent distribution may be conditioned on a first learned distribution associated with the type of prosodic expressivity. In various examples, the first audio data may be sent to a first computing device.
Multimedia processing circuit and electronic system
A multimedia processing circuit is provided. The multimedia processing circuit includes a smart interpreter engine and an audio engine. The smart interpreter engine includes a speech to text converter, a natural language processing module and a translator. The speech to text converter is utilized for converting speech data into text data corresponding to the first language. The natural language processing module is utilized for converting the text data corresponding to the first language into glossary text data corresponding to the first language according to an application program being executed in a host. The application program comprises a specific game software. The translator is utilized for converting the glossary text data corresponding to the first language into text data corresponding to a second language. The audio engine is utilized for converting the speech data corresponding to the first language into an analog speech signal corresponding to the first language.
ASSISTIVE LISTENING DEVICE SYSTEMS, DEVICES AND METHODS FOR PROVIDING AUDIO STREAMS WITHIN SOUND FIELDS
Embodiments herein relate to assistive listening devices and systems for providing audio streams to device wearers within sound fields. In an embodiment an assistive listening device is included having a control circuit, an electroacoustic transducer for generating sound in electrical communication with the control circuit, a power supply circuit in electrical communication with the control circuit, and a communications circuit in electrical communication with the control circuit. The control circuit can be configured to issue a communication to an audio communication device or audio provisioning device including at least one of a language preference, a set of hearing requirements, data regarding a presentation delay, and an authorization status identifier, digital code, digital token, or digital key specific to a wearer of the assistive listening device. Other embodiments are also included herein.
SMART INTERPRETER ENGINE AND ELECTRONIC SYSTEM
A smart interpreter engine is provided. The smart interpreter engine includes a speech to text converter, a natural language processing module and a translator. The speech to text converter is utilized for converting speech data corresponding to a first language into text data corresponding to the first language. The natural language processing module is utilized for converting the text data corresponding to the first language into glossary text data corresponding to the first language according to a game software. The translator is utilized for converting the glossary text data corresponding to the first language into text data corresponding to a second language.