G10L15/005

SMART SPEAKER, MULTI-VOICE ASSISTANT CONTROL METHOD, AND SMART HOME SYSTEM
20230052994 · 2023-02-16 ·

The invention discloses a smart loudspeaker, wherein the smart loudspeaker includes a voice input module, a language recognition module and at least two voice assistants, and the language recognition module receives a voice information from the voice input module and determines the language category based on the voice information and activates the voice assistant corresponding to the language category.

PROCESSING ACCELERATOR ARCHITECTURES
20230047378 · 2023-02-16 ·

In various embodiments, this application provides an audio information processing method, an audio information processing apparatus, an electronic device, and a storage medium. An audio information processing method in an embodiment includes: obtaining a first audio feature corresponding to audio information; performing, based on an audio feature at a specified moment in the first audio feature and audio features adjacent to the audio feature at the specified moment, an encoding on the audio feature at the specified moment to obtain a second audio feature corresponding to the audio information; obtaining decoded text information corresponding to the audio information; and obtaining, based on the second audio features and the decoded text information, text information corresponding to the audio information. According to this method, fewer parameters are used in the process of obtaining the second audio feature and obtaining, based on the second audio feature and the decoded text information, the text information corresponding to the audio information, thereby reducing computational complexity in the audio information processing process and improving audio information processing efficiency.

SPEECH RECOGNITION APPARATUS, METHOD AND PROGRAM

A score integration unit 7 obtains a new score Score (l.sub.1:n.sup.b, c) that integrates a score Score (l.sub.1:n.sup.b, c) and a score Score (w.sub.1:o.sup.b, c). This new score Score (l.sub.1:n.sup.b, c) becomes a score Score (l.sub.1:n.sup.b) in a hypothesis selection unit 8. Thus, the score Score (l.sub.1:n.sup.b) can be said to take into account the score Score (w.sub.1:o.sup.b, c). In a speech recognition apparatus, first information is extracted on the basis of the score Score (l.sub.1:n.sup.b) taking into account the score Score (w.sub.1:o.sup.b, c). Thus, speech recognition with higher performance than that in the related art can be achieved.

INPUT DISPLAY DEVICE, INPUT DISPLAY METHOD, AND COMPUTER-READABLE MEDIUM

An input display device includes: a processor to execute a program; and a memory to store the program which, when executed by the processor, results in performance of steps including: receiving an input of a track by a receiving unit; generating a track image showing the track; acquiring a character string; and displaying the character string acquired in the acquiring to be superimposed on the track image. When the character string is acquired in the acquiring before the track image is generated, the displaying the character string is stood by.

TRANSLATION APPARATUS, TRANSLATION SYSTEM, AND NON-TRANSITORY COMPUTER READABLE MEDIUM
20180011840 · 2018-01-11 · ·

A translation apparatus includes a translation unit which translates content of a document into a different language, a history creating unit which, in translation of the content from a first language into a second language, creates history information including a correspondence between original text in the first language and translated text in the second language, an extraction unit which, in translation of the content from the second language into another language, if content (present content) of the document in the second language is present in the history information, extracts content (absent content) that is not present in the history information, and a combining unit which combines a translation result obtained by translating the present content from the second language into the other language, with a replacement result obtained by replacing the absent content from the second language to the other language based on the history information.

Recommending Results In Multiple Languages For Search Queries Based On User Profile
20230237098 · 2023-07-27 ·

Systems and methods for a media guidance application that generates results in multiple languages for search queries. In particular, the media guidance application resolves multiple language barriers by taking automatic and manual user language settings and applying those settings to a variety of potential search results.

METHOD FOR MULTI-CHANNEL AUDIO SYNCHRONIZATION FOR TASK AUTOMATION
20230004731 · 2023-01-05 · ·

A method for coordinating actions between an audio channel and a synchronized non-audio channel includes receiving an indication of a start of a session associated with a user and having an audio channel that is synchronized with a non-audio channel. Thereafter, repeated determinations are made as to whether a prompt on the non-audio channel has been received from the user. In response to each determination that the prompt on the non-audio channel has not been received from the user, a signal is sent to cause an inaudible output on the audio channel to the user. In response to a determination that the prompt on the non-audio channel has been received from the user, an audible output is selected based on an activity by the user on the non-audio channel, and a signal is sent to cause the audible output to be output on the audio channel.

Signal processing apparatus, communication system, method performed by signal processing apparatus, storage medium for signal processing apparatus, method performed by communication terminal, and storage medium for communication terminal to receive text data from another communication terminal in response to a unique texting completion notice

According to one embodiment, a signal processing apparatus correlates a plurality of communication terminals as a group and enables one-to-many communications in the group. The signal processing apparatus includes processing circuitry. The processing circuitry assigns a transmission right to one of the communication terminals in the group. The processing circuitry generates text data based on voice data from said one of the communication terminals in possession of the transmission right. The processing circuitry gives a texting completion notice indicative of completion of texting processing to the communication terminals in the group. The processing circuitry transmits, after the texting completion notice is given, the generated text data to at least one of the communication terminals in the group.

SYSTEMS AND METHODS FOR AUTOMATED AUDIO TRANSCRIPTION, TRANSLATION, AND TRANSFER FOR ONLINE MEETING

The present invention discloses systems and methods for multimedia processing. For example, the present invention provides systems and methods for receiving spoken audio, converting the spoken audio to text, and transferring the text to a user. As desired, the speech or text can be translated into one or more different languages. Systems and methods for real-time conversion and transmission of speech and text are provided, including systems and methods for large scale processing of multimedia events.

Automated call requests with status updates

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, relating to synthetic call status updates. In some implementations, a method includes determining, by a task manager module, that a triggering event has occurred to provide a current status of a user call request. The method may then determine, by the task manager module, the current status of the user call request. A representation of the current status of the user call request is generated. Then, the generated representation of the current status of the user call request is provided to the user.