G10L13/00

SYSTEM AND METHOD FOR GENERATING, TRIGGERING, AND PLAYING AUDIO CUES IN REAL TIME USING A PERSONAL AUDIO DEVICE
20230044079 · 2023-02-09 ·

A system and method for generating, triggering and playinga sequence of audio files with cues for delivering a presentation for a presenter using a personal audio devicecoupled to a computing device. The system comprising the comprising a computer devicethat is coupled to a presentation data analysis server through a network. The method includes (i) generating a sequence of audio files with cues for delivering a presentation, (ii) triggering playing an audio file from the sequence of audio files, and (iii) playing the sequence of audio files one by one, on the computing device, using the personal audio devicecoupled to a computing deviceto enable the presenter to recall and speak the content based on the sequence of the audio files.

AUTONOMOUS MOBILE BODY, INFORMATION PROCESSING METHOD, PROGRAM, AND INFORMATION PROCESSING DEVICE
20230042682 · 2023-02-09 ·

The present technology relates to an autonomous mobile body, an information processing method, a program, and an information processing device, by which a user experience based on an output sound of the autonomous mobile body can be improved. The autonomous mobile body includes a recognition section that recognizes a paired device that is paired with the autonomous mobile body, and a sound control section that changes a control method for an output sound to be outputted from the autonomous mobile body, on the basis of a recognition result of the paired device, and controls the output sound in accordance with the changed control method. The present technology is applicable to a robot, for example.

METHOD AND APPARATUS FOR PROCESSING VIRTUAL VIDEO LIVESTREAMING, STORAGE MEDIUM AND ELECTRONIC DEVICE
20230039789 · 2023-02-09 ·

A method includes: receiving text data and motion data of a virtual object, the motion data including a motion identifier of a specified motion and a start position identifier of a start position that the specified motion starts being in line with text in the text data; generating audio data and expression data of the virtual object according to the text data, and generating facial images of the virtual object according to the expression data; generating a background image sequence containing the specified motion according to the start position identifier and the motion identifier, the background image sequence including at least one background image; performing image fusion processing on the facial images and the at least one background image to obtain one or more live video frames; and synthesizing the live video frames with the audio data into a live video stream in real time.

Digital audio method for creating and sharing audio books using a combination of virtual voices and recorded voices, customization based on characters, serialized content, voice emotions, and audio assembler module
11594210 · 2023-02-28 ·

A method includes receiving a text file of an author's book as input to a serialized process that creates a record of each paragraph of text and creating a character file with associated character attributes and information required for the recording process and or virtualization process. The method includes combining the serialized file with the character file to create a snippet file, assigning characters to snippets, and generating audio files from snippets using text-to-speech APIs. The snippets of text are assigned to a character, can be edited and audio played back. The method includes sharing snippets with narrators to record specific characters not represented by text-to-speech synthesized audio and concatenating all audio files from snippets, with proper time spacing, into a publishable audiobook format. The snippets are concatenated, and audio files are created through links to text-to-speech API processes. The snippets are concatenated and shared with a human narrator.

Digital audio method for creating and sharing audio books using a combination of virtual voices and recorded voices, customization based on characters, serialized content, voice emotions, and audio assembler module
11594210 · 2023-02-28 ·

A method includes receiving a text file of an author's book as input to a serialized process that creates a record of each paragraph of text and creating a character file with associated character attributes and information required for the recording process and or virtualization process. The method includes combining the serialized file with the character file to create a snippet file, assigning characters to snippets, and generating audio files from snippets using text-to-speech APIs. The snippets of text are assigned to a character, can be edited and audio played back. The method includes sharing snippets with narrators to record specific characters not represented by text-to-speech synthesized audio and concatenating all audio files from snippets, with proper time spacing, into a publishable audiobook format. The snippets are concatenated, and audio files are created through links to text-to-speech API processes. The snippets are concatenated and shared with a human narrator.

Outside ordering system
11594223 · 2023-02-28 · ·

An ordering system can be positioned partially, or completely, outside in a retail environment with an ordering device located outside of a building on a site. The ordering device receiving a first audio stream concurrently with a second audio stream from an employee and proceeds to capture the first audio stream with a first port of an on-site computing device while capturing the second audio stream with a second port of the on-site computing device. A customer strategy can be executed with an intelligence module of the on-site computing device connected to the ordering device with the on-site customer strategy directing automated interactions with a first on-site customer to compile a retail order. The employee may communicate directly with the intelligence module via the second port without interrupting the first audio stream.

Smart interactive media content guide

Methods, apparatuses and media for providing content upon request are provided. A search request for content is received from a user. A first filter is applied to the search request to modify the search request before a search algorithm searches for the content to return in response to the search request. Items of content are determined based on the search request to which the first filter is applied. A second filter is applied to the items of content to determine search results. The search results are provided to the user.

Smart interactive media content guide

Methods, apparatuses and media for providing content upon request are provided. A search request for content is received from a user. A first filter is applied to the search request to modify the search request before a search algorithm searches for the content to return in response to the search request. Items of content are determined based on the search request to which the first filter is applied. A second filter is applied to the items of content to determine search results. The search results are provided to the user.

Generating videos with a character indicating a region of an image
11595738 · 2023-02-28 · ·

Methods, systems, and computer-readable media for generating videos with characters indicating regions of images are provided. For example, an image containing a first region may be received. At least one characteristic of a character may be obtained. A script containing a first segment of the script may be received. The first segment of the script may be related to the first region of the image. The at least one characteristic of a character and the script may be used to generate a video of the character presenting the script and at least part of the image, where the character visually indicates the first region of the image while presenting the first segment of the script.

Generating videos with a character indicating a region of an image
11595738 · 2023-02-28 · ·

Methods, systems, and computer-readable media for generating videos with characters indicating regions of images are provided. For example, an image containing a first region may be received. At least one characteristic of a character may be obtained. A script containing a first segment of the script may be received. The first segment of the script may be related to the first region of the image. The at least one characteristic of a character and the script may be used to generate a video of the character presenting the script and at least part of the image, where the character visually indicates the first region of the image while presenting the first segment of the script.