H04N21/2335

REMOTELY GENERATED ENCODING METADATA FOR LOCAL CONTENT ENCODING
20210352340 · 2021-11-11 ·

Embodiments are directed towards remotely generating encoding metadata at a remote content distributor for use by a local user computing device. The remote content distributor receives and encodes content. During or after the encoding process, the remote content distributor generates encoding metadata that indicates how the content was encoded by the remote content distributor. The remote content distributor provides the encoding metadata to the user computer device. The user computing device receives the content and the encoding metadata and encodes the content based on the encoding metadata. The user computing device can then provide the encoded content to another computing device for decoding and presentation to a user.

Fragment-aligned audio coding

Audio video synchronization and alignment or alignment of audio to some other external clock are rendered more effective or easier by treating fragment grid and frame grid as independent values, but, nevertheless, for each fragment the frame grid is aligned to the respective fragment's beginning. A compression effectiveness lost may be kept low when appropriately selecting the fragment size. On the other hand, the alignment of the frame grid with respect to the fragments' beginnings allows for an easy and fragment-synchronized way of handling the fragments in connection with, for example, parallel audio video streaming, bitrate adaptive streaming or the like.

Allocation of video recording functionality
11792469 · 2023-10-17 · ·

A system may receive a plurality of requests to record a plurality of programs. Based on these requests, resources may be provisioned for transcoding variable bit rate streams into constant bit rate streams. Resources may be provisioned based on assigning transcoding tasks to an existing set of deployed devices in which the transcoding tasks correspond to the plurality of requests. Resources may further be provisioned based on incrementally deploying an additional device to the existing set of deployed devices when the transcoding capacity reaches a certain level.

Watermarking with phase shifting

Apparatus, devices, systems, methods, and articles of manufacture are disclosed for watermarking with phase shifting. Example watermark decoding apparatus disclosed herein are to identify watermark components in a media signal, determine a phase shift pattern associated with the watermark components in the media signal, the phase shift pattern based on one or more phase references, and detect a symbol of a watermark based on the phase shift pattern, the watermark associated with the watermark components in the media signal.

TEXT-TO-SPEECH AUDIO SEGMENT RETRIEVAL

A client computing system sends to a server system a presentation request for an audio presentation of electronic communications, and receives a manifest from the server system. The manifest indicates a plurality of segment-specific retrieval locations in which a different one of the plurality of segment-specific retrieval locations is indicated for each of a plurality of text-to-speech audio segments of the audio presentation. For each of the plurality of text-to-speech audio segments, the client computing system identifies a presentation order of the text-to-speech audio segment within the audio presentation; sends to the server system a segment request for the text-to-speech audio segment at the segment-specific retrieval location for that text-to-speech audio segment; receives from the server system the text-to-speech audio segment responsive to the segment request for that text-to-speech audio segment; and outputs the text-to-speech audio segment in the identified presentation order.

Method and System for Adding Subtitles and/or Audio
20230283815 · 2023-09-07 ·

A method includes that a media asset server receives an identifier and a new-language file of a target video and converts the new-language file into a new-language medium file. The media asset server finds a first index file based on the identifier of the target video, and obtains a second index file based on a storage address of the new-language medium file on the media asset server. The media asset server sends the new-language medium file and the second index file to a content delivery server. The content delivery server replaces the storage address of the new-language medium file on the media asset server in the second index file with a storage address of the new-language medium file on the content delivery server to obtain a third index file. The content delivery server generates a first URL of the target video.

Online gaming platform voice communication system
11752433 · 2023-09-12 · ·

A method and system is disclosed that permits users of an online gaming platform to communicate via voice with other user in the online gaming platform.

Watermarking with phase shifting

Apparatus, devices, systems, methods, and articles of manufacture are disclosed for watermarking with phase shifting. An example watermark encoding apparatus includes memory, machine readable instructions, and processor circuitry to execute the instructions to select a plurality of frequencies for encoding a watermark symbol, apply a phase shift pattern to the plurality of frequencies, the phase shift pattern based on a phase reference, and embed in the plurality of frequencies the applied phase shift pattern in a media signal to encode the watermark symbol in the a media signal, and embed the phase reference into the media signal.

Techniques for clock rate synchronization
11758214 · 2023-09-12 · ·

Disclosed herein are playback devices, groups of playback devices, and methods of operating playback devices and groupings thereof, wherein individual playback devices are configured to detect and correct clock rate drift by synchronizing a clock rate and perhaps clock time of a local clock to a remote time source.

Encoding media content for playback compatibility

Methods and apparatus are described for encoding and delivering secondary content (e.g., advertisements) for insertion in primary content (e.g., live events, movies, and television shows). Secondary content is encoded such that the duration of any audio component of the secondary content is equal to or longer than the video component duration. When the secondary content is inserted, the overall duration of the inserted content is specified to be equal to the video component duration.