H04N21/440236

Media message creation with automatic titling

In some implementations, a user device can be configured to create media messages with automatic titling. For example, a user can create a media messaging project that includes multiple video clips. The video clips can be generated based on video data and/or audio data captured by the user device and/or based on pre-recorded video data and/or audio data obtained from various storage locations. When the user device captures the audio data for a clip, the user device can obtain a speech-to-text transcription of the audio data in near real time and present the transcription data (e.g., text) overlaid on the video data while the video data is being captured or presented by the user device.

Space efficiency and management of content

A transcoder resource retrieves content stored in a repository. The stored content can be encoded in accordance with a first encoding format. The transcoder resource transcodes the retrieved content into a second encoding format. To increase available capacity, the transcoder resource stores the content in the second encoding format as a replacement to the content encoded according to the first encoding format. The second encoding format can support a higher compression ratio than the first encoding format. Additionally, the transcoding can be performed as one of multiple tasks executed in parallel. In such an instance, during parallel processing, a content management resource interleaves execution of the main processing task and the background task of transcoding the content into the second encoding format. The transcoding task can be assigned a lower execution priority than one or more other tasks executed in parallel.

VIDEO SYNTHESIS METHOD, APPARATUS, COMPUTER DEVICE AND READABLE STORAGE MEDIUM
20220007061 · 2022-01-06 ·

The present disclosure provides a video synthesis method, apparatus, computer device and computer-readable storage medium, which the method includes: acquiring a first video; capturing second video data photographed in real time; performing first encoding on the second video data to obtain an encoded video; synthesizing the first video and the encoded video to obtain synthesized video data; and performing second encoding on the synthesized video data to obtain a target video. By means of the method, there is less loss in the obtained target video frames and a relatively high definition of the video frames.

CONTEXT SENSITIVE ADS
20210352379 · 2021-11-11 ·

Advertisements are tailored not only to a person's profile but also to the context in which the person finds himself. Thus, for example a video-based advertisement may be reformatted or reprovisioned in audio format when the person is driving, while an audio-based advertisement may be reformatted or reprovisioned to video format in noisy conditions.

Audio content recognition method and apparatus, and device and computer-readable medium

Embodiments of the present disclosure disclose an audio content recognition method and apparatus, an electronic device and a non-transitory computer-readable medium. A specific implementation of the method includes: obtaining a voice fragment collection and a non-voice fragment collection by segmenting audio; determining a type and language information of each voice fragment in the voice fragment collection; obtaining, for each voice fragment in the voice fragment collection, a first recognition result by performing voice recognition on the voice fragment based on the type and the language information of the voice fragment. In the implementation, speaking and music fragments in the audio are recognized by different models, so that two audio contents may both have better recognition effects. Moreover, audio of different language contents is recognized by using different models, thereby further improving a voice recognition effect.

ELECTRONIC DEVICE AND CONTROL METHOD THEREFOR
20210344974 · 2021-11-04 · ·

An electronic device is disclosed. The present electronic device comprises a display, a speaker, an input unit, and a processor for controlling the display so that an image signal inputted through the input unit is displayed, controlling the speaker so that an audio signal synchronized with the displayed image signal is outputted, and controlling the display so that caption information corresponding to the audio signal outputted during a preset previous time is displayed on the basis of the point of time when a user command is inputted.

Method and apparatus for secure transfer and playback of multimedia content
11166001 · 2021-11-02 · ·

A method and apparatus for secure transfer and playback of multimedia content enables the secure transfer of multimedia content from a digital video recorder (DVR) to a personal computer (PC) and further to a handheld device. A DVR determines which devices on a Local Area Network (LAN) are authorized to share and/or retrieve content from the DVR. The DVR receives a connection request from a PC on the LAN, authorizes the connection request and establishes a secure connection between the DVR and the PC. Once the secure connection is established, the DVR receives a request for multimedia content from the PC, prepares the multimedia content for transfer and transfers the multimedia content to the PC.

Systems and methods for displaying subjects of a video portion of content

Systems and methods are described herein for displaying subjects of a portion of content. Media data of content is analyzed during playback, and a number of action signatures are identified. Each action signature is associated with a particular subject within the content. The action signature is stored, along with a timestamp corresponding to a playback position at which the action signature begins, in association with an identifier of the particular subject. Upon receiving a command, icons representing each of a number of action signatures at or near the current playback position are displayed. Upon receiving user selection of an icon corresponding to a particular signature, a portion of the content corresponding to the action signature is played back.

Video content conversion

Described are techniques for video conversion for accessibility including a technique comprising determining, using data from at least one camera, that a user is distracted based on a direction of gaze of the user with respect to a display device presenting video content. The technique further comprises converting, by a machine learning model, the video content to audio content in response to determining the user is distracted, wherein the audio content comprises a description of the video content. The technique further comprises outputting, using at least one speaker, the audio content to the user while the user is distracted.

METHOD FOR GENERATING AND REPRODUCING MULTIMEDIA CONTENT, ELECTRONIC DEVICE FOR PERFORMING SAME, AND RECORDING MEDIUM IN WHICH PROGRAM FOR EXECUTING SAME IS RECORDED
20230333725 · 2023-10-19 ·

Method for displaying multimedia content, electronic device for performing same, and recording medium in which program for executing same is recorded are disclosed. In one embodiment, a method for displaying multimedia content comprises acquiring multimedia content including video data which is reproduced as a video, and slide data including a key scene which is matched with event time point in a reproduction time period of the video data and is displayed in a slideshow manner, acquiring a text data corresponding to the multimedia content, displaying the multimedia content in a first area according to a video mode for reproducing the video data as the video or a slideshow mode for displaying the key scene in the slideshow manner, displaying at least a portion of the text data in a second area; and adjusting the displayed text data according to the displayed multimedia content.