IPIQ

H04N9/8211

Video dubbing method, apparatus, device, and storage medium

11817127 · 2023-11-14 ·

Beijing Bytedance Network Technology Co., Ltd.

The present disclosure provides a video dubbing method, an apparatus, a device, and a storage medium. The method includes: when receiving an audio recording start trigger operation for a first time point of a target video and starting from a video picture corresponding to the first time point, playing the target video based on a timeline and receiving audio data based on the timeline; and when receiving an audio recording end trigger operation for a second time point, generating an audio recording file. The audio recording file has a linkage relationship with a timeline of a video clip taking the video picture corresponding to the first time point as a starting frame and taking a video picture corresponding to the second time point as an ending frame.

Digital deposition and evidence recording system

11463650 · 2022-10-04 ·

CVISUALEVIDENCE, LLC

James Curio

Embodiments include a modular video recording system. The system includes a first module for supporting a primary input to be recorded. The system may include combinations of modules for supporting different combinations of recording inputs from video and audio sources for recording the received inputs in different combinations.

VIDEO PROCESSING METHOD AND DEVICE, TERMINAL, AND STORAGE MEDIUM

20220264053 · 2022-08-18 ·

The embodiments of the disclosure provide a video processing method and device, terminal and storage medium. The method includes: turning on a first camera located at a first side of a terminal so as to obtain a first video stream through the first camera; turning on a second camera located at a second side of the terminal so as to obtain a second video stream through the second camera; receiving a switching command and performing a preset switching operation on the first video stream and the second video stream according to the switching command; recording receiving time of the switching command; and generating timeline information according to the receiving time and the preset switching operation. In the method of the disclosure, by recording the receiving time of the switching command and generating the timeline information, more flexible choices may be provided for subsequent video presentation and editing.

Electronic device for linking music to photography, and control method therefor

11445144 · 2022-09-13 ·

Samsung Electronics Co., Ltd.

The present invention relates to a content producing device and method for matching and storing music information when an electronic device captures an image and, particularly, to a content producing device and method for storing, together with a captured image, information on music played by an electronic device or around the electronic device when the image is captured. According to one embodiment of the present disclosure, a control method for an electronic device comprises the steps of: capturing an image when a photographing instruction is inputted by a user; acquiring, during capturing of the image, sound source information on music played in a space in which the electronic device is located; and matching the sound source information on music to the captured image and storing the same.

METHOD AND DEVICE FOR GENERATING VIDEO FILE, COMPUTER APPARATUS, AND STORAGE MEDIUM

20220286218 · 2022-09-08 ·

Chintan Pandya

Embodiments of the present disclosure provide a method and device for generating a video file, a computer apparatus, and a storage medium. The method includes: obtaining a video-recording instruction, and recording a current picture on a recording interface according to the video-recording instruction; scanning an audio stream broadcasted to a predetermined channel by an audio device; in response to the scanned audio stream meeting a predetermined condition, determining the scanned audio stream to be a target audio stream, and displaying prompt information on the recording interface; and in response to receiving an operation of selecting the target audio stream according to the prompt information, combining the selected target audio stream and the recorded current picture to obtain a video file.

Recording presentations using layered keyframes

11437072 · 2022-09-06 ·

Moxtra, Inc.

A layered-keyframe-based, presentation recording service provides for presentation recording sessions, the recording of presentations, and the creation of presentation videos. A user records with the user's device the document pages and page annotations, as well audio and video streams, that are presented using the device during the course of a presentation recording session. The pages, annotations and video streams are efficiently and separately recorded as keyframes. These keyframes are used as document, annotation and video layers to create layered keyframes. A presentation video is created from the layered keyframes and the recorded audio stream. Users can then playback presentation videos at a time, place and manner that is available to, accessible by and/or convenient to them.

CONTROLLING SOUNDS OF INDIVIDUAL OBJECTS IN A VIDEO

20220214858 · 2022-07-07 ·

A method for modifying a sound produced by a sound source in a video includes capturing video and audio of a scene is disclosed. Audio is captured using a microphone array. A sound source is isolated and a direction of arrival of the sound source with respect to a capture location is identified. One or more visual objects in the captured video are identified. One of the isolated sound sources is associated with one of the identified visual objects. An input identifying one of the isolated sound sources is received during playing of the captured video and audio. The input includes a command. Responsive to receiving the input, an attribute of the identified isolated sound source is modified. The input may identify a visual object associated with a sound source. A system and article of manufacture are also disclosed.

Video tagging by correlating visual features to sound tags

11450353 · 2022-09-20 ·

Sony Interactive Entertainment Inc.

Automatically recommending sound effects based on visual scenes enables sound engineers during video production of computer simulations, such as movies and video games. This recommendation engine may be accomplished by classifying SFX and using a machine learning engine to output a first of the classified SFX for a first computer simulation based on learned correlations between video attributes of the first computer simulation and the classified SFX.

Method and device for processing multimedia information, electronic equipment and computer-readable storage medium

11272136 · 2022-03-08 ·

BEIJING MICROLIVE VISION TECHNOLOGY CO., LTD

Deping Liu

The present application discloses a method and device for processing multimedia information, an electronic equipment, and a computer-readable storage medium. The method for processing multimedia information includes: detecting whether multimedia configuration parameters have changed during a process of recording multimedia information; and recording the multimedia information based on the changed multimedia configuration parameters when detecting that the multimedia configuration parameters have changed. According to the embodiments of the present application, multimedia configuration parameters of the special effects such as stickers, make-up, filters, and mixing can be added during the recording of multimedia information, which improves the user experience.

Systems and methods for adjusting dubbed speech based on context of a scene

11151980 · 2021-10-19 ·

Rovi Guides, Inc.

Systems and methods are disclosed herein for detecting dubbed speech in a media asset and receiving metadata corresponding to the media asset. The systems and methods may determine a plurality of scenes in the media asset based on the metadata, retrieve a portion of the dubbed speech corresponding to the first scene, and process the retrieved portion of the dubbed speech corresponding to the first scene to identify a speech characteristic of a character featured in the first scene. Further, the systems and methods may determine whether the speech characteristic of the character featured in the first scene matches the context of the first scene, and if the match fails, perform a function to adjust the portion of the dubbed speech so that the speech characteristic of the character featured in the first scene matches the context of the first scene.

Patent classifications

H04N9/8211