Patent classifications
H04N9/8211
VIDEO TAGGING BY CORRELATING VISUAL FEATURES TO SOUND TAGS
Automatically recommending sound effects based on visual scenes enables sound engineers during video production of computer simulations, such as movies and video games. This recommendation engine may be accomplished by classifying SFX and using a machine learning engine to output a first of the classified SFX for a first computer simulation based on learned correlations between video attributes of the first computer simulation and the classified SFX.
Automatic processing of double-system recording
A method for automatically producing a video and audio mix at a first portable electronic device. The method receives a request to capture video and audio, performs a network discovery process to find a second portable electronic device, and sends a message to the second device indicating when to start recording audio for a double system recording session. The method initiates the recording session, such that both devices record concurrently. In response to the first device stopping the recording of audio and sound, signaling the second device to stop recording for the identified recording session. In response to the first device receiving a first audio track from the second device that contains an audio signal recorded during the recording session, automatically generating a mix of video and audio, such that one of the audio signals from the first and second tracks is ducked relative to the other.
Digital deposition and evidence recording system
Embodiments include a modular video recording system. The system includes a first module for supporting a primary input to be recorded. The system may include combinations of modules for supporting different combinations of recording inputs from video and audio sources for recording the received inputs in different combinations.
Motion-based camera mode control to reduce rolling shutter artifacts
Cameras may monitor its operation and automatically switch between operation modes thereby to best capture users' experiences. Auxiliary sensor data collected by the one or more sensors and/or captured image data may be analyzed to determine when a camera should switch to a high-motion operation mode. The auxiliary sensor data include motion information of the camera and the content of the captured images include motion information of the captured objects. When a camera or objects captured by the camera are moving rapidly, the camera is switched to operate at the high-motion operation mode to ensure image quality and minimize artifacts to best capture users' experiences. Motion of the camera may be detected or predicted by analyzing the auxiliary sensor data and motion of the captured objects may be detected by analyzing the captured image data thereby to determine whether or not the camera should switch to the high-motion operation mode.
ELECTRONIC DEVICE FOR LINKING MUSIC TO PHOTOGRAPHY. AND CONTROL METHOD THEREFOR
The present invention relates to a content producing device and method for matching and storing music information when an electronic device captures an image and, particularly, to a content producing device and method for storing, together with a captured image, information on music played by an electronic device or around the electronic device when the image is captured. According to one embodiment of the present disclosure, a control method for an electronic device comprises the steps of: capturing an image when a photographing instruction is inputted by a user; acquiring, during capturing of the image, sound source information on music played in a space in which the electronic device is located; and matching the sound source information on music to the captured image and storing the same.
INFORMATION RECORDING/REPRODUCING APPARATUS
Module compressing/encoding plural images for recording, or an image recording apparatus or image recording system including the module. At least two first and second image input module types capture an image of frames from a video signal source, and convert into digital image data for output; an image input arbitration module arbitrates to place digital image data provided by the first type before digital image data provided by the second type. The first type image input module is provided with a real-time image. The apparatus or system further includes: a memory module temporarily storing the real-time image by a processing time for the arbitration module to process digital image data provided by the second type of image input module, an image encoding module compressing and encoding the digital image data, and changing a processing speed to perform a process of compression and encoding more in real time.
Recording medium, playback method, and playback device
A playback method of reading out and playing streams from a recording medium is provided. A stream file. including a first luminance video and a second luminance video, a playlist file, first and second combination information, elementary streams that can be played together with the first luminance video, and elementary streams that can be played together with the second luminance video are recorded in the recording medium. The playback method includes, when playing the first luminance video, reading out any one of the elementary streams listed in the first combination information of the playlist file, and playing at the same time with the first luminance video, and when playing the second luminance video, reading out any one of the elementary streams listed in the second combination information of the playlist file, and playing at the same time with the second luminance video.
Methods and system for customizing immersive media content
A computing device has memory, a microphone, and an image sensor. A process plays an immersive video in a user interface region, which displays a portion of the immersive video according to a user selected visibility window. While playing the immersive video, the user adjusts the window, and the process records information that identifies placement of the window within the immersive video. The process records audio provided by the user and records video of the user. The process uses the information that identifies placement of the window to form a customized video including what was displayed in the window while playing the immersive video. The customized video also includes a visual overlay in a peripheral portion of the customized video, which includes the recorded video of the user. The customized video also includes an audio overlay using the recorded audio. The process transmits the customized video to another computer.
Audio encoding using video information
Various audio encoders and methods of using the same are disclosed. In one aspect, an apparatus is provided that includes an audio encoder and an audio encoder mode selector. The audio encoder mode selector is operable to analyze video data and adjust an encoding mode of the audio encoder based on the analyzed video data.
Systems and methods for adjusting dubbed speech based on context of a scene
Systems and methods are disclosed herein for detecting dubbed speech in a media asset and receiving metadata corresponding to the media asset. The systems and methods may determine a plurality of scenes in the media asset based on the metadata, retrieve a portion of the dubbed speech corresponding to the first scene, and process the retrieved portion of the dubbed speech corresponding to the first scene to identify a speech characteristic of a character featured in the first scene. Further, the systems and methods may determine whether the speech characteristic of the character featured in the first scene matches the context of the first scene, and if the match fails, perform a function to adjust the portion of the Dubbed Speech of the dubbed speech so that the speech characteristic of the character featured in the first scene matches the context of the first scene.