Patent classifications
H04N5/93
Systems, Methods, and Devices for Synchronization of Vehicle Data with Recorded Audio
A method for post-processing to synchronize audio data with vehicle data includes generating an artificial sound data based on time-series vehicle data. The method includes determining an offset that maximizes cross-correlation between the artificial sound data and recorded audio data. The method also includes shifting one or more of the time-series data and the recorded audio data relative to each other in time based on the offset. The shift may be used to generate or render a synchronized set of time-series data and recorded audio data.
Storage system of original frame of monitor data and storage method thereof
A storage system of original frames of monitor data and a storage method thereof are provided. The storage system includes a monitor sensor, an event marking circuit, a data storage circuit and a frame processing circuit. The monitor sensor provides a plurality of original frames. The event marking circuit has an input terminal coupling to the monitor sensor and an output terminal, and is used for determining an event intensity of a corresponding one of the original frames and marks the event intensity on the corresponding original frame. The data storage circuit is coupled to the output terminal and is used for completely storing the original frames. The frame processing circuit is coupled to the data storage circuit and is used for checking whether the original frames within the data storage circuit are deleted according to the event intensities.
Method and system for sign language translation and descriptive video service
A method and a system for a sign language translation and descriptive video service are disclosed. The method and system enables an easy preparation of video including a descriptive screen and a sign language so that a hearing-impaired person and a visually impaired person can receive a help for using a video media. The method includes extracting a character string in a text form from a caption of an original video; translating the character string in the text form extracted from the caption of the original video to a machine language; matching the character string translated to the machine language with a sign language video in a database; synchronizing the original video with the sign language video, and mixing the original video and the synchronized sign language video; and editing the sign language video with a sign language video editing tool.
Video production method, computer device, and storage medium
A video production method and apparatus, a storage medium, and a computer device are disclosed. The method includes: receiving a follow-shot instruction in a case that reference video content is played on a video play interface, the follow-shot instruction including a reference video identifier; displaying a first video display region and a second video display region on a terminal screen; playing the reference video content in the first video display region, and recording displayed real-time video content in the second video display region; and generating a target video based on the recorded real-time video content and the reference video content. The first video display region and the second video display region are displayed on the terminal screen, the reference video content is played in the first video display region, and the displayed real-time video content is recorded in the second video display region.
Systems and methods for adjusting dubbed speech based on context of a scene
Systems and methods are disclosed herein for detecting dubbed speech in a media asset and receiving metadata corresponding to the media asset. The systems and methods may determine a plurality of scenes in the media asset based on the metadata, retrieve a portion of the dubbed speech corresponding to the first scene, and process the retrieved portion of the dubbed speech corresponding to the first scene to identify a speech characteristic of a character featured in the first scene. Further, the systems and methods may determine whether the speech characteristic of the character featured in the first scene matches the context of the first scene, and if the match fails, perform a function to adjust the portion of the dubbed speech so that the speech characteristic of the character featured in the first scene snatches the context of the first scene.
Information processing apparatus and recording medium
There is provided An information processing apparatus including a perspective switching control unit configured to switch a perspective when playing back content acquired by a content acquisition unit to at least one of a first-person perspective and a third-person perspective, an editing unit configured to edit a part of the content, and a playback control unit configured to play back the content edited by the editing unit in the at least one of the first-person perspective and the third-person perspective to which the perspective has been switched by the perspective switching control unit.
Sound processing system and processing method that emphasize sound from position designated in displayed video image
A recorder receives designation of a video which is desired to be reproduced from a user. If designation of one or more designated locations where sound is emphasized on a screen of a display which displays the video is received by the recorder from the user via an operation unit during reproduction or temporary stopping of the video, a signal processing unit performs an emphasis process on audio data, that is, the signal processing unit emphasizes audio data in directions directed toward positions corresponding to the designated locations from a microphone array by using audio data recorded in the recorder. A reproducing device reproduces the emphasis-processed audio data and video data in synchronization with each other.
Sound processing system and processing method that emphasize sound from position designated in displayed video image
A recorder receives designation of a video which is desired to be reproduced from a user. If designation of one or more designated locations where sound is emphasized on a screen of a display which displays the video is received by the recorder from the user via an operation unit during reproduction or temporary stopping of the video, a signal processing unit performs an emphasis process on audio data, that is, the signal processing unit emphasizes audio data in directions directed toward positions corresponding to the designated locations from a microphone array by using audio data recorded in the recorder. A reproducing device reproduces the emphasis-processed audio data and video data in synchronization with each other.
Object detecting apparatus, image capturing apparatus, method for controlling object detecting apparatus, and storage medium
An object detecting apparatus includes a detecting unit configured to detect an area of a predetermined object from an image, a calculating unit configured to calculate an evaluation value on the area detected by the detecting unit, and a control unit configured, when the evaluation value satisfies a predetermined criterion, to determine that the area is the predetermined object. The predetermined criterion is set depending on an amount of distortion of an image displayed on a display unit.
User interface for method for creating a custom track
A system for allowing a user to create a custom track on a user apparatus, the user apparatus having a display is described. A memory stores a plurality of video clips and an audio track having a timeline. An application is stored in the memory. The application is configured to provide, on the display of the user apparatus, a plurality of video source windows, each of the plurality of video source windows corresponding to a respective one of the plurality of video clips. The application is further configured to allow the user to create the custom track while the audio track is playing by correlating portions of the plurality of video clips with the audio track by selecting respective ones of the plurality of video source windows at desired times in the timeline of the audio track.