Patent classifications
G10H2240/325
AUDIO DETECTION METHOD AND APPARATUS, COMPUTER DEVICE, AND READABLE STORAGE MEDIUM
This application provide an audio detection method performed by a computer device. The method includes: acquiring a target time point and a reference point of the target time point from target audio data; performing energy evaluation on the target time point according to an audio amplitude value of the target time point to obtain an energy evaluation value of the target time point; performing energy evaluation on the reference point according to an audio amplitude value of the reference point to obtain an energy evaluation value of the reference point; performing accuracy verification on the target time point according to the energy evaluation value of the target time point and the energy evaluation value of the reference point; and if the accuracy verification on the target time point succeeds, adding the target time point as a target stress point into a target stress point set.
METHOD AND SYSTEM FOR INSTRUMENT SEPARATING AND REPRODUCING FOR MIXTURE AUDIO SOURCE
A method and a system for instrument separating and reproducing for a mixture audio source is provided. The method and/or the system includes inputting selected music into an instrument separation model for extracting features therefrom, determining audio source signals of multiple channels for the separation of all instruments, each channel containing sound of one instrument, and transmitting the signals of the different channels to multiple speakers placed at designated positions for playing, which can reproduce or recreate an immersive sound field listening experience for users.
Performance analysis method and performance analysis device
A performance analysis method realized by a computer includes sequentially estimating performance positions within a musical piece by an analysis process applied to an audio signal representing a performance sound of the musical piece, and setting a performance position at a first time point on a time axis within the musical piece to a performance position corresponding to a time series of the performance positions estimated by the analysis process in a selection period prior to and spaced away from the first time point within the musical piece.
SYSTEM AND METHOD FOR GENERATING AND EDITING A VIDEO
The invention provides a system and a computer-implemented method for generating and editing a video including providing a mobile communication device comprising a camera, a display, a central processing unit (CPU), a video generating application and a memory. Next, starting the video generating application, and then opening the camera and providing camera tutorials. The camera tutorials comprise instructions for camera positioning, camera moving, and camera aligning while taking videos. Next, taking videos of a scene following the instructions for camera positioning, camera moving, and camera aligning while taking videos. Next, uploading the videos to the memory, editing the videos and producing a composite video for the scene. The camera tutorials include a “moving forward/backward” tutorial directing a user first to hold the camera still, to align a horizontal view line in the display with a marker line, and then to move the user's body forward or backward while taking a video of the scene. The editing of the videos includes slowing the videos down, and matching rhythm of music accompanying each video to transitions of consecutive videos. The slowing down of the videos includes removing every other frame.
SERVER SIDE CROSSFADING FOR PROGRESSIVE DOWNLOAD MEDIA
Systems and methods are provided to implement and facilitate cross-fading, interstitials and other effects/processing of two or more media elements in a personalized media delivery service. Effects or crossfade processing can occur on the broadcast, publisher or server-side, but can still be personalized to a specific user, in a manner that minimizes processing on the downstream side or client device. The cross-fade can be implemented after decoding, processing, re-encoding, and rechunking the relevant chunks of each component clip. Alternatively, the cross-fade or other effect can be implemented on the relevant chunks in the compressed domain, thus obviating any loss of quality by re-encoding. A large scale personalized content delivery service can limit the processing to essentially the first and last chunks of any file, there being no need to process the full clip.
EXTERNAL EXTENDED DEVICE AND AUDIO PLAYBACK METHOD
An external extended device, an audio processing method and an audio playback method are provided. The external extended device is configured to receive power supplied and a signal transmitted by a television device and the external extended device includes: an integrated physical interface connecting the external extended device and the television device; and a sound-mixing processing chip electrically connected with the integrated physical interface and configured to: acquire an accompaniment audio signal transmitted by the television device via the integrated physical interface; perform sound-mixing processing on a user voice signal gathered by a microphone device and the accompaniment audio signal; and transmit a sound-mixed signal to a power amplifier circuit of the television device via the integrated physical interface.
NON-TRANSITORY COMPUTER-READABLE MEDIUM HAVING COMPUTER-READABLE INSTRUCTIONS AND SYSTEM
A sound controlling system including a user terminal having a sound source, a wireless communication device, a digital to analog converter (DAC) and first processing electronics. The first processing electronics are configured to: provide data of a backing sound to the sound source; control the sound source to generate a sound signal based on the data; receive a first input instruction including a first instruction to transmit the sound signal and a second instruction to play back the backing sound; provide the sound signal to the wireless communication device as the first input instruction being the first instruction, and provide the sound signal to the DAC as being the second instruction; control the wireless communication device to convert the sound signal to a wireless signal and transmit the wireless signal; and convert the sound signal from a digital signal to an analog signal for play back of the backing sound.
AUDIOVISUAL COLLABORATION SYSTEM AND METHOD WITH SEED/JOIN MECHANIC
User interface techniques provide user vocalists with mechanisms for seeding subsequent performances by other users (e.g., joiners). A seed may be a full-length seed spanning much or all of a pre-existing audio (or audiovisual) work and mixing, to seed further contributions of one or more joiners, a user's captured media content for at least some portions of the audio (or audiovisual) work. A short seed may span less than all (and in some cases, much less than all) of the audio (or audiovisual) work. For example, a verse, chorus, refrain, hook or other limited “chunk” of an audio (or audiovisual) work may constitute a seed. A seeding user's call invites other users to join the full-length or short-form seed by singing along, singing a particular vocal part or musical section, singing harmony or other duet part, rapping, talking, clapping, recording video, adding a video clip from camera roll, etc. The resulting group performance, whether full-length or just a chunk, may be posted, livestreamed, or otherwise disseminated in a social network.
METHOD FOR CHORUS MIXING, APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM
The present disclosure provides a method for chorus mixing, an apparatus, an electronic device and storage media. The method includes converting a main vocal audio signal and a chorus audio signal into signals in frequency domain, respectively, wherein the chorus audio signal comprises main vocal audio played by a speaker; determining a delay between the main vocal audio signal and the chorus audio signal based on a frequency-domain signal of the main vocal audio signal and a frequency-domain signal of the main vocal audio played by the speaker included in a frequency-domain signal of the chorus audio signal; aligning the chorus audio signal with the main vocal audio signal based on the determined delay; performing an echo cancellation on the aligned chorus audio signal; and mixing audio of the main vocal audio signal and the echo-canceled chorus audio signal.
Data synchronisation
The present invention relates to a method and apparatus to synchronise audio and video data. More particularly, the present invention relates to a loop-based audio-visual mixing apparatus and method for synchronising a plurality of videos and their corresponding audio streams to create audio-visual compositions. According to one aspect, there is provided a method for creating a synchronised lineal sequence from multiple inputs of audio and video data, comprising the steps of: providing a first input, comprising audio and video data; providing one or more subsequent inputs, comprising audio and video data; determining at least one rhythm metric unit for each input; queueing the or each subsequent inputs such that the or each subsequent input is triggered at a beginning of a next said rhythm metric unit of a determined input.