Patent classifications
G10H1/06
Server side crossfading for progressive download media
In exemplary embodiments of the present invention systems and methods are provided to implement and facilitate cross-fading, interstitials and other effects/processing of two or more media elements in a personalized media delivery service so that each client or user has a consistent high quality experience. The effects or crossfade processing can occur on the broadcast, publisher or server-side, but can still be personalized to a specific user, thus still allowing a personalized experience for each individual user, in a manner where the processing burden is minimized on the downstream side or client device. This approach enables a consistent user experience, independent of client device capabilities, both static and dynamic. The cross-fade can be implemented after decoding the relevant chunks of each component clip, processing, recoding and rechunking, or, in a preferred embodiment, the cross-fade or other effect can be implemented on the relevant chunks to the effect in the compressed domain, thus obviating any loss of quality by re-encoding. A large scale personalized content delivery service can be implemented by limiting the processing to essentially the first and last chunks of any file, since there is no need to processing the full clip. In exemplary embodiments of the present invention this type of processing can easily be accommodated in cloud computing technology, where the first and last files may be conveniently extracted and processed within the cloud to meet the required load. Processing may also be done locally, for example, by the broadcaster, with sufficient processing power to manage peak load.
Data-Driven Autosuggestion Within Media Content Creation Applications
A media composition application, such as a musical scorewriter or a digital audio workstation, provides in situ suggestions for continuation or completion of a media composition. The suggestions are based on some or all of the portion of the composition already composed or are based on a corpus of compositions, such as those by a particular composer or those of a specific genre. The length of the suggestions is specified by the user. The suggestions are provided within a graphical user interface of the application and displayed as a possible direct continuation of the composition within a musical stave. If the user rejects the suggestion, additional suggestions are automatically displayed in situ. Reductive, most-probable suggestions may be offered as well as exploratory suggestions that facilitate a creative compositional interaction between user and application. Data filters enable selected aspects of a data source to be used for suggestion generation.
Data-Driven Autosuggestion Within Media Content Creation Applications
A media composition application, such as a musical scorewriter or a digital audio workstation, provides in situ suggestions for continuation or completion of a media composition. The suggestions are based on some or all of the portion of the composition already composed or are based on a corpus of compositions, such as those by a particular composer or those of a specific genre. The length of the suggestions is specified by the user. The suggestions are provided within a graphical user interface of the application and displayed as a possible direct continuation of the composition within a musical stave. If the user rejects the suggestion, additional suggestions are automatically displayed in situ. Reductive, most-probable suggestions may be offered as well as exploratory suggestions that facilitate a creative compositional interaction between user and application. Data filters enable selected aspects of a data source to be used for suggestion generation.
Systems and methods for viewing instant updates of an audio waveform with an applied effect
A method implemented in an audio editing device comprises retrieving, by the editing device, audio content and obtaining from a user via the user interface an effect to be applied to the audio content. The method further comprises determining a target number of samples based on the obtained effect, obtaining the target number of samples of the audio content, and identifying, within the obtained target number of samples, a maximum amplitude and a minimum amplitude. The method further comprises applying the effect to the obtained target number of samples and displaying on the user interface, a graphical representation of the obtained target number of samples with the applied effect, wherein the graphical representation depicts amplitude varying as a function of time, wherein displaying the graphical representation is performed prior to generating an audio output file with the effect applied to the audio content.
Server side crossfading for progressive download media
Systems and methods are provided to implement and facilitate cross-fading, interstitials and other effects/processing of two or more media elements in a personalized media delivery service. Effects or crossfade processing can occur on the broadcast, publisher or server-side, but can still be personalized to a specific user, in a manner that minimizes processing on the downstream side or client device. The cross-fade can be implemented after decoding the relevant chunks of each component clip, processing, recoding and rechunking, or, the cross-fade or other effect can be implemented on the relevant chunks to the effect in the compressed domain, thus obviating any loss of quality by re-encoding. A large scale personalized content delivery service can limit the processing to essentially the first and last chunks of any file, there being no need to processing the full clip.
Server side crossfading for progressive download media
Systems and methods are provided to implement and facilitate cross-fading, interstitials and other effects/processing of two or more media elements in a personalized media delivery service. Effects or crossfade processing can occur on the broadcast, publisher or server-side, but can still be personalized to a specific user, in a manner that minimizes processing on the downstream side or client device. The cross-fade can be implemented after decoding the relevant chunks of each component clip, processing, recoding and rechunking, or, the cross-fade or other effect can be implemented on the relevant chunks to the effect in the compressed domain, thus obviating any loss of quality by re-encoding. A large scale personalized content delivery service can limit the processing to essentially the first and last chunks of any file, there being no need to processing the full clip.
METHOD FOR SONG MULTIMEDIA SYNTHESIS, ELECTRONIC DEVICE AND STORAGE MEDIUM
The disclosure provides a method for synthesizing a song multimedia, an electronic device and a storage medium. Material obtaining modes are provided based on a song multimedia synthesis request. User audios provided by a user are obtained based on a selected material obtaining mode. A user timbre output by a timbre extraction model is obtained by inputting the user audios into the timbre extraction model. Lyrics to be synthesized and a tune to be synthesized provided by the user are obtained based on the selected material obtaining mode, and a synthesized song multimedia is obtained by inputting the user timbre, the lyrics to be synthesized and the tune to be synthesized into a song synthesis model.
METHOD FOR SONG MULTIMEDIA SYNTHESIS, ELECTRONIC DEVICE AND STORAGE MEDIUM
The disclosure provides a method for synthesizing a song multimedia, an electronic device and a storage medium. Material obtaining modes are provided based on a song multimedia synthesis request. User audios provided by a user are obtained based on a selected material obtaining mode. A user timbre output by a timbre extraction model is obtained by inputting the user audios into the timbre extraction model. Lyrics to be synthesized and a tune to be synthesized provided by the user are obtained based on the selected material obtaining mode, and a synthesized song multimedia is obtained by inputting the user timbre, the lyrics to be synthesized and the tune to be synthesized into a song synthesis model.
Efficient Combined Harmonic Transposition
The present document relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), and to digital effect processors, e.g. so-called exciters, where generation of harmonic distortion adds brightness to the processed signal. In particular, a system configured to generate a high frequency component of a signal from a low frequency component of the signal is described. The system may comprise an analysis filter bank (501) configured to provide a set of analysis subband signals from the low frequency component of the signal; wherein the set of analysis subband signals comprises at least two analysis subband signals; wherein the analysis filter bank (501) has a frequency resolution of Δf. The system further comprises a nonlinear processing unit (502) configured to determine a set of synthesis subband signals from the set of analysis subband signals using a transposition order P; wherein the set of synthesis subband signals comprises a portion of the set of analysis subband signals phase shifted by an amount derived from the transposition order P; and a synthesis filter bank (504) configured to generate the high frequency component of the signal from the set of synthesis subband signals; wherein the synthesis filter bank (504) has a frequency resolution of FΔf; with F being a resolution factor, with F≥1; wherein the transposition order P is different from the resolution factor F.
Musical sound generation method, musical sound generation device, and recording medium
A musical sound generation device includes a control device which, when a performance operator among a plurality of performance operators has been operated for a part that has been set to sound a predetermined number of simulated voices of an analog synthesizer, assigns a sounding parameter of one or two or more voices, which form a timbre of a simulated voice of the analog synthesizer corresponding to the operated performance operator and are selected from a plurality of sounding voices, to a sound generation circuit, and assigns, to the sound generation circuit, an information set selected from a plurality of information sets and each include a variation value that applies a variation to the sounding parameter of the one or two or more voices, and a sound generation circuit that performs a sounding process of the one or two or more voices using the sounding parameter and the information set.