TRANSITION FUNCTIONS OF DECOMPOSED SIGNALS
20220269476 · 2022-08-25
Assignee
Inventors
Cpc classification
G10H2210/056
PHYSICS
H04S2400/15
ELECTRICITY
G10H2210/081
PHYSICS
G06F3/04886
PHYSICS
G06F3/165
PHYSICS
G10H2210/125
PHYSICS
G10H2220/106
PHYSICS
G10H2240/325
PHYSICS
G10H2220/101
PHYSICS
H04R5/04
ELECTRICITY
H04R2420/01
ELECTRICITY
G10H2210/241
PHYSICS
G10H2230/015
PHYSICS
H04S2400/13
ELECTRICITY
H04R2227/005
ELECTRICITY
G06F3/04847
PHYSICS
G10H2250/641
PHYSICS
H04S2420/07
ELECTRICITY
G10H2250/035
PHYSICS
G10H2250/311
PHYSICS
H04R2430/01
ELECTRICITY
International classification
G11B27/10
PHYSICS
Abstract
A device including: first and second input units providing first and second input signals of first and second audio tracks, a decomposition unit to decompose the first input audio signal to obtain decomposed signals, a playback unit to start playback of a first output signal obtained from recombining at least first and second decomposed signals at first and second volume levels, respectively, and a transition unit for performing a transition between playback of the first output signal and playback of a second output signal obtained from the second input signal. The transition unit is adapted for reducing the first/second volume levels according to first/second transition functions. The device includes an analyzing unit to analyze an audio signal to determine a song part junction between two song parts. The transition time interval of at least one of the transition functions is set such as to include the song part junction.
Claims
1. A method for processing audio signals, comprising: providing a first input signal of a first input audio track and a second input signal of a second input audio track; decomposing the first input signal to obtain a plurality of decomposed signals, comprising at least a first decomposed signal and a second decomposed signal different from the first decomposed signal; assigning a first volume level to the first decomposed signal and a second volume level to the second decomposed signal; starting playback of a first output signal obtained from recombining at least the first decomposed signal at the first volume level with the second decomposed signal at the second volume level, such that the first output signal substantially equals the first input signal; while playing the first output signal, reducing the first volume level according to a first transition function and reducing the second volume level according to a second transition function different from said first transition function, wherein each of the transition functions assigns a predetermined volume level or a predetermined change in volume level to each of a plurality of controller positions within a controller range of a user operated controller defined between a controller first end position and a controller second end position; and starting playback of a second output signal obtained from the second input signal after starting playback of the first output signal but before volume levels of all decomposed signals of the first input signal have reached substantially zero.
2. The method of claim 1, wherein each of the transition functions further assigns the predetermined volume level or the predetermined change in volume level to each of a plurality of time frames within a transition time interval defined between a transition start time (T1) and a transition end time (T3).
3. The method of claim 2, wherein the first transition function and the second transition function are defined such that the volume level is at a maximum at one or more of the transition start time (T1) or the controller first end position, and such that the volume level is at a minimum at one or more of the transition end time (T3) or the controller second end position.
4. The method of claim 2, wherein the third transition function and the fourth transition function are defined such that the volume level is at a minimum at one or more of the transition start time (T1) or the controller first end position, and such that the volume level is at a maximum at one or more of the transition end time (T3) or the controller second end position.
5. The method of claim 1, wherein at least one of the transition functions is a linear function or contains a linear portion.
6. The method of claim 1, wherein at least one of the transition functions is one or more of a continuous function or a monotonic function.
7. The method of claim 1, wherein the first transition function and the second transition function differ from each other with regard to slope and/or wherein the third transition function and the fourth transition function differ from each other with regard to slope.
8. The method of claim 1, further comprising analyzing an audio signal comprising at least one of the decomposed signals to determine a song part junction between two song parts within the first input audio track or within the second input audio track, wherein a transition time interval of at least one of the transition functions is set such as to include the song part junction.
9. The method of claim 1, further comprising: receiving a user input referring to a transition command, including at least one transition parameter; and setting at least one of the transition functions according to the transition parameter.
10. The method of claim 9, wherein the transition parameter is a transition start time (T1) of a transition time interval of at least one of the transition functions.
11. The method of claim 9, wherein the transition parameter is a transition end time (T3) of a transition time interval of at least one of the transition functions.
12. The method of claim 9, wherein the transition parameter is a length (T3-T1) of a transition time interval of at least one of the transition functions.
13. The method of claim 9, wherein the transition parameter is a transition reference time (T2) within the transition time interval of at least one of the transition functions.
14. The method of claim 9, wherein the transition parameter is a slope, shape or offset of at least one of the transition functions.
15. The method of claim 9, wherein the transition parameter is an assignment or deassignment of a preset transition function to or from a selected one of the plurality of decomposed signals.
16. A device for processing audio signals, comprising: a first input unit providing a first input signal of a first input audio track and a second input unit providing a second input signal of a second input audio track; a decomposition unit configured to decompose the first input audio signal to obtain a plurality of decomposed signals, comprising at least a first decomposed signal and a second decomposed signal different from the first decomposed signal; a playback unit configured to start playback of a first output signal obtained from recombining at least the first decomposed signal at a first volume level with the second decomposed signal at a second volume level, such that the first output signal substantially equals the first input signal; a transition unit for performing a transition between playback of the first output signal and playback of a second output signal obtained from the second input signal, wherein the transition unit has a volume control section adapted for reducing the first volume level according to a first transition function and reducing the second volume level according to a second transition function different from said first transition function; and an analyzing unit configured to analyze an audio signal to determine a song part junction between two song parts within the first input audio track or within the second input audio track, wherein a transition time interval of at least one of the transition functions is set such as to include the song part junction.
17. The device of claim 16, further comprising a user interface configured to accept a user input referring to a transition command, including at least one transition parameter, wherein the transition unit is configured to set at least one of the transition functions according to the transition parameter.
18. The device of claim 17, wherein the transition parameter is preferably selected from one or more of: a transition start time (T1) of a transition time interval of at least one of the transition functions, a transition end time (T3) of a transition time interval of at least one of the transition functions, a length of a transition time interval of at least one of the transition functions, a transition reference time (T2) within the transition time interval of at least one of the transition functions, a slope, shape or offset of at least one of the transition functions, or an assignment or deassignment of a preset transition function to or from a selected one of the plurality of decomposed tracks.
19. The device of claim 16, further comprising a display unit configured to display a graphical representation of one or more of the first input audio track or the second input audio track, wherein the user interface is configured to receive at least one transition parameter through a selection or marker applied by the user in relation to the graphical representation.
20. The device of claim 16, wherein the device includes a display unit configured to display a graphical representation of at least one of the decomposed signals, wherein the user interface is configured to allow a user to assign or deassign a preset transition function to or from a selected one of the plurality of decomposed tracks.
Description
[0067] Preferred embodiments of the present invention will be described in the following on the basis of the attached drawings, wherein
[0068]
[0069]
[0070]
[0071] A device 10 according to an embodiment of the present invention may be formed by a computer such as a tablet computer or a smartphone, which comprises standard hardware components such as input/output ports, wireless connectivity, a housing, a touchscreen, an internal storage as well as a plurality of microprocessors, RAM and ROM. Essential features of the present invention are implemented in device 10 by means of a suitable software application or a software plugin running on device 10.
[0072] The display of device 10 preferably has a first section 12a associated to a first song A and a second section 12b associated to a second song B. First section 12a includes a first waveform display region 14a which displays at least one graphical representation of song A, in particular one or more waveform signals associated to song A. For example, the first waveform display region 14a may display a waveform of song A and/or one or more waveforms of decomposed signals obtained from decomposing song A. For example, decomposition of song A may be carried out to obtain a decomposed drum signal, a decomposed vocal signal and a decomposed harmonic signal, which may be displayed within the first waveform display region 14a. Likewise, a second waveform display region 14b may be included in the second section 12b such as to display a graphical representation related to song B in the same or corresponding manner as described above for song A. Thus, the second waveform display region 14b may display one or more waveforms of song B and/or at least one waveform of a decomposed signal obtained from song B.
[0073] Furthermore, first and second waveform display regions 14a, 14b may each display a play-head 16a, 16b, respectively, which show a current playback position within song A and song B, respectively.
[0074] Each of the first and second sections 12a and 12b may further include a number of control elements for controlling playback, effects and other features related to song A and song B, respectively. For example, the first section 12a may include a play button 18a which can be pushed by a user to alternatively start and stop playback of song A (more precisely audio signals obtained from Song A, such as decomposed signals). Likewise, the second section 12b may include a play button 18b which may be pushed by a user to alternatively start and stop playback of song B (more precisely audio signals obtained from Song B, such as decomposed signals).
[0075] An output signal generated by device 10 in accordance with the settings of device 10 and with a control input received from a user may be output at an output port 20 in digital or analog format, such as to be transmitted to a further audio processing unit or directly to a PA system, speakers or head phones. Alternatively, the output signal may be output through internal speakers of device 10.
[0076] According to the present invention, device 10 can perform a smooth transition from playback of song A to playback of song B by virtue of a transition unit, which will be explained in more detail below. In the present embodiment, device 10 may comprise a transition button 22 displayed on the display of device 10, which may be pushed by a user to initiate a transition from playback of song A towards playback of song B. By a single operation of transition button 22 (pushing the button 22), device 10 starts changing individual volumes of individual decomposed signals of songs A and B according to respective transition functions such as to smoothly cross-fade from song A to song B within a predetermined transition time interval.
[0077] In addition or alternatively, device 10 may include a transition controller 24 which can be moved by a user between one controller end point referring to a playback of only song A and a second controller end point referring to playback of only song B. This allows controlling the volumes of individual decomposed signals of songs A and B using transition functions, which are based not on time but on controller position of the transition controller 24. In this manner, in particular the speed and progress of the transition can manually be controlled through the transition controller 24.
[0078]
[0079] Audio processing is based on a first input track and a second input track, which may be stored within the device 10, for example in an internal memory of the device, a hard drive or any other storage medium. First and second input tracks are preferably digital audio files of a standard compressed or uncompressed audio file format such as mp3, WAV, AIFF or the like. Alternatively, first and second input tracks may be received as continuous streams, for example via an Internet connection of device 10 or from an external playback device via an input audio interface or via a microphone.
[0080] First and second input tracks are preferably processed within first and second input units 26a and 26b, respectively, which may be configured to decrypt or decompress the audio data, if necessary, and/or may be configured to extract a segment of the first input track and a segment of the second input track in order to continue processing based on the segments. This has an advantage that time-consuming processing algorithms, such as the decomposition based on a neural network, will not have to analyze the entire first or second input track upfront, but will perform processing based on shorter segments, which allows continuing processing and eventually start playback at an earlier point in time. In addition, in case of receiving the first and second input tracks as continuous streams, it would in many cases not be feasible to wait until the complete input tracks are received before starting to process the data.
[0081] The output of the first and second input units 26a, 26b, for example the segments of the first and second input tracks, form first and second input signals, and they are input into first and second AI systems 28a, 28b of a decomposition unit 40. Each AI system 28a, 28b includes a neural network trained to decompose the first and second input signals, respectively, with respect to sound components of different timbres. Decomposition unit 40 thus decomposes the first input signal to obtain a first group of decomposed signals and decomposes the second input signal to obtain a second group of decomposed signals. In the present example, each group of decomposed signals includes a decomposed drum signal, a decomposed vocal signal and a decomposed harmonic signal, which each form a complete set of decomposed signals or a complete decomposition, which means that a sum of all decomposed signals of the first group will resemble the first input signal, and the sum of all decomposed signals of the second group will resemble the second input signal.
[0082] It should be noted that although in the present embodiment two AI systems 28a, 28b are used, decomposition unit 40 may also include only one AI system and only one neural network, which is trained and configured to determine all decomposed signals of the first input signal as well as all decomposed signals of the second input signal. As a further alternative, more than two AI systems may be used, for example a separate AI system and a separate neural network may be used to generate each of the decomposed signals.
[0083] All decomposed signals, in particular both groups of decomposed signals, are then input into a playback unit 42 in order to generate an output signal for playback. Playback unit 42 comprises a transition unit 44, which is basically adapted to recombine the decomposed signals of both groups taking into account specific volume levels associated to each of the decomposed signals. Transition unit 44 is configured to recombine the decomposed signals in such a manner as to either play only a first output signal obtained from a sum of all decomposed signals of the first input signal, or a second output signal obtained from a sum of all decomposed signals of the second input signal, or any transition in between the first and the second output signals where decomposed signals of both first and second input signals are played. In particular, transition unit 44 stores individual transition functions DA, VA, HA, DB, VB, HB for each of the decomposed signals which each define a specific volume level for each time frame within a transition interval or for each controller position of the transfer controller within a controller range. Taking into account the respective volume levels according to the respective transition functions DA, VA, HA, DB, VB, HB, all decomposed signals will then be recombined to obtain the output signal.
[0084] Playback unit 42 may further include a control unit 45, which is adapted to control at least one or the transition functions DA, VA, HA, DB, VB, HB based on a user input.
[0085] The output signal generated by playback unit 42 may then be routed to an output audio interface 46 for a sound output. At any location within the signal flow, one or more sound effects may be inserted into the audio signal by means of one or more effect chains 48. In the present example, effect chain 48 is located between playback unit 42 and output audio interface 46.
[0086]
[0087] As shown in
[0088]
[0089] As can be seen in
[0090] As can be seen in
[0091] It should be noted that although the transition functions shown in
[0092] With reference again to
[0093] In a further embodiment, device 10 may include means for determining characteristic song parts of songs A and/or B, such as a verse, a chorus, a bridge, an intro or an outro. A user may then choose to carry out a transition at a junction between two song parts, or device 10 may automatically carry out a transition at certain song part junctions and towards certain song part junctions of the other song, for example a transition from the beginning of an outro section of song A to an end of an intro section of song B.