IMPROVEMENTS TO AUDIO PITCH PROCESSING
20220343883 · 2022-10-27
Inventors
Cpc classification
G10H2210/311
PHYSICS
G10H3/187
PHYSICS
International classification
Abstract
Disclosed are a method and device for processing an audio signal, in which a pitch processed signal 21 is mixed 33 with a high pass filtered 30 version of the input signal. This produces improvements in the latency and quality of the pitch processed signal, particularly for live performance.
Claims
1.-11. (canceled)
12. A method for processing an input audio signal during live performance, using a DSP device comprising at least the steps of: (a) splitting the input audio signal into first and second copies; (b) inputting the first copy to the DSP device, to produce a pitch processed signal, (c) in parallel, inputting the second copy to a high pass filter to produce a high pass filter signal; (d) mixing the pitch processed signal and the high pass filter signal to produce an output signal.
13. A method according to claim 12, wherein the pitch processing signal and the high pass filter signal are not temporally aligned.
14. A method according to claim 13, wherein the latency of the pitch processed signal relative to the high pass filter signal is less than about 25 ms.
15. A method according to claim 12, wherein the high pass filter has a cut off frequency, and the bandwidth of the pitch processed signal has an upper bound of about the cut off frequency.
16. A method according to claim 12, wherein the two copies are produced as outputs from an analog to digital converter, and the high pass filter is implemented digitally.
17. A method according to claim 16, wherein the high pass filter is implemented within the DSP device.
18. A pitch processing device, comprising a DSP device, a splitter, a high pass filter and a mixer, the splitter being adapted to receive an input audio signal, and to operatively generate a first and a second split signal in parallel, the first split signal being connected to the DSP device and the second split signal being connected to the high pass filter, and the DSP device output and the high pass filter output being connected to the mixer, wherein operatively the DSP device output is a pitch processed signal, and the DSP device output and the high pass filtered signal are mixed by the mixer, so as to produce an output signal.
19. A device according to claim 18, wherein the wherein the pitch processed signal and the high pass filtered signal are not temporally aligned.
20. A device according to claim 19, wherein the high pass filter has a cut off frequency, and the bandwidth of the pitch processed signal has an upper bound of about the cut off frequency.
21. A device according to claim 18, wherein the high pass filter and mixer are digital, the output signal from the DSP is digital, and wherein device further comprises a digital to analog converter connected to the mixer so as to produce an analog output signal.
22. A device according to claim 21, wherein the high pass filter and mixing are performed with the DSP device.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0017] Implementations of the present invention will now be described with reference to the accompanying figures, in which:
[0018]
[0019]
[0020]
[0021]
DETAILED DESCRIPTION OF THE INVENTION
[0022] The present invention will now be described with reference to specific examples and implementations. It will be understood that these are illustrative and not limitative of the scope of the present invention. In particular, the signal processing and mixing processes described could be performed in any suitable analog or digital system, or mix thereof. The specific arrangements described are only examples of many possible implementations, as will be apparent to those skilled in the art.
[0023] The present invention may be implemented in conjunction with any suitable pitch DSP, for example the DSP in a Whammy V by Digitech®), the PitchFork® by ElectroHarmonix or the Morpheus Dive Bomber®. The present invention does not require changes to the operation of the pitch change device itself, but rather adds additional components. Of course, the invention could be implemented in a device, or using yet to be developed alternative devices. It will be appreciated that in principle the pitch DSP may be only a component of a processor or system of processors.
[0024] Pitch is not an absolute entity derived from physical stimulus—it is a perceptual attribute (akin to ‘colour’), it is a psychoacoustic phenomenon. Human perception of pitch has been the subject of hypothesis and conjecture for centuries, with no completely definitive or fully agreed explanation of its many mechanisms.
[0025] Some aspects are now generally agreed. It is believed that the perception of any particular pitch in a complex signal is strongly influenced by the balance of the ‘partials’—the fundamental frequency versus the harmonics that are present. Even here, there is room for confusion (e.g. ‘the missing fundamental effect’, August Seeback circa 1840)—and the issue becomes even more complex when discussing stringed instruments. The vibration of a string is more correctly viewed as multiple individual vibrations, where the first harmonic above the fundamental is often not an exact multiple of the fundamental frequency (e.g. a string with a fundamental frequency of 200 Hz can have a 1st harmonic at 401 Hz).
[0026] The principle which underlies the present invention is that pitch of the upper harmonics of an audio signal (e.g. from a stringed instrument like a guitar) is poorly discerned—humans have no physiological or cognitive mechanism to discern pitch above certain frequencies. It is generally agreed that there is no mechanism for pitch discrimination above 5000 Hz, but this is the theoretical boundary. In practice, empirical testing by the inventor shows this boundary to be significantly lower, especially in live performance.
[0027] This may be due to numerous perceptual factors including: inattention (listening to music is usually a social function, not a technical one); volume (pitch interpretation is confused by volume—e.g. louder music is often interpreted to have higher pitch); and distraction/masking (the presence of multiple complex signals from numerous instruments greatly lowers the ability to interpret any single pitch accurately).
[0028] The present invention accordingly seeks to exploit the imprecision of human pitch determination at higher frequencies to improve the perceived performance of pitch DSPs.
[0029] In overview, in one implementation the signal is processed as follows, and as illustrated schematically in
[0034] The HPF preferably has a moderately steep ‘slope’ (how quickly the frequencies are rolled off below the cut-off frequency, e.g. 24 dB per octave). It will be understood that the HPF output will be an analog signal composed only of harmonics above (say) 2.5 Khz, but at the original pitch of those harmonics present at the input.
[0035] The cut off frequency selected may vary depending upon the nature of the signal. For a guitar, a cut-off frequency of 2.5 kHz is more than twice the frequency of the fundamental of the highest possible note that can be played ('E′ at the 24th fret, the very top of the neck). In practice, most players will only play notes with a fundamental three times lower than a cut-off of 2.5 kHz ('A′ at the 17th fret).
[0036]
[0037]
[0038] It will be understood that the HPF signal is added to a pitch shifted signal with which it is not temporally aligned—the analog HPF output signal has transients and harmonics that are not delayed by analog processing, whereas the pitch shifted signal is delayed by DSP processing latency. In practice, it is not necessary for the harmonics and transients to be perfectly aligned to a precise relationship with the pitch shifted signal because human perception is tolerant of these discrepancies provided they are within reasonable bounds. Beneficially, the pitch shifted signal now only needs to encompass a more limited bandwidth, corresponding for example to the cut off frequency selected for the HPF. In practice, these two bandwidth roll-offs (DSP processor and HPF) need not be precisely aligned as, once again, perception does not require high levels of precision.
[0039] Perceptually, three beneficial effects are derived from suitable implementations of the present invention. First, the bandwidth of the resultant composite signal is perceptually a much better match to the input signal (the final output signal has the band-limited, pitch-shifted signal plus the non-pitch-shifted upper harmonics of the input signal). This is highly significant to players who want to maintain their original ‘tone’.
[0040] Second, the HPF signal component includes the initial transients (from striking the strings). This is beneficial, as a common problem which is encountered when pitch is shifted a significant amount by DSP processing is phase and transient smearing, leading to the processed signal sounding dull and blurred.
[0041] Third, because there is no latency in the HPF signal added to the composite output—as it is derived with analog circuitry, not computer processing—the players' perception of latency is greatly diminished. The analog circuits used in the implementation described are all common-place designs well known to those in the audio electronic arts and may be implemented with simple op-amp technology using standard I.Cs, for example the TL074, OPA2134.
[0042] In practice, several factors contribute to the overall effectiveness of any successful implemention of the present invention.
[0043] There is no perception of pitch discrepancy between the two component signals because human cognition and auditory pathways are unable to discriminate between them.
[0044] The perceived bandwidth of the composite output sounds entirely ‘natural’ because it is derived from the actual harmonics of the input signal—not a manufactured or ‘tweaked’ recreation. As a result, the fidelity of reproduction is perceived as a close match to the original signal.
[0045] Matching the HPF cut-off characteristics (slope & frequency) to complement the band-limiting characteristics (slope & frequency) in the DSP process ensures a seamless transition between the two components of the output signal. In practice, it may be preferred to have these roll-off characteristics overlapping or separated to better match the pitch transform required and/or the musical/tonal preference of the player.
[0046] The nominal latency period of contemporary pitch DSPs is, of commercial necessity, below acceptable limits (i.e. <20 mS). This does not imply it is not a deterrent to use—it is merely ‘good enough’ as a compromise. However, this latency period is short enough that the initial transient (supplied by the HPF filter) seamlessly blends with the (delayed) pitch-processed signal to form a perceptually contiguous signal. This is highly desirable in performance as it provides the illusion of near-zero latency to the player.
[0047] It will be appreciated that this implementation is a basic implementation, and that numerous alternatives and additions could be used to exploit the underlying inventive concept.
[0048] For example, selectable cut-off frequencies in the HPF could be provided to match the ‘tone’ profile required by the type or style of music, for example acoustic players will likely want the full extended bandwidth on offer while ‘heavy metal’ players may prefer a restricted band width (e.g. less extreme high frequencies) as they commonly employ high-gain amplifiers and fuzz boxes. These do not reproduce extreme high frequencies in a desirable way.
[0049] To this end, the HPF could be substituted by a band-pass filter (BPF), which limits the lower and upper frequency of the harmonics allowed to pass through. This allows additional control of the very high frequencies, discriminating against them. This can be desirable in certain music styles (e.g. ‘heavy metal’ as discussed above)
[0050] The ratio of mixing the original harmonics with the re-pitched signal can be dynamically varied with some advantage in operation. For example, a signal that is re-pitched significantly lower (e.g. one octave) may benefit from a higher level (e.g. 2:1) of original harmonics to account for the inescapable harmonic roll-off caused by the re-pitching algorithm and the normal phase/transient smearing the processing causes.
[0051] The cut-off frequency of the HPF or BPF, can be dynamically varied with some advantage in operation. For example, a signal that is re-pitched significantly lower (e.g. one octave) may benefit from a lower HPF cut-off frequency, to match with the lower pitched harmonic content produced by the re-pitching algorithm.
[0052] None of these alterations will affect the apparent latency improvement already described.
[0053] In an alternative implementation, the HPF process can be readily implemented in the digital domain. HPF code is readily implemented, as will be apparent to those skilled in the art. It can operate with very low latency (˜1-2 mS) in a simple and inexpensive processor. However, the digital HPF filtering is preferably in the pitch DSP itself. In this way, one A/D converter can feed both processes and the results of both processes can be proportionally mixed (in digital) within the DSP.
[0054] As a result, analog HPF components are eliminated, as is the need for a separate final analog mixing stage.
[0055] A digital implementation allows for discretionary time-alignment of the ‘non-pitched’ HPF signal versus the ‘re-pitched’ DSP signal by adding a defined delay to the HPF signal. Again, the delay code is simple to implement and can be done with a cheap CPU or preferably within the pitch DSP.
[0056]
[0057] A first signal is processed by the pitch processor 52 to produce the desired pitch shifted signal. A second signal is processed by the HPF process 53 in the DSP, and then processed though a delay process 54, Both signals are mixed digitally 55, to produce a digital output signal for digital to analog converter 56. The desired analog output 57 is then generated.