Sound processing method

10587238 ยท 2020-03-10

Assignee

Inventors

Cpc classification

International classification

Abstract

The invention is related to sound processing methods, especially to digital sound processing methods. In an embodiment of the invention, the method includes at least the following steps: calculating the magnitude spectrum of a segment of the sound signal, processing of the calculated magnitude spectrum to produce an outline of the spectrum indicating the peaks of the calculated magnitude spectrum, processing said outline to adjust said peaks and surrounding frequency areas of the spectrum, applying a Hilbert transform to said processed outline for forming the complex spectrum of a minimum phase impulse response, and convolving said minimum phase impulse response with said segment of the sound signal for forming a processed output signal segment.

Claims

1. A method for filtering an audio signal using a minimum phase response filter that adapts to the audio signal thereby performing a desired filtering of signal features that exist in the signal, said method comprising processing the sound signal in segments, and wherein at least the following computer implemented steps are performed: cutting said sound signal into overlapping segments, and performing at least the following computer implemented steps for each said segment of said sound signal: calculating the magnitude spectrum of said segment of the sound signal, processing of the calculated magnitude spectrum to produce an outline of the spectrum indicating the peaks of the calculated magnitude spectrum, said step of processing of the calculated magnitude spectrum further including the step of applying high pass filtering in order to remove an overall shape of the spectrum and retain the peaks of the spectrum, processing said outline to adjust said peaks and surrounding frequency areas of the spectrum, executing a Hilbert transform to said processed outline to form a complex spectrum of a minimum phase impulse response, and convolving said minimum phase impulse response with said segment of the sound signal for forming a processed output signal segment.

2. The method according to claim 1, wherein said step of processing said outline to adjust said peaks of the spectrum further includes a step of applying lowpass filtering to smooth the retained peaks.

3. The method according to claim 2, wherein said lowpass filtering is performed using a sweeping filter.

4. The method according to claim 2, wherein said step of processing said outline to adjust said peaks of the spectrum further comprises applying a bin dependent weighting function to said outline.

5. The method according to claim 1, wherein said step of processing said outline to adjust said peaks of the spectrum further comprises the steps of scaling the outline and inverting the outline.

6. A non-transitory computer-readable medium having stored thereon computer readable instructions for carrying out a method for processing a sound signal, in which method the sound signal is processed in segments, wherein said computer readable instructions comprise instructions to cause a processor of a device, when executed by said processor to perform at least the following steps: cutting said sound signal into overlapping segments, and performing at least the following steps for each said segment of said sound signal: calculating a magnitude spectrum of said segment of the sound signal, processing of the calculated magnitude spectrum to produce an outline of the spectrum indicating the peaks of the calculated magnitude spectrum, said step of processing of the calculated magnitude spectrum including the step of applying high pass filtering in order to remove an overall shape of the spectrum and retain the peaks of the spectrum, processing said outline to adjust said peaks and surrounding frequency areas of the spectrum, applying a Hilbert transform to said processed outline for forming a complex spectrum of a minimum phase impulse response, and convolving said minimum phase impulse response with said segment of the sound signal for forming a processed output signal segment.

7. The non-transitory computer-readable medium according to claim 6, wherein said step of processing said outline to adjust said peaks of the spectrum includes the step of applying lowpass filtering to smooth the retained peaks.

8. The non-transitory computer-readable medium according to claim 7, wherein said lowpass filtering is performed using a sweeping filter.

9. The non-transitory computer-readable medium according to claim 6, wherein said step of processing said outline to adjust said peaks of the spectrum further comprises applying a bin dependent weighting function to said outline.

10. The non-transitory computer-readable medium according to claim 6, wherein said step of processing said outline to adjust said peaks of the spectrum further comprises the steps of scaling the outline, and inverting the outline.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) Various embodiments of the invention will be described in detail below, by way of example only, with reference to the accompanying drawings, of which

(2) FIGS. 1A, 1B, 1C, 1D, and 1E illustrate how the various processing steps of the method effect the spectrum and/or the outline of the spectrum method according to an embodiment of the invention, and

(3) FIG. 2 illustrates the steps of a method according to an embodiment of the invention.

DETAILED DESCRIPTION OF SOME EMBODIMENTS

(4) The following embodiments are exemplary. Although the specification may refer to an, one, or some embodiment(s), this does not necessarily mean that each such reference is to the same embodiment(s), or that the feature only applies to a single embodiment. Single features of different embodiments may be combined to provide further embodiments.

(5) In the following, features of the invention will be described with a simple example of a signal processing method with which various embodiments of the invention may be implemented. Only elements relevant for illustrating the embodiments are described in detail. Details that are generally known to a person skilled in the art may not be specifically described herein.

(6) In the following, an embodiment of the invention is described with reference to FIGS. 1A to 1E. First, the input signal is cut into segments i.e. windowed with overlap, for example using a Hann window or similar. Then, for each segment (window) magnitude spectrum is calculated using a Fourier transform. FIG. 1A illustrates such a calculated magnitude spectrum. The resulting spectrum is then filtered using a highpass filter in order to remove the overall shape of the spectrum and leave behind only the peaks, as illustrated in FIG. 1B. Next, lowpass filtering is applied to the spectrum in order to smooth the peaks, as illustrated in FIG. 1C.

(7) This has the advantage of reducing ringing of the desired filter response. In a further embodiment of the invention, the lowpass filtering is performed using a sweeping filter, i.e. a filter where the cutoff frequency changes for every bin of the spectrum. The spectrum can advantageously be further adjusted by applying per-bin weighting to compensate for lower bins typically having more energy in musical applications, and to emphasise or reduce peak reduction in desired areas. For example, a suitable per-bin weighting function is 1/sqrt(k), where k is the bin number. Other similar weighting functions can also be used in various embodiments of the invention.

(8) The processed spectrum can be processed further for example by linear or non-linear scaling in order to expand or limit the height of the outline of the spectrum. Further, the processed spectrum can also be inverted to produce a magnitude spectrum outline indicating frequencies to be enhanced or suppressed. FIG. 1D shows the result of scaling, adding a constant, and inverting of the outline of FIG. 1C.

(9) Finally, Hilbert transform is applied to the resulting magnitude spectrum outline to form a minimum phase response. The minimum phase response is then convolved with the original signal segment (window) in order to produce a result signal segment. The convolution of the said segment of sound and formed minimum phase impulse response can be done as a multiplication of the spectrums, or for example in the time domain.

(10) FIG. 1E illustrates the effect of the processing, showing the magnitude spectrum of the original unprocessed sound signal segment as a dotted line, and the magnitude spectrum of the final resulting sound signal segment as a continuous line. As can be seen from FIG. 1F, the spectrum peaks of the original sound signals have been attenuated, while the rest of the spectrum is nearly unchanged.

(11) FIG. 2 illustrates further embodiments of the invention. This embodiment concerns processing of a sound signal, in which method the sound signal is processed in segments. In this embodiment at least the following steps are performed for a segment of the sound signal: calculating 210 the magnitude spectrum of said segment of the sound signal, processing 220 of the calculated magnitude spectrum to produce an outline of the spectrum indicating the peaks of the calculated magnitude spectrum, processing 230 said outline to adjust said peaks and surrounding frequency areas of the spectrum, applying 240 a Hilbert transform to said processed outline for forming the complex spectrum of a minimum phase impulse response, and convolving 250 said minimum phase impulse response with said segment of the sound signal for forming a processed output signal segment.

(12) In a further embodiment of the invention said step of processing 220 of the calculated magnitude spectrum comprises the step of applying highpass filtering 222 in order to remove the overall shape of the spectrum and retain the peaks of the spectrum.

(13) In a further embodiment of the invention said step of processing 230 said outline to adjust said peaks of the spectrum comprises the step of applying lowpass filtering 232 to smooth the retained peaks.

(14) In a further embodiment of the invention, said lowpass filtering is performed using a sweeping filter.

(15) In a further embodiment of the invention said step of processing 230 said outline to adjust said peaks of the spectrum further comprises applying 234 a bin dependent weighting function to said outline.

(16) In a further embodiment of the invention said step of processing 230 said outline to adjust said peaks of the spectrum further comprises the steps of scaling 235 the outline, and inverting 236 the outline. The scaling may be linear or non-linear scaling, such as for example exponential scaling with the scaling exponent being less than 1.

(17) In further embodiments of the invention said step of processing 230 said outline to adjust said peaks of the spectrum further comprises the steps of adding 237 a constant to the outline and/or clipping 238 the outline to a specified range.

(18) The order of steps 220 to 238 in FIG. 2 is only an example. In different embodiments of the invention the order of these steps may be different.

(19) In a further embodiment of the invention, the inventive functionality is provided in a non-transitory computer-readable medium having stored thereon computer-readable instructions, wherein executing the instructions by a computing device causes the computing device to perform the inventive functionality described in this specification.

(20) According to a further group of embodiments of the invention, a non-transitory computer-readable medium is provided. The non-transitory computer-readable medium has stored thereon computer readable instructions for carrying out a method for processing a sound signal, in which method the sound signal is processed in segments, wherein said computer readable instructions cause a processor of a device when executed by said processor to perform at least the following steps for a segment of the sound signal: calculating the magnitude spectrum of said segment of the sound signal, processing of the calculated magnitude spectrum to produce an outline of the spectrum indicating the peaks of the calculated magnitude spectrum, processing said outline to adjust said peaks and surrounding frequency areas of the spectrum applying a Hilbert transform to said processed outline for forming the complex spectrum of a minimum phase impulse response, and convolving said minimum phase impulse response with said segment of the sound signal for forming a processed output signal segment.

(21) In a further embodiment of this group of embodiments, said step of processing of the calculated magnitude spectrum comprises the step of applying highpass filtering in order to remove the overall shape of the spectrum and retain the peaks of the spectrum.

(22) In a further embodiment of this group of embodiments, said step of processing said outline to adjust said peaks of the spectrum comprises the step of applying lowpass filtering to smooth the retained peaks.

(23) In a further embodiment of this group of embodiments, said lowpass filtering is performed using a sweeping filter.

(24) In a further embodiment of this group of embodiments, said step of processing said outline to adjust said peaks of the spectrum further comprises applying a bin dependent weighting function to said outline.

(25) In a further embodiment of this group of embodiments, said step of processing said outline to adjust said peaks of the spectrum further comprises the steps of scaling the outline, and inverting the outline.

(26) Different embodiments of the invention can be implemented as a software program product to be executed by a general purpose computing device, or by a digital signal processor (DSP) unit. The software program product can be in the form of an application software system that is downloadable from a server system; an application software system that is recorded on a computer-readable medium; as a software programming library for inclusion in other software product; or for example as a plugin software module for use by other software applications.

(27) The invention has many benefits.

(28) The invention allows control and dampening of strong spectral spikes, and in a dynamic fashion, matching the dampening to the incoming program material i.e. sound signal. The invention can advantageously be used for sound processing in music studios, sound post-processing, and sound reproduction systems.

(29) The invention also has the advantage that due to the dynamically adjusting nature of processing, various embodiments of the invention do not apply processing when no peaks are present in the incoming audio signal, so that e.g. noise comes through relatively unprocessed as compared to processing using conventional static or dynamic equalizers or multiband compressors.

(30) Further, in embodiments implementing non-linear scaling of the spectrum outline results in compressed dynamic response where increasing peak intensity in the input signal does not result in as much reduction as there is increase, and inversely, small input signal peaks are reduced more easily even when large peaks are present in the input signal.

(31) A further benefit is that the resulting minimum phase filter response forces filtering artefacts down-stream, eliminating pre-ringing that conventional sharp linear phase equalizer cuts would have as a result.

(32) A further benefit is that the filtering method can be performed on-the-fly, whereby the method can be used in studio and live environments.

(33) A further benefit is that the method allows for flexible parametrisation with small number of parameters, such as the highpass and lowpass cutoff frequencies. Also, per-frequency bin weighting of the spectrum outline allows limiting the processing to just the frequency areas where needed.

(34) The embodiments of the invention in which non-linear scaling of the spectrum outline in order to attenuate spectrum peaks result in higher peaks being attenuated more strongly than lower peaks, which fits equal loudness contours of the human hearing. This results in a naturally clear, pleasing filtering result for the listener.

(35) Further, the forming of the spectrum outline is phase agnostic, i.e. phase information of the original spectrum can be discarded, because the necessary information is recreated with the Hilbert transform. This provides a flexible framework allowing arbitrary manipulation of the frequency balance.

(36) In view of the foregoing description it will be evident to a person skilled in the art that various modifications may be made within the scope of the invention. While a preferred embodiment of the invention has been described in detail, it should be apparent that many modifications and variations thereto are possible, all of which fall within the true spirit and scope of the invention.

(37) It is to be understood that the embodiments of the invention disclosed are not limited to the particular structures, process steps, or materials disclosed herein, but are extended to equivalents thereof as would be recognized by those ordinarily skilled in the relevant arts. It should also be understood that terminology employed herein is used for the purpose of describing particular embodiments only and is not intended to be limiting.

(38) Reference throughout this specification to one embodiment or an embodiment means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases in one embodiment or in an embodiment in various places throughout this specification are not necessarily all referring to the same embodiment.

(39) As used herein, a plurality of items, structural elements, compositional elements, and/or materials may be presented in a common list for convenience. However, these lists should be construed as though each member of the list is individually identified as a separate and unique member. Thus, no individual member of such list should be construed as a de facto equivalent of any other member of the same list solely based on their presentation in a common group without indications to the contrary. In addition, various embodiments and example of the present invention may be referred to herein along with alternatives for the various components thereof. It is understood that such embodiments, examples, and alternatives are not to be construed as de facto equivalents of one another, but are to be considered as separate and autonomous representations of the present invention.

(40) Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided, such as examples of lengths, widths, shapes, etc., to provide a thorough understanding of embodiments of the invention. One skilled in the relevant art will recognize, however, that the invention can be practiced without one or more of the specific details, or with other methods, components, materials, etc. In other instances, well-known structures, materials, or operations are not shown or described in detail to avoid obscuring aspects of the invention.

(41) While the foregoing examples are illustrative of the principles of the present invention in one or more particular applications, it will be apparent to those of ordinary skill in the art that numerous modifications in form, usage and details of implementation can be made without the exercise of inventive faculty, and without departing from the principles and concepts of the invention. Accordingly, it is not intended that the invention be limited, except as by the claims set forth below.