DEVICE FOR REDUCING NOISE DURING THE REPRODUCTION OF AN AUDIO SIGNAL USING A HEADPHONE OR HEARING AID, AND CORRESPONDING METHOD
20250316256 ยท 2025-10-09
Inventors
Cpc classification
G10K11/17881
PHYSICS
G10K2210/511
PHYSICS
G10K2210/3028
PHYSICS
G10K2210/1081
PHYSICS
G10K11/17873
PHYSICS
H04R2430/03
ELECTRICITY
International classification
Abstract
For reducing noise when playing back an audio signal with headphones or a hearing aid, at least one sensor is provided for detecting a sensor signal based on ambient sound and/or structure-borne sound. The sensor signal or the sensor signals are first fed to a preprocessing unit for preprocessing, which carries out filtering and/or summation for active noise suppression, active suppression of the occlusion effect and/or an ambient mode. With a subsequent filter bank, the sensor signal or the output signal of the preprocessing unit is divided into frequency bands by means of several filters. One or more calculation units are provided for calculating weighting factors for the individual frequency bands, which calculate the weighting factors based on a measure of the sensor signal in the respective frequency band and a measure of the noise signal of the sensor or the output signal of the preprocessing unit in silence in this frequency band. The individual frequency bands are multiplied by the corresponding calculated weighting factors using multipliers. The weighted output signals of the filter bank are summed to form an overall output signal using an adder. A compensation signal based on the overall output signal is output using an output unit.
Claims
1. Device for reducing noise when reproducing an audio signal with a near-head audio output device, in particular headphones or a hearing aid, with at least one sensor for detecting a sensor signal based on ambient sound and/or structure-borne sound; a preprocessing unit for preprocessing the sensor signal or the sensor signals, wherein the preprocessing unit carries out filtering and/or summation for active noise suppression, active suppression of the occlusion effect and/or an ambient mode; a filter bank for dividing the sensor signal or the output signal of the preprocessing unit into frequency bands using a plurality of filters; one or more calculation units for calculating weighting factors for the individual frequency bands, the weighting factors being calculated based on a measure of the signal of the sensor or the output signal of the preprocessing unit in the respective frequency band and a measure of the noise signal of the sensor or the output signal of the preprocessing unit in silence in this frequency band; multipliers, by means of which the individual frequency bands are multiplied by the associated calculated weighting factors; an adder, by means of which the weighted output signals of the filter bank are summed to form an overall output signal; and an output unit for outputting a compensation signal based on the overall output signal.
2. Device according to claim 1, wherein the weighting factors are calculated based on the estimated power or standard deviation of the sensor signal or the output signal of the preprocessing unit in the respective frequency band and the estimated power or standard deviation of the noise signal of the sensor or the output signal of the preprocessing unit in silence in this frequency band.
3. Device according to claim 1, wherein the preprocessing unit, the filters of the filter bank, the multipliers and the adder are implemented at a first sampling rate and the calculation units are implemented at a lower second sampling rate.
4. Device according to claim 3, wherein the preprocessing unit, the filters of the filter bank, the multipliers and the adder are implemented by a first processor.
5. Device according to claim 4, wherein the calculation units are implemented by a second processor separate therefrom.
6. Device according to claim 5, wherein the output signals of the respective filters of the filter bank are transmitted from the first processor to the second processor via an interface between the processors and the calculated weighting factors for the individual bands of the filter bank are transmitted from the second processor back to the first processor.
7. Device according to claim 6, wherein the rate of the output signals of the filters of the filter bank is adapted to the sampling rate on the second processor by sampling rate converters.
8. Device according to claim 5, wherein the input signal of the filter bank is transmitted from the first processor to the second processor via an interface between the processors, wherein the rate of the input signal to be transmitted is converted by a sampling rate converter and a second filter bank is simulated on the second processor for calculating the weighting factors, which realizes the filters at a corresponding lower sampling rate and wherein the calculated weighting factors for the individual bands of the filter bank are transmitted from the second processor back to the first processor.
9. Device according to claim 1, wherein the at least one sensor comprises one or more internal microphones arranged in an earpiece for detecting a sound signal in the ear canal of a user and/or external microphones for detecting a sound signal outside the ear canal and/or acceleration sensors for detecting structure-borne sound which is transmitted to the earpiece via the ear canal, and the output unit comprises at least one loudspeaker.
10. Device according to claim 1, wherein the filters of the filter bank are realized by cascaded biquadratic filters as low-pass, high-pass and/or band-pass filters.
11. Device according to claim 1, wherein it is integrated in a near-head audio output device, in particular a headphone or hearing aid.
12. Method for reducing noise when reproducing an audio signal with a near-head audio output device, in particular headphones or a hearing aid, with the steps: detecting a sensor signal based on ambient sound and/or structure-borne sound with at least one sensor arranged in the headphones or hearing aid; preprocessing the sensor signal or the sensor signals, wherein filtering and/or summation is carried out for active noise suppression, active suppression of the occlusion effect and/or an ambient mode; applying noise suppression to the sensor signal or the output signal of the preprocessing unit, wherein the signal is divided into several frequency bands by means of a filter bank, the individual frequency bands are multiplied by weighting factors for the individual frequency bands, wherein the weighting factors are calculated based on a measure of the signal of the sensor or the output signal of the preprocessing unit in the respective frequency band and a measure of the noise signal of the sensor or the output signal of the preprocessing unit in silence in this frequency band, and the weighted output signals of the filter bank are then summed to form an overall output signal; and outputting a compensation signal based on the overall output signal.
Description
[0034] Further features of the present disclosure will become apparent from the following description and claims taken in conjunction with the figures.
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
[0045] In order to better understand the principles of the present disclosure, embodiments are explained in more detail below with reference to the figures. It is understood that the disclosure is not limited to these embodiments and that the described features can also be combined or modified without departing from the scope of the disclosure as defined in the claims.
[0046]
[0047] Closing off the ear canal 12 with headphones 10 also means that structure-borne noise, such as impact noise or speech, which is emitted into the ear canal 12 through vibrating ear canal walls, can hardly escape from the ear canal 12. This manifests itself in an amplification of low frequencies of the structure-borne noise compared to an open ear canal, which in combination with the weakened perception of ambient noise is referred to as the occlusion effect. To compensate for the occlusion effect (Active Occlusion Cancellation, AOC), additional sensors in the form of the inward-facing microphone 20 and the acceleration sensor 23 on the side of the headphones facing into the ear canal can be used to record information about the structure-borne noise in the ear canal 12.
[0048] In addition to generating acoustic transparency or compensating for the occlusion effect when wearing headphones, the sensors 20, 22, 23 and processors 24 can also be used to actively dampen loud ambient noise. With active noise suppression, an acoustic compensation signal is played via the headphone's internal loudspeaker 21 based on the sensor data, which destructively interferes with the ambient sound at the eardrum 13 and thus reduces the perceived volume of the ambient sound. Furthermore, the sensors can be used for an ambient mode in which ambient sound is processed and/or amplified, for example to improve speech intelligibility or compensate for hearing loss.
[0049] As already mentioned, a problem that is solved by the disclosed method is that the sensors not only record a useful signal x(n), with the discrete time index n, but also induce noise v(n). In the context of this disclosure, the useful signal includes not only speech but also ambient noise, some of which are considered undesirable in conventional methods for noise suppression, but here are assigned to the useful signal and not to the noise. The output signal of a sensor results from the sum of the useful signal and the sensor noise to
[0050] By processing and filtering the sensor signal, the noise can be significantly amplified, which can be perceived as unpleasant when played back by the loudspeaker for an AOC or ANC.
[0051]
[0052] For each band k a time-variant weighting factor g.sub.k(n) is calculated in a calculation unit 31, which is then multiplied by the band signal .sub.k(n). At the output of the noise suppression 63, an estimate {tilde over (x)}(n) of the useful signal is then obtained, based on the weighted sum of the band signals:
[0053]
[0054] The preprocessing unit can, for example, filter the microphone signal using a filter with the impulse response w(n), which is designed for a transparency or ambient mode, an ANC or AOC. Furthermore, the preprocessing unit can, for example, filter several sensor signals with different filters and then add them up to form an output signal.
[0055] The output signal is obtained with a filter w(n) as follows:
[0056] In order to achieve the lowest possible input-to-output latency on the processor for fast filtering 34, a high sampling rate is preferably used. The input-to-output latency should advantageously be less than 1 millisecond. Since the characteristics of the useful signal generally change only slowly, the calculation of the weighting factors can be carried out at a lower sampling rate. The calculation of the weighting factors can advantageously be carried out on one or more separate processors 35. In order to avoid aliasing effects when transferring the band signals to a lower sampling rate for calculating the weighting factors, a conversion of the sampling rate by sampling rate converters 33 is optionally provided. The weighting factors, on the other hand, do not require sampling rate conversion since the weighting factors are low-frequency signals.
[0057] Instead of converting the rate of the K output signals of the filters 30, the rate of the input signal of the filter bank can alternatively be converted and a second filter bank can be simulated on the processor or processors for calculating the weighting factors 35, which the filters 30 implement at a correspondingly lower sampling rate. The factors calculated in this way are then transmitted to the processor for fast filtering 34 as previously described and applied there. Since a filter bank on the processor is still necessary for fast filtering 34, this does not reduce the complexity of the processors 34, 35, but does reduce the number of sampling rate converters 33 and communication channels between the processors 34, 35.
[0058]
[0059] Furthermore, the sum of the bandpass filters 30
should be advantageously optimized for the following objective function
so that there are no unwanted cancellations or amplifications, especially in the transition areas when the bandpass filters are connected in parallel. The lower cutoff frequency of the first bandpass filter f.sub.g,1 can be set equal to 0 Hz, commonly known as low-pass filter, and the upper cutoff frequency of the last bandpass filter f.sub.g,K+1 can be set equal to the Nyquist frequency, commonly known as high-pass filter. In order to carry out processing that mimics the human ear, it is advantageous to distribute the cutoff frequencies f.sub.g,k linearly on a psychoacoustic frequency scale, e.g. the Bark scale. Furthermore, the filters 30 can advantageously be optimized such that the group delay of the transfer function B(z) is minimized, taking into account a target curve and a maximum deviation for the magnitude response.
[0060] Advantageously, the filters 30 can be realized as a cascade of biquadratic filters.
[0061] The filter bank therefore contributes only minimally to a delay in the signal path, which means that a high-performance ANC and a transparency mode without comb filter effects and without the double perception of plosives are still possible and can be calculated efficiently. Advantageously, the magnitude and phase response of B(z) is taken into account for the design of filters for an ANC or AOC, for example in a preprocessing unit 32.
[0062] However, an implementation corresponding to a quadrature filter bank is also possible. On the one hand, this leads to a filter bank with a linear frequency resolution and possibly an additional delay due to a sampling rate conversion in the signal path, but on the other hand to a reduction in the computational complexity.
[0063] The weighting factors can be specified based on a spectral subtraction rule using estimates of the short-term power of the disturbed wanted signal
and the noise
The subtraction rule can be derived from the cost function, presented below independently of the band index k=1, 2, . . . , K
and results, assuming that the noise and the wanted signal are uncorrelated, in
[0064] This rule can cause the weighting factor to become negative, which leads to unwanted phase reflections. Therefore, it is usually assumed that 0g(n)1. A hard limitation of g(n) to this range of values can lead to undesirable temporal artifacts due to discontinuous changes. Depending on the estimated short-term power of the undisturbed wanted signal
the mathematically optimal weighting factor is defined using the cost function as
whereby the desired range of values of g(n) is maintained. Unfortunately, x(n) and thus also
are not known.
[0065] Since the power of sensor noise is usually small compared to the power of a useful signal, it can be assumed that
when a useful signal is present, which results in the calculation rule of block 53 in the calculation unit 31 according to
can be determined for each band, e.g. by measuring in silence, listening tests, taking component specifications into account, estimating in a quiet environment during runtime or using mathematical methods. The sensor and quantization noise is also commonly known as system noise or noise floor. For a measurement in silence, for example, the digitized signals of the sensors 20, 22, 23 or the output signal of the preprocessing unit 32 can be recorded, wherein the headphones 10 are activated and located in a room acoustically protected from ambient noise. Advantageously, the values {circumflex over ()}.sub.v.sub.
of the input signal .sub.k(n) can, for example, be calculated by a block 50 using an exponential smoother with a smoothing factor 0<<<1 based on
[0066] For reasons of cost and battery efficiency, integrated systems such as headphones often use processors that only have fixed-point arithmetic. The division operation in particular is numerically poorly conditioned, which is why it is desirable to reduce the dynamics of the input values of the division operation when calculating the weighting factor. This can be achieved by the calculation unit 31 shown in
[0067] As shown in
of the noise signal can be estimated based on the output signals of the filters 30 .sub.k(n). For this purpose, established algorithms such as the so-called baseline tracer can be used in block 52, which are based on the assumption that the power of the noise is quasi time-invariant and can be estimated in pauses in the useful signal. These methods can be used to estimate the system noise in the running system when a quiet environment is detected. In contrast to hearing aid applications, the goal here is not to detect ambient noise, but to detect the system noise.
[0068] Furthermore, it is possible to set the weighting factors g.sub.k(n) so that the power of the respective weighted bandpass signals does not exceed a specified threshold. This can be used to protect the hearing of users in noisy environments.
[0069]
[0070]
[0071] The device according to the disclosure can in particular be integrated into headphones, whereby such headphones can be designed in various ways. For example, they can be earphones, hearables, or so-called in-ear monitors, which are used, for example, during live performances by musicians or TV presenters to check one's own voice, or a combination of headphones and a mouth microphone for recording speech in the form of a headset. The device can also be part of a hearing aid. In addition, the device can be integrated into smart glasses, VR/AR headsets, collar loudspeakers or bone conduction headphones. Finally, parts of the device can also be part of an external device, such as a smartphone.
[0072]
[0073] In the method, in a first step 70, ambient sound and/or structure-borne sound, which originates, for example, from a voice output of the user wearing the headphones or footsteps of this user, is recorded using at least one sensor arranged in the headphones. The corresponding sensor signals are processed in a subsequent step 71 by means of a preprocessing unit, which carries out filtering and/or summation for active noise suppression, active suppression of the occlusion effect and/or an ambient mode. In the subsequent step 72, the sensor signal or the output signal of the preprocessing unit is then subjected to noise suppression. The method uses a filter bank which is integrated into the signal path between the sensors and the loudspeaker of the headphones. The filtered sensor signal is divided into several frequency bands using the filter bank.
[0074] An algorithm sets weighting factors, which are multiplied by the respective output signals of the filter bank, so that frequency bands in which there is hardly any useful signal are noticeably attenuated. In particular, the weighting factors can be calculated based on a measure of the signal from the sensor or the output signal from the preprocessing unit (32) in the respective frequency band and a measure of the noise signal from the sensor or the output signal from the preprocessing unit (32) in silence in this frequency band. The weighted output signals of the filter bank are then summed to form an overall output signal.
[0075] In the subsequent step 73, a compensation signal based on the total output signal is then fed to a loudspeaker of the headphones and output by the latter.
[0076] In the case of headphones that include sound transducers for both ears of the user, the described procedure can be carried out separately for each ear or together for the sound transducers of both ears.
REFERENCE SYMBOL LIST
[0077] 10 In-ear headphones [0078] 11 Ear insert [0079] 12 Ear canal [0080] 13 Eardrum [0081] 20 Inner microphone [0082] 21 Loudspeaker [0083] 22 External microphone [0084] 23 Acceleration sensor [0085] 24 Signal processor [0086] 30 Filter [0087] 31 Calculation unit for weighting factor [0088] 32 Preprocessing unit [0089] 33 Sampling rate converter [0090] 34 Processor for fast filtering [0091] 35 Processor for calculating the weighting factors [0092] 40 Magnitude response of a filter [0093] 50 Power estimator of the useful signal [0094] 51 Estimator of the standard deviation of the useful signal [0095] 52 Power estimator of the noise signal [0096] 53 Block with calculation rule [0097] 60 Equalizer [0098] 61 Secondary path [0099] 62 Filter [0100] 63 Noise reduction [0101] 64 Switch for the filter output signal [0102] 65 Estimation of the secondary path [0103] 70 Process step for collecting the sensor data [0104] 71 Process step for filtering the sensor data [0105] 72 Process step for applying noise reduction [0106] 73 Process step for outputting a compensation signal