Audio processing compression system using level-dependent channels
09736583 · 2017-08-15
Assignee
Inventors
Cpc classification
H04R1/10
ELECTRICITY
International classification
Abstract
Disclosed herein, among other things, are methods and apparatus for a level-dependent compression system for hearing assistance devices, such as hearing aids. The present subject matter includes a hearing assistance device having a buffer for receiving time domain input signals and a frequency analysis module to convert time domain input signals into a plurality of subband signals. A power detector is adapted to receive the subband signals and to provide a subband version of the input signals. A nonlinear gain stage applies gain to the plurality of subband versions of the input signals, and a frequency synthesis module processes subband signals from the nonlinear gain stage and to create a processed output signal. The device also includes a filter for filtering the signals, and a level-dependent compression module. The level-dependent compression module is adapted to provide bandwidth control to the plurality of subband signals produced by the frequency analysis stage.
Claims
1. A method of operating a hearing assistance device, the method comprising: receiving time-domain input signals; converting received time domain input signals into a plurality of subband signals; processing the plurality of subband signals to: apply nonlinear gain to the plurality of subband signals; and adjust width of subbands based on detected power as a function of frequency using a level-dependent compression module adapted to provide bandwidth control to the plurality of subband signals; and converting the processed plurality of subband signals to time domain signals.
2. The method of claim 1, wherein the level-dependent compression module includes level-dependent analysis channels to control a compressive-gain signal as a function of frequency.
3. The method of claim 2, wherein the level-dependent analysis channels include channels with level-dependent bandwidths.
4. The method of claim 1, wherein power from bands of a static bandwidth are weighted and summed according to signal level.
5. The method of claim 1, wherein the level-dependent compression module includes uniformly scaled analysis filterbanks.
6. The method of claim 1, wherein the level-dependent compression module includes non-uniformly scaled analysis filterbanks.
7. The method of claim 1, wherein the level-dependent compression module is adapted for compression of audio signals.
8. The method of claim 7, wherein the audio signals include speech.
9. The method of claim 7, wherein the audio signals include music.
10. A method of operating a hearing assistance device, the method comprising: converting time domain input signals into a plurality of subband signals; processing the plurality of subband signals, including: applying nonlinear gain to the plurality of subband versions of the input signals; and adding a weighted power of a subband signal to at least one other weighted subband signal in an adjacent subband using a level-dependent compression module to provide a final instantaneous-power estimate, and converting the processed plurality of subband signals to time domain signals; and filtering the time domain signals using a filter.
11. The method of claim 10, wherein the level-dependent compression module is adapted to provide a final instantaneous power estimate for power integration.
12. The method of claim 10, wherein the level-dependent compression module is adapted to provide a final instantaneous power estimate for nonlinear gain.
13. The method of claim 10, wherein the level-dependent compression module is adapted to provide a final instantaneous power estimate for frequency synthesis.
14. The method of claim 10, wherein the level-dependent compression module is adapted to provide a final instantaneous power estimate for time-domain filtering.
15. The method of claim 10, wherein the filter includes a finite impulse response (FIR) filter.
16. The method of claim 10, wherein the weighted power is determined using weights as a function of target bandwidth.
17. The method of claim 16, wherein the weights are symmetrically distributed across adjacent bands.
18. The method of claim 16, wherein the weights are asymmetrically distributed across adjacent bands.
19. The method of claim 10, wherein the level-dependent compression module includes an unbranched architecture.
20. The method of claim 10, wherein the level-dependent compression module includes a side-branched architecture.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
DETAILED DESCRIPTION
(6) The following detailed description of the present subject matter refers to subject matter in the accompanying drawings which show, by way of illustration, specific aspects and embodiments in which the present subject matter may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the present subject matter. References to “an”, “one”, or “various” embodiments in this disclosure are not necessarily to the same embodiment, and such references contemplate more than one embodiment. The following detailed description is demonstrative and not to be taken in a limiting sense. The scope of the present subject matter is defined by the appended claims, along with the full scope of legal equivalents to which such claims are entitled.
(7) The present subject matter includes method and apparatus for a level-dependent compression system for audio processing and hearing assistance devices, such as audio limiters, audio compressors, and hearing aids. The following examples will be provided for a hearing aid, which is only one type of hearing assistance device. It is understood however, that the disclosure is not limited to hearing aids and that the teachings provided herein can be applied to a variety of audio processing and hearing assistance devices.
(8) The present invention relates to a signal compression system and method, particularly suitable for compression of audio signals such as speech and music. In various embodiments, the present subject matter provides the use of level-dependent analysis channels to control the compressive-gain signal as a function of frequency. In various embodiments, the present level-dependent analysis channels are channels with level-dependent bandwidths. In various embodiments, powers from bands of a static bandwidth are weighted and summed according to signal level to operate on an effectively broader frequency range than a single analysis band. In various applications, the level-dependent bandwidths are a function of signal level to provide compression as a function of frequency and signal level.
(9) The present subject matter applies to compression systems using both uniformly and non-uniformly scaled analysis filterbanks. In addition, the present subject matter applies to compression systems using both unbranched and side-branched architectures.
(10) In various embodiments, this system provides an improved solution for the trade-off dilemma between preserved spectral contrast and applying frequency-specific gain compared to prior systems. The present subject matter is useful in a variety of applications involving compression of signals generally.
(11) Approaches Using Tunable Bands
(12)
(13) The bandwidth-power function should be continuous, but does not need to be monotonous. Possible choices include, but are not limited to, sigmoid curves, piecewise linear, exponential or power-law functions. In various embodiments with feedback a maximum change in bandwidth with power, i.e., the maximum absolute slope of the bandwidth-power function, is limited such that, for a white-noise input, the change in bandwidth corresponding to a 1-dB change in power results in an additional change of within-channel power of less than 1 dB. This ensures that the feedback loop is stable and converging in time. It is understood that other bandwidth-power functions may be used without departing from the scope of the present subject matter.
(14)
(15)
(16) The bandwidth-power function should be continuous, but does not need to be monotonous. Possible choices include, but are not limited to, sigmoid curves, piecewise linear, exponential or power-law functions. It is understood that other bandwidth-power functions may be used without departing from the scope of the present subject matter.
(17)
(18) Approaches Using Weighted Static Bands
(19) Alternatively, the frequency-analysis stage 202 can remain static as in
(20) Since the level-dependence of the bandwidths is realized through power summation, it is most convenient to measure the channel bandwidths in terms of equivalent rectangular bandwidths. If the bands in 202 have equal maximum passband transmission, the ERB of compression-channel n will be the weighted sum of the ERBs of the individual bands contributing to that channel, with weights w.sub.n,k. The target bandwidth b.sub.n for channel n is given by the bandwidth-power function B.sub.n, which should be continuous, but does not need to be monotonous. It is understood that other bandwidth-power functions may be used without departing from the scope of the present subject matter. There are two possible choices for the input received by the bandwidth-power function. The bandwidth can be chosen to depend on the channel power: b.sub.n=B.sub.n({tilde over (P)}.sub.n), or, alternatively, to depend on the band power: b.sub.n=B.sub.n(P.sub.n). The former results in feedback bandwidth control while the latter results in a feed-forward bandwidth control.
(21) In
(22) Another embodiment of the present subject matter includes a compression system which employs two parallel filterbank paths, one filterbank with narrow and one with broad channels, and then either weights and sums their corresponding power estimates with level-dependent weights or calculates two non-linear gain signals based on the power estimates from the two filterbanks and then weights and sums these gain signals with level-dependent weights. At low sound levels, for example, the gain is predominantly determined by the filterbank with narrow channels, while the gain at high sound levels is determined by the filterbank with broad channels.
(23) Further Considerations
(24) Compression speeds and bandwidth-power functions of the compression channels are chosen according to the objectives of the compression system. For example, the compression speed should mirror the rate of the information-carrying power fluctuations in the signal to be compressed, which can differ for speech and music. The present subject matter is not limited to the use of a particular compression speed or bandwidth-power function. However, various embodiments of the present subject matter include one or more of fast-acting compression (resolving phonemic level variations of speech) and/or channels widening with increasing level when the system is employed to compensate for hearing impairment. In various embodiments, time constants on the order of tens of milliseconds are employed to perform the fast-acting compression.
(25) If the level-dependent compression channels are widened sufficiently with increasing level, the proposed level-dependent system will preserve spectral contrast for high-level portions of sound such as vowels and vowel-consonant transitions in speech which are coded in terms of spectral-pattern cues. Furthermore, this system will prevent distortion of short-term spectral changes in high-level sounds such as frequency glides or formant transitions in speech and music. Since the compression channels will be narrow at low input levels, the system can provide adequate gain to low-level signals such as consonants in speech surrounded by spectral interferers. Furthermore, narrow channels at low levels will prevent objectionable modulation of steady background sounds by foreground sounds. If the system is sufficiently fast-acting, it can restore audibility of weak sounds rapidly following intense sounds such as weak consonants following intense vowels. It can also restore audibility in complex situations where multiple talkers are speaking at different levels. Hence, this system increases the potential for listening in both spectral and temporal dips, and taking into account the preservation of spectral contrast at high levels, it combines the advantages of both single-channel and multi-channel compression without suffering from their respective disadvantages.
(26) It should be noted that an asymmetric widening of the compression channels towards the high-frequency side with increasing level can compensate specifically for increased upward spread of masking which is often observed in hearing-impaired listeners. High-frequency sound components falling into a given compression channel will reduce the gain applied to sound components at lower frequencies and thus reduce upward spread of masking.
(27) In addition, the proposed system can normalize loudness perception in hearing-impaired listeners to a larger extent than prior systems. Normal-hearing listeners show a differential growth of loudness for narrowband and wideband sounds, due to the level-dependent bandwidth of auditory filters. For wideband stimuli at low levels, remote frequency components are compressed independently, since they fall into narrow, independent auditory filters. At higher levels, filters are broader and remote frequency components will be compressed jointly, even for wideband stimuli. As a consequence, differences in loudness between narrowband and wideband sounds decrease with increasing level. Since hearing-impaired listeners show broadened and more static auditory filters than normal-hearing listeners, they do not show the same differential growth of loudness. However, compression using channels which widen with increasing level can restore differential loudness growth for aided hearing-impaired listeners. The normalization of loudness perception may improve perceived sound quality as well as performance on involved auditory tasks such as speech perception in complex environments.
(28) The combination of level-dependent channels and fast-acting compression also bears advantages in audio limiting and output compression limiting: If the instantaneous power in a given compression channel is high, the channel will be widened and thus, power summation across frequency is accounted for by this channel. This allows for a higher limiting threshold level (the level at which compression limiting is activated) and for a smaller clipping margin (the difference between the maximum allowed band output level and broadband saturation level), resulting in improved perceived sound quality.
(29) The present subject matter is demonstrated for hearing aids. It is understood however, that the disclosure is not limited to hearing aids and that the teachings provided herein can be applied to a variety of audio processing and hearing assistance devices, including but not limited to, behind-the-ear (BTE), in-the-ear (ITE), in-the-canal (ITC), receiver-in-canal (RIC), or completely-in-the-canal (CIC) type hearing aids. It is understood that behind-the-ear type hearing aids may include devices that reside substantially behind the ear or over the ear. Such devices may include hearing aids with receivers associated with the electronics portion of the behind-the-ear device, or hearing aids of the type having receivers in the ear canal of the user, including but not limited to receiver-in-canal (RIC) or receiver-in-the-ear (RITE) designs. The present subject matter can also be used in hearing assistance devices generally, such as cochlear implant type hearing devices and such as deep insertion devices having a transducer, such as a receiver or microphone, whether custom fitted, standard, open fitted or occlusive fitted. It is understood that other hearing assistance devices not expressly stated herein may be used in conjunction with the present subject matter.
(30) This application is intended to cover adaptations or variations of the present subject matter. It is to be understood that the above description is intended to be illustrative, and not restrictive. The scope of the present subject matter should be determined with reference to the appended claims, along with the full scope of legal equivalents to which such claims are entitled.