System for modelling acoustic transfer functions and reproducing three-dimensional sound

11218832 · 2022-01-04

Assignee

Inventors

Cpc classification

International classification

Abstract

Systems and methods are disclosed for modelling of individual acoustic transfer functions relative to the audition of an individual in three-dimensional space. A method is provided for modelling sets of acoustic transfer functions specific to an individual according to a multiplicity of directions in space, where a set of acoustic transfer functions specific to the individual in a given direction is determined depending on the result of a statistical analysis of a plurality of distinct stimuli emitted in the direction of the individual. A stimulus can be dependent on at least one set of predetermined acoustic transfer functions that are associated with the given direction, and on responses received from the individual to each emitted stimulus.

Claims

1. A method for modelling individual-specific sets of acoustic transfer functions specific to an individual according to more than two directions in space for use in reproducing three-dimensional sound, the method comprising: determining an individual-specific set of acoustic transfer functions that are specific to an individual in a given direction of the more than two directions, the determining comprising calculating the individual-specific set of acoustic transfer functions using a statistical analysis of: a plurality of distinct stimuli emitted for the individual in a given direction, a stimulus of the plurality of distinct stimuli dependent upon at least one first set of predetermined acoustic transfer functions that are associated with the given direction, a first set of predetermined acoustic transfer functions comprising at least two predetermined acoustic transfer functions, and a plurality of responses received from the individual to the plurality of emitted stimuli, each response giving information related to the emission direction perceived by the individual.

2. The modelling method of claim 1, additionally comprising carrying out the statistical analysis by direction in space of the plurality of distinct stimuli and of the received responses for the given direction of the more than two directions in space.

3. The modelling method of claim 1, additionally comprising, for the given direction of the more than two directions in space: emitting, for the individual from the given direction, the plurality of distinct stimuli depending on at least one set of predetermined acoustic transfer functions that are associated with the given direction; and receiving a response of the individual to each emitted stimulus.

4. The modelling method of claim 1, additionally comprising generating, for the given direction, the plurality of distinct stimuli depending on at least one set of predetermined acoustic transfer functions that are associated with the given direction.

5. The modelling method of claim 1, wherein a stimulus results from the addition of noise to a set of average acoustic transfer functions that are associated with the given direction, the average acoustic transfer functions calculated depending on sets of acoustic transfer functions, and wherein the acoustic transfer functions are recorded in a database of acoustic transfer functions and associated with the given direction.

6. The modelling method of claim 1, additionally comprising calculating a set of average acoustic transfer functions that are associated with the given direction depending on a plurality of sets of acoustic transfer functions, wherein the acoustic transfer functions are recorded in a database of acoustic transfer functions and associated with the given direction, and wherein the stimuli are dependent on the set of calculated average acoustic transfer functions.

7. The modelling method of claim 1, wherein the statistical analysis uses a psychophysical technique of reverse correlation.

8. A non-transitory computer-readable medium comprising instructions, which when executed by a processor, cause the processor to perform the modelling method of claim 1.

9. A modeller of sets of individual-specific acoustic transfer functions specific to an individual according to more than two directions in space for use in reproducing three-dimensional sound, the modeller configured to generate individual-specific sets of acoustic transfer functions specific to an individual in a given direction of the more than two directions, the generation comprising calculating the individual-specific set of acoustic transfer functions using a statistical analysis of: a plurality of distinct stimuli emitted for the individual from the given direction, a stimulus of the plurality of distinct stimuli dependent on at least one first set of predetermined acoustic transfer functions that are associated with the given directions first set of predetermined acoustic transfer functions comprising at least two predetermined acoustic transfer functions, and a plurality of responses received from the individual to the plurality of emitted stimuli, each response giving information related to the emission direction perceived by the individual.

10. The modeller of claim 9, wherein the modeller is configured to statistically analyze the emitted stimuli and the received responses by the given direction of the at least two directions.

11. The modeller of claim 9, wherein the modeller includes: an emitter configured to emit the plurality of distinct stimuli for the individual from the given direction; and a receiver configured to receive the responses of the individual to each emitted stimulus.

12. A three-dimensional sound card including: a modeller of sets of individual-specific acoustic transfer functions specific to an individual according to more than two directions in space, the modeller configured to generate at least one set of individual-specific acoustic transfer functions specific to the individual in a given direction of the more than two directions, the generation comprising calculating the individual-specific set of acoustic transfer functions using a statistical analysis of: a plurality of distinct stimuli emitted for the individual from the given direction, a stimulus of the plurality of distinct stimuli dependent on at least one first set of predetermined acoustic transfer functions that are associated with the given direction, a first set of predetermined acoustic transfer functions comprising at least two predetermined acoustic transfer functions, and a plurality of responses received from the individual to the plurality of emitted stimuli, each response giving information related to the emission direction perceived by the individual; and a set of parallel audio outputs configured to allow a plurality of loudspeakers to be simultaneously connected to the sound card and configured to each simultaneously deliver an audio signal to be reproduced to a loudspeaker connected to the audio output, the audio signal including, during a modelling phase, the stimulus corresponding to the loudspeaker and, during a reproducing phase, the signal to be reproduced modified by the function corresponding to the loudspeaker audio output of the set of individual-specific acoustic transfer functions that are calculated for the individual using the sound card.

13. A system for reproducing three-dimensional sound, including: a modeller of sets of individual-specific acoustic transfer functions specific to an individual according to more than two directions in space, the modeller configured to generate at least one set of individual-specific acoustic transfer functions specific to the individual in a given direction of the more than two directions, the generation comprising calculating the individual-specific set of acoustic transfer functions using a statistical analysis of: a plurality of distinct stimuli emitted for the individual from the given direction, a stimulus of the plurality of distinct stimuli dependent on at least one first set of predetermined acoustic transfer functions that are associated with the given direction, a first set of predetermined acoustic transfer functions comprising at least two predetermined acoustic transfer functions, and a plurality of responses received from the individual to the plurality of emitted stimuli, each response giving information related to the emission direction perceived by the individual, and a set of loudspeakers, each loudspeaker of the set of loudspeakers able to reproduce an audio signal, the audio signal including, during a modelling phase, the stimulus corresponding to the loudspeaker and, during a reproducing phase, a signal to be reproduced modified by the function corresponding to the loudspeaker of the set of individual-specific acoustic transfer functions that are calculated for the individual using the sound card.

14. The system of claim 13, the system additionally including headphones in which two loudspeakers of the set of loudspeakers are placed such that each of the two loudspeakers is placed on one of the two ears of the individual when the headphones are placed on the head of the individual, wherein the set of acoustic transfer functions is a corresponding pair of transfer functions.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) The features and advantages of the embodiments described herein will become more clearly apparent on reading the description, which is given by way of example, and the figures referred to thereby, which show:

(2) FIG. 1, a simplified schematic of a method for modelling a set of individual acoustic transfer functions.

(3) FIG. 2, a simplified schematic of a modeller of a set of individual acoustic transfer functions.

(4) FIG. 3, a simplified schematic of a system for reproducing three-dimensional sound.

DETAILED DESCRIPTION

(5) FIG. 1 illustrates a simplified schematic of a method for modelling a set of individual acoustic transfer functions. The method for modelling sets of acoustic transfer functions TFI_MD is specific to an individual according to a multiplicity of directions in space. This modelling method TFI_MD includes determining TFI_DT a set of acoustic transfer functions (tf.sub.1,di.sup.U, . . . tf.sub.N,di.sup.U) specific to an individual U in a given direction di of the multiplicity of directions depending on the result r.sub.di.sup.U of a statistical analysis of a plurality of distinct stimuli {(s.sub.1,di.sup.j, . . . s.sub.N,di.sup.j))}.sup.j emitted in the direction of the individual U, and on responses {s.sub.j,di.sup.U}, received from the individual U to each emitted stimulus. A stimulus is dependent on at least one set of predetermined acoustic transfer functions that are associated with the given direction.

(6) By direction in space associated with an acoustic transfer function what is in particular meant is a direction, relative to the user, in which a virtual source is created by means of the modelling.

(7) In particular, the modelling method TFI_MD includes a statistical analysis ST_NLZ by direction di in space of the emitted stimuli (s.sub.1,di . . . s.sub.N,di) and of the received responses a.sub.di.sup.U.

(8) In particular, the modelling method TFI_MD includes the following steps, which are carried out for the given direction di of the multiplicity of directions in space: a plurality of distinct stimuli depending on at least one set of predetermined acoustic transfer functions that are associated with the given direction di are emitted S_TR in the direction of an individual U; a response of the individual U to each emitted stimulus is received A_REC.

(9) In particular, for the given direction di, a plurality of stimuli are generated S_GN depending on at least one set of predetermined acoustic transfer functions that are associated with the given direction.

(10) In particular, a stimulus s.sub.1,di.sup.j . . . s.sub.N,di.sup.j results from the addition+of noise n.sub.j to a set of average acoustic transfer functions avg{tf.sub.1,di.sup.k}.sup.k . . . avg{tf.sub.N,di.sup.k}.sup.k that are associated with the given direction di and that are calculated depending on sets of acoustic transfer functions that are recorded in a database tf_bdd of acoustic transfer functions and that are associated with the given direction.

(11) The addition of noise to generate the stimuli allows the variation space to be explored without a priori hypotheses as to the properties of the spectral profile (of the set of individual acoustic transfer functions) that are responsible for the localization in a given direction (for example the frontal direction).

(12) In particular, the modelling method TFI_MD includes the following steps in which a set of average acoustic transfer functions that are associated with the given direction is calculated AVG depending on a plurality of sets of acoustic transfer functions that are recorded in a database of acoustic transfer functions and that are associated with the given direction; the stimuli are dependent on the set of calculated average acoustic transfer functions.

(13) By set of average acoustic transfer functions what is meant is one average acoustic transfer function per reproduction channel, in particular in the case of binaural synthesis: an average acoustic transfer function for the right ear and an average acoustic transfer function for the left ear of the user U.

(14) In particular, the statistical analysis ST_NLZ uses the psychophysical technique of reverse correlation. It is based on the high-level observation of perceptive processes and employs a testing phase during which the modelling method TFI_MD subjects the individual to a set of stimuli that are obtained by adding noise to a neutral stimulus (for example an average of acoustic transfer functions) and observes the responses of the individual U to these various stimuli. By analysing the statistical relationships between the stimuli and the responses, the modelling method TFI_MD identifies TFI_DT the perceptive filters, in the present case the individual acoustic transfer functions, associated with the studied perceptive process, i.e. the properties of the stimuli that define a given perceptive response.

(15) Thus, the modelling method is based on perception to identify the acoustic transfer functions specific to an individual.

(16) The modelling of frontal sound sources (direction of 0° azimuth and 0° elevation) is particularly critical. The use of generic binaural filters in such modelling engenders a spatialization of sound sources that is often disappointing: the listener tends to locate the source above, or even inside his head.

(17) Using the modelling method TFI_MD, a pair of neutral binaural filters (i.e. a set of what are called neutral acoustic transfer functions) is calculated by averaging AVG a plurality of sets of acoustic transfer functions HRTF, which functions are measured in the frontal direction for a large selection of individuals forming a sample group (said functions optionally being pre-recorded in a database of sets of acoustic transfer functions tf_bdd).

(18) A set of spatialized stimuli synthesized S_GN with binaural filters obtained by adding+noise n.sub.j to the pair of neutral filters is played S_TR for the intention of the listener, i.e. of the individual U for whom the modelling method TFI_MD determines the set of personalized acoustic transfer functions. The addition of noise affects the spectral profile.

(19) For each emitted stimulus, the listener U indicates whether he perceives it to be correctly spatialized (i.e. in the direction di that the modelling TFI_MD is attempting to reproduce, in the present case the frontal direction and outside his head) or not. This indication of the listener U forms the response a received A_REC during the modelling TFI_MD.

(20) The analysis ST_NLZ of the statistical relationships between the stimuli and the responses of the author make it possible to determine TFI_DT the spectral profile suited to the listener U and guaranteeing the correct reproduction of sounds in the modeled direction di, in the present case the frontal direction.

(21) This modelling method TFI_MD may be applied to any other direction.

(22) One particular embodiment of the modelling method is a program comprising program-code instructions for executing the steps of the modelling method when said program is executed by a processor.

(23) FIG. 2 shows a simplified schematic of a model of a set of individual acoustic transfer functions.

(24) The modeller 100 of sets of acoustic transfer functions specific to an individual according to a multiplicity of directions in space specific to an individual according to a multiplicity of directions in space, includes a generator 1004 of sets of acoustic transfer functions specific to an individual in a given direction of the multiplicity of directions on the basis of the result of a statistical analysis of a plurality of distinct stimuli emitted in the direction of the individual, a stimulus being dependent on at least one set of predetermined acoustic transfer functions that are associated with the given direction, and of responses received from the individual to each emitted stimulus.

(25) In particular, the modeller 100 includes a statistical analyser 1003 of the emitted stimuli and of the received responses by given direction of the multiplicity of directions.

(26) In particular, the modeller 100 includes: an emitter 1001 of a plurality of distinct stimuli in the direction of an individual depending on at least one set of predetermined acoustic transfer functions that are associated with at least one given direction of the multiplicity of directions; and a receiver 1002 of the responses of the individual to each emitted stimulus.

(27) In one particular embodiment, a three-dimensional sound card 10 includes: a modeller 100 of sets of acoustic transfer functions specific to an individual according to a multiplicity of directions in space that is able to generate at least one set of acoustic transfer functions that are specific to an individual in a given direction of the multiplicity of directions on the basis of the result of a statistical analysis of a plurality of distinct stimuli emitted in the direction of the individual, a stimulus being dependent on at least one set of predetermined acoustic transfer functions that are associated with the given direction, and of responses received from the individual to each emitted stimulus, and a set 102 of parallel audio outputs that allow a plurality of loudspeakers 2.sub.1 . . . 2.sub.N to be simultaneously connected to the sound card and that each simultaneously deliver an audio signal to be reproduced to a loudspeaker connected to the audio output, the audio signal including, during a modelling phase, the stimulus corresponding to the loudspeaker and, during a reproducing phase, the signal to be reproduced modified by the function corresponding to the loudspeaker audio output of the set of acoustic transfer functions that are modelled for the individual using the sound card.

(28) In particular, the modeller 100 includes a stimulus generator 1000 that delivers, for a given direction di, a plurality (j) of sets of stimuli (s.sub.1,di.sup.j . . . s.sub.N,dij). The generator 1000 in particular adds, for each set of stimuli (s.sub.1,di.sup.j . . . s.sub.N,di.sup.j), noise n.sub.j to a given set of predetermined acoustic transfer functions (tf.sub.1,di.sup.k′ . . . tf.sub.N,di.sup.k′). The noise n.sub.3 applied to the set of predetermined acoustic transfer functions (tf.sub.1,di.sup.k′ . . . tf.sub.N,di.sup.k′) to obtain the set of stimuli (s.sub.1,di.sup.j . . . s.sub.N,di.sup.j) is distinct from the noise n.sub.j′ applied to the same set of predetermined acoustic transfer functions (tf.sub.1,di.sup.k′ . . . tf.sub.N,di.sup.k′) to obtain the set of stimuli (s.sub.1,di.sup.j′ . . . s.sub.N,di.sup.j′).

(29) The predetermined set of acoustic transfer functions that is used to generate the stimuli is in particular a set of what are called neutral acoustic transfer functions, namely it does not reflect a specific morphology. Thus, the statistical analysis is not biased by a particular morphological model and the determination of the individual acoustic transfer functions allows a better approximation of the actual acoustic transfer functions of the individual.

(30) In particular, such a what is called neutral set of acoustic transfer functions is obtained by averaging a plurality of sets of acoustic transfer functions, which functions are recorded in a database of acoustic transfer functions. For example, those sets of acoustic transfer functions which are used to calculate this what is called neutral set of acoustic transfer functions are selected randomly from the database of acoustic transfer functions or depending on one or more morphological parameters neighbouring those of the individual, or consist of all the sets of acoustic transfer functions that are recorded in the database of acoustic transfer functions.

(31) Most often a set of acoustic transfer functions is a pair of acoustic transfer functions (for example in the particular case of binaural stimulation) that is composed of the acoustic transfer function corresponding to the right ear and of the acoustic transfer function corresponding to the left ear of an individual.

(32) The emitter 1001 emits, for at least one given direction di, a plurality of sets of stimuli (s.sub.1,di.sup.j . . . s.sub.N,di.sup.j) in the direction of the individual U for whom the modeller 100 determines a set of acoustic transfer functions in a given direction di. In particular, the emitter 1001 transmits these sets of stimuli, for example via an output assembly 102 of a 3-D sound card 10 and/or of a terminal 1 including the modeller 100, to a set of loudspeakers (2.sub.1 . . . 2.sub.N) that play the stimuli to the individual U. Each stimuli s.sub.n,di.sup.j of a set of stimuli is intended for a specific loudspeaker 2.sub.n of the set of loudspeakers (2.sub.1 . . . 2.sub.N).

(33) To each set of stimuli (s.sub.1,di.sup.j . . . s.sub.N,di.sup.j), the individual U reacts by transmitting a response a in particular by means of an interface 12 of the terminal 1 (by input, by voice command, etc.). The receiver 1002 receives the response a.sub.j.sup.U to the set j of stimuli of the individual U.

(34) For a given direction di, the analyser 1003 carries out a statistical analysis on the sets of emitted stimuli (s.sub.1,di.sup.j . . . s.sub.N,di.sup.j) and the corresponding responses a.sub.j.sup.U. The generator 1004 then determines the set (tf.sub.1,di.sup.j . . . tf.sub.N,diU) of acoustic transfer functions that is specific to this individual U for the given direction di depending on the result r.sub.di.sup.U delivered by the analyser 1003.

(35) The operation is optionally repeated for one or more other distinct directions di′.

(36) Thus, the terminal 1 including a reader 11 of a sound signal as may play a 3-D sound signal in the direction of the individual U. Specifically, the terminal 1 includes a filter 101 the filtering parameters of which are formed, for a least one direction di, by the transfer-function set delivered by the modeller 100. The filter 101 then converts the monophonic sound signal as with a set of sound signals that are played to the individual U by means of the set of loudspeakers.

(37) FIG. 3 illustrates a simplified schematic of a system for reproducing three-dimensional sound.

(38) The system for reproducing three-dimensional sound includes: a modeller 100 of sets of acoustic transfer functions specific to an individual according to a multiplicity of directions in space that is able to generate at least one set of acoustic transfer functions that are specific to an individual in a given direction of the multiplicity of directions on the basis of the result of a statistical analysis of a plurality of distinct stimuli emitted in the direction of the individual, a stimulus being dependent on at least one set of predetermined acoustic transfer functions that are associated with the given direction, and of responses received from the individual to each emitted stimulus, and a set of loudspeakers {2.sub.1, 2.sub.2} that are each able to reproduce an audio signal, the audio signal including, during a modelling phase, the stimulus corresponding to the loudspeaker and, during a reproducing phase, to a signal to be reproduced modified by the function corresponding to the loudspeaker of the set of acoustic transfer functions that are modelled for the individual using the reproducing system.

(39) In particular, the reproducing system includes headphones 20 in which the two loudspeakers 2.sub.1 and 2.sub.2 of the set of loudspeakers are placed such that each of the two loudspeakers is placed on one of the two ears of the individual U when the headphones 20 are placed on his head, the set of acoustic transfer functions being a corresponding pair of transfer functions.

(40) Thus, the modelling does not require specific equipment. It may be implemented with a simple set of headphones.

(41) The embodiments described herein also relate to a medium. The data medium may be any entity or device capable of storing the program. For example, the medium may include a storing means, such as a ROM, for example a CD-ROM or a microelectronic circuit ROM or even a magnetic recording means, for example a floppy disk or a hard disk.

(42) Furthermore, the data medium may be a transmissible medium such as an optical or electrical signal that may be transmitted via an optical or electrical cable, by radio or by other means. The program may in particular be downloaded from a network, the Internet in particular.

(43) Alternatively, the data medium may be an integrated circuit in which the program is incorporated, the circuit being suitable for executing or for being used in the execution of the method in question.

(44) In another implementation, the embodiments described herein are implemented by means of software and/or hardware components. In this light, the term module may correspond either to a software component or to a hardware component. A software component corresponds to one or more computer programs, one or more sub-programs of a program, or more generally to any element of a program or of a software package able to implement a function or a set of functions according to the above description. A hardware component corresponds to any element of a hardware assembly able to implement a function or a set of functions.

(45) In the foregoing description, specific details are given to provide a thorough understanding of the examples. However, it will be understood by one of ordinary skill in the art that the examples may be practiced without these specific details. Certain features that are described separately herein can be combined in a single embodiment, and the features described with reference to a given embodiment also can be implemented in multiple embodiments separately or in any suitable subcombination.

(46) The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.