Method and apparatus for capturing and rendering an audio scene
11259101 · 2022-02-22
Assignee
Inventors
Cpc classification
H04R1/025
ELECTRICITY
H04S2400/15
ELECTRICITY
H04S7/30
ELECTRICITY
H04R1/02
ELECTRICITY
International classification
H04R1/02
ELECTRICITY
H04R1/24
ELECTRICITY
Abstract
The method of capturing an audio scene includes acquiring sounds having first and second directivities to obtain first and second acquisition signals, respectively, the first directivity being higher than the second directivity, the steps of acquiring being performed simultaneously, and both acquisition signals together representing the audio scene; separately storing the first and second acquisition signals or mixing individual channels in the acquisition signals to obtain first and second mixed signal, respectively, and separately storing the first and second mixed signals, or transmitting the first and second mixed signals or the first and second acquisition signals to a loudspeaker setup and rendering the first mixed signal or the first acquisition signal using a loudspeaker arrangement having a first directivity and simultaneously rendering the second mixed signal or the second acquisition signal using a loudspeaker arrangement having a second directivity, the second loudspeaker directivity being lower than the first one.
Claims
1. A microphone comprising: a first electret microphone portion having a first free space; a second electret microphone portion having a second free space, wherein the first electret microphone portion and the second electret microphone portion are arranged in a back-to-back arrangement at a back-to-back interface; a vent channel configured for venting the first free space and the second free space to an ambient pressure, wherein the vent channel comprises: a third vent channel portion extending at the back-to-back interface between the first microphone portion and the second microphone portion to an ambient of the microphone, the ambient having the ambient pressure; a first vent channel portion extending from the third vent channel portion into the first free space of the first electret microphone portion; and a second vent channel portion extending from the third vent channel portion into the second free space of the second electret microphone portion; first contacts configured for deriving a first electrical signal and arranged at the first microphone portion; and second contacts configured for deriving a second electrical signal and arranged at the second microphone portion.
2. The microphone of claim 1, wherein the first free space is defined by an electrode movable in response to sound energy with respect to a first opposing counter electrode, and wherein the second free space is defined by an electrode movable in response to sound energy with respect to a second opposing counter electrode.
3. The microphone of claim 1, wherein the vent channel comprises a plurality of first vent channel portions in the first electret microphone portion, the plurality of first vent channel portions comprising the first vent channel portion, and a plurality of second vent channel portions in the second electret microphone portion, the plurality of second vent channel portions comprising the second vent channel portion, wherein the plurality of first vent channel portions and the plurality of second vent channel portions are connected by the third vent channel portion so that the first free space and the second free space communicate via the third vent channel portion with the ambient pressure.
4. The microphone of claim 1, wherein the first contacts comprise a first electrode connected to a first movable electrode of the first microphone portion and a second electrode electrically connected to a first counter electrode of the first microphone portion, wherein the second contacts comprise a first electrode electrically connected to a second movable electrode of the second microphone portion and the second contact electrically connected to a second counter electrode, wherein the second electrode of the first contacts and the second electrode of the second contacts are implemented as a single contact, and wherein the first counter electrode and the second counter electrode are short-circuited.
5. The microphone of claim 1, further comprising a signal combiner for combining the first electrical signal output by the first microphone portion and the second electrical signal output by the second microphone portion.
6. The microphone of claim 1, configured so that outer dimensions are, in a length direction, less than 20 mm, in a width direction, less than 20 mm, and in a height direction, less than 10 mm.
7. The microphone of claim 1, wherein the first electret microphone portion comprises a first membrane and wherein the second electret microphone portion comprises a second membrane, wherein the vent channel comprises a plurality of first vent channel portions in the first electret microphone portion, the plurality of first vent channel portions comprising the first vent channel portion, and a plurality of second vent channel portions in the second electret microphone portion, the plurality of second vent channel portions comprising the second vent channel portion, wherein the plurality of first vent channel portions and the plurality of second vent channel portions are connected by the third vent channel portion so that the first free space and the second free space communicate via the third vent channel portion with the ambient pressure.
8. The microphone of claim 7, wherein the plurality of first vent channel portions, the plurality of second vent channel portions and the third vent channel portion are realized in such a way that the first and the second free spaces do not build up an additional counter-pressure in addition to the ambient pressure, irrespective of the movements of the first and the second membranes, so that the first free space and the second free space are always equalized to the ambient pressure.
9. Acoustic instrument comprising: a sound emitting portion; and a microphone comprising: a first electret microphone portion having a first free space and a second electret microphone portion having a second free space, wherein the first and second electret microphone portions are arranged in a back-to-back arrangement, wherein a vent channel is provided for venting the first free space and the second free space to an ambient pressure, wherein the vent channel comprises: a third vent channel portion extending at a back-to-back interface between the first microphone portion and the second microphone portion to an ambient of the microphone, the ambient having the ambient pressure; a first vent channel portion extending from the third vent channel portion into the first free space of the first electret microphone portion; and a second vent channel portion extending from the third vent channel portion into the second free space of the second electret microphone portion, and wherein first contacts for deriving a first electrical signal are arranged at the first microphone portion, and wherein second contacts for deriving a second electrical signal are arranged at the second microphone portion, wherein the microphone is attached to the sound emitting portion.
10. Acoustic instrument of claim 9, wherein the acoustic instrument is implemented as a violin having an F-hole, and wherein the microphone is attached to the F-hole.
11. Method of manufacturing a microphone, the method comprising: providing a first electret microphone portion having a first free space and a second electret microphone portion having a second free space, fixing the first and second electret microphone portions in a back-to-back arrangement to each other, wherein a vent channel is provided for venting the first free space and the second free space to an ambient pressure, wherein the vent channel comprises: a third vent channel portion extending at a back-to-back interface between the first microphone portion and the second microphone portion to an ambient of the microphone, the ambient having the ambient pressure; a first vent channel portion extending from the third vent channel portion into the first free space of the first electret microphone portion; and a second vent channel portion extending from the third vent channel portion into the second free space of the second electret microphone portion, and wherein first contacts for deriving a first electrical signal are arranged at the first microphone portion, and wherein second contacts for deriving a second electrical signal are arranged at the second microphone portion.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) Embodiments of the present invention will be detailed subsequently referring to the appended drawings, in which:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
DETAILED DESCRIPTION OF THE INVENTION
(15)
(16) In an embodiment, the step of acquiring the sound having a first directivity comprises placing microphones 100 illustrated in
(17) Furthermore, the step 202 of
(18)
(19) As indicated in
(20) The sound acquisition concept illustrated in
(21) When an orchestra is considered, it has been found that the sound energy which is emitted directly in the front direction to the listener is composed mainly of instruments having a high directivity such as trumpets or trombones and, additionally, comes from the singers or vocalists. This “high Q” sound portion is detected by microphones 100 of
(22) Instruments having a high directivity but which do not directly emit sound in the front direction such as a tuba, different horns or wings and several wood wind instruments and, additionally, instruments having a low directivity such as string instruments, percussion, gong or triangle generate a room-like or less directed sound emission. This “low Q” sound portion is detected with a microphone set placed lateral and/or above the instruments or with respect to the sound scene. If microphones having a certain directivity are used, it is advantageous that these microphones are directed into the direction of the individual sound sources such as tuba, horns, wood wind instruments, strings, percussion, gong, triangle.
(23) These individual “high Q” and “low Q” microphone signals, i.e., the first and second acquisition signals are independently recorded from each other and further processed such as mixed, stored, transmitted or in other ways manipulated. Hence, separate high and low Q mixtures can be mixed to obtain the first and second mixed signals and these mixed signals can be stored within the storage 108 or can be rendered via separate high and low Q speakers.
(24) Dual Q loudspeaker systems illustrated in
(25) Furthermore, as indicated at 115 in
(26) Advantageously, the dual Q technology is combined with the icon technology which is described in the context of
(27) Subsequently,
(28) Furthermore, instead of or in addition to placing the microphones 102 above or lateral to the sound scene and placing the microphones 100 in front of the sound scene, microphones can also be placed selectively in a corresponding proximity to the corresponding instruments.
(29) When the audio scene, for example, comprises an orchestra having a first set of instruments emitting with a higher directivity and a second set of instruments emitting sound with a lower directivity, then the step of acquiring comprises placing the first set of microphones closer to the instruments of the first set of instruments than to the instruments of the second set of instruments to obtain the first acquisition signal and placing the second set of microphones closer to the instruments of the second set of instruments, i.e., the low directivity emitting instruments, than to the first set of instruments to obtain the second acquisition signal.
(30) Depending on the implementation, the directivity as defined by a directivity factor related to a sound source is the ratio of radiated sound intensity at the remote point on the principle axis of a sound source to the average intensity of the sound transmitted through a sphere passing through the remote point and concentric with the sound source. Advantageously, the frequency is stated so that the directivity factor is obtained for individual subbands.
(31) Regarding a sound acquisition by microphones, the directivity factor is the ratio of the square of the voltage produced by sound waves arriving parallel to the principle axis of a microphone or other receiving transducer to the mean square of the voltage that would be produced if sound waves having the same frequency and mean square pressure where arriving simultaneously from all directions with random phase. Advantageously, the frequency is stated in order to have a directivity factor for each individual subband.
(32) Regarding sound emitters such as speakers, the directivity factor is the ratio of radiated sound intensity at the remote point on the principle axis of a loudspeaker or other transducer to the average intensity of the sound transmitted through a sphere passing through the remote point and concentric with the transducer. Advantageously, the frequency is given as well in this case.
(33) However, other definitions exist for the directivity factor as well which all have the same characteristic but result in different quantitative results. For example, for a sound emitter, the directivity factor is a number indicating the factor by which the radiated power would have to be increased if the directed emitter were replaced by an isotopic radiator assuming the sane field intensity for the actual sound source and the isotropic radiator.
(34) For the receiving case, i.e., for a microphone, the directivity factor is a number indicating the factor by which the input power of the receiver/microphone for the direction of maximum reception exceeds the mean power obtained by averaging the power received from all directions of reception if the field intensity at the microphone location is equal for any direction of wave incidence.
(35) The directivity factor is a quantitative characterization of the capacity of a sound source to concentrate the radiated energy in a given direction or the capacity of a microphone to select signals incident from a given direction.
(36) When the measure of the directivity factor is from 0 to 1, then the directivity factor related to the first acquisition signal is advantageously greater than 0.6 and the directivity factor related to the second acquisition is advantageously lower than 0.4. Stated differently, it is advantageous to place the two different sets of microphones so that the values of 0.6 for the first acquisition signal and 0.4 for the second acquisition signal is obtained. Naturally, it will practically not be possible to have a first acquisition signal only having directed sound and not having any omnidirectional sound. On the other hand, it will not be possible to have a second acquisition signal only having omnidirectionally emitted sound and not having directionally emitted sound. However, the microphones are manufactured and placed in such a way that the directionally emitted sound dominates the omnidirectionally emitted sound in the first microphone signal and that the omnidirectionally emitted sound dominates over the directionally emitted sound in the second acquisition signal.
(37) A method of rendering an audio scene comprises a step of providing a first acquisition signal related to sound having a first directivity or providing a first mixed signal related to sound having the first directivity. The method of rendering additionally comprises providing a second acquisition signal related to sound having a second directivity or providing a second mixed signal related to sound having a second directivity, where the first directivity is higher than the second directivity. The steps of providing can be actually implemented by receiving, in the sound rendering portion of
(38) Furthermore, the method of rendering comprises a step of generating (210, 212) a sound signal from the first acquisition signal or the first mixed signal and the step of generating a second sound signal from the second acquisition signal or the second mixed signal. For generating the first sound signal a directional speaker arrangement 118 is used, and for generating the second signal an omnidirectional speaker arrangement 120 is used. Advantageously, the directivity of the directional speaker arrangement is higher than the directivity of the omnidirectional speaker arrangement 120, although it is clear that an ideal omnidirectional emission characteristic can almost not be generated by existing loudspeaker systems, although the loudspeaker of
(39) Advantageously, the emission characteristic of the omnidirectional speakers is close to the ideal omnidirectional characteristic within a tolerance of 30%.
(40) Subsequently, reference is made to
(41) For example, brass instruments are instruments with a mainly translatory sound generation. The human voice generates a translatorial and a rotational portion of the air molecules. For the transmission of the translation, existing microphones and speakers with piston-like operating membranes and a back enclosure are available.
(42) The rotation is generated mainly by playing bow instruments, guitar, a gong or a piano due to the acoustic short-circuit of the corresponding instrument. The acoustic short-circuit is, for example, performed via the F-holes of a violin, the sound hole for the guitar or between the upper and lower surface of the sounding board at a grand or normal piano or by the front and back phase of a gong. When generating a human voice, the rotation is excited between mouth and nose. The rotation movement is typically limited to the medium sound frequencies and can be advantageously acquired by microphones having a figure of eight characteristic, since these microphones additionally have an acoustic short-circuit. The reproduction is realized by mid-frequency speakers with freely vibratable membranes without having a backside enclosure.
(43) The vibration is generated by violins or is strongly generated by xylophones, cymbals and triangles. The vibrations of the atoms within a molecule is generation up to the ultrasound region above 60 kHz and even up to 100 kHz.
(44) Although this frequency range is typically not perceivable by the human hearing mechanism, nevertheless level and frequency-dependent demodulations effects and other effects take place, which are then made perceivable, since they actually occur within the hearing range extending between 20 Hz and 20 kHz. The authentic transmission of vibration is available by extending the frequency range above the hearing limit at about 20 kHz up to more than 60 or even 100 kHz.
(45) The detection of the directional sound portion for a correct location of sound sources involves a directional microphoning and speakers with a high emission quality factor or directivity in order to only put sound to the ears of the listeners as far as possible. For the directional sound, a separate mixing is generated and reproduced via separate speakers.
(46) The detection of the room-like energy is realized by a microphone setup placed above or lateral with respect to the sound sources. For the transmission of the room-like portion, a separate mixing is generated and reproduced by speakers having a low emission quality factor (sphere emitters) in a separate manner.
(47) Subsequently, an advantageous loudspeaker is described with respect to
(48) Furthermore, the carrier 312 comprises a tip portion having a cross-sectional area which is less than 20% of a cross-sectional area of the base portion, where the speaker arrangement 314 is fixed to the tip portion. Advantageously, as illustrated in
(49) The speaker arrangement 314 has a sphere-like carrier structure 316, which is also illustrated in
(50) Advantageously, the speaker arrangement comprises at least six individual speakers and particularly even twelve individual speakers arranged in twelve different directions, where, in this embodiment, the speaker arrangement 314 comprises a pentagonal dodekaeder (e.g. body with 12 equally distributed surfaces) having twelve individual areas, wherein each individual area is provided with an individual speaker membrane. Importantly, the loudspeaker arrangement 314 does not comprise a loudspeaker enclosure and the individual speakers are held by the supporting structure 316 so that the membranes of the individual speakers are freely suspended.
(51) Furthermore, as illustrated in
(52) Alternatively, however, the loudspeaker in
(53) The enclosure furthermore comprises a further speaker 604 which is suspended at an upper portion of the enclosure and which has a freely suspended membrane. This speaker is a low/mid speaker for a low/mid frequency range between 80 and 300 Hz and advantageously between 100 and 300 Hz. This additional speaker is advantageous, since—due to the freely suspended membrane—the speaker generates rotation stimulation/energy in the low/mid frequency range. This rotation enhances the rotation generated by the speakers 314 at low/mid frequencies. This speaker 604 receives the low/mid frequency portion of the signal provided to the speakers at 314, e.g., the second acquisition signal or the second mixed signal.
(54) In an advantageous embodiment with a single subwoofer, the subwoofer is a twelve inch subwoofer in the closed longitudinal enclosure 300 and the speaker arrangement 314 is a pentagon dodekaeder medium/high speaker arrangement with freely vibratable medium frequency membranes.
(55) Additionally, a method of manufacturing a loudspeaker comprises the production and/or provision of the enclosure, the carrier portion and the speaker arrangement, where the carrier portion is placed on top of the longitudinal enclosure and the speaker arrangement with the individual speakers is placed on top of the carrier portion or alternatively the speaker arrangement without the individual speakers is placed on top of the carrier portion and then the individual speakers are mounted.
(56) Subsequently, reference is made to
(57) The microphone comprises a first electret microphone portion 801 having a first free space and a second electret portion 802 having a second free space. The first and the second microphone portions 801, 802 are arranged in a back-to-back arrangement. Furthermore, a vent channel 804 is provided for venting the first free space and/or the second free space. Furthermore, first contacts 806a, 806b for deriving an electrical signal 806c and second contacts 808a and 806b for deriving a second electrical signal 808b are arranged at the first microphone portion 801, and the second microphone portion 802, respectively. Hence,
(58)
(59) The second electret microphone portion 802 is advantageously constructed in the same manner and comprises, from bottom to top, a metallization 820, a foil 821, a spacer 822 defining a second vented free space 823. On the spacer 822 an electret foil 824 is placed and above the electret foil 824 a counter electrode 826 is placed which forms the back plate of the second microphone portion. Hence, elements 820 to 826 represent the second electret microphone portion 802 of the
(60) Advantageously, the first and the second microphone portions have a plurality of vertical vent portions 804b, 804c, as illustrated in
(61) Advantageously, the microphone in accordance with the present invention is a back-electret double-microphone with a symmetrical construction. The metalized foils 811, 821 are moved or excited by the kinetic energy of the air molecules (sound) and therefore the capacity of the capacitor consisting of the back electrode 816, 826 and the metallization 810, 820 is changed. Due to the persistent charge on the electret foils 814, 824, a voltage U.sub.1, U.sub.2 is generated due to the equation Q=C×U, which means that U is equal to Q/C. The voltage U.sub.1 is proportional to the movement of the electrode 810, 811, and the voltage U.sub.2 is proportional to the movement of the electrode 820, 821. Two individual electret microphones are arranged in a back-to-back arrangement. The vertical vent portions 804b, 804c are useful in order to avoid a back-like closure of the free spaces 813, 823. In order to maintain this functionality additionally when the microphones are arranged in the back-to-back arrangement, the horizontal vent portions 804a are provided which communicate with the vertical vent portions 804b, 804c. Hence, even in the back-to-back arrangement, a closure of the vented free spaces 813, 823 is avoided.
(62)
(63) Naturally, an actually provided signal combiner does not necessarily have to be the controllability feature. Instead, the in-phase, out-of-phase or weighted addition functionality of the combiner can be correspondingly hardwired so that each microphone has a certain output signal characteristic with the combined C output signal, but this microphone cannot be configured. However, when the controllable combiner has the switching functionality illustrated in
(64) Advantageously, the inventive electret microphone is miniaturized and only has dimensions as are set forth in
(65) Furthermore, in order to fully detect the vibration energy, the icon microphone should have an audio bandwidth of 60 kHz and advantageously up to 100 kHz. To this end, the foils 811, 821 have to be attached to the spacer in a correspondingly stiff manner. The microphone illustrated in
(66) Although some aspects have been described in the context of an apparatus, it is clear that these aspects also represent a description of the corresponding method, where a block or device corresponds to a method step or a feature of a method step. Analogously, aspects described in the context of a method step also represent a description of a corresponding block or item or feature of a corresponding apparatus.
(67) Depending on certain implementation requirements, embodiments of the invention can be implemented in hardware or in software. The implementation can be performed using a digital storage medium, for example a floppy disk, a DVD, a CD, a ROM, a PROM, an EPROM, an EEPROM or a FLASH memory, having electronically readable control signals stored thereon, which cooperate (or are capable of cooperating) with a programmable computer system such that the respective method is performed.
(68) Some embodiments according to the invention comprise a non-transitory data carrier having electronically readable control signals, which are capable of cooperating with a programmable computer system, such that one of the methods described herein is performed or having stored thereon the first or second acquisition signals or first or second mixed signals.
(69) Generally, embodiments of the present invention can be implemented as a computer program product with a program code, the program code being operative for performing one of the methods when the computer program product runs on a computer. The program code may for example be stored on a machine readable carrier.
(70) Other embodiments comprise the computer program for performing one of the methods described herein, stored on a machine readable carrier.
(71) In other words, an embodiment of the inventive method is, therefore, a computer program having a program code for performing one of the methods described herein, when the computer program runs on a computer.
(72) A further embodiment of the inventive methods is, therefore, a data carrier (or a digital storage medium, or a computer-readable medium) comprising, recorded thereon, the computer program for performing one of the methods described herein.
(73) A further embodiment of the inventive method is, therefore, a data stream or a sequence of signals representing the computer program for performing one of the methods described herein. The data stream or the sequence of signals may for example be configured to be transferred via a data communication connection, for example via the Internet.
(74) A further embodiment comprises a processing means, for example a computer, or a programmable logic device, configured to or adapted to perform one of the methods described herein.
(75) A further embodiment comprises a computer having installed thereon the computer program for performing one of the methods described herein.
(76) In some embodiments, a programmable logic device (for example a field programmable gate array) may be used to perform some or all of the functionalities of the methods described herein. In some embodiments, a field programmable gate array may cooperate with a microprocessor in order to perform one of the methods described herein. Generally, the methods are advantageously performed by any hardware apparatus.
(77) While this invention has been described in terms of several embodiments, there are alterations, permutations, and equivalents which fall within the scope of this invention. It should also be noted that there are many alternative ways of implementing the methods and compositions of the present invention. It is therefore intended that the following appended claims be interpreted as including all such alterations, permutations and equivalents as fall within the true spirit and scope of the present invention.