H04R3/005

IMAGING DEVICE
20180007267 · 2018-01-04 ·

An imaging device includes an optical system including a movable lens, an image capture unit configured to capture a subject image through the optical system, a sound-collecting microphone, and a control unit configured to control the optical system and the image capture unit and to receive an sound signal from the microphone. The imaging device has a first mode and a second mode as moving-image shooting modes. The control unit makes the movable lens move faster in the first mode than in the second mode when a moving image is captured. The control unit filters the sound signal with a narrower-band filter in the first mode than in the second mode.

MICROPHONE ARRAY SPEECH ENHANCEMENT
20180012616 · 2018-01-11 ·

Speech received from a microphone array is enhanced. In one example, a noise filtering system receives audio from the plurality of microphones, determines a beamformer output from the received audio, applies a first auto-regressive moving average smoothing filter to the beamformer output, determines noise estimates from the received audio, applies a second auto-regressive moving average smoothing filter to the noise estimates, and combines the first and second smoothing filter outputs to produce a power spectral density output of the received audio with reduced noise.

Voice detection using ear-based devices

This disclosure describes techniques for detecting voice commands from a user of an ear-based device. The ear-based device may include an in-ear facing microphone to capture sound emitted in an ear of the user, and an exterior facing microphone to capture sound emitted in an exterior environment of the user. The in-ear microphone may generate an inner audio signal representing the sound emitted in the ear, and the exterior microphone may generate an outer audio signal representing sound from the exterior environment. The ear-based device may compute a ratio of a power of the inner audio signal to the outer audio signal and may compare this ratio to a threshold. If the ratio is larger than the threshold, the ear-based device may detect the voice of the user. Further, the ear-based device may set a value of the threshold based on a level of acoustic seal of the ear-based device.

MICROPHONE ARRAY NOISE SUPPRESSION USING NOISE FIELD ISOTROPY ESTIMATION
20180012617 · 2018-01-11 ·

Noise is suppressed from a microphone array by estimating a noise field isotropy. In some examples audio is received from a plurality of microphones. A power spectral density of a beamformer output is determined and a power spectral density of microphone noise differences is determined. A noise power spectral density is determined using a transfer function and the noise power spectral density is applied to the beamformer output power spectral density to produce a power spectral density output of the received audio with reduced noise.

Headset sound leakage mitigation
11711645 · 2023-07-25 · ·

An audio system for a headset includes a plurality of speakers and an audio controller. The plurality of speakers may be in a dipole configuration that cancel sound leakage into a local area of the headset. The controller filters audio content presented by the plurality of speakers to further mitigate leakage of audio content into the local area. The audio determines sound filters based on environmental conditions, such as ambient noise levels, as well as based on the audio content being presented.

ADAPTIVE AUDIO CONSTRUCTION

Described herein is a method for creating an object-based audio signal from an audio input, the audio input including one or more audio channels that are recorded to collectively define an audio scene. The one or more audio channels are captured from a respective one or more spatially separated microphones disposed in a stable spatial configuration. The method includes the steps of: a) receiving the audio input; b) performing spatial analysis on the one or more audio channels to identify one or more audio objects within the audio scene; c) determining contextual information relating to the one or more audio objects; d) defining respective audio streams including audio data relating to at least one of the identified one or more audio objects; and e) outputting an object-based audio signal including the audio streams and the contextual information.

Voice Sensing using Multiple Microphones

A noise cancelling headset includes first and second earpieces, each earpiece including a respective feedback microphone, a respective feed-forward microphone, and a respective output driver. A first feedback filter receives an input from at least the first feedback microphone and produces a first filtered feedback signal. A first feed-forward filter receives an input from at least the first feed-forward microphone and produces a first filtered feed-forward signal. A first summer combines the first filtered feedback signal and the first filtered feed-forward signal and produces a first output signal. An output interface provides the first output signal as an output from the headset.

Bidirectional channel control systems, methods, devices and computer readable storagemeduums

A bidirectional channel control system, method, device, and non-transitory computer-readable storage medium based on Digital Enhanced Cordless Telecommunications is provided. The system comprises a transmitter, at least one receiver, an audio channel, and a text message control channel. The transmitter is configured to send an audio data stream to the at least one receiver through the audio channel. The transmitter is configured to send a control command to the one receiver through the text message control channel in a one-to-one single-point text message mode, and to receive a feedback result from the one receiver in response to the control command. Alternatively, the transmitter is configured to send a control command to each of the at least one receiver based on DECT protocol through the text message control channel in at least one of a one-to-many broadcast messaging mode and a one-to-one single-point messaging mode, and to receive a feedback result from each of the at least one receiver in response to the control command in the one-to-one single-point messaging mode.

Audio data processing method, apparatus and storage medium for detecting wake-up words based on multi-path audio from microphone array

An audio data processing method is provided. The method includes: obtaining multi-path audio data in an environmental space, obtaining a speech data set based on the multi-path audio data, and separately generating, in a plurality of enhancement directions, enhanced speech information corresponding to the speech data set; matching a speech hidden feature in the enhanced speech information with a target matching word, and determining an enhancement direction corresponding to the enhanced speech information having a highest degree of matching with the target matching word as a target audio direction; obtaining speech spectrum features in the enhanced speech information, and obtaining, from the speech spectrum features, a speech spectrum feature in the target audio direction; and performing speech authentication on the speech hidden feature and the speech spectrum feature that are in the target audio direction based on the target matching word, to obtain a target authentication result.

MICROPHONE NOISE SUPPRESSION FOR COMPUTING DEVICE
20180012585 · 2018-01-11 · ·

A computing device with a microphone system is disclosed. The computing device includes a microphone system with an environment microphone and a noise microphone. The environment microphone picks up an environment microphone signal which includes (1) a desired signal component based on desired sound and (2) a noise component based on noise from a noise source. The noise microphone picks up a noise microphone signal based on the noise, and is configured such that contributions to the noise microphone signal from the desired sound, if present, are attenuated relative to the environment microphone. A controller receives and processes time samples from the noise microphone signal to yield a noise estimation of the noise component. The estimation is subtracted from the environment microphone signal to yield and end-user output.