Patent classifications
H04R1/40
TARGET SOUND SIGNAL GENERATION APPARATUS, TARGET SOUND SIGNAL GENERATION METHOD, AND PROGRAM
Provided is a target sound extraction technique based on a steering vector generation method enabling instability in a calculation to be prevented when a neural network is trained by using an error back propagation method to reduce an estimation error of a beamformer. A target sound signal generation apparatus generates a target sound signal y.sub.t,f corresponding to a target sound included in an observed sound from an observed signal vector x.sub.t,f corresponding to the observed sound collected by using a plurality of microphones. The target sound signal generation apparatus includes a mask generation unit, a steering vector generation unit, a beamformer vector generation unit, and a target sound signal generation unit. The mask generation unit is configured as a neural network trained by using an error back propagation method. The steering vector generation unit generates a steering vector h.sub.f by determining an eigenvector corresponding to a maximum eigenvalue of a predetermined matrix generated from the observed signal vector x.sub.t,f and a mask γ.sub.t,f by using a power method.
SIGNAL PROCESSING DEVICE, SIGNAL PROCESSING METHOD, PROGRAM, AND SIGNAL PROCESSING SYSTEM
Provided is a signal processing device including a main speech detection unit configured to detect, by using a neural network, whether or not a signal input to a sound collection device assigned to each of at least two speakers includes a main speech that is a voice of the corresponding speaker, and output frame information indicating presence or absence of the main speech.
LOUDSPEAKER CONTROL
There is provided a computer-implemented method of generating audio signals for an array of loudspeakers, the method comprising: receiving a plurality of input audio signals, wherein a respective one of the plurality of input audio signals is to be reproduced, by the array, at each of a plurality of control points in an acoustic environment, and wherein each of the plurality of control points is associated with a respective one of a plurality of loudspeaker groups; receiving an estimate of a position of each of the plurality of control points; assigning, using the received estimate of the position of each of the plurality of control points, each of the loudspeakers in the array to at least one of the plurality of loudspeaker groups, wherein the assigning of a particular loudspeaker to a particular loudspeaker group is based on a relative position of the particular loudspeaker with respect to one or more of the at least one control points associated with the particular loudspeaker group; and generating a respective output audio signal for each of the loudspeakers in the array by applying a set of filters to the plurality of input audio signals, the output audio signal for a particular loudspeaker being generated according to the at least one loudspeaker group to which the particular loudspeaker is assigned.
Associated spatial audio playback
An apparatus including at least one processor and at least one memory including computer code for one or more programs, the at least one memory and the computer code configured, with the at least one processor, to cause the apparatus at least to: generate content lock information for a content lock, wherein the content lock information enables control of audio signal processing associated with audio signals related to one or more audio sources based on a position and/or orientation input.
Using classified sounds and localized sound sources to operate an autonomous vehicle
An ambient sound environment is captured by a microphone array of an autonomous vehicle traveling in the ambient sound environment. A perception module of the autonomous vehicle classifies sounds and localizes sound sources in the ambient sound environment. Classification is performed using spectrum analysis and/or machine learning. In an embodiment, sound sources within a field of view (FOV) of an image sensor of the autonomous vehicle are localized in a visual scene generated by the perception module. In an embodiment, one or more sound sources outside the FOV of the image sensors are localized in a static digital map. Localization is performed using parametric or non-parametric techniques and/or machine learning. The output of the perception module is input into a planning module of the autonomous vehicle to plan a route or trajectory for the autonomous vehicle in the ambient sound environment.
System to determine direction toward user
A device has a microphone array that acquires sound data and a camera that acquires image data. A portion of the device may be moveable by one or more actuators. Responsive to the user, the portion of the device is moved toward an estimated direction of the user. The estimated direction is based on sensor data including the sound data and the image data. First variance values for individual sound direction values are calculated. Data derived from the image data or data from other sensors may be used to modify the first variance values and determine second data comprising second variances. The second data may be processed to determine the estimated direction of the user. For example, the second data may be processed by both a forward and a backward Kalman filter, and the output combined to determine an estimated direction toward the user.
Acoustic devices
The present disclosure provides an acoustic device including a microphone array, a processor, and at least one speaker. The microphone array may be configured to acquire an environmental noise. The processor may be configured to estimate a sound field at a target spatial position using the microphone array. The target spatial position may be closer to an ear canal of a user than each microphone in the microphone array. The processor may be configured to generate a noise reduction signal based on the environmental noise and the sound field estimation of the target spatial position. The at least one speaker may be configured to output a target signal based on the noise reduction signal. The target signal may be used to reduce the environmental noise. The microphone array may be arranged in a target area to minimize an interference signal from the at least one speaker to the microphone array.
Wearable device with directional audio
A wearable device can provide an audio module that is operable to provide audio output from a distance away from the ears of the user. For example, the wearable device can be worn on clothing of the user and direct audio waves to the ears of the user. Such audio waves can be focused by a parametric array of speakers that limit audibility by others. Thus, the privacy of the audio directed to the user can be maintained without requiring the user to wear audio headsets on, over, or in the ears of the user. The wearable device can further include microphones and/or connections to other devices that facilitate calibration of the audio module of the wearable device. The wearable device can further include user sensors that are configured to detect, measure, and/or track one or more properties of the user.
METHOD AND DEVICE FOR CONTROLLING THE PROPAGATION OF ACOUSTIC WAVES ON A WALL
A method and a device for controlling the propagation of acoustic waves in the vicinity of a wall, the method and device implementing a master device for controlling a set Nc of cells primarily made up of a speaker, a set of Nm microphones connected to the speaker, and a control unit, by means of control laws that determine the intensity of the electrical signal that must be sent to each speaker so as to obtain a target determined generalized acoustic impedance for each speaker, such that a fraction of the acoustic waves is absorbed by the membrane of each speaker.
TRANSDUCER, ELECTRONIC DEVICE AND TRANSDUCER ARRAY
The present disclosure is related to a transducer. The transducer includes a substrate. A plurality of vibrating membranes, in a configuration of cantilevers, are disposed on a main surface of the substrate. A plurality of piezoelectric elements are stacked on the plurality of vibrating membranes for generating a voltage to excite each vibrating membrane. The cantilevers of the plurality of vibrating membranes extend in a direction from a reference point on the main surface toward the cantilevers, or in a direction from the cantilevers toward the reference point.