Patent classifications
G01S3/8034
STORAGE MEDIUM, SOUND SOURCE DIRECTION ESTIMATION METHOD, AND SOUND SOURCE DIRECTION ESTIMATION DEVICE
A non-transitory computer-readable storage medium storing a program that causes a processor included in a computer mounted on a sound source direction estimation device to execute a process, the process includes calculating a sound pressure difference between a first voice data acquired from a first microphone and a second voice data acquired from a second microphone and estimating a sound source direction of the first voice data and the second voice data based on the sound pressure difference, outputting an instruction to execute a voice recognition on the first voice data or the second voice data in a language corresponding to the estimated sound source direction, and controlling a reference for estimating a sound source direction based on the sound pressure difference, based on a time length of the voice data used for the voice recognition based on the instruction and a voice recognition time length.
SPEAKER RECOGNITION/LOCATION USING NEURAL NETWORK
Computing devices and methods utilizing a joint speaker location/speaker identification neural network are provided. In one example a computing device receives a multi-channel audio signal of an utterance spoken by a user. Magnitude and phase information features are extracted from the signal and inputted into a joint speaker location/speaker identification neural network that is trained via utterances from a plurality of persons. A user embedding comprising speaker identification characteristics and location characteristics is received from the neural network and compared to a plurality of enrollment embeddings extracted from the plurality of utterances that are each associated with an identity of a corresponding person. Based at least on the comparisons, the user is matched to an identity of one of the persons, and the identity of the person is outputted.
Apparatus and method for audio analysis
An apparatus comprises a receiver (201) receiving a multi-channel audio signal representing audio for a scene. An extractor (203) extracts at least one directional audio component by applying a spatial filtering to the multi-channel signal where the spatial filtering is dependent on the multi-channel audio signal. A feature processor (205) determines a set of features for the first directional audio component and a categorizer (207) determines a first audio source category out of a plurality of audio source categories for the directional audio signal in response to the set of features. An assigner (209) assigns a first audio source property to the first directional audio component from a set of audio source properties for the first audio source category. The apparatus may provide very advantageous categorization and characterization of individual audio sources/components present in a multi-channel signal. This may be advantageous e.g. for visualization of audio events.
VOICE RECOGNITION IMAGE FEEDBACK PROVIDING SYSTEM AND METHOD
Voice recognition image feedback providing system and method capable of providing conveniently various services to a user is disclosed. A voice recognition image feedback device in the system comprises a voice recognition unit configured to recognize user's voice, an image feedback unit configured to output an image corresponding to the voice in a direction where the voice is inputted and a control unit configured to control the voice recognition unit and the image feedback unit. Here, wherein the image is outputted to a user peripheral location area so that the image is shown to the user, the user peripheral location area means an area having a radius of 3 meter based on a point at which the user locates, and at least part of the image is shown in the area having the radius of 3 meter.
METHOD AND SYSTEM OF ACOUSTIC ANGLE OF ARRIVAL DETECTION
A system, article, and method of acoustic angle of arrival detection uses both same-time and delayed-time audio signal value comparisons in a time domain that are input to a classifier neural network.
DEVICE AND METHOD FOR DETERMINING A SOUND SOURCE DIRECTION
A device for determining a sound source direction determines a direction in which a source of a reached sound exists, based on at least one of a sound pressure difference between a first sound pressure that is a sound pressure of a first frequency component of a first part of the reached sound acquired by a first microphone and a second sound pressure that is a sound pressure of the first frequency component of a second part of the reached sound acquired by a second microphone, and a phase difference between a first phase that is a phase of a second frequency component of the first part of the reached sound and a second phase that is a phase of the second frequency component of the second part of the reached sound.
Floating Base Vector Sensor
Systems and methods are provided for sensing acoustic signals using a floating base vector sensor. A vector sensor according to an embodiment of the present disclosure can be used to detect and characterize low frequency sound wave(s) in a viscous medium (e.g., air, water, etc.) by detecting a periodic motion of the media particles associated with the sound wave(s). The orientation of the particle velocity deduced from such measurements can provide information regarding the wave vector of the sound wave(s), can define the direction of arrival (DOA) for the acoustic signal, and can assist locating the source of the sound of interest.
THREE-DIMENSIONAL MEASUREMENT DEVICE WITH ANNOTATION FEATURES
A three-dimensional (3D) measurement system and method is provided. The system includes a noncontact measurement device, an annotation member and a processor. The noncontact measurement device being operable to measure a distance from the noncontact measurement device to a surface. The annotation member is coupled to the noncontact measurement device. The processor is operably coupled to the noncontact measurement device and the annotation member, the processor operable to execute computer instructions when executed on the processor for determining 3D coordinates of at least one point in a field of view based at least in part on the distance, recording an annotation in response to an input from a user, and associating the annotation with the at least one point.
CONVEYING SYSTEM WITH AN AUTOMATIC TETHERING FUNCTION
A conveying system having automatic tethering function comprises an electrically driven mobile conveying device and a portable transmitting device (beacon). The conveying device automatically follows the transmitting device at a predetermined distance. The conveying and transmitting devices are coupled together via a radio link and ultrasound. The transmitting device transmits a radio, and, in time correlation, an ultrasound signal. The ultrasound signal is received by the conveying device in a spatially resolved manner, and an automatic tethering function is implemented based on the radio and ultrasound signals.
Hostile fire detection for an airborne platform
Systems and methods are presented for detecting a direction of an incoming projectile and determining a source location of the projectile. One or more resonant sensors (comprising a plate, piezo electric sensor, etc.) can be arranged, where shockwaves from the projectile (e.g., shockwaves from a bullet travelling at supersonic speeds) are incident upon the plate and cause the plate to resonate. The resonance causes an electrical signal to be generated by the piezo electric sensor (e.g., a piezo electric film sensor), the greater the degree of resonance in the plate, the higher the magnitude of signal generated by the piezo electric sensor. By comparing the magnitude of the piezo electric signals across the array of resonant sensors it is possible to determine a trajectory of the projectile and hence a location of the source of the projectile. Acoustic waves can also be generated by muzzle waves.