Patent classifications
G10L25/78
PRIVACY-PRESERVING SOCIAL INTERACTION MEASUREMENT
Various systems, devices, and methods for social interaction measurement that preserve privacy are presented. An audio signal can be captured using a microphone. The audio signal can be processed using an audio-based machine learning model that is trained to detect the presence of speech. The audio signal can be discarded such that content of the audio signal is not stored after the audio signal is processed using the machine learning model. An indication of whether speech is present within the audio signal can be output based at least in part on processing the audio signal using the audio-based machine learning model.
PRIVACY-PRESERVING SOCIAL INTERACTION MEASUREMENT
Various systems, devices, and methods for social interaction measurement that preserve privacy are presented. An audio signal can be captured using a microphone. The audio signal can be processed using an audio-based machine learning model that is trained to detect the presence of speech. The audio signal can be discarded such that content of the audio signal is not stored after the audio signal is processed using the machine learning model. An indication of whether speech is present within the audio signal can be output based at least in part on processing the audio signal using the audio-based machine learning model.
Audio output apparatus and method of controlling thereof
An audio output apparatus is disclosed. The audio output apparatus that outputs a multi-channel audio signal through a plurality of speakers disposed at different locations, the audio output apparatus includes an input interface, and a processor configured to, based on the multi-channel audio signal input through the inputter being received, obtain scene information on a type of audio included in the multi-channel audio signal and sound image angle information about an angle formed by sound image of the type of audio included in the multi-channel audio signal based on a virtual user, and generate an output signal to be output through the plurality of speakers from the multi-channel audio signal based on the obtained scene information and sound image angle information, wherein the type of audio includes at least one of sound effect, shouting sound, music, and voice, and a number of the plurality of speakers is equal to or greater than a number of channels of the multi-channel audio signal.
Audio output apparatus and method of controlling thereof
An audio output apparatus is disclosed. The audio output apparatus that outputs a multi-channel audio signal through a plurality of speakers disposed at different locations, the audio output apparatus includes an input interface, and a processor configured to, based on the multi-channel audio signal input through the inputter being received, obtain scene information on a type of audio included in the multi-channel audio signal and sound image angle information about an angle formed by sound image of the type of audio included in the multi-channel audio signal based on a virtual user, and generate an output signal to be output through the plurality of speakers from the multi-channel audio signal based on the obtained scene information and sound image angle information, wherein the type of audio includes at least one of sound effect, shouting sound, music, and voice, and a number of the plurality of speakers is equal to or greater than a number of channels of the multi-channel audio signal.
Audio system and signal processing method of voice activity detection for an ear mountable playback device
An audio system for an ear mountable playback device comprises a speaker, an error microphone predominantly sensing sound being output from the speaker and a feed-forward microphone predominantly sensing ambient sound. The audio system further comprises a voice activity detector which is configured to record a feed-forward signal from the feed-forward microphone. Furthermore, an error signal is recorded from the error microphone. A detection parameter is determined as a function of the feed-forward signal and the error signal. The detection parameter is monitored and a voice activity state is set depending on the detection parameter.
Audio system and signal processing method of voice activity detection for an ear mountable playback device
An audio system for an ear mountable playback device comprises a speaker, an error microphone predominantly sensing sound being output from the speaker and a feed-forward microphone predominantly sensing ambient sound. The audio system further comprises a voice activity detector which is configured to record a feed-forward signal from the feed-forward microphone. Furthermore, an error signal is recorded from the error microphone. A detection parameter is determined as a function of the feed-forward signal and the error signal. The detection parameter is monitored and a voice activity state is set depending on the detection parameter.
STREAMING DATA PROCESSING FOR HYBRID ONLINE MEETINGS
Techniques of streaming data processing for hybrid online meetings are disclosed herein. In one example, a method includes receiving, at the remote server, a video stream captured by a camera in the conference room. The video stream captures images of multiple local participants of an online meeting. The method also includes determining identities of the captured images of the multiple local participants in the received video stream using meeting information of the online meeting and generating a set of individual video streams each corresponding to one of the multiple local participants. The set of individual video streams can then be transmitted to the second computing device corresponding to a remote participant of the online meeting as if the multiple local participants are virtually joining the online meeting.
Spoken notifications
An example method includes, at an electronic device: receiving an indication of a notification; in accordance with receiving the indication of the notification: obtaining one or more data streams from one or more sensors; determining, based on the one or more data streams, whether a user associated with the electronic device is speaking; and in accordance with a determination that the user is not speaking: causing an output associated with the notification to be provided.
Spoken notifications
An example method includes, at an electronic device: receiving an indication of a notification; in accordance with receiving the indication of the notification: obtaining one or more data streams from one or more sensors; determining, based on the one or more data streams, whether a user associated with the electronic device is speaking; and in accordance with a determination that the user is not speaking: causing an output associated with the notification to be provided.
Detection of live speech
A method of detecting live speech comprises: receiving a signal containing speech; obtaining a first component of the received signal in a first frequency band, wherein the first frequency band includes audio frequencies; and obtaining a second component of the received signal in a second frequency band higher than the first frequency band. Then, modulation of the first component of the received signal is detected; modulation of the second component of the received signal is detected; and the modulation of the first component of the received signal and the modulation of the second component of the received signal are compared. It may then be determined that the speech may not be live speech, if the modulation of the first component of the received signal differs from the modulation of the second component of the received signal.