H04M2203/509

Audio assisted auto exposure

Systems, methods, and computer-readable media for automatically setting exposure levels in a session based on an active participant. In some embodiments, a method can include detecting faces of one or more participants in one or more images in a captured video feed at a location of the participants illuminated at one or more illumination levels at the location. The one or more detected faces can be associated with brightness levels of the one or more participants based on the one or more illumination levels at the location. Audio input can be received for the one or more participants at the location and a first participant can be identified as an active participant using the audio input. Further, an exposure level of the captured video feed can be set based on the first participant acting as the active participant according to a brightness level in the one or more images associated with a face detection of the first participant in the one or more images.

Collecting and correlating microphone data from multiple co-located clients, and constructing 3D sound profile of a room
10587756 · 2020-03-10 · ·

An overlay network platform facilitates a multi-party conference. End users participate in the conference using client-based web browser software, and using a protocol such as WebRTC. According to this disclosure, an enhanced audio experience for the conference is providing by collecting and correlating microphone data from multiple co-located clients, and then constructing (at the platform) a three-dimensional (3D) sound profile of the room in which the clients are co-located. By processing in the platform (as opposed to locally at each client), the approach enables platform-side creation of an ad-hoc, high quality microphone array that identifies the relative positions and orientations of the microphones that are being used by the clients. Individual audio streams received from the microphones are combined, and the relative position information (of the individual microphones) is used to render a single audio stream that represents a high quality recording of the audio in the common physical space. Other clients in the conference request, receive and play back this high quality stream to obtain a high-fidelity 3D representation of the audio as if they are physically present in the room.

Microphone Array System

A microphone array system or microphone array unit for a conference system is provided that includes a front board, side walls and a plurality of microphone capsules arranged in or on the front board mountable on or in a ceiling of a conference room. The microphone array system or unit is adapted for generating a steerable beam within a maximum detection angle range. The microphone array system or microphone array unit includes a processing unit which is configured to receive the output signals of the microphone capsules and to steer the beam based on the received output signal of the microphone array. The processing unit is configured to control the microphone array to limit the detection angle range to exclude at least one predetermined exclusion sector in which a noise source is located.

ARRAY MICROPHONE MODULE AND SYSTEM

A microphone module comprises a housing, an audio bus, and a first plurality of microphones in communication with the audio bus. The microphone module further comprises a module processor in communication with the first plurality of microphones and the audio bus. The module processor is configured to detect the presence of an array processor in communication with the audio bus, detect the presence of a second microphone module in communication with the audio bus, and configure the audio bus to pass audio signals from both the first plurality of microphones and the second microphone module to the array processor.

SYSTEM AND METHOD FOR CAPTURING SOUND SOURCE
20240098406 · 2024-03-21 · ·

A method for capturing a sound source includes: capturing a space where a microphone array is located to generate an image by a camera, wherein the microphone array is configured to receive a sound generated by the sound source and generate a sound source coordinate of the sound source relative to the microphone array; searching for a sub-image belonging to the microphone array within the images by a computing device connected to the camera; calculating a microphone coordinate of the microphone array relative to the camera by the computing device according to the sub-image; calculating a required control parameter by the computing device at least according to the sound source coordinate and the microphone coordinate; adjusting a capturing direction by the camera to capture the sound source at least according to the required control parameter.

AUDIO ASSISTED AUTO EXPOSURE
20190379839 · 2019-12-12 ·

Systems, methods, and computer-readable media for automatically setting exposure levels in a session based on an active participant. In some embodiments, a method can include detecting faces of one or more participants in one or more images in a captured video feed at a location of the participants illuminated at one or more illumination levels at the location. The one or more detected faces can be associated with brightness levels of the one or more participants based on the one or more illumination levels at the location. Audio input can be received for the one or more participants at the location and a first participant can be identified as an active participant using the audio input. Further, an exposure level of the captured video feed can be set based on the first participant acting as the active participant according to a brightness level in the one or more images associated with a face detection of the first participant in the one or more images.

Optimal view selection method in a video conference
10491809 · 2019-11-26 · ·

A system for ensuring that the best available view of a person's face is included in a video stream when the person's face is being captured by multiple cameras at multiple angles at a first endpoint. The system uses one or more microphone arrays to capture direct-reverberant ratio information corresponding to the views, and determines which view most closely matches a view of the person looking directly at the camera, thereby improving the experience for viewers at a second endpoint.

Array microphone module and system

A microphone module comprises a housing, an audio bus, and a first plurality of microphones in communication with the audio bus. The microphone module further comprises a module processor in communication with the first plurality of microphones and the audio bus. The module processor is configured to detect the presence of an array processor in communication with the audio bus, detect the presence of a second microphone module in communication with the audio bus, and configure the audio bus to pass audio signals from both the first plurality of microphones and the second microphone module to the array processor.

Multitalker optimised beamforming system and method

A method of processing a series of microphone inputs of an audio conference, the method including the steps of: (a) conducting a spatial analysis and feature extraction of the audio conference based on current audio activity; (b) aggregating historical information to obtain information about the approximate relative location of recent sound objects relative to the series of microphone inputs; (c) utilizing the relative location or distance of the sound objects from the series of microphone inputs to determine if beam forming should be utilized to enhance the audio reception from recent sound objects.

Data transmission method and system, and related device
10405241 · 2019-09-03 · ·

A data transmission method, where a host acquires parameter information of a wireless communication channel between a wireless microphone array and the host, that is, a signal-to-noise ratio or bandwidth. The host reduces sampling frequency of the wireless microphone array or decreases a quantity of data transmission paths between the wireless microphone array and the host when the acquired parameter information satisfies a first preset condition such that bandwidth occupied when the wireless microphone array transmits data is reduced.