IPIQ

H04M3/568

SPATIAL AUDIO CONVERSATIONAL ANALYSIS FOR ENHANCED CONVERSATION DISCOVERY

20220415327 · 2022-12-29 ·

Microsoft Technology Licensing, Llc

Systems and methods for providing enhanced teleconferencing. An example method includes receiving audio streams from a plurality of client devices of participants of a teleconference; converting the audio streams for a first conversation within the teleconference into first text; converting the audio streams for a second conversation within the teleconference into a second text; analyzing the first text to identify one or more topics being discussed in the first conversation; analyzing the second text to identify one or more topics being discussed in the second conversation; and presenting, in a teleconference user interface, at least one of the one or more topics being discussed in the first conversation or the one or more topics being discussed in the second conversation.

AUTOMATED SESSION PARTICIPATION ON BEHALF OF ABSENT PARTICIPANTS

20220417297 · 2022-12-29 ·

The technology disclosed herein enables an absent participant to participate in a communication session. In a particular embodiment, a method includes identifying a meeting for an automated attendee to attend on behalf of a user. At a time for the meeting, the method includes joining the automated attendee to a communication session for the meeting. In the automated attendee, the method provides monitoring, in real-time, user communications exchanged between two or more other users over the communication session. During the monitoring, upon identifying a portion of the user communications that is relevant to the user, the method provides notifying the user about the portion.

NOISE DETECTOR FOR TARGETED APPLICATION OF NOISE REMOVAL

20220415335 · 2022-12-29 ·

Techniques for performing conditional or controlled noise removal from audio that may contain background noise. The techniques involve obtaining audio from an environment that may have one or more unwanted noise sources, and converting the audio to digital audio data. The digital audio data is analyzed to detect whether there is noise in the audio. When noise is detected in the audio, noise removal is performed on the digital audio data to remove the noise from the audio. When noise is not detected in the audio, the digital audio data is further processed without performing noise removal on the digital audio data.

VOICE COMMUNICATION APPARATUS AND HOWLING DETECTION METHOD

20220415336 · 2022-12-29 ·

Panasonic Intellectual Property Management Co., Ltd.

Shinichi TAKAYAMA

A voice communication apparatus includes a communication unit configured to communicate with one or more another terminals via a network, a voice signal processing unit configured to acquire a first voice signal collected from a voice input terminal, acquire a second voice signal output from another terminal, and detect whether there is howling based on the first and second voice signals, a control unit configured to determine whether a device connected to the voice input terminal or a device connected to the voice output terminal is a howling cause based on a detection result of the voice signal processing unit, and an alert notification unit configured to generate and output an alert screen when the control unit determines that the device connected to the voice input terminal or the device connected to the voice output terminal is the howling cause.

PROACTIVE AUDIO OPTIMIZATION FOR CONFERENCES

20220417364 · 2022-12-29 ·

In one embodiment, a device obtains audio data from one or more past conferences. A prediction model that predicts when participants of a conference will speak is generated based on the audio data from the one or more past conferences. The device uses the prediction model to predict a speech distribution for participants of a particular conference. Then, the device proactively optimizes audio parameters of the particular conference based on its predicted speech distribution.

Holographic Calling for Artificial Reality

20220413434 · 2022-12-29 ·

A holographic calling system can capture and encode holographic data at a sender-side of a holographic calling pipeline and decode and present the holographic data as a 3D representation of a sender at a receiver-side of the holographic calling pipeline. The holographic calling pipeline can include stages to capture audio, color images, and depth images; densify the depth images to have a depth value for each pixel while generating parts masks and a body model; use the masks to segment the images into parts needed for hologram generation; convert depth images into a 3D mesh; paint the 3D mesh with color data; perform torso disocclusion; perform face reconstruction; and perform audio synchronization. In various implementations, different of these stages can be performed sender-side or receiver side. The holographic calling pipeline also includes sender-side compression, transmission over a communication channel, and receiver-side decompression and hologram output.

Audio conferencing using a distributed array of smartphones

11539844 · 2022-12-27 ·

Dolby Laboratories Licensing Corporation

Described is a method of hosting a teleconference among a plurality of client devices arranged in two or more acoustic spaces, each client device having an audio capturing capability and/or an audio rendering capability, the method comprising: grouping the plurality of client devices into two or more groups based on their belonging to respective acoustic spaces, receiving first audio streams from the plurality of client devices, generating second audio streams from the first audio streams for rendering by respective client devices among the plurality of client devices, based on the grouping of the plurality of client devices into the two or more groups, and outputting the generated second audio streams to respective client devices. Further described are corresponding computation devise, computer programs, and computer-readable storage media.

Variable-volume audio streams

11539845 · 2022-12-27 ·

Microsoft Technology Licensing, Llc

Systems and methods for enhanced teleconferencing. An example method includes generating a teleconference interface with a plurality of user-controlled participant interface elements representing participants of the teleconference; identifying a first conversation based on positions, in the teleconference interface, of a first subset of the participant interface elements; identifying a second conversation based on the positions, in the teleconference interface, of a second subset of the participant interface elements; accessing supplemental data, from at least one of a networking or social media database, for the participants of the teleconference; and presenting, within the participant interface elements, the supplemental data.

WIRELESS COMMUNICATION DEVICE, WIRELESS COMMUNICATION BASE STATION, AND WIRELESS COMMUNICATION METHOD

20220407626 · 2022-12-22 ·

A wireless communication device comprising a communication unit, a storage, an operation unit, a communication quality measuring unit, and a controller that controls the communication unit and the storage, wherein the controller further controls the storage to store, when the operation unit detects a transmission starting operation during every channel frequency is used by other wireless communication devices, the voice data until the operation unit detects a transmission stopping operation;

controls the communication unit to transmit, when every channel frequency is not used and the bit error rate is smaller than a first threshold, all pieces of voice data that have not been transmitted, and to transmit, when every channel frequency is not used and the bit error rate is between the first threshold and a second threshold, a piece of divided data having a predetermined length from a head of the voice data which has not been transmitted.

SPEECH ENHANCEMENT TECHNIQUES THAT MAINTAIN SPEECH OF NEAR-FIELD SPEAKERS

20220392478 · 2022-12-08 ·

An endpoint selectively enhances a captured audio signal based on an operating mode. The endpoint obtains an audio input signal of multiple users in a physical location. The audio input signal is captured by a microphone. The endpoint separates voice signals from the audio input signal and determines an operating mode for an audio output signal. The endpoint selectively adjusts each of the voice signals based on the operating mode to generate the audio output signal.

Patent classifications

H04M3/568