Patent classifications
H04L65/60
Advanced packet-based sample audio concealment
In a reliable multi-cast, a concealment scheme may be applied to recover or conceal lost or otherwise corrupted packets of audio information for one channel based on the audio information of other channels in the reliable multi-cast. The concealment scheme may employ correction factors for channels derived from the channel relationships.
Method and apparatus for stream descriptor binding in a streaming environment
A method, apparatus and computer program product provide a stream binding mechanism that supports stream data pulling and pushing in a distributed or cloud based streaming environment. The method, apparatus and computer program product receive a stream register message associated with a stream from a streaming entity. The stream register message includes a binding descriptor. The method, apparatus and computer program product transmit a create connection message to a stream broker. The method, apparatus and computer program product transmit an endpoint message including a set of connection parameters of an endpoint to the streaming entity. The method, apparatus and computer program product receive a query for the stream from a stream processing node. And the method, apparatus and computer program product transmit a response to the query to the stream processing node. The response includes a set of connection parameters of the stream broker.
Method and apparatus for stream descriptor binding in a streaming environment
A method, apparatus and computer program product provide a stream binding mechanism that supports stream data pulling and pushing in a distributed or cloud based streaming environment. The method, apparatus and computer program product receive a stream register message associated with a stream from a streaming entity. The stream register message includes a binding descriptor. The method, apparatus and computer program product transmit a create connection message to a stream broker. The method, apparatus and computer program product transmit an endpoint message including a set of connection parameters of an endpoint to the streaming entity. The method, apparatus and computer program product receive a query for the stream from a stream processing node. And the method, apparatus and computer program product transmit a response to the query to the stream processing node. The response includes a set of connection parameters of the stream broker.
Multi-stream target-speech detection and channel fusion
Audio processing systems and methods include an audio sensor array configured to receive a multichannel audio input and generate a corresponding multichannel audio signal and target-speech detection logic and an automatic speech recognition engine or VoIP application. An audio processing device includes a target speech enhancement engine configured to analyze a multichannel audio input signal and generate a plurality of enhanced target streams, a multi-stream target-speech detection generator comprising a plurality of target-speech detector engines each configured to determine a probability of detecting a specific target-speech of interest in the stream, wherein the multi-stream target-speech detection generator is configured to determine a plurality of weights associated with the enhanced target streams, and a fusion subsystem configured to apply the plurality of weights to the enhanced target streams to generate an enhancement output signal.
Multi-stream target-speech detection and channel fusion
Audio processing systems and methods include an audio sensor array configured to receive a multichannel audio input and generate a corresponding multichannel audio signal and target-speech detection logic and an automatic speech recognition engine or VoIP application. An audio processing device includes a target speech enhancement engine configured to analyze a multichannel audio input signal and generate a plurality of enhanced target streams, a multi-stream target-speech detection generator comprising a plurality of target-speech detector engines each configured to determine a probability of detecting a specific target-speech of interest in the stream, wherein the multi-stream target-speech detection generator is configured to determine a plurality of weights associated with the enhanced target streams, and a fusion subsystem configured to apply the plurality of weights to the enhanced target streams to generate an enhancement output signal.
Information processing device and attendance state management method
An information processing device includes a receiving unit for receiving face image data of a student, who takes a class in a classroom, a plurality of times during the class from a camera provided in the classroom; a control unit for comparing the face image data with registered face image data of the student and count the number of times the student of the registered face image data is photographed by the camera during the class; and a transmission unit for transmitting the number of times of photographing to a terminal device used by a teacher who teaches the class.
Information processing device and attendance state management method
An information processing device includes a receiving unit for receiving face image data of a student, who takes a class in a classroom, a plurality of times during the class from a camera provided in the classroom; a control unit for comparing the face image data with registered face image data of the student and count the number of times the student of the registered face image data is photographed by the camera during the class; and a transmission unit for transmitting the number of times of photographing to a terminal device used by a teacher who teaches the class.
Filtering video content items
Methods and systems for filtering video content items are described herein. The system identifies a plurality of video content items that are linked to respective image content items. The system determines, for each of the plurality of video content items, whether a video content item corresponds to a respective image content item. In response to the determining, the system causes to be provided information identifying the plurality of video content items excluding video content items that do not correspond to respective image content items.
Filtering video content items
Methods and systems for filtering video content items are described herein. The system identifies a plurality of video content items that are linked to respective image content items. The system determines, for each of the plurality of video content items, whether a video content item corresponds to a respective image content item. In response to the determining, the system causes to be provided information identifying the plurality of video content items excluding video content items that do not correspond to respective image content items.
Scalable extended reality video conferencing
Some embodiments of the present inventive concept provide for improved telepresence and other virtual sessions dynamic scaling and/or assignment of computing resources. An XR telepresence platform can allow for immersive multi-user video conferencing from within a web browser or other medium. The platform can support spatial audio and/or user video. The platform can scale to hundreds or thousands of users concurrently in a single or multiple virtual environments. Disclosed herein are resource allocation techniques for dynamically allocating client connections across multiple servers.