Patent classifications
H04R2227/005
AUDIO OUTPUT CONFIGURATION FOR MOVING DEVICES
Described herein is a system for recalibrating an audio configuration for mobile or moving devices. The system may configure a multi-device output group to generate synchronous output audio using multiple devices. For example, the output group may include a first device generating a first portion of output audio corresponding to a first channel and a second device generating a second portion of the output audio corresponding to a second channel. If the second device detects motion and/or movement indicating a change in its location, the system may recalibrate the output group to continue generating the output audio without the second device. For example, the first device or a new device can generate the second portion of the output audio instead of the second device. When the second device returns, the system can recalibrate the output group to include the second device again.
Smart audio system capable of determining speaker type and position
There is provided a smart audio system including multiple audio devices and a central server. The central server confirms a model of every audio device and a position thereof in an operation area in a scan mode. The central server confirms a user position or a user state to accordingly control output power of a speaker of each of the multiple audio devices in an operation mode.
Media content based on playback zone awareness
Systems and methods are provided for providing media content based on playback zone awareness. In one aspect, a computing system receives, via a network interface, zone data from the media playback system, wherein the zone data includes an indication of a particular zone of the media playback system, and wherein the particular zone comprises at least one playback device. The computing system identifies audio content based on (i) the indication of the particular zone and (ii) contextual data associated with the particular zone, and provides, via the network interface, an indication of the identified audio content to the media playback system.
Synchronizing playback by media playback devices
Example systems, apparatus, and methods receive audio information including a plurality of frames from a source device, wherein each frame of the plurality of frames includes one or more audio samples and a time stamp indicating when to play the one or more audio samples of the respective frame. In an example, the time stamp is updated for each of the plurality of frames using a time differential value determined between clock information received from the source device and clock information associated with the device. The updated time stamp is stored for each of the plurality of frames, and the audio information is output based on the plurality of frames and associated updated time stamps. A number of samples per frame to be output is adjusted based on a comparison between the updated time stamp for the frame and a predicted time value for play back of the frame.
Systems and methods for generating labeled data to facilitate configuration of network microphone devices
Systems and methods for generating training data are described herein. Pieces of metadata captured by a plurality of networked sensor systems can be captured, where each piece of metadata is associated with a specific set of sensor data captured by one of the plurality of networked sensor systems and includes a set of characteristics for the specific set of captured sensor data. A probabilistic model can be generated based on the received metadata and simulations can be performed based upon a training corpus by generating multiple scenarios, and, for each scenario, a scenario specific version of a particular annotated sample is generated by performing a simulation using the particular annotated sample. The scenario specific versions of annotated samples from the training corpus can be stored as a training data set on the at least one network device.
Bidirectional channel control systems, methods, devices and computer readable storagemeduums
A bidirectional channel control system, method, device, and non-transitory computer-readable storage medium based on Digital Enhanced Cordless Telecommunications is provided. The system comprises a transmitter, at least one receiver, an audio channel, and a text message control channel. The transmitter is configured to send an audio data stream to the at least one receiver through the audio channel. The transmitter is configured to send a control command to the one receiver through the text message control channel in a one-to-one single-point text message mode, and to receive a feedback result from the one receiver in response to the control command. Alternatively, the transmitter is configured to send a control command to each of the at least one receiver based on DECT protocol through the text message control channel in at least one of a one-to-many broadcast messaging mode and a one-to-one single-point messaging mode, and to receive a feedback result from each of the at least one receiver in response to the control command in the one-to-one single-point messaging mode.
MEDIA PLAYBACK BASED ON SENSOR DATA
Example techniques relate to playback based on acoustic signals in a system including a first network device and a second network device. A first network device may detect a presence of a user using a camera and/or infrared sensors. The first network device sends, in response to detecting the presence of the user, a particular signal via the first network interface. The second network device receives data corresponding to the particular signal and plays back an audio output corresponding to the particular signal.
Display assistant device having a monitoring mode and an assistant mode
A display assistant device comprises a display, a camera, a speaker, microphones, a processor and memory. The memory stores programs comprising instructions that, when executed by the processor, enable a plurality of modes of the display assistant device. The modes include a monitoring mode and an assistant mode. In the monitoring mode, the device is configured to perform a remote monitoring function in which first video captured by the camera is streamed to a remote server system for monitoring uses. The monitoring uses include transmission of the first video to remote client devices authorized to access the first video. In the assistant mode, the device is configured to perform a second plurality of functions that excludes the monitoring function and includes a video communication function in which second video captured by the camera is transmitted to second devices participating in a video communication with a first user of the device.
Methods, systems, and media for identifying a plurality of sets of coordinates for a plurality of devices
Methods, systems, and media for identifying a plurality of sets of coordinates for a plurality of devices are provided. In some embodiments, the method comprises: identifying each device in a plurality of devices associated with a user account; instructing the plurality of devices to perform an audio sequence; receiving a plurality of transit times from the plurality of devices; determining a plurality of distances based on the plurality of transit times; determining a plurality of sets of coordinates based on the plurality of distances; associating to each of the plurality of devices a corresponding unique one of the plurality of sets of coordinates; and causing at least one of the plurality of devices to play spatial audio determined from the plurality of sets of coordinates.
Noise classification for event detection
In one aspect, a network microphone device includes a plurality of microphones and is configured to detect sound via the one or more microphones. The network microphone device may capture sound data based on the detected sound in a first buffer, and capture metadata associated with the detected sound in a second buffer. The network microphone device may classify one or more noises in the detected sound and cause the network microphone device to perform an action based on the classification of the respective one or more noises.