Patent classifications
H04R27/00
Vocal guidance engines for playback devices
Systems and methods for vocal guidance for playback devices are disclosed. A playback device can include a first wireless transceiver for communication via a first data network and a second wireless transceiver for communication via a second data network. The device includes one or more processors and is configured to maintain a library that includes one or more source device names and corresponding audio content, the audio content configured to be played back via an amplifier to indicate association of a particular source device with the playback device via the first data network. The device receives, via the second data network, information from one or more remote computing devices, and based on the information, updates the library by: (i) adding at least one new source device name and corresponding audio content; (ii) changing at least one source device name or its corresponding audio content; or both (i) and (ii).
Audio communication device
An audio communication device includes: a sound position determiner that determines sound localization positions for N audio signals in a virtual space having first and second walls; N sound localizers each performing sound localization processing to localize sound in the sound localization position determined by the sound position determiner, and outputting localized sound signals; an adder that sums the N localized sound signals, and outputs a summed localized sound signal. Each sound localizer performs the processing using: a first head-related transfer function (HRTF) assuming that a sound wave emitted from the sound localization position of the sound localizer determined by the sound position determiner directly reaches each ear of a hearer virtually present at the hearer position; and a second HRTF assuming that the sound wave emitted from the sound localization position reaches each ear of the hearer after being reflected by closer one of the first and second walls.
Audio communication device
An audio communication device includes: a sound position determiner that determines sound localization positions for N audio signals in a virtual space having first and second walls; N sound localizers each performing sound localization processing to localize sound in the sound localization position determined by the sound position determiner, and outputting localized sound signals; an adder that sums the N localized sound signals, and outputs a summed localized sound signal. Each sound localizer performs the processing using: a first head-related transfer function (HRTF) assuming that a sound wave emitted from the sound localization position of the sound localizer determined by the sound position determiner directly reaches each ear of a hearer virtually present at the hearer position; and a second HRTF assuming that the sound wave emitted from the sound localization position reaches each ear of the hearer after being reflected by closer one of the first and second walls.
Dynamic Player Selection for Audio Signal Processing
In one aspect, a first playback device is configured to (i) receive a set of voice signals, (ii) process the set of voice signals using a first set of audio processing algorithms, (iii) identify, from the set of voice signals, at least two voice signals that are to be further processed, (iv) determine that the first playback device does not have a threshold amount of computational power available, (v) receive an indication of an available amount of computational power of a second playback device, (vi) send the at least two voice signals to the second playback device, (vii) cause the second playback device to process the at least two voice signals using a second set of audio processing algorithms, (viii) receive, from the second playback device, the processed at least two voice signals, and (ix) combine the processed at least two voice signals into a combined voice signal.
OPTIMIZATION OF NETWORK MICROPHONE DEVICES USING NOISE CLASSIFICATION
Systems and methods for optimizing network microphone devices using noise classification are disclosed herein. In one example, individual microphones of a network microphone device (NMD) detect sound. The sound data is analyzed to detect a trigger event such as a wake word. Metadata associated with the sound data is captured in a lookback buffer of the NMD. After detecting the trigger event, the metadata is analyzed to classify noise in the sound data. Based on the classified noise, at least one performance parameter of the NMD is modified.
NETWORKED MICROPHONE DEVICES, SYSTEMS, & METHODS OF LOCALIZED ARBITRATION
A first playback device is configured to perform functions comprising: detecting sound, identifying a wake word based on the sound as detected by the first device, receiving an indication that a second playback device has also detected the sound and identified the wake word based on the sound as detected by the second device, after receiving the indication, evaluating which of the first and second devices is to extract sound data representing the sound and thereby determining that the extraction of the sound data is to be performed by the second device over the first device, in response to the determining, foregoing extraction of the sound data, receiving VAS response data that is indicative of a given VAS response corresponding to a given voice input identified in the sound data extracted by the second device, and based on the VAS response data, output the given VAS response.
Adjusting volume levels
In general, user interfaces for controlling a plurality of multimedia players in groups are disclosed. According to one aspect of the present invention, a user interface is provided to allow a user to group some of the players according to a theme or scene, where each of the players is located in a zone. When the scene is activated, the players in the scene react in a synchronized manner. For example, the players in the scene are all caused to play a multimedia source or music in a playlist, wherein the multimedia source may be located anywhere on a network. The user interface is further configured to illustrate graphically a size of a group, the larger the group appears relatively, the more plays there are in the group.
Adjusting volume levels
In general, user interfaces for controlling a plurality of multimedia players in groups are disclosed. According to one aspect of the present invention, a user interface is provided to allow a user to group some of the players according to a theme or scene, where each of the players is located in a zone. When the scene is activated, the players in the scene react in a synchronized manner. For example, the players in the scene are all caused to play a multimedia source or music in a playlist, wherein the multimedia source may be located anywhere on a network. The user interface is further configured to illustrate graphically a size of a group, the larger the group appears relatively, the more plays there are in the group.
Sound wave dead spot generation
A speaker system uses destructive wave interference to generate “dead spots” with respect to an audio presentation. The signal for the dead spot generating device can be an inverted signal generated using the audio signal. In one embodiment, the inverted signal is generated using the audio signal, an indication of loudness at one or more active speakers, and a determination of the characteristics of the sound path from the one or more active speakers (including delay and attenuation).
Sound wave dead spot generation
A speaker system uses destructive wave interference to generate “dead spots” with respect to an audio presentation. The signal for the dead spot generating device can be an inverted signal generated using the audio signal. In one embodiment, the inverted signal is generated using the audio signal, an indication of loudness at one or more active speakers, and a determination of the characteristics of the sound path from the one or more active speakers (including delay and attenuation).