G06F3/16

Alias-based access of entity information over voice-enabled digital assistants

In one embodiment, a domain-name based framework implemented in a digital assistant ecosystem uses domain names as unique identifiers for request types, requesting entities, responders, and target entities embedded in a natural language request. Further, the framework enables interpreting natural language requests according to domain ontologies associated with different responders. A domain ontology operates as a keyword dictionary for a given responder and defines the keywords and corresponding allowable values to be used for request types and request parameters. The domain-name based framework thus enables the digital assistant to interact with any responder that supports a domain ontology to generate precise and complete responses to natural language based requests.

Method and device for recognizing speech in vehicle

The present disclosure relates to a method and a device for recognizing speech in a vehicle. The method for recognizing the speech in the vehicle may include collecting one or more types of information, determining information to be linked with each other for speech recognition based on an information processing priority predefined corresponding to each type of the collected information, analyzing the determined information to perform the speech recognition for a signal input through a microphone, and extracting at least one of a wake up voice or a command voice through the speech recognition to control the vehicle. Therefore, the present disclosure has an advantage of more accurately performing the speech recognition by linking collected various information in the vehicle with each other.

Method and device for recognizing speech in vehicle

The present disclosure relates to a method and a device for recognizing speech in a vehicle. The method for recognizing the speech in the vehicle may include collecting one or more types of information, determining information to be linked with each other for speech recognition based on an information processing priority predefined corresponding to each type of the collected information, analyzing the determined information to perform the speech recognition for a signal input through a microphone, and extracting at least one of a wake up voice or a command voice through the speech recognition to control the vehicle. Therefore, the present disclosure has an advantage of more accurately performing the speech recognition by linking collected various information in the vehicle with each other.

Intelligent media queue

Systems, methods, and non-transitory computer-readable storage media for intelligently managing a playlist of digital media provide an intelligent dynamic queue that is configured to manage the playback of digital media. The queue can transition between passive playback mode, active playback mode, and mixed playback mode. The queue can handle the playback of the songs in the queue according to the playback mode and/or a queue status field that is associated with each song in the queue.

Drainage channel for a submersible camera with drainage ports on two surfaces
11579515 · 2023-02-14 · ·

A camera system with six faces and a front housing is configured to capture images and audio content from external the camera body. The camera system includes an interior audio assembly protected from external environments by a waterproof membrane. The camera system includes drainage ports on the bottom face and the left face of the camera system to encourage moisture to drain from the system. A first drainage channel couples the internal audio assembly to the first drainage port on left face of the camera system and a second drainage channel couples the drainage port on the left face of the camera system to the drainage port on the bottom face of the camera system. A third drainage channel exists between the front face of the camera system and the front housing, the third drainage channel coupling the first and second drainage channels.

Drainage channel for a submersible camera with drainage ports on two surfaces
11579515 · 2023-02-14 · ·

A camera system with six faces and a front housing is configured to capture images and audio content from external the camera body. The camera system includes an interior audio assembly protected from external environments by a waterproof membrane. The camera system includes drainage ports on the bottom face and the left face of the camera system to encourage moisture to drain from the system. A first drainage channel couples the internal audio assembly to the first drainage port on left face of the camera system and a second drainage channel couples the drainage port on the left face of the camera system to the drainage port on the bottom face of the camera system. A third drainage channel exists between the front face of the camera system and the front housing, the third drainage channel coupling the first and second drainage channels.

Password-based authorization for audio rendering

A method and device for processing one or more audio streams based on password-based privacy restrictions is described. A device may be configured to receive unrestricted audio streams of the one or more audio streams based on privacy restrictions associated with a password, wherein the one or more audio streams are from audio elements represented in an acoustic environment that comprises one or more sub-acoustic spaces, each of the one or more audio streams representative of a respective soundfield, and generate the respective soundfields of the unrestricted audio streams.

Video-based competition platform

A video-based competition platform supports video-based competitions between possibly geographically distributed competitors. The video-based competition platform enables users of electronic communication devices to create, compete, view, and vote in video-based competitions. In at least some embodiments, a video-based competition is presented to a user with two or more video clips played in conjunction. The video clips may be synchronized to a time base and/or common audio clip.

SELECTIVELY ACTIVATING ON-DEVICE SPEECH RECOGNITION, AND USING RECOGNIZED TEXT IN SELECTIVELY ACTIVATING ON-DEVICE NLU AND/OR ON-DEVICE FULFILLMENT

Implementations can reduce the time required to obtain responses from an automated assistant by, for example, obviating the need to provide an explicit invocation to the automated assistant, such as by saying a hot-word/phrase or performing a specific user input, prior to speaking a command or query. In addition, the automated assistant can optionally receive, understand, and/or respond to the command or query without communicating with a server, thereby further reducing the time in which a response can be provided. Implementations only selectively initiate on-device speech recognition responsive to determining one or more condition(s) are satisfied. Further, in some implementations, on-device NLU, on-device fulfillment, and/or resulting execution occur only responsive to determining, based on recognized text form the on-device speech recognition, that such further processing should occur. Thus, through selective activation of on-device speech processing, and/or selective activation of on-device NLU and/or on-device fulfillment, various client device resources are conserved.

DETECTION OF ENVIRONMENT FOR VOICE CONTROL OF MOTION TRACKING SYSTEM

A method for allowing or disallowing control of a motion tracking system by means of voice, comprising: digitally processing sound detected by each microphone of the at least one microphone; digitally computing SNR by computing both first energy of a voice signal in the detected sound and second energy of noise in the detected sound; digitally processing electromagnetic waves captured by each antenna of the at least one antenna so as to detect data packets transmitted to the computing apparatus by each sensor of the plurality of sensors, each data packet including RSSI of a respective sensor; digitally computing distance between each sensor and the computing apparatus based on the RSSI of the data packets received from the respective sensor; digitally computing a percentage of sensors of the plurality of sensors having at least one of: a distance exceeding a predetermined maximum distance threshold, and a change in distance exceeding a predetermined maximum changing distance threshold; and digitally setting allowance or disallowance of voice control based on both the SNR computed and the percentage of sensors computed.