Patent classifications
G10L17/00
Matching Active Speaker Pose Between Two Cameras
Described are multiple cameras in a conference room, each pointed in a different direction. A primary camera includes a microphone array to perform sound source localization (SSL). The SSL is used in combination with a video image to identify the speaker from among multiple individuals that appear in the video image. Pose information of the speaker is developed. Pose information of each individual identified in each other camera is developed. The speaker pose information is compared to the pose information of the individuals from the other cameras. The best match for each other camera is selected as the speaker in that camera. The speaker views of each camera are compared to determine the speaker view with the most frontal view of the speaker. That camera is selected to provide the video for provision to the far end.
DIARISATION AUGMENTED REALITY AIDE
An image of a real-world environment including one or more users, is received from an image capture device. A mask status of a first user of is determined by a processor based on the image. A stream of audio including speech from one or more users is captured from one or more audio transceivers. A first user speech from the stream of audio identified by the processor. The stream of audio is parsed, by the processor and based on the first user speech and based on an audio processing technique, to create a first user speech element. An augmented view that includes the first user speech element is generated, for a wearable computing device, based on the first user speech and based on the mask status.
Multi-user devices in a connected home environment
A device implementing a system for sharing a voice profile includes a processor configured to receive a request to share a first voice profile corresponding to a first user account associated with a first device, with a second device associated with a second user account, the second device being voice-enabled, the first voice profile being stored on a first data store associated with the first user account. The processor is further configured to update a second data store associated with the second user account to include a reference to the first voice profile stored on the first data store, and to send, to the second device, a notification that the second data store has been updated to include the reference to the first voice profile.
User authentication as a service
Systems, methods, and devices for adaptably authenticating a user are disclosed. A device captures a user input, and sends data corresponding thereto to a system. The system determines natural language understanding (NLU) results representing the user input. A user authentication component of the system receives the NLU results and determines a skill configured to perform an action responsive to the user input. The user authentication component adaptably performs user authentication based on a user authentication condition associated with the skill. If the user can be authenticated to the satisfaction of the condition, the NLU results data are sent to the skill, along with an indicator representing the user was authenticated by the system.
User authentication as a service
Systems, methods, and devices for adaptably authenticating a user are disclosed. A device captures a user input, and sends data corresponding thereto to a system. The system determines natural language understanding (NLU) results representing the user input. A user authentication component of the system receives the NLU results and determines a skill configured to perform an action responsive to the user input. The user authentication component adaptably performs user authentication based on a user authentication condition associated with the skill. If the user can be authenticated to the satisfaction of the condition, the NLU results data are sent to the skill, along with an indicator representing the user was authenticated by the system.
MAN-MACHINE DIALOGUE MODE SWITCHING METHOD
The present disclosure discloses a man-machine dialogue mode switching method, which is applicable to an electronic device. The method includes receiving a current user sentence spoken by a current user; determining whether a dialogue field to which the current user sentence belongs is a preset dialogue field; if yes, switching the current dialogue mode to a full-duplex dialogue mode; and if not, switching the current dialogue mode to a half-duplex dialogue mode. In the present disclosure, the dialogue mode is switched by determining whether the dialogue field to which the current user sentence belongs is the preset dialogue field, and the dialogue mode can be automatically switched and adjusted according to the difference of the dialogue fields, such that the man-machine dialogue is always in the most suitable dialogue mode and can be realized smoothly.
Situationally Aware Social Agent
A system for providing a situationally aware social agent includes processing hardware and a memory storing a software code. The processing hardware executes the software code to receive radar data and audio data, process the radar data and the audio data to obtain radar-based location data and audio-based location data each corresponding to a location of one or more user(s), and process the radar data and the audio data to obtain radar-based venue data and audio-based venue data each corresponding to an environment surrounding the user(s). The software code further determines, using the radar-based location data and the audio-based location data, the location of the user(s), determines, using the radar-based venue data and the microphone-based venue data, the environment surrounding the user(s), and identifies, based on the location and the environment, an interactive expression for use by the situationally aware social agent to interact with the user(s).
Secure nonscheduled video visitation system
Described are methods and systems in which the censorship and supervision tasks normally performed by secured facility personnel are augmented or automated entirely by a Secure Nonscheduled Video Visitation System. In embodiments, the Secure Nonscheduled Video Visitation System performs voice biometrics, speech recognition, non-verbal audio classification, fingerprint and other biometric authentication, image object classification, facial recognition, body joint location determination analysis, and/or optical character recognition on the video visitation data. The Secure Nonscheduled Video Visitation utilizes these various analysis techniques in concert to determine if all rules and regulations enforced by the jurisdiction operation the secured facility are being followed by the parties to the video visitation session.
Autonomous/semi-autonomous driving method and apparatus with trusted data collection, retention and/or sharing
Apparatus, method and computer readable medium associated with autonomous/semi-autonomous driving are disclosed herein. In embodiments, an apparatus for autonomous/semi-autonomous driving may comprise a management system to be disposed in an autonomous/semi-autonomous vehicle. The management system may include a reservation subsystem to receive, from a cloud server, a reservation of the autonomous or semi-autonomous vehicle for a passenger or a driver, and an access control subsystem to control access to the autonomous or semi-autonomous vehicle that includes a trust function to gain trust of the passenger or driver with respect to the passenger or driver's data privacy requirements will be met, when the passenger or driver attempts to exercise the reservation. Other embodiments may be disclosed or claimed.
Display device and method for controlling the same
A display device and a method for controlling the same are provided. The display device includes a rollable display screen, a voice acquisition unit, an identification control unit, a drive control unit and a display control unit. The voice acquisition unit is configured to acquire a first voice command. The identification control unit is configured to identify the first voice command acquired by the voice acquisition unit as a voice process command, and the voice process command includes a rolling operation command and a display drive command. The drive control unit is configured to perform an operation corresponding to the rolling operation command on the rollable display screen according to the rolling operation command. The display control unit is configured to control a display state of the rollable display screen according to the display drive command.