Patent classifications
G06F3/167
Method and apparatus for evaluating user intention understanding satisfaction, electronic device and storage medium
A method and apparatus for generating a user intention understanding satisfaction evaluation model, a method and apparatus for evaluating a user intention understanding satisfaction, an electronic device and a storage medium are provided, relating to intelligent voice recognition and knowledge graphs. The method for generating a user intention understanding satisfaction evaluation model is: acquiring a plurality of sets of intention understanding data, at least one set of which comprises a plurality of sequences corresponding to multi-round behaviors of an intelligent device in multi-round man-machine interactions; and learning the plurality of sets of intention understanding data through a first machine learning model, to obtain the user intention understanding satisfaction evaluation model after the learning, wherein the user intention understanding satisfaction evaluation model is configured to evaluate user intention understanding satisfactions of the intelligent device in the multi-round man-machine interactions according to the plurality of sequences corresponding to the multi-round man-machine interactions.
Smart audio system capable of determining speaker type and position
There is provided a smart audio system including multiple audio devices and a central server. The central server confirms a model of every audio device and a position thereof in an operation area in a scan mode. The central server confirms a user position or a user state to accordingly control output power of a speaker of each of the multiple audio devices in an operation mode.
Quiet hours for notifications
In some implementations, a computing device can be configured to automatically turn off notifications when generating a notification would cause a disturbance or be unwanted by a user. The device can be configured with quiet hours during which notifications that would otherwise be generated by the computing device can be suppressed. In some implementations, quiet hours can be configured as a time period with a start time and an end time. In some implementations, quiet hours can be derived from application data. For example, calendar data, alarm clock data, map data, etc. can be used to determine when quiet hours should be enforced. In some implementations, the device can be configured with exceptions to quiet hour notification suppression. In some implementations, the user can identify contacts to which the quiet hours notification suppression should not be applied.
Modification of audio-based computer program output
Modifying computer program output in a voice or non-text input activated environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify a computer program to invoke. The computer program can identify a dialog data structure. The system can modify the identified dialog data structure to include a content item. The system can provide the modified dialog data structure to a computing device for presentation.
Head orientation training devices
Described herein are head orientation training devices for sports training and systems including such devices. A head orientation training device (HOTD) can include a position sensor configured to sense a tilt of the device, continually, while the device is activated. The HOTD can also include a computing device, configured to retrieve the tilt of the device from the position sensor, repeatedly, while the apparatus is activated. Also, the computing device of the HOTD can be configured to activate an alarm (such an audible or a visual alert) when the tilt of the apparatus exceeds a tilt threshold. The tilt threshold can be user definable as well.
Photo album management method, storage medium and electronic device
The present disclosure provides a photo album management method. The method includes obtaining voice search information from a user, performing intent recognition on the voice search information to obtain an intent recognition result which indicates an intent of the user for a photo album, obtaining a voiceprint feature from the voice search information to determine identity information of the user, sending the intent recognition result and the identity information of the user, and opening the photo album according to the intent recognition result and the identity information.
In-vehicle speech processing apparatus
An in-vehicle apparatus is connectable to a device that includes a voice assistant function. The in-vehicle apparatus includes: a voice detector that performs voice recognition of an audio signal input from a microphone and that controls functions of the in-vehicle apparatus based on a result of the voice recognition; and an interface that communicates with the device. When being informed of a detection of a predetermined word in the audio signal as the result of the voice recognition of the audio signal performed by the voice detector, the interface sends to the device, not via the voice detector, the audio signal input from the microphone. The predetermined word is for activating the voice assistant function of the device.
Medical query answering apparatus
The application relates to providing medical information in response to a query, in particular, although not exclusively, to a method of querying a medical database using voice recognition. According to an embodiment, there is provided a medical query answering apparatus that is configured to convert detected audio signals into a natural language data stream; determine from the natural language data stream whether a voice command for information relating to a medical device or a medicament has been issued by a user; in response to a positive determination, convert at least part of the natural language data stream into a query for querying a medical database; supply the query to the medical database and receive a response therefrom; and provide an audio output based at least in part on the response from the medical database.
Methods and systems for displaying virtual objects from an augmented reality environment on a multimedia device
Methods and systems are disclosed for displaying an augmented reality virtual object on a multimedia device. One method comprises detecting, in an augmented reality environment displayed using a first device, a virtual object; detecting, within the augmented reality environment, a second device, the second device comprising a physical multimedia device; and generating, at the second device, a display comprising a representation of the virtual object.
Speaker based anaphora resolution
A speech-processing system configured to determine entities corresponding to ambiguous words such as anaphora (“he,” “she,” “they,” etc.) included in an utterance. The system may associate incoming utterances with a speaker identification (ID), device ID, and other data. The system then tracks entities referred to in utterances so that if a later utterance includes an ambiguous entity reference, the system may take the speaker ID, device ID, etc. from the ambiguous reference, along with the text of the utterance and other data, and compare that information to previously mentioned entities (or other entities that may be relevant) to identify the entity mentioned in the ambiguous statement. Once the entity is determined, the system may then complete command processing of the utterance using the identified entity.