G10L15/30

Systems and methods for voice identification and analysis
11580986 · 2023-02-14 ·

Obtaining configuration audio data including voice information for a plurality of meeting participants. Generating localization information indicating a respective location for each meeting participant. Generating a respective voiceprint for each meeting participant. Obtaining meeting audio data. Identifying a first meeting participant and a second meeting participant. Linking a first meeting participant identifier of the first meeting participant with a first segment of the meeting audio data. Linking a second meeting participant identifier of the second meeting participant with a second segment of the meeting audio data. Generating a GUI indicating the respective locations of the first and second meeting participants, and the GUI indicating a first transcription of the first segment and a second transcription of the second segment. The first transcription is associated with the first meeting participant in the GUI, and the second transcription is associated with the second meeting participant in the GUI.

Artificial intelligence device and method of operating artificial intelligence device
11580969 · 2023-02-14 · ·

An artificial intelligence device includes a microphone configured to receive a speech command, a speaker, a communication unit configured to perform communication with an external artificial intelligence device, and a processor configured to receive a wake-up command through the microphone, acquire a first speech quality level of the received wake-up command, receive a second speech quality level of the wake-up command input to the external artificial intelligence device from the external artificial intelligence device through the communication unit, output a notification indicating that the artificial intelligence device is selected as an object to be controlled through the speaker, when the first speech quality level is larger than the second speech quality level, receive an operation command through the microphone, acquire an intention of the received operation command and transmit the operation command to an external artificial intelligence device which will perform operation corresponding to the operation command according to the acquired intention through the communication unit.

Artificial intelligence device and method of operating artificial intelligence device
11580969 · 2023-02-14 · ·

An artificial intelligence device includes a microphone configured to receive a speech command, a speaker, a communication unit configured to perform communication with an external artificial intelligence device, and a processor configured to receive a wake-up command through the microphone, acquire a first speech quality level of the received wake-up command, receive a second speech quality level of the wake-up command input to the external artificial intelligence device from the external artificial intelligence device through the communication unit, output a notification indicating that the artificial intelligence device is selected as an object to be controlled through the speaker, when the first speech quality level is larger than the second speech quality level, receive an operation command through the microphone, acquire an intention of the received operation command and transmit the operation command to an external artificial intelligence device which will perform operation corresponding to the operation command according to the acquired intention through the communication unit.

Configurable conversation engine for executing customizable chatbots

A conversation engine performs conversations with users using chatbots customized for performing a set of tasks that can be performed using an online system. The conversation engine loads a chatbot configuration that specifies the behavior of a chatbot including the tasks that can be performed by the chatbot, the types of entities relevant to each task, and so on. The conversation may be voice based and use natural language. The conversation engine may load different chatbot configurations to implement different chatbots. The conversation engine receives a conversation engine configuration that specifies the behavior of the conversation engine across chatbots. The system may be a multi-tenant system that allows customization of the chatbots for each tenant.

Configurable conversation engine for executing customizable chatbots

A conversation engine performs conversations with users using chatbots customized for performing a set of tasks that can be performed using an online system. The conversation engine loads a chatbot configuration that specifies the behavior of a chatbot including the tasks that can be performed by the chatbot, the types of entities relevant to each task, and so on. The conversation may be voice based and use natural language. The conversation engine may load different chatbot configurations to implement different chatbots. The conversation engine receives a conversation engine configuration that specifies the behavior of the conversation engine across chatbots. The system may be a multi-tenant system that allows customization of the chatbots for each tenant.

User-specific acoustic models

Systems and processes for providing user-specific acoustic models are provided. In accordance with one example, a method includes, at an electronic device having one or more processors, receiving a plurality of speech inputs, each of the speech inputs associated with a same user of the electronic device; providing each of the plurality of speech inputs to a user-independent acoustic model, the user-independent acoustic model providing a plurality of speech results based on the plurality of speech inputs; initiating a user-specific acoustic model on the electronic device; and adjusting the user-specific acoustic model based on the plurality of speech inputs and the plurality of speech results.

User-specific acoustic models

Systems and processes for providing user-specific acoustic models are provided. In accordance with one example, a method includes, at an electronic device having one or more processors, receiving a plurality of speech inputs, each of the speech inputs associated with a same user of the electronic device; providing each of the plurality of speech inputs to a user-independent acoustic model, the user-independent acoustic model providing a plurality of speech results based on the plurality of speech inputs; initiating a user-specific acoustic model on the electronic device; and adjusting the user-specific acoustic model based on the plurality of speech inputs and the plurality of speech results.

Electronic apparatus and control method thereof
11580988 · 2023-02-14 · ·

Disclosed is an electronic apparatus. The electronic apparatus includes a first communicator, a second communicator; and a processor configured to determine whether or not an external electronic apparatus outputting input speech is connectable to a network connectable through the first communicator, based on the input speech, and to transmit a signal for controlling the external electronic apparatus to the external electronic apparatus through the first communicator or the second communicator depending on whether or not the external electronic apparatus is connectable to the network.

Electronic apparatus and control method thereof
11580988 · 2023-02-14 · ·

Disclosed is an electronic apparatus. The electronic apparatus includes a first communicator, a second communicator; and a processor configured to determine whether or not an external electronic apparatus outputting input speech is connectable to a network connectable through the first communicator, based on the input speech, and to transmit a signal for controlling the external electronic apparatus to the external electronic apparatus through the first communicator or the second communicator depending on whether or not the external electronic apparatus is connectable to the network.

Methods and systems for pushing audiovisual playlist based on text-attentional convolutional neural network
11580979 · 2023-02-14 · ·

In some embodiments, methods and systems for pushing audiovisual playlists based on a text-attentional convolutional neural network include a local voice interactive terminal, a dialog system server and a playlist recommendation engine, where the dialog system server and the playlist recommendation engine are respectively connected to the local voice interactive terminal. In some embodiments, the local voice interactive terminal includes a microphone array, a host computer connected to the microphone array, and a voice synthesis chip board connected to the microphone array. In some embodiments, the playlist recommendation engine obtains rating data based on a rating predictor constructed by the neural network; the host computer parses the data into recommended playlist information; and the voice terminal synthesizes the results and pushes them to a user in the form of voice.