G10L2015/228

APPARATUS FOR PROCESSING USER COMMANDS AND OPERATION METHOD THEREOF
20220383873 · 2022-12-01 ·

Methods and devices are provided in which one or more identifiers are registered based on a user setting made by a user of an electronic device. Each of the one or more identifiers corresponds to at least one activated service module. A registered identifier, of the one or more identifiers, is extracted from a user command input through an input device. The user command is changed using a basic identifier preset for a first service module corresponding to the registered identifier. The changed user command is transmitted to a server configured to control execution of the first service module. A result of executing the changed user command based on the first service module is received from the server.

NATURAL LANGUAGE INTERFACES

It is not trivial to implement speech and natural language processing in offline embedded systems. Voice control of devices in various settings and applications can benefit from an embedded speech and natural language processing solution. One feature that helps to correct automatic speech recognition outputs is grammar projection. Another feature addresses situations where there is imperfect information or incomplete information by providing an application programming interface to enable structured queries and responses between an interpreter and an application.

SYSTEM AND METHOD FOR FEDERATED, CONTEXT-SENSITIVE, ACOUSTIC MODEL REFINEMENT
20220383859 · 2022-12-01 ·

A system and method for federated, context-sensitive, acoustic model refinement comprising a federated language model server and a plurality of edge devices. The federated language model server may comprise one or more machine learning models trained and developed centrally on the server, and distribute these one or more machine learning models to edge devices wherein they may be operated locally on the edge devices. The edge devices may gather or generate context data that can be used by a speech recognition engine, and the local language models contained therein, to develop adaptive, context-sensitive, user-specific language models. Periodically, the federated language model server may select a subset of edge devices from which to receive uploaded local model parameters, that may be aggregated to perform central model updates wherein the updated model parameters may then be sent back to edge devices in order to update the local model parameters.

ENABLING NATURAL LANGUAGE INTERACTIONS WITH USER INTERFACES FOR USERS OF A SOFTWARE APPLICATION
20220383869 · 2022-12-01 · ·

A user specifies a natural language command to a device. Software on the device generates contextual metadata about the user interface of the device, such as data about all visible elements of the user interface, and sends the contextual metadata along with the natural language command to a natural language understanding engine. The natural language understanding engine parses the natural language query using a stored grammar (e.g., a grammar provided by a maker of the device) and as a result of the parsing identifies information about the command (e.g., the user interface elements referenced by the command) and provides that information to the device. The device uses that provided information to respond to the command.

Third party account linking for voice user interface

Methods and systems for adding functionality to an account of a language processing system where the functionality is associated with a second account of a first application system is described herein. In a non-limiting embodiment, an individual may log into a first account of a language processing system and log into a second account of a first application system. While logged into both the first account and the second account, a button included within a webpage provided by the first application may be invoked. A request capable of being serviced using the first functionality may be received by the language processing system from a device associated with the first account. The language processing system may send first account data and the second account data to the first application system to facilitate an action associated with the request, thereby enabling the first functionality for the first account.

Information processing device and information processing method
11514903 · 2022-11-29 · ·

The present technology relates to an information processing device and an information processing method that make it possible to generate interaction data with less cost. Provided is the information processing device including a processor that generates, on the basis of interaction history information, a coupling context to be coupled to a context of interest to be noticed among a plurality of contexts. This makes it possible to generate interaction data with less cost. The present technology is applicable as server-side service of a voice interaction system, for example.

Method, device, and system of selectively using multiple voice data receiving devices for intelligent service

An electronic device is provided, which includes a user interface, at least one communication module, a microphone, at least one speaker, at least one processor operatively connected with the user interface, the at least one communication module, the microphone, and the at least one speaker, and at least one memory operatively connected with the at least one processor, wherein the at least one memory stores instructions, which when executed, instruct the at least one processor to while the electronic device is wiredly or wirelessly connected with an access point (AP) connected with at least one external electronic device, after receiving, through the microphone, part of a wake-up utterance to invoke a voice-based intelligent assistant service, broadcast identification information about the electronic device and receive identification information broadcast from the external electronic device, after receiving the whole wake-up utterance through the microphone, individually transmit first information related to the wake-up utterance received through the microphone to the at least one external electronic device and individually receive, from the external electronic device, second information related to the wake-up utterance received by the at least one external electronic device, and determine whether to transmit voice information received after the wake-up utterance to an external server based on at least part of the first information and the second information. Other various embodiments are possible as well.

Voice command recognition device and method thereof

A voice command recognition device and a method thereof are provided. The voice command recognition device includes a processor that registers one or more voice commands selected by analysis of one or more voice commands repeatedly used by a user or a voice command utterance pattern of the user to generate one package command and a storage storing data or an algorithm for speech recognition by the processor.

INTELLIGENT VOICE RECOGNITION METHOD AND APPARATUS
20220375469 · 2022-11-24 · ·

An intelligent voice recognition method and apparatus are disclosed. An intelligent voice recognition apparatus according to one embodiment of the present invention recognizes speech of the user and outputs a response determined on the basis of the speech, wherein, when a plurality of candidate responses related to the speech exist, the response is determined from among the plurality of candidate responses on the basis of device state information about the voice recognition apparatus, and thus ambiguity in a conversation between a user and the voice recognition apparatus can be reduced so that more natural conversation processing is possible. The intelligent voice recognition apparatus and/or an artificial intelligence (AI) apparatus of the present invention can be associated with an AI module, a drone (an unmanned aerial vehicle (UAV)), a robot, an augmented reality (AR) device, a virtual reality (VR) device, a device related to a 5G service, and the like.

ELECTRONIC DEVICE FOR TRANSLATING VOICE OR TEXT AND METHOD THEREOF
20220374615 · 2022-11-24 ·

An electronic device is provided. The electronic device includes an input unit configured to receive a voice or text, an output unit, and a processor. The processor is configured to determine context information, translate the received voice or text based on the context information, convert the translated voice or text, and output the converted voice or text using the output unit.