IPIQ

G10L25/00

Method, apparatus, device and computer-readable storage medium for voice interaction

11393490 · 2022-07-19 ·

Baidu Online Network Technology (Beijing) Co., Ltd.

According to embodiments of the present disclosure, a method, apparatus, device, and computer readable storage medium for voice interaction are provided. The method includes: determining a text corresponding to the voice signal based on a voice feature of a received voice signal. The method further includes: determining, based on the voice feature and the text, a matching degree between a reference voice feature of an element in the text and a target voice feature of the element. The method further includes: determining a first possibility that the voice signal is an executable command based on the text. The method further includes: determining a second possibility that the voice signal is the executable command based on the voice feature.

Spoken language understanding system

11393456 · 2022-07-19 ·

Amazon Technologies, Inc.

A system is provided for a self-learning policy engine that can be used by various spoken language understanding (SLU) processing components. The system also provides for sharing contextual information from processing performed by an upstream SLU component to a downstream SLU component to facilitate decision making by the downstream SLU component. The system also provides for a SLU component to select from a variety of actions to take. A SLU component may implement an instance of the self-learning policy that is specifically configured for the particular SLU component.

EXPANDABLE DIALOGUE SYSTEM

20220093081 · 2022-03-24 ·

Microsoft Technology Licensing, Llc

A system that allows non-engineers administrators, without programming, machine language, or artificial intelligence system knowledge, to expand the capabilities of a dialogue system. The dialogue system may have a knowledge system, user interface, and learning model. A user interface allows non-engineers to utilize the knowledge system, defined by a small set of primitives and a simple language, to annotate a user utterance. The annotation may include selecting actions to take based on the utterance and subsequent actions and configuring associations. A dialogue state is continuously updated and provided to the user as the actions and associations take place. Rules are generated based on the actions, associations and dialogue state that allows for computing a wide range of results.

EXPANDABLE DIALOGUE SYSTEM

20220093081 · 2022-03-24 ·

Microsoft Technology Licensing, Llc

Content reproducer, sound collector, content reproduction system, and method of controlling content reproducer

11289114 · 2022-03-29 ·

Yamaha Corporation

Akihiko Suyama

A content reproducer according to the present disclosure includes a sound collector configured to collect a speech, and a controller configured to obtain speech input direction information about the speech and determine a content output direction based on the speech input direction information. Alternatively, a content reproducer according to the present disclosure includes a communicator configured to obtain speech input direction information, and a controller configured to determine a content output direction based on the speech input direction information.

Device and system with integrated customer service components

11222505 · 2022-01-11 ·

The present disclosure relates generally to a server that manages a plurality of gaming machines. As a non-limiting example, the server may include instructions that receive a message indicating that a user has requested assistance in connection with a gaming machine, instructions that analyze the message to determine a type of assistance required to satisfy the user request, instructions that determine a destination address for a service communication device, where the service communication device is selected based on the type of assistance required to satisfy the user request, and instructions that cause a service request message to be transmitted to the destination address.

Method and Apparatus for Detecting Correctness of Pitch Period

20210335377 · 2021-10-28 ·

A method and an apparatus for detecting correctness of a pitch period, where the method for detecting correctness of a pitch period includes determining, according to an initial pitch period of an input signal in a time domain, a pitch frequency bin of the input signal, where the initial pitch period is obtained by performing open-loop detection on the input signal, determining, based on an amplitude spectrum of the input signal in a frequency domain, a pitch period correctness decision parameter, associated with the pitch frequency bin, of the input signal, and determining correctness of the initial pitch period according to the pitch period correctness decision parameter.

Method and Apparatus for Detecting Correctness of Pitch Period

20210335377 · 2021-10-28 ·

Systems and methods for artificial dubbing

11159597 · 2021-10-26 ·

VIDUBLY LTD

Methods, systems, and computer-readable media for artificially generating a revoiced media stream are provided. In one implementation, a system may receive a media stream including an individual with particular voice speaking in an origin language. The system may obtain a transcript of the media stream including utterances spoken in the origin language and translate the transcript to a target language. The translated transcript may include a set of words in the target language for each of at least some of the utterances spoken in the origin language. The system may analyze the media stream to determine a voice profile for the individual. Thereafter, the system may determine a synthesized voice for a virtual entity intended to dub the individual that is similar to the particular voice. Then, the system may generate a revoiced media stream in which the translated transcript in the target language is spoken by the virtual entity.

Machine learning classifications of aphasia

11145321 · 2021-10-12 ·

Omniscient Neurotechnology Pty Limited

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing aphasia assessment. One of the methods includes receiving a recording, generating a text transcript of the recording, and generating speech quantifying and comprehension scores which can be used to determine an aphasia classification. Another method includes performing an aphasia assessment on a brain image to obtain an aphasia classification.

Patent classifications

G10L25/00