G10L25/60

INFORMATION PROCESSING APPARATUS, NON-TRANSITORY COMPUTER READABLE MEDIUM, AND INFORMATION PROCESSING METHOD
20230098333 · 2023-03-30 · ·

An information processing apparatus includes: a processor configured to instantaneously acquire quality information indicative of quality of utterer's voice on a listener's side; and instantaneously present improvement information for improving the quality to the utterer in a case where the quality indicated by the acquired quality information does not satisfy a predetermined condition.

INFORMATION PROCESSING APPARATUS, NON-TRANSITORY COMPUTER READABLE MEDIUM, AND INFORMATION PROCESSING METHOD
20230098333 · 2023-03-30 · ·

An information processing apparatus includes: a processor configured to instantaneously acquire quality information indicative of quality of utterer's voice on a listener's side; and instantaneously present improvement information for improving the quality to the utterer in a case where the quality indicated by the acquired quality information does not satisfy a predetermined condition.

ELECTRONIC DEVICE AND CONTROL METHOD THEREOF

An electronic device including a memory storing signal information corresponding to a trigger speech; a microphone; a communication interface; and a processor configured to identify whether a first speech signal received through the microphone corresponds to the trigger speech based on the signal information, obtain a first speech sharpness value of the first speech signal based on the identifying, obtain a second speech sharpness value from the at least one external device through the communication interface, based on the first speech sharpness value being greater than the second speech sharpness value, identify a speech command included in the second speech signal received through the microphone by entering a speech recognition mode, and control the electronic device based on the identifying of the speech command, and the processor is further configured to, based on the speech command being unidentified, control the communication interface to transmit a control signal to the at least one external device based on the second speech sharpness value.

ELECTRONIC DEVICE AND CONTROL METHOD THEREOF

An electronic device including a memory storing signal information corresponding to a trigger speech; a microphone; a communication interface; and a processor configured to identify whether a first speech signal received through the microphone corresponds to the trigger speech based on the signal information, obtain a first speech sharpness value of the first speech signal based on the identifying, obtain a second speech sharpness value from the at least one external device through the communication interface, based on the first speech sharpness value being greater than the second speech sharpness value, identify a speech command included in the second speech signal received through the microphone by entering a speech recognition mode, and control the electronic device based on the identifying of the speech command, and the processor is further configured to, based on the speech command being unidentified, control the communication interface to transmit a control signal to the at least one external device based on the second speech sharpness value.

METHODS AND SYSTEMS FOR AUDIO SAMPLE QUALITY CONTROL
20230033305 · 2023-02-02 ·

The present disclosure provides methods and systems that may be used for providing quality control for audio samples. The audio samples may be speech samples of a user. The user may be participating in an audio interview.

METHODS AND SYSTEMS FOR AUDIO SAMPLE QUALITY CONTROL
20230033305 · 2023-02-02 ·

The present disclosure provides methods and systems that may be used for providing quality control for audio samples. The audio samples may be speech samples of a user. The user may be participating in an audio interview.

Acoustic based speech analysis using deep learning models

A method and system for detecting one or more speech features in speech audio data includes receiving speech audio data, performing preprocessing on the speech audio data to prepare the speech audio data for use as an input into one or more models that detect one or more speech features, providing the preprocessed speech audio data to a stacked machine learning model, and analyzing the preprocessed speech audio data via the stacked ML model to detect the one or more speech features. The stacked ML model includes a feature aggregation model, a sequence to sequence model, and a decision-making model.

Methods, apparatus and computer-readable mediums related to biometric authentication
11487861 · 2022-11-01 · ·

Embodiments of the disclosure provide a mechanism for performing a biometric algorithm on ear biometric data acquired from a user. The mechanism may be used for biometric authentication, or in-ear detect, for example. In one embodiment, a method is provided in which a quality metric of an input signal to a transducer and/or a signal on a return path from the transducer is monitored. One or more steps of a biometric process, comprising monitoring of a parameter related to an admittance of the transducer, comparison of the parameter to a stored profile for an authorised user, generation of a score based on the comparison, comparison of the score to one or more threshold values, and initiation of one or more actions, may be performed responsive to the quality metric meeting one or more criteria.

System and method of enhancing intelligibility of audio playback

A personal listening system and a method of using the personal listening system to enhance speech intelligibility of audio playback, are described. The method includes determining a speech intelligibility metric, such as a speech reception threshold, of a user. Based on the speech intelligibility metric, a tuning parameter is applied to an audio input signal. The speech reception threshold is compared to an environmental signal-to-noise ratio to determine whether enhancement of the audio input signal is warranted. Application of the tuning parameter to the audio input signal generates an audio output signal having reduced noise, making playback of the audio output signal more intelligible to the user. Other aspects are also described and claimed.

System and method of enhancing intelligibility of audio playback

A personal listening system and a method of using the personal listening system to enhance speech intelligibility of audio playback, are described. The method includes determining a speech intelligibility metric, such as a speech reception threshold, of a user. Based on the speech intelligibility metric, a tuning parameter is applied to an audio input signal. The speech reception threshold is compared to an environmental signal-to-noise ratio to determine whether enhancement of the audio input signal is warranted. Application of the tuning parameter to the audio input signal generates an audio output signal having reduced noise, making playback of the audio output signal more intelligible to the user. Other aspects are also described and claimed.