Patent classifications
G10L21/057
METHOD OF OPERATING A HEARING DEVICE AND A HEARING DEVICE PROVIDING SPEECH ENHANCEMENT BASED ON AN ALGORITHM OPTIMIZED WITH A SPEECH INTELLIGIBILITY PREDICTION ALGORITHM
A method of training an algorithm for optimizing intelligibility of speech components of a sound signal in hearing aids, headsets, etc., comprises a) providing a first database comprising a multitude of predefined time segments of first electric input signals representing sound and corresponding measured speech intelligibilities; b) determining optimized first parameters of a first algorithm by optimizing it with said predefined time segments and said corresponding measured speech intelligibilities, the first algorithm providing corresponding predicted speech intelligibilities; c) providing a second database comprising a multitude of time segments of second electric input signals representing sound, d) determining optimized second parameters of a second algorithm by optimizing it with said multitude of time segments, said second algorithm being configured to provide processed second electric input signals exhibiting respective predicted speech intelligibilities estimated by said first algorithm, said optimizing being conducted under a constraint of maximizing said predicted speech intelligibility.
Dynamic latency control
As may be implemented in a manner consistent with one or more embodiment, aspects of the disclosure are directed to latency control with signals, such as audio signals. For instance, a quality characteristic of an audio signal having time-sequenced frames exhibiting a signal quality can assessed, and an output indicative of the signal quality is provided based on the assessment. An amount of latency in the audio signal is dynamically adjusted based on the output, and the latency can be used in processing the time-sequenced frames, such as to use future frames in assessing or correcting a current frame during a time period facilitated via the latency.
Dynamic latency control
As may be implemented in a manner consistent with one or more embodiment, aspects of the disclosure are directed to latency control with signals, such as audio signals. For instance, a quality characteristic of an audio signal having time-sequenced frames exhibiting a signal quality can assessed, and an output indicative of the signal quality is provided based on the assessment. An amount of latency in the audio signal is dynamically adjusted based on the output, and the latency can be used in processing the time-sequenced frames, such as to use future frames in assessing or correcting a current frame during a time period facilitated via the latency.
Methods and systems for transcription playback with variable emphasis
Methods and systems are provided for assisting operation of a vehicle using speech recognition and transcription using text-to-speech for transcription playback with variable emphasis. One method involves analyzing a transcription of an audio communication with respect to the vehicle to identify an operational term pertaining to a current operational context of the vehicle within the transcription, creating an indicator identifying the operational term within the transcription for emphasis when the operational term pertains to the current operational context of the vehicle, identifying a user-configured playback rate; and generating an audio reproduction of the transcription of the audio communication in accordance with the user-configured playback rate, wherein the operational term is selectively emphasized within the audio reproduction based on the indicator.
Determining a playback rate of media for a requester
A method, a system, and a computer program product for providing media to a requester at a particular playback rate associated with the requester. The method includes receiving a request from a requester for a playback session of media that includes a time varying content. In response to receiving the request, a profile associated with the requester is accessed to determine a playback rate of the media for the requester. In response to determining the playback rate of the media for the requester, the media is provided to the requester at the determined playback rate. The method further includes monitoring the playback session of the media for playback changes by the requester and dynamically adapting the playback rate associated with the requester based on the type and frequency of playback changes.
Determining a playback rate of media for a requester
A method, a system, and a computer program product for providing media to a requester at a particular playback rate associated with the requester. The method includes receiving a request from a requester for a playback session of media that includes a time varying content. In response to receiving the request, a profile associated with the requester is accessed to determine a playback rate of the media for the requester. In response to determining the playback rate of the media for the requester, the media is provided to the requester at the determined playback rate. The method further includes monitoring the playback session of the media for playback changes by the requester and dynamically adapting the playback rate associated with the requester based on the type and frequency of playback changes.
Improving speech intelligibility by varying gains of sound types based on scene information
Provided are an electronic apparatus and a controlling method thereof. The electronic apparatus includes an inputter and a processor configured to, based on receiving an audio signal through the inputter, obtain a speech intelligibility for the audio signal, and modify the audio signal so that the speech intelligibility becomes a target intelligibility that is set based on scene information regarding a type of audio included in the audio signal, and the type of audio includes at least one of a sound effect, shouting, music, or a speech.
Improving speech intelligibility by varying gains of sound types based on scene information
Provided are an electronic apparatus and a controlling method thereof. The electronic apparatus includes an inputter and a processor configured to, based on receiving an audio signal through the inputter, obtain a speech intelligibility for the audio signal, and modify the audio signal so that the speech intelligibility becomes a target intelligibility that is set based on scene information regarding a type of audio included in the audio signal, and the type of audio includes at least one of a sound effect, shouting, music, or a speech.
DYNAMIC LATENCY CONTROL
As may be implemented in a manner consistent with one or more embodiment, aspects of the disclosure are directed to latency control with signals, such as audio signals. For instance, a quality characteristic of an audio signal having time-sequenced frames exhibiting a signal quality can assessed, and an output indicative of the signal quality is provided based on the assessment. An amount of latency in the audio signal is dynamically adjusted based on the output, and the latency can be used in processing the time-sequenced frames, such as to use future frames in assessing or correcting a current frame during a time period facilitated via the latency.
INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING METHOD
An information processing device determines a priority sound data item from among a plurality of sound data items of which sound generation timings overlap each other and which are directed at an avatar of a first user in a virtual space. The information processing device performs control to notify the first user of contents of the priority sound data item by reproducing the priority sound data item at a first timing. The information processing device performs control to notify the first user of contents of a non-priority sound data item without reproducing the non-priority sound data item at the first timing. The non-priority sound data item is not determined to be the priority sound data item from among the plurality of sound data items, without reproducing the non-priority sound data item at the first timing.