G10L21/057

DEVICES AND METHODS FOR AUDITORY REHABILITATION FOR INTERAURAL ASYMMETRY
20230049597 · 2023-02-16 ·

A device, system and related methods to provide assessment and treatment of amblyaudia through standardized methods that do not require advanced training or a booth with loudspeakers for the operator to administer. The ARIA stimuli protocols for both assessment and treatment, encoded in or to be used by a software program or application, are transferred to a stand-alone set of specialized noise-cancelling headphones attached or connected to, wired or wirelessly, a software platform on an electronic computing device. or integrated with the headphones. The program administers assessment tests to individuals through the noise-cancelling earphones. The device enables someone with minimal instructions to administer automatically or semi-automatically both assessment and treatment protocols, generate results, make interpretations, store data, and produce reports. The device or system may be loaded with standard protocols for English-speaking individuals, as well as dichotic speech material in any language.

Outside ordering system
11594223 · 2023-02-28 · ·

An ordering system can be positioned partially, or completely, outside in a retail environment with an ordering device located outside of a building on a site. The ordering device receiving a first audio stream concurrently with a second audio stream from an employee and proceeds to capture the first audio stream with a first port of an on-site computing device while capturing the second audio stream with a second port of the on-site computing device. A customer strategy can be executed with an intelligence module of the on-site computing device connected to the ordering device with the on-site customer strategy directing automated interactions with a first on-site customer to compile a retail order. The employee may communicate directly with the intelligence module via the second port without interrupting the first audio stream.

PERSONALIZED VOICE CONVERSION SYSTEM

A personalized voice conversion system includes a cloud server and an intelligent device that communicates with the cloud server. The intelligent device upstreams an original voice signal to the cloud server. The cloud server converts the original voice signal into an intelligible voice signal based on an intelligible voice conversion model. The intelligent device downloads and plays the intelligible voice signal. Based on the original voice signal and the corresponding intelligible voice signal, the cloud server and the intelligent device train an off-line voice conversion model provided to the intelligent device. When the intelligent device stops communicating with the cloud server, the intelligent device converts a new original voice signal into a new intelligible voice signal based on the off-line voice conversion model and plays the new intelligible voice signal.

METHODS AND SYSTEMS FOR TRANSCRIPTION PLAYBACK WITH VARIABLE EMPHASIS

Methods and systems are provided for assisting operation of a vehicle using speech recognition and transcription using text-to-speech for transcription playback with variable emphasis. One method involves analyzing a transcription of an audio communication with respect to the vehicle to identify an operational term pertaining to a current operational context of the vehicle within the transcription, creating an indicator identifying the operational term within the transcription for emphasis when the operational term pertains to the current operational context of the vehicle, identifying a user-configured playback rate; and generating an audio reproduction of the transcription of the audio communication in accordance with the user-configured playback rate, wherein the operational term is selectively emphasized within the audio reproduction based on the indicator.

SELECTIVE FINE-TUNING OF SPEECH

Speech conveyed over a network, such as during an electronic conference may be more difficult to understand if the recipient has difficulty understanding the speech of users having a particular speech attribute. However, other recipients may have no difficulty understanding the speech. As provided herein, speech provided by a user may have phonemes comprising accents or other speech pattern that, if removed, are more readily understood by a particular user. Such alterations are provided only to the users that require it, such as by a server or a specific user's communication device, without affecting the speech concurrently presented to other users.

HEARING DEVICE COMPRISING AN ADAPTIVE FILTER BANK

A hearing device comprises a) at least one input transducer configured to pick up sound from an acoustic environment around the user when the user is wearing the hearing device, the at least one input transducer providing at least one electric input signal representative of said sound, b) at least one analysis filter bank configured to provide said at least one electric input signal as a multitude of frequency sub-band signals, the at least one analysis filter bank comprising b1) a plurality of M first filters h.sub.m(n), whose impulse responses are modulated from a first prototype filter h(n), where m=0, 1, . . . , M−1 is a frequency band index, and n is a time index, c) a processor for processing said at least one electric input signal provided by said at least one analysis filter bank, or a signal originating therefrom, and providing a processed signal, d) an output transducer configured to provide stimuli perceivable as sound to the user in dependence of said processed signal, and e) a controller for controlling said analysis filter bank by applying a different first prototype filter to said at least one filter bank in dependence of said current acoustic environment. A method of operating a hearing device is further disclosed.

HEARING DEVICE COMPRISING AN ADAPTIVE FILTER BANK

A hearing device comprises a) at least one input transducer configured to pick up sound from an acoustic environment around the user when the user is wearing the hearing device, the at least one input transducer providing at least one electric input signal representative of said sound, b) at least one analysis filter bank configured to provide said at least one electric input signal as a multitude of frequency sub-band signals, the at least one analysis filter bank comprising b1) a plurality of M first filters h.sub.m(n), whose impulse responses are modulated from a first prototype filter h(n), where m=0, 1, . . . , M−1 is a frequency band index, and n is a time index, c) a processor for processing said at least one electric input signal provided by said at least one analysis filter bank, or a signal originating therefrom, and providing a processed signal, d) an output transducer configured to provide stimuli perceivable as sound to the user in dependence of said processed signal, and e) a controller for controlling said analysis filter bank by applying a different first prototype filter to said at least one filter bank in dependence of said current acoustic environment. A method of operating a hearing device is further disclosed.

AUDIO PROCESSING METHOD, AUDIO PROCESSING APPARATUS AND COMPUTER STORAGE MEDIUM

An audio processing method applied to a first terminal is described, and includes: in response to receiving of audio data input by a user at the first terminal, and determination that a voice change function is turned on, determining change parameters; and based on the change parameters, performing change processing on the audio data.

AUDIO PROCESSING METHOD, AUDIO PROCESSING APPARATUS AND COMPUTER STORAGE MEDIUM

An audio processing method applied to a first terminal is described, and includes: in response to receiving of audio data input by a user at the first terminal, and determination that a voice change function is turned on, determining change parameters; and based on the change parameters, performing change processing on the audio data.

Method and apparatus for processing speech

Embodiments of the present disclosure provide a method and apparatus for processing a speech. The method may include: acquiring an original speech; performing speech recognition on the original speech, to obtain an original text corresponding to the original speech; associating a speech segment in the original speech with a text segment in the original text; recognizing an abnormal segment in the original speech and/or the original text; and processing a text segment indicated by the abnormal segment in the original text and/or the speech segment indicated by the abnormal segment in the original speech, to generate a final speech. A speech segment in the original speech is associated with a text segment in the original text to realize visual processing of the speech.