Patent classifications
G10L13/02
VOICE DATA CREATION DEVICE
A voice data creation device is a device configured to create voice data including an additional word which is a word to be added to a recognition target in a speech recognition system, and includes: a sentence example extraction unit configured to extract one or more text corpora including the additional word from a text corpus group including a plurality of text corpora consisting of sentence examples including a plurality of words; a sentence example selection unit configured to select a text corpus having a highest measure indicating a likelihood of occurrence as a sentence among the text corpora extracted by the sentence example extraction unit 11 as an optimal sentence example for the additional word; and a voice creation unit configured to output a synthesized voice of the optimal sentence example generated by a predetermined voice synthesis system as voice data corresponding to the additional word.
VOICE CONVERSION METHOD AND RELATED DEVICE
A voice conversion method and a related device are provided to implement diversified human voice beautification. A method in embodiments of this application includes: receiving a mode selection operation input by a user, where the mode selection operation is for selecting a voice conversion mode. A plurality of provided selectable modes include: a style conversion mode, for performing speaking style conversion on a to-be-converted first voice; a dialect conversion mode, for adding an accent to or removing an accent from the first voice; and a voice enhancement mode, for implementing voice enhancement on the first voice. The three modes have corresponding voice conversion networks. Based on a target conversion mode selected by the user, a target voice conversion network corresponding to the target conversion mode is selected to convert the first voice, and output a second voice obtained through conversion.
Configuring An External Presentation Device Based On An Impairment Of A User
A mobile device communicates content to an external presentation device, such as a display device, for display or other presentation at the external presentation device. The mobile device identifies an impairment of a user of the mobile device. One or more configuration settings for a user interface of the external presentation device are determined based on the impairment of the user. These configuration settings are, for example, an indication of a particular audio level, an indication to perform text to speech, an indication of a particular type of color blindness, and so forth. The configuration settings are communicated to the external presentation device, allowing the external presentation device to be configured for or adapted to the impairment of the user.
Configuring An External Presentation Device Based On An Impairment Of A User
A mobile device communicates content to an external presentation device, such as a display device, for display or other presentation at the external presentation device. The mobile device identifies an impairment of a user of the mobile device. One or more configuration settings for a user interface of the external presentation device are determined based on the impairment of the user. These configuration settings are, for example, an indication of a particular audio level, an indication to perform text to speech, an indication of a particular type of color blindness, and so forth. The configuration settings are communicated to the external presentation device, allowing the external presentation device to be configured for or adapted to the impairment of the user.
SPEECH SYNTHESIS METHOD AND SYSTEM
Disclosed is a speech synthesis method including: acquiring fundamental frequency information and acoustic feature information from original speech; generating an impulse train from the fundamental frequency information, and inputting it to a harmonic time-varying filter; inputting the acoustic feature information into a neural network filter estimator to obtain corresponding impulse response information; generating noise signal by a noise generator; determining, by the harmonic time-varying filter, harmonic component information through filtering processing on the impulse train and the impulse response information; determining, by a noise time-varying filter, noise component information based on the impulse response information and the noise; and generating a synthesized speech from the harmonic component information and the noise component information. Acoustic features are processed to obtain corresponding impulse response information, and harmonic component information and noise component information are modeled respectively, thereby reducing computation of speech synthesis and improving the quality of the synthesized speech.
Interactive method and device of robot, and device
Embodiments of the present disclosure provide an interactive method of a robot, an interactive device of a robot and a device. The method includes: obtaining voice information input by an interactive object, and performing semantic recognition on the voice information to obtain a conversation intention; obtaining feedback information corresponding to the conversation intention based on a conversation scenario knowledge base pre-configured by a simulated user; and converting the feedback information into a voice of the simulated user, and playing the voice to the interactive object.
SYSTEM FOR TRANSCRIBING AND PERFORMING ANALYSIS ON PATIENT DATA
Methods, apparatuses, and systems for transcribing and performing analysis on patient data are disclosed. Data is collected from one or more medical professionals as well as sensors and imaging devices positioned on or oriented towards a patient. An analysis is performed on the patient data and the data is presented to a medical professional via a verbal interface in a conversational manner, allowing the medical professional to provide additional data such as observations or instructions which may be used for further analysis or to perform actions related to the patient's care.
SYSTEM FOR TRANSCRIBING AND PERFORMING ANALYSIS ON PATIENT DATA
Methods, apparatuses, and systems for transcribing and performing analysis on patient data are disclosed. Data is collected from one or more medical professionals as well as sensors and imaging devices positioned on or oriented towards a patient. An analysis is performed on the patient data and the data is presented to a medical professional via a verbal interface in a conversational manner, allowing the medical professional to provide additional data such as observations or instructions which may be used for further analysis or to perform actions related to the patient's care.
SYSTEM FOR ADMINISTERING A QUALITATIVE ASSESSMENT USING AN AUTOMATED VERBAL INTERFACE
Using artificial intelligence and data observed using sensors or imaging devices to prompt a patient to provide responses or perform actions and then observing the patient's responses to the prompts and performing an assessment resulting in a quantitative result. The quantitative result is then used to complete a clinical qualitative assessment of the patient.
SYSTEM FOR ADMINISTERING A QUALITATIVE ASSESSMENT USING AN AUTOMATED VERBAL INTERFACE
Using artificial intelligence and data observed using sensors or imaging devices to prompt a patient to provide responses or perform actions and then observing the patient's responses to the prompts and performing an assessment resulting in a quantitative result. The quantitative result is then used to complete a clinical qualitative assessment of the patient.