G10L19/0018

CLIENT, SYSTEM AND METHOD FOR CUSTOMIZING VOICE BROADCAST
20200394992 · 2020-12-17 ·

Embodiments of the present disclosure provide a client for customizing voice broadcast. The client an acquisition module, an extraction module, a sample generation module and a voice playing module. The acquisition module is configured to acquire an original audio. The extraction module is configured to extract a voiceprint feature from the original audio. The sample generation module is configured to produce a sample sound effect based on the voiceprint feature extracted. The voice playing module is configured to play information to be played based on the sample sound effect.

Haptic communication system using cutaneous actuators for simulation of continuous human touch

A haptic communication device includes an array of cutaneous actuators to generate haptic sensations corresponding to actuator signals received by the array. The haptic sensations include at least a first haptic sensation and a second haptic sensation. The array includes at least a first cutaneous actuator to begin generating the first haptic sensation at a first location on a body of a user at a first time. A second cutaneous actuator begins generating the second haptic sensation at a second location on the body of the user at a second time later than the first time.

Machine communication system using haptic symbol set

A haptic device comprises a signal generator that is configured to receive an input word that is a unit of a language. The signal generator converts the input word into one or more phonemes of the input word. The signal generator further converts the one or more phonemes into a sequence of actuator signals. The sequence of actuator signals is formed from a concatenation of sub-sequences of actuator signals. Each phoneme corresponding to a unique sub-sequence of actuator signals. The haptic device further comprises a two dimensional array of cutaneous actuators configured to receive the sequence of actuator signals from the signal generator, each of the actuator signals mapped to a cutaneous actuator of the two dimensional array of cutaneous actuators.

Audio encoding for functional interactivity
10839853 · 2020-11-17 · ·

Some examples include receiving audio content through a microphone of an electronic device and determining whether embedded data is included in the received audio content. The electronic device may decode the received audio content to extract the embedded data. In addition, the electronic device may perform at least one of: sending a communication to a computing device over a network based on the extracted embedded data, or presenting information on a display of the electronic device based on the extracted embedded data.

System and method for preserving privacy of data in the cloud

A system and method for preserving the privacy of data while processing of the data in a cloud. The system comprises a computer program application and a client encryption key, The system is operable to encrypt the computer program application and data using the client encryption key; upload the encrypted computer program application and encrypted data in the cloud; enable the computer platform to undertake processing of the encrypted data in the cloud using the encrypted computer program application; output encrypted processing results; and, enable decryption of the encrypted processing results using the client encryption key.

Characterizing, Selecting And Adapting Audio And Acoustic Training Data For Automatic Speech Recognition Systems
20200312349 · 2020-10-01 ·

A system for and method of characterizing a target application acoustic domain analyzes one or more speech data samples from the target application acoustic domain to determine one or more target acoustic characteristics, including a CODEC type and bit-rate associated with the speech data samples. The determined target acoustic characteristics may also include other aspects of the target speech data samples such as sampling frequency, active bandwidth, noise level, reverberation level, clipping level, and speaking rate. The determined target acoustic characteristics are stored in a memory as a target acoustic data profile. The data profile may be used to select and/or modify one or more out of domain speech samples based on the one or more target acoustic characteristics.

Haptic communication using interference of haptic outputs on skin

Embodiments relate to enhancing haptic communication by using two or more cutaneous actuators to create constructive or destructive interference patterns on the receiving user's skin. The actuator signals for the two or more cutaneous actuators are shaped and generated so that the two or more cutaneous actuators cause vibrations on the receiving user's patch of skin to increase or decrease. In this way, various enhancement to haptic communication can be achieved.

METHOD FOR COMMUNICATING A NON-SPEECH MESSAGE AS AUDIO
20200251088 · 2020-08-06 ·

A method is provided for communicating a non-speech message as audio from a first device to a second device such that information can be passed between the first and second device. The method includes: encoding the non-speech message as a dissimilar speech message having a plurality of phonemes; transmitting the speech message over one or more audio communications channels from the first device; receiving the speech message at the second device; recognizing the speech message; and decoding the dissimilar speech message to the non-speech message. By using existing audio functionality, and the increasingly more reliable voice recognition applications, an improved method is provided for sharing complex data messages using commonly available communication channels.

Audio Recording Optimization for Calls Serviced by an Artificial Intelligence Agent
20200243097 · 2020-07-30 ·

Artificial agents utilized for voice interactions continue to improve in their capacity to conduct more sophisticated interactions. Rather than just presenting a limited set of options, artificial agents are continuing to narrow the gap between generated speech and natural human speech. A requirement is often in place that spoken interactions be recorded, however, storing speech, even with data compression, is a resource-demanding task. Generated speech may be provided from content, such as text, and speech data. By recording an identifier of the content and associated speech data, storage processing and space requirements can be greatly reduced. Playback may be provided from a waveform of audio provided by the human participant and by selecting the content associated with the content identifier and generating speech of the content utilizing settings provided by the speech data.

Speech recognition method and device based on a similarity of a word and N other similar words and similarity of the word and other words in its sentence
10714089 · 2020-07-14 · ·

The disclosure discloses a speech recognition method and a device, aiming at recognizing words in a sentence text after determining the sentence text corresponding to an input speech, and substituting after determining the wrong words which do not conform to an application scenario in the sentence text, so as to improve the accuracy of speech recognition. The speech recognition method according to embodiments of the present disclosure includes: recognizing a sentence text corresponding to an input speech according to the speech; recognizing wrong words in the sentence text; determining substitute words corresponding to the wrong words; and substituting the wrong words with the substitute words. The determination of the wrong words is based on comparing an average similarity of the word with N other similar words against a first threshold and comparing a maximum similarity between the said word and other words in the sentence against a second threshold.