IPIQ

G10L17/16

Acoustic model training method, speech recognition method, apparatus, device and medium

11030998 · 2021-06-08 ·

Ping An Technology (Shenzhen) Co., Ltd.

An acoustic model training method, a speech recognition method, an apparatus, a device and a medium. The acoustic model training method comprises: performing feature extraction on a training speech signal to obtain an audio feature sequence; training the audio feature sequence by a phoneme mixed Gaussian Model-Hidden Markov Model to obtain a phoneme feature sequence; and training the phoneme feature sequence by a Deep Neural Net-Hidden Markov Model-sequence training model to obtain a target acoustic model. The acoustic model training method can effectively save time required for an acoustic model training, improve the training efficiency, and ensure the recognition efficiency.

ACOUSTIC MODEL TRAINING METHOD, SPEECH RECOGNITION METHOD, APPARATUS, DEVICE AND MEDIUM

20210125603 · 2021-04-29 ·

PING AN TECHNOLOGY (SHENZHEN) CO., LTD.

Method, apparatus and system for speaker verification

10937430 · 2021-03-02 ·

Beijing Didi Infinity Technology And Development Co., Ltd.

The present disclosure relates to a method, apparatus, and system for speaker verification. The method includes: acquiring an audio recording; extracting speech signals from the audio recording; extracting features of the extracted speech signals; and determining whether the extracted speech signals represent speech by a predetermined speaker based on the extracted features and a speaker model trained with reference voice data of the predetermined speaker.

Method, apparatus and system for speaker verification

10937430 · 2021-03-02 ·

Beijing Didi Infinity Technology And Development Co., Ltd.

Methods and system for distributing information via multiple forms of delivery services

11057337 · 2021-07-06 ·

Greenfly, Inc.

A content distribution facilitation system is described comprising configured servers and a network interface configured to interface with a plurality of terminals in a client server relationship and optionally with a cloud-based storage system. A request from a first source for content comprising content criteria is received, the content criteria comprising content subject matter. At least a portion of the content request content criteria is transmitted to a selected content contributor. If recorded content is received from the first content contributor, the first source is provided with access to the received recorded content. The recorded content may be transmitted via one or more networks to one or more destination devices. Optionally, a voice analysis and/or facial recognition engine are utilized to determine if the recorded content is from the first content contributor.

Methods and system for distributing information via multiple forms of delivery services

11057337 · 2021-07-06 ·

Greenfly, Inc.

METHODS AND SYSTEM FOR DISTRIBUTING INFORMATION VIA MULTIPLE FORMS OF DELIVERY SERVICES

20210006531 · 2021-01-07 ·

METHODS AND SYSTEM FOR DISTRIBUTING INFORMATION VIA MULTIPLE FORMS OF DELIVERY SERVICES

20210006531 · 2021-01-07 ·

Method for microphone selection and multi-talker segmentation with ambient automated speech recognition (ASR)

10847171 · 2020-11-24 ·

Nuance Communications, Inc.

Disclosed methods and systems are directed to determining a best microphone pair and segmenting sound signals. The methods and systems may include receiving a collection of sound signals comprising speech from one or more audio sources (e.g., meeting participants) and/or background noise. The methods and systems may include calculating a TDOA and determining, based on the TDOA and via robust statistics, the best pair of microphones. The methods and systems may also include segmenting sound signals from multiple sources.

Method for microphone selection and multi-talker segmentation with ambient automated speech recognition (ASR)

10847171 · 2020-11-24 ·

Nuance Communications, Inc.

Patent classifications

G10L17/16