IPIQ

G10L15/193

Meeting-adapted language model for speech recognition

11636854 · 2023-04-25 ·

Microsoft Technology Licensing, Llc

A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.

Speech recognition method and apparatus

11664020 · 2023-05-30 ·

Alibaba Group Holding Limited

A speech recognition method comprises: generating, based on a preset speech knowledge source, a search space comprising preset client information and for decoding a speech signal; extracting a characteristic vector sequence of a to-be-recognized speech signal; calculating a probability at which the characteristic vector corresponds to each basic unit of the search space; and executing a decoding operation in the search space by using the probability as an input to obtain a word sequence corresponding to the characteristic vector sequence.

No Loss-Optimization for Weighted Transducer

20170352348 · 2017-12-07 ·

Techniques and architectures may be used to generate and perform a process using weighted finite-state transducers involving generic input search graphs. The process need not pursue theoretical optimality and instead search graphs may be optimized without an a priori optimization step. The process may result in an automatic speech recognition (ASR) decoder that is substantially faster than ASR decoders the include the optimization step.

No Loss-Optimization for Weighted Transducer

20170352348 · 2017-12-07 ·

Hybrid decoding using hardware and software for automatic speech recognition systems

11676585 · 2023-06-13 ·

Amazon Technologies, Inc.

Pushkaraksha Gejji

Embodiments describe a method for decoding speech including receiving speech input at an audio input device, generating speech data that is a digital representation of the speech input; extracting acoustic features of the speech data, assigning acoustic scores to the acoustic features, receiving data representing the acoustic features and the acoustic scores, decoding the data representing the acoustic features into a word, having a word score, by referencing a WFST language model, modifying the word score into a new word score based on a personalized grammar model stored in the external memory device, the processor is separate from and external to the WFST accelerator, and determining an intent represented by a plurality of words outputted by the WFST accelerator, where the plurality of words include the word and the new word score.

MEETING-ADAPTED LANGUAGE MODEL FOR SPEECH RECOGNITION

20220358912 · 2022-11-10 ·

MEETING-ADAPTED LANGUAGE MODEL FOR SPEECH RECOGNITION

20220358912 · 2022-11-10 ·

Meeting-adapted language model for speech recognition

11430433 · 2022-08-30 ·

Microsoft Technology Licensing, Llc

Meeting-adapted language model for speech recognition

11430433 · 2022-08-30 ·

Microsoft Technology Licensing, Llc

LANGUAGE AND GRAMMAR MODEL ADAPTATION

20220036893 · 2022-02-03 ·

Systems and methods described herein relate to adapting a language model for automatic speech recognition (ASR) for a new set of words. Instead of retraining the ASR models, language models and grammar models, the system only modifies one grammar model and ensures its compatibility with the existing models in the ASR system.

Patent classifications

G10L15/193