Patent classifications
G10L25/39
Technologies for end-of-sentence detection using syntactic coherence
Technologies for detecting an end of a sentence in automatic speech recognition are disclosed. An automatic speech recognition device may acquire speech data, and identify phonemes and words of the speech data. The automatic speech recognition device may perform a syntactic parse based on the recognized words, and determine an end of a sentence based on the syntactic parse. For example, if the syntactic parse indicates that a certain set of consecutive recognized words form a syntactically complete and correct sentence, the automatic speech recognition device may determine that there is an end of a sentence at the end of that set of words.
Technologies for end-of-sentence detection using syntactic coherence
Technologies for detecting an end of a sentence in automatic speech recognition are disclosed. An automatic speech recognition device may acquire speech data, and identify phonemes and words of the speech data. The automatic speech recognition device may perform a syntactic parse based on the recognized words, and determine an end of a sentence based on the syntactic parse. For example, if the syntactic parse indicates that a certain set of consecutive recognized words form a syntactically complete and correct sentence, the automatic speech recognition device may determine that there is an end of a sentence at the end of that set of words.
Automated Speech Recognition Proxy System for Natural Language Understanding
An interactive response system mixes HSR subsystems with ASR subsystems to facilitate overall capability of voice user interfaces. The system permits imperfect ASR subsystems to nonetheless relieve burden on HSR subsystems. An ASR proxy is used to implement an IVR system, and the proxy dynamically determines how many ASR and HSR subsystems are to perform recognition for any particular utterance, based on factors such as confidence thresholds of the ASRs and availability of human resources for HSRs.
Automated Speech Recognition Proxy System for Natural Language Understanding
An interactive response system mixes HSR subsystems with ASR subsystems to facilitate overall capability of voice user interfaces. The system permits imperfect ASR subsystems to nonetheless relieve burden on HSR subsystems. An ASR proxy is used to implement an IVR system, and the proxy dynamically determines how many ASR and HSR subsystems are to perform recognition for any particular utterance, based on factors such as confidence thresholds of the ASRs and availability of human resources for HSRs.
SPEECH RECOGNITION METHOD AND DEVICE
This patent disclosure relates to a voice technology and discloses a voice recognition method and electronic device. In some embodiments of this disclosure, soft clustering calculation is performed in advance according to N gausses obtained by model training, to obtain M soft clustering gausses; when voice recognition is performed, voice is converted to obtain an eigenvector, and top L soft clustering gausses with highest scores are calculated according to the eigenvector, wherein the L is less than the M; and member gausses among the L soft clustering gausses are used as gausses that need to participate in calculation in an acoustic model in a voice recognition process to calculate likelihood of the acoustic model.
SPEECH RECOGNITION METHOD AND DEVICE
This patent disclosure relates to a voice technology and discloses a voice recognition method and electronic device. In some embodiments of this disclosure, soft clustering calculation is performed in advance according to N gausses obtained by model training, to obtain M soft clustering gausses; when voice recognition is performed, voice is converted to obtain an eigenvector, and top L soft clustering gausses with highest scores are calculated according to the eigenvector, wherein the L is less than the M; and member gausses among the L soft clustering gausses are used as gausses that need to participate in calculation in an acoustic model in a voice recognition process to calculate likelihood of the acoustic model.
TECHNOLOGIES FOR END-OF-SENTENCE DETECTION USING SYNTACTIC COHERENCE
Technologies for detecting an end of a sentence in automatic speech recognition are disclosed. An automatic speech recognition device may acquire speech data, and identify phonemes and words of the speech data. The automatic speech recognition device may perform a syntactic parse based on the recognized words, and determine an end of a sentence based on the syntactic parse. For example, if the syntactic parse indicates that a certain set of consecutive recognized words form a syntactically complete and correct sentence, the automatic speech recognition device may determine that there is an end of a sentence at the end of that set of words.
TECHNOLOGIES FOR END-OF-SENTENCE DETECTION USING SYNTACTIC COHERENCE
Technologies for detecting an end of a sentence in automatic speech recognition are disclosed. An automatic speech recognition device may acquire speech data, and identify phonemes and words of the speech data. The automatic speech recognition device may perform a syntactic parse based on the recognized words, and determine an end of a sentence based on the syntactic parse. For example, if the syntactic parse indicates that a certain set of consecutive recognized words form a syntactically complete and correct sentence, the automatic speech recognition device may determine that there is an end of a sentence at the end of that set of words.
DRONE DETECTION AND CLASSIFICATION WITH COMPENSATION FOR BACKGROUND CLUTTER SOURCES
A system, method, and apparatus for detecting drones are disclosed. An example method includes receiving a digital sound sample and partitioning the digital sound sample into segments. The method also includes applying a frequency and power spectral density transformation to each of the segments to produce respective sample vectors. For each of the sample vectors, the example method determines a combination of drone sound signatures and background sound signatures that most closely match the sample vector. The method further includes determining, for the sample vectors, if the drone sound signatures in relation to the background sound signatures that are included within the respective combinations are indicative of a drone. Conditioned on determining that the drone sound signatures are indicative of a drone, an alert message indicative of the drone is transmitted.
DRONE DETECTION AND CLASSIFICATION WITH COMPENSATION FOR BACKGROUND CLUTTER SOURCES
A system, method, and apparatus for detecting drones are disclosed. An example method includes receiving a digital sound sample and partitioning the digital sound sample into segments. The method also includes applying a frequency and power spectral density transformation to each of the segments to produce respective sample vectors. For each of the sample vectors, the example method determines a combination of drone sound signatures and background sound signatures that most closely match the sample vector. The method further includes determining, for the sample vectors, if the drone sound signatures in relation to the background sound signatures that are included within the respective combinations are indicative of a drone. Conditioned on determining that the drone sound signatures are indicative of a drone, an alert message indicative of the drone is transmitted.