G10L17/12

ELECTRONIC DEVICE
20200135194 · 2020-04-30 · ·

Disclosed herein is an electronic device. The electronic device according to an embodiment of the present disclosure includes an input unit configured to receive a speech input including a wakeup word and a command word from a sound source, a communication unit configured to communicate with one or more other electronic devices, an artificial intelligence unit configured to obtain a degree of recognition of the wakeup word in the electronic device, receive a degree of recognition of the wakeup word in each of the one or more other electronic devices, and perform a function corresponding to the command word when the electronic device has a highest priority based on the degree of recognition of the wakeup word in the electronic device and the degree of recognition of the wakeup word in each of the one or more other electronic devices, wherein the degree of recognition of the wakeup word in the electronic device is obtained based on at least one of a score of the wakeup word or location information of the sound source, in the electronic device.

ELECTRONIC DEVICE
20200135194 · 2020-04-30 · ·

Disclosed herein is an electronic device. The electronic device according to an embodiment of the present disclosure includes an input unit configured to receive a speech input including a wakeup word and a command word from a sound source, a communication unit configured to communicate with one or more other electronic devices, an artificial intelligence unit configured to obtain a degree of recognition of the wakeup word in the electronic device, receive a degree of recognition of the wakeup word in each of the one or more other electronic devices, and perform a function corresponding to the command word when the electronic device has a highest priority based on the degree of recognition of the wakeup word in the electronic device and the degree of recognition of the wakeup word in each of the one or more other electronic devices, wherein the degree of recognition of the wakeup word in the electronic device is obtained based on at least one of a score of the wakeup word or location information of the sound source, in the electronic device.

SPEAKER VERIFICATION

A method of speaker verification comprises: comparing a test input against a model of a user's speech obtained during a process of enrolling the user; obtaining a first score from comparing the test input against the model of the user's speech; comparing the test input against a first plurality of models of speech obtained from a first plurality of other speakers respectively; obtaining a plurality of cohort scores from comparing the test input against the plurality of models of speech obtained from a plurality of other speakers; obtaining statistics describing the plurality of cohort scores; modifying said statistics to obtain adjusted statistics; normalising the first score using the adjusted statistics to obtain a normalised score; and using the normalised score for speaker verification

Speaker recognition based on vibration signals

An embodiment of a semiconductor package apparatus may include technology to acquire vibration information corresponding to a speaker, and identify the speaker based on the vibration information. Other embodiments are disclosed and claimed.

Speaker recognition based on vibration signals

An embodiment of a semiconductor package apparatus may include technology to acquire vibration information corresponding to a speaker, and identify the speaker based on the vibration information. Other embodiments are disclosed and claimed.

Speaker verification

A method of speaker verification comprises: comparing a test input against a model of a user's speech obtained during a process of enrolling the user; obtaining a first score from comparing the test input against the model of the user's speech; comparing the test input against a first plurality of models of speech obtained from a first plurality of other speakers respectively; obtaining a plurality of cohort scores from comparing the test input against the plurality of models of speech obtained from a plurality of other speakers; obtaining statistics describing the plurality of cohort scores; modifying said statistics to obtain adjusted statistics; normalising the first score using the adjusted statistics to obtain a normalised score; and using the normalised score for speaker verification.

Method and apparatus for performing speaker recognition

Embodiments of the present invention perform speaker identification and verification by first prompting a user to speak a phrase that includes a common phrase component and a personal identifier. Then, the embodiments decompose the spoken phrase to locate the personal identifier. Finally, the embodiments identify and verify the user based on the results of the decomposing.

Method and apparatus for performing speaker recognition

Embodiments of the present invention perform speaker identification and verification by first prompting a user to speak a phrase that includes a common phrase component and a personal identifier. Then, the embodiments decompose the spoken phrase to locate the personal identifier. Finally, the embodiments identify and verify the user based on the results of the decomposing.

SPEAKER IDENTIFICATION ASSISTED BY CATEGORICAL CUES

Methods, computer program products, and systems are presented. The methods include, for instance: obtaining a media file including a speech by one or more speaker. The language of the speech is identified and biographic data of a speaker of the speech is generated by analyzing semantics and vocal characteristics of the speech. The speaker is diarized and confidence in a resulting speaker label is evaluated against a threshold. The speaker label is adjusted with the language of the speech and biographic data of the speaker and produced as speaker metadata of the media file.

Generating dialogue based on verification scores

An example apparatus for generating dialogue includes an audio receiver to receive audio data including speech. The apparatus also includes a verification score generator to generate a verification score based on the audio data. The apparatus further includes a user detector to detect that the verification score exceeds a lower threshold but does not exceed a higher threshold. The apparatus includes a dialogue generator to generate dialogue to solicit additional audio data to be used to generate an updated verification score in response to detecting that the verification score exceeds a lower threshold but does not exceed a higher threshold.