VOICE CAPTCHA
20230142081 · 2023-05-11
Assignee
Inventors
- John Benjamin FISLER (Belle Vernon, PA, US)
- Nikos POLIS (West Orange, NJ, US)
- Christopher JENNISON (Nashua, NH, US)
- Andrew Matkin (Bolton, MA, US)
- David ARDMAN (Quebec, CA)
- Nirvana Tikku (Brooklyn, NY, US)
Cpc classification
G10L17/24
PHYSICS
G06F21/32
PHYSICS
International classification
G06F21/32
PHYSICS
G10L17/24
PHYSICS
Abstract
A method of Completely Automated Public Turing test to tell Computers and Humans Apart (CAPTCHA) includes: recording, by a voice CAPTCHA module, a speech spoken by a user; determining, by a voice biometric service (VBS), whether a voiceprint matching the user's speech exists; and if a voiceprint matching the user's speech exists, verifying the user as a human user by the VBS. If a voiceprint matching the user's speech does not exist, the VBS i) generates a unique voiceprint for the user based on the user's speech, and/or ii) determines whether the user's speech is at least one of a synthetically generated speech and a previously recorded audio being played back. The user can perform a guest checkout without logging into the voice CAPTCHA module, in which case the VBS compares previously used voiceprints to the user's speech.
Claims
1. A method of Completely Automated Public Turing test to tell Computers and Humans Apart (CAPTCHA), comprising: recording, by a voice CAPTCHA module, a speech spoken by a user; determining, by a voice biometric service (VBS), whether a voiceprint matching the user's speech exists; and if a voiceprint matching the user's speech exists, verifying the user as a human user by the VBS.
2. The method of claim 1, further comprising: if a voiceprint matching the user's speech does not exist, generating by the VBS a unique voiceprint for the user based on the user's speech.
3. The method of claim 1, further comprising: if a voiceprint matching the user's speech does not exist, determining by the VBS whether the user's speech is at least one of a synthetically generated speech and a previously recorded audio being played back.
4. The method of claim 1, further comprising: presenting, by the voice CAPTCHA module, a login screen to the user; wherein the VBS determines whether the voiceprint matching the user's speech exists after the user has logged in.
5. The method of claim 2, further comprising: presenting, by the voice CAPTCHA module, a login screen to the user; wherein the VBS determines whether the voiceprint matching the user's speech exists after the user has logged in.
6. The method of claim 3, wherein the voice CAPTCHA module enables the user to perform a guest checkout without logging into the voice CAPTCHA module.
7. The method of claim 6, further comprising: comparing, by the VBS, previously used voiceprints to the user's speech.
8. The method of claim 2, further comprising: if a voiceprint matching the user's speech does not exist, determining by the VBS whether the user's speech is one of a synthetically generated speech and a previously recorded audio being played back.
9. The method of claim 8, wherein if the user's speech is not one of a synthetically generated speech and a previously recorded audio being played back, the VBS determines the user's speech to be a unique and authentic human voice.
10. The method of claim 9, wherein the unique voiceprint for the user is generated by the VBS after determining the user's speech is a unique and authentic human voice.
11. A system for implementing a method of Completely Automated Public Turing test to tell Computers and Humans Apart (CAPTCHA), comprising: a voice CAPTCHA module configured to record a speech spoken by a user; and. a voice biometric service (VBS) configured to: i) determine whether a voiceprint matching the user's speech exists, and ii) if a voiceprint matching the user's speech exists, verifying the user as a human user.
12. The system of claim 11, wherein: the VBS is configured to generate a unique voiceprint for the user based on the user's speech if a voiceprint matching the user's speech does not exist.
13. The system of claim 11, wherein: if a voiceprint matching the user's speech does not exist, the VBS is configured to determine whether the user's speech is at least one of a synthetically generated speech and a previously recorded audio being played back.
14. The system of claim 11, wherein: the voice CAPTCHA module is configured to present a login screen to the user; and the VBS is configured to determine whether the voiceprint matching the user's speech exists after the user has logged in.
15. The system of claim 12. wherein: the voice CAPTCHA module is configured to present a login screen to the user; and the TBS is configured to determine whether the voiceprint matching the user's speech exists after the user has logged in.
16. The system of claim 13, wherein: the voice CAPTCHA. module is configured to enable the user to perform a guest checkout without logging into the voice CAPTCHA module.
17. The system of claim 16, wherein: the VI3S is configured to compare previously used voiceprints to the user's speech.
18. The system of claim 12, wherein: if a voiceprint matching the user's speech does not exist, the VBS is configured to determine whether the user's speech is at least one of a synthetically generated speech and a previously recorded audio being played back.
19. The system of claim 18, wherein: if the user's speech is not one of a synthetically generated speech and a previously recorded audio being played back, the VBS is configured to determine the user's speech to be a unique and authentic human voice.
20. The system of claim 19, wherein: the VBS is configured to generate the unique voiceprint for the user after determining the user's speech is a unique and authentic human voice.
Description
BRIEF DESCRIPTION OF THE FIGURES
[0011] FIG.1a is a schematic diagram of various components of an example system for implementing the voice CAPTCHA method according to the present disclosure.
[0012]
[0013]
[0014]
[0015] FIG, 4 illustrates an example signal flow in an example system for implementing the voice CAPTCHA method for the case in which the user performs a “guest checkout”.
DETAI LED DESCRIPTION
[0016]
[0017]
[0018]
[0019] Continuing with
[0020]
[0021] Continuing with
[0022]
[0023] Continuing with
[0024] The MW 102 then sends a request to enroll the user with the VBS 103, as shown by the process arrow 4015. Once the VBS 103 sends to the MW 102 an indication that sufficient audio material from the user has been collected for training, as shown by the process arrow 4016, the MW 102 sends a request to the VBS 103 (as shown by the process arrow 4017) to start the training process to build a unique voiceprint. Once the training process for the voiceprint of the user has been completed, the VBS 103 sends to the MW 102 an indication that the unique voiceprint for the user has been successfully trained, as shown by the process arrow 4018.
[0025] As a summary, several examples of the method and the system according to the present disclosure are provided.
[0026] A first example of the method according to the present disclosure provides a method of Completely Automated Public Turing test to tell Computers and Humans Apart (CAPTCHA), comprising: recording, by a voice CAPTCHA module, a speech spoken by a user; determining, by a voice biometric service (VBS), whether a voiceprint matching the user's speech exists; and if a voiceprint matching the user's speech exists, verifying the user as a human user by the VBS.
[0027] A second example of the method modifying the first example of the method, the second method further comprising: if a voiceprint matching the user's speech does not exist, generating by the VBS a unique voiceprint for the user based on the user's speech.
[0028] A third example of the method modifying the first example of the method, the third method further comprising: if a voiceprint matching the user's speech does not exist, determining by the VBS whether the user's speech is at least one of a synthetically generated speech and a previously recorded audio being played back.
[0029] A fourth example of the method modifying the first example of the method, the fourth method further comprising: presenting, by the voice CAPTCHA module, a login screen to the user; wherein the VBS determines whether the voiceprint matching the user's speech exists after the user has logged in.
[0030] A fifth example of the method modifying the second example of the method, the fifth method further comprising: presenting, by the voice CAPTCHA module, a login screen to the user; wherein the VBS determines whether the voiceprint matching the user's speech exists after the user has logged in.
[0031] In a sixth example of the method modifying the third example of the method, the voice CAPTCHA module enables the user to perform a guest checkout without logging into the voice CAPTCHA module.
[0032] A seventh example of the method modifying the sixth example of the method, the seventh method further comprising: comparing, by the VBS, previously used voiceprints to the user's speech.
[0033] An eighth example of the method modifying the second example of the method, the eight method further comprising: if a voiceprint matching the user's speech does not exist, determining by the \IBS whether the user's speech is one of a synthetically generated speech and a previously recorded audio being played back.
[0034] In a ninth example of the method modifying the eighth example of the method, if the user's speech is not one of a synthetically generated speech and a previously recorded audio being played back, the VBS determines the user's speech to be a unique and authentic human voice.
[0035] In a tenth example of the method modifying the ninth example of the method, the unique voiceprint for the user is generated by the VBS after determining the user's speech is a unique and authentic human voice.
[0036] A first example of the system according to the present disclosure provides a system for implementing a method of Completely Automated. Public Turing test to tell Computers and Humans Apart (CAPTCHA), comprising: a voice CAPTCHA module configured to record a speech spoken by a user; and a voice biometric service (VBS) configured to: i) determine whether a voiceprint matching the user's speech exists, and ii) if a voiceprint matching the user's speech exists, verifying the user as a human user.
[0037] In a second example of the system modifying the first example of the system, the VBS is configured to generate a unique voiceprint for the user based on the user's speech if a voiceprint matching the user's speech does not exist.
[0038] In a third example of the system modifying the first example of the system, if a voiceprint matching the user's speech does not exist, the VBS is configured to determine whether the user's speech is at least one of a synthetically generated speech and a previously recorded audio being played back.
[0039] In a fourth example of the system modifying the first example of the system, the voice CAPTCHA module is configured to present a login screen to the user; and the VBS is configured to determine whether the voiceprint matching the user's speech exists after the user has logged in.
[0040] In a fifth example of the system modifying the second example of the system, the voice CAPTCHA module is configured to present a login screen to the user; and the VBS is configured to determine whether the voiceprint matching the user's speech exists after the user has logged in.
[0041] In a sixth example of the system modifying the third example of the system, the voice CAPTCHA module is configured to enable the user to perform a guest checkout without logging into the voice CAPTCHA module.
[0042] In a seventh example of the system modifying the sixth example of the system, the VBS is configured to compare previously used voiceprints to the user's speech.
[0043] In an eighth example of the system modifying the second example of the system, if a voiceprint matching the user's speech does not exist, the VBS is configured to determine whether the user's speech is at least one of a synthetically generated speech and a previously recorded audio being played back.
[0044] In a ninth example of the system modifying the eighth example of the system, if the user's speech is not one of a synthetically generated speech and a previously recorded audio being played back, the VBS is configured to determine the user's speech to be a unique and authentic human voice.
[0045] In a tenth example of the system modifying the ninth example of the system, the VBS is configured to generate the unique voiceprint for the user after determining the user's speech is a unique and authentic human voice.