Speech recognition method of and system for determining the status of an answered telephone during the course of an outbound telephone call
10320982 ยท 2019-06-11
Assignee
Inventors
- Lucas Merrow (Rollinsford, NH, US)
- Alexandra Drane (Winchester, MA, US)
- John Krocker (Hamilton, MA, US)
- Oleg Boulanov (Beverly, MA, US)
- Nasreen R. Quibria (Newton, MA, US)
- Michael R. Robinson (Scottsdale, AZ, US)
Cpc classification
H04M1/64
ELECTRICITY
H04M3/487
ELECTRICITY
H04M2203/2016
ELECTRICITY
H04M1/82
ELECTRICITY
International classification
H04M3/51
ELECTRICITY
H04M1/64
ELECTRICITY
H04M1/82
ELECTRICITY
Abstract
A system for determining the status of an answered telephone during the course of an outbound telephone call includes an automated telephone calling device for placing a telephone call to a location having a telephone number at which a target person is listed, upon the telephone call being answered, initiating a prerecorded greeting which asks for the target person and receiving a spoken response from an answering person and a speech recognition device for performing a speech recognition analysis on the spoken response to determine a status of the spoken response. If the speech recognition device determines that the answering person is the target person, the speech recognition device initiates a speech recognition application with the target person.
Claims
1. A method of determining the status of an answered telephone during the course of an outbound telephone call comprising: placing, with an automated calling system, a telephone call to a location having a telephone number at which a target person is listed; upon the telephone call being answered, initiating a prerecorded greeting which asks for the target person; determining the telephone call was not answered by an answering machine; receiving a spoken response from an answering person; performing a speech recognition analysis on the spoken response received from an answering person to determine a status of the spoken response; initiating, in response to the status of the spoken response, a prerecorded query selected from a plurality of prerecorded queries asking for the target person; wherein when the speech recognition analysis determines that the answering person is the target person, initiating a speech recognition application for interaction with the target person; and wherein the speech recognition application produces a series of acoustic outputs as prompts, which comprise queries about a particular topic, and are presented to the target person.
2. The method of claim 1, wherein the speech recognition application includes an application file programmed into the automated calling system.
3. The method of claim 1, wherein the speech recognition application records responses to the queries about the particular topic from the target person.
4. The method of claim 1, further comprising: eliminating a line noise that causes the speech recognition analysis to incorrectly detect the spoken response from the answering person.
5. The method of claim 1, wherein the speech recognition analysis comprises using a web access application, an educational application, a learning and lesson application, or a compliance application.
6. The method of claim 1, wherein the automated calling system includes a database of persons to whom the system is capable of initiating telephone calls.
7. The method of claim 1, further comprising: determining whether the spoken response during the prerecorded greeting is shorter than a predetermined time.
8. The method of claim 7, wherein the predetermined time is greater than 1600 milliseconds.
9. The method of claim 1, wherein determining the telephone call was not answered by the answering machine includes determining whether a beep occurs during the prerecorded greeting.
10. The method of claim 1, further comprising: playing an identification prompt in response to initiating the prerecorded query selected from the plurality of prerecorded queries asking for the target person.
11. An automated calling system, comprising: a processor; a memory including instructions stored thereupon, the instructions upon execution by the processor causes the processor to: initiate an automated telephone call to a location having a telephone number at which a target person is listed; upon the telephone call being answered, initiate a prerecorded greeting which asks for the target person; determine the telephone call was not answered by an answering machine; receive a spoken response from an answering person; perform a speech recognition analysis on the spoken response received from an answering person to determine a status of the spoken response; and initiate, in response to the status of the spoken response, a prerecorded query selected from a plurality of prerecorded queries asking for the target person; wherein when the speech recognition analysis includes a determination that the answering person is the target person, and initiation of a speech recognition application with the target person; and wherein the speech recognition application is configured to produce a series of acoustic outputs as prompts, which comprise queries about a particular topic for presentation to the target person.
12. The automated calling system of claim 11, wherein the speech recognition application includes an application file programmed into the automated calling system.
13. The automated calling system of claim 11, wherein the speech recognition application records responses to the queries about the particular topic from the target person.
14. The automated calling system of claim 11, wherein the processor is further configured to: eliminate a line noise that causes the speech recognition analysis to incorrectly detect the spoken response from the answering person.
15. The automated calling system of claim 11, wherein the speech recognition analysis comprises using a web access application, an educational application, a learning and lesson application, or a compliance application.
16. The automated calling system of claim 11, including a database of persons to whom the system is capable of initiating telephone calls.
17. The automated calling system of claim 11, wherein the processor is further configured to: determine whether the spoken response during the prerecorded greeting is shorter than a predetermined time.
18. The automated calling system of claim 17, wherein the predetermined time is greater than 1600 milliseconds.
19. The automated calling system of claim 11, wherein the processor is further configured to: determine whether a beep occurs during the prerecorded greeting.
20. The automated calling system of claim 11, wherein the processor is further configured to: play an identification prompt in response to initiating the prerecorded query selected from the plurality of prerecorded queries asking for the target person.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The foregoing and other objects of this invention, the various features thereof, as well as the invention itself may be more fully understood from the following description when read together with the accompanying drawings in which:
(2)
(3)
(4)
DETAILED DESCRIPTION
(5) Referring now to
(6) The automated telephone calling system 14 may also include a network interface that facilitates receipt of audio information by any of a variety of a networks, such as telephone networks, cellular telephone networks, the Web, Internet, local area networks (LANs), wide area networks (WANs), private networks, virtual private networks (VPNs), intranets, extranets, wireless networks, and the like, or some combination thereof. The system 12 may be accessible by any one or more of a variety of input devices capable of communicating audio information. Such devices may include, but are not limited to, a standard telephone or cellular telephone 18.
(7) Automated telephone calling system 14 includes a database of persons to whom the system 12 is capable of initiating telephone calls, each such person being referred to hereinafter as the target person, a telephone number associated with each person and a recorded data file that includes the target person's name. Such automated telephone calling devices are known in the art. As is described below, the automated telephone calling system 14 is capable of initiating a telephone call to a target person and playing a prerecorded greeting prompt asking for the target person. The system 14 then interacts with speech recognition system 16 to analyze responses received from the person on telephone 18.
(8) Speech recognition system 16 is an automated system on which a speech recognition application, including a series of acoustic outputs called prompts, which comprise queries about a particular topic, are programmed so that they can be presented to a respondent, preferably by means of a telephonic interaction between the querying party and the respondent. However, a speech recognition application may be any interactive application that collects, provides, and/or shares information. As examples, in the present invention, a speech application may be any of a group of interactive applications, including consumer survey applications; Web access applications; educational applications, including computer-based learning and lesson applications and testing applications; screening applications; consumer preference monitoring applications; compliance applications, including applications that generate notifications of compliance related activities, including notifications regarding product maintenance; test result applications, including applications that provide at least one of standardized tests results, consumer product test results, and maintenance results; and linking applications, including applications that link two or more of the above applications.
(9) In the preferred embodiment, each speech recognition application includes an application file programmed into the speech recognition system 16. Preferably, the series of queries that make up the application is designed to obtain specific information from the respondents to aid in customer or consumer service, education and research and development of particular products or services or other functions. For example, a particular speech application could be designed to ask respondents specific queries about a particular product or service. The entity that issues the application may then use this information to further develop the particular product or service. An application may also be used to provide specific information to a particular person.
(10) When the automated telephone calling device 12 initiates a call to the target person, it must be able to determine if the call was answered by a live person or by an answering machine. The method carried out by the system 12 is shown in flow diagram 20 of
(11) While the greeting prompt is being played, the speech recognition system 16 also attempts to detect the short tone or beep output by an answering machine, step 30. If no beep tone is detected, step 36, the speech recognition system 16 determines that a live person has answered the telephone and waits for a response to the greeting prompt, step 34. If a beep tone is detected, step 36, the speech recognition system 16 determines that an answering machine has answered the telephone, step 40.
(12) Once the speech recognition system 16 has determined that an answering machine has answered the telephone 18, at step 38 or 40, the greeting prompt is interrupted, step 41, and a prerecorded answering machine message prompt is played to the answering machine, step 42.
(13) As is the case during the playing of the greeting prompt, if, during the playing of the answering machine message prompt, a spoken response is detected which is greater than the predetermined time, step 44, or a beep tone is detected, step 46, the answering machine message prompt is interrupted and replayed, step 42. This insures that the answering machine message prompt is properly aligned with the answering machine recorder. When the answering machine message prompt has been played in its entirety without the detection of a spoken response, step 44 or a beep tone, step 46, the process ends, step 48.
(14) As described above, when the telephone 18 is answered, the speech recognition system 16 plays a greeting prompt, step 26,
(15) In step 50, the speech recognition system 16 determines that the answering person is the target person based on the answer to the greeting prompt. Typical answers that indicate that the answering person is indeed the target person include:
(16) A. Yes
(17) B. This is him/her/he/she/me/Y (name asked for)
(18) C. Yes it is
(19) D. Speaking
(20) E. Go ahead
(21) F. You got him/her
(22) When either of these responses is received by speech recognition system 16, the system analyzes the response and determines that the answering person is the target person. The speech recognition system then initiates the speech recognition application with the target person, step 52.
(23) If the spoken response to the greeting prompt received by speech recognition system 16 is determined to be one of the following:
(24) A. He/she is not here
(25) B. He/she is out (now)
(26) C. He/she is away
(27) the speech recognition system 16 determines that the target person is not available, step 54. Based on this determination, the speech recognition system 16 plays a message prompt asking the answering person if he/she will take a message, step 56. As shown in
(28) If the spoken response to the greeting prompt received by speech recognition system 16 is determined to be one of the following:
(29) A. No
(30) B. It isn't
(31) C. It isn't he/she/him/her
(32) D. No, this is not he/she/him/her
(33) E. No, this is not
(34) F. This is his/her wife/husband
(35) the speech recognition system 16 determines that the answering person has indicated that the answering person is not the target person, step 64. Based on this determination, the speech recognition system plays a hold prompt asking the answering person if the speech recognition system 16 should hold for the target person, step 66. As shown in
(36) If the spoken response to the greeting prompt received by speech recognition system 16 is determined to be one of the following:
(37) A. Who is calling?
(38) B. Who am I speaking with?
(39) C. What can I do for you?
(40) D. Can I ask what this is for?
(41) E. Can I ask what this is about?
(42) F. Can I tell him/her who this is?
(43) G. What is this/it for/about?
(44) H. Who is this?
(45) I. Who are you?
(46) the speech recognition system 16 determines that the answering person has asked for the identity of the entity responsible for the call, step 72. Based on this determination, the speech recognition system plays a prerecorded message that identifies the entity and, optionally, the reason for the call, step 74. The speech recognition system then plays the hold prompt asking the answering person if the speech recognition system 16 should hold for the target person, step 66. The speech recognition device 16 receives the spoken response to this prompt and analyzes it to determine the answer to the prompt. If the answer is determined to be No, the speech recognition system 16 plays the message prompt, step 56, and proceeds through the message procedure described above. If the answer is determined to be Yes, the system waits, step 68, until a further spoken response is detected. When a spoken response is received by the speech recognition system 16, the system plays an identification prompt, step 70, asking if the person is the target person. If the answer to the identification prompt is determined to be No, the speech recognition system 16 plays the message prompt, step 56, and proceeds through the message procedure described above. If the answer is determined to be Yes, the speech recognition system then initiates the speech recognition application with the target person, step 52, as indicated by connector 71.
(47) If the spoken response to the greeting prompt received by speech recognition system 16 is determined to be one of the following:
(48) A. Hold
(49) B. He's/she's here, hold on
(50) C. Just a minute/moment/second
(51) D. Yes, hold on
(52) the speech recognition system 16 determines that the answering person has asked the system to hold, step 76. Based on this determination, the system waits, step 68, until a further spoken response is detected. When a spoken response is received by the speech recognition system 16, the system plays an identification prompt, step 70, asking if the person is the target person. If the answer to the identification prompt is determined to be No, the speech recognition system 16 plays the message prompt, step 56, and proceeds through the message procedure described above. If the answer is determined to be Yes, the speech recognition system then initiates the speech recognition application with the target person, step 52, as indicated by connector 71.
(53) If, after the greeting prompt is played, step 26, the spoken response to the greeting prompt received by the speech recognition system 16 is determined to be Hello?, step 78, the speech recognition system 16 determines that the phone has been picked up by the target person. Based on this determination, the system plays the identification prompt, step 70, asking if the person is the target person. If the answer to the identification prompt is determined to be No, the speech recognition system 16 plays the message prompt, step 56, and proceeds through the message procedure described above. If the answer is determined to be Yes, the speech recognition system then initiates the speech recognition application with the target person, step 52, as indicated by connector 71.
(54) If the spoken response to the greeting prompt received by the speech recognition system 16 is determined to be one of the following:
(55) A. You (must) have the wrong number
(56) B. You've got the wrong number
(57) C. There is no one here by that name
(58) D. No one by that name lives here
(59) the speech recognition system 16 determines that it has dialed a wrong number, step 80. Based on this determination, the speech recognition system 16 plays the termination prompt, step 82, thanking the answering person, and the call is terminated.
(60) In the case in which the spoken response to the greeting prompt is unrecognized by the speech recognition system 16, or there is no response within a predetermined time period, such that none of the preceding steps can be followed, step 84, the speech recognition system 16 determines if any of the words of the response were recognized, step 86. If one or more of the words of the response are recognized, the speech recognition system 16 may play the identification prompt, asking if the answering person is the target person, step 88. In order to indicate to the answering person that the system did not understand the initial spoken response, the identification prompt may be modified to I did not hear you well, is this Y?
(61) If none of the words of the spoken response to the greeting prompt are recognized, step 86, or if no response is received, the system jumps to step 66, as indicated by connector 90 in
(62) Accordingly, the present invention provides a method of and system for determining the status of an answered telephone during the course of an outbound call. The system includes an automated telephone calling system which initiates a telephone call to a target person listed at a particular telephone number. When the telephone call is answered, the system first determines whether a live person or an answering machine has answered the telephone. If an answering machine is detected, the system leaves a message for the target person. If a live person answers the telephone, the system utilizes a speech recognition analysis to determine the status of the answering person and to attempt to locate the target person, if the answering person is not the target person. If the target person is available, the system initiates a speech recognition application. Otherwise, the system either attempts to leave a message for the target person, or simply terminates the call.
(63) The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. For example, the system may be utilized to connect a live person to the target person. In this instance, once the system has determined that the target person is on the line, a live person may be connected to the target person, rather than the speech recognition algorithm, for the purpose of conducting a live conversation with the target person. The present embodiments are therefore to be considered in respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of the equivalency of the claims are therefore intended to be embraced therein.