Ultrasonic-based person detection system and method
11867848 · 2024-01-09
Assignee
Inventors
Cpc classification
International classification
Abstract
An ultrasonic-based person detection method. The method comprising the steps of: (a) emitting, from an emitter, an ultrasonic signal, the ultrasonic signal including a component at a first frequency; (b) receiving reflections of the ultrasonic signal, the received signal including components at frequencies greater than and less than the first frequency; (c) determining a difference between an upper portion of the received signal containing a frequency higher than the first frequency, and a lower portion of the received signal containing a frequency lower than the first frequency; and (d) determining, based on the difference between the upper portion and the lower portion, whether a person is present.
Claims
1. An ultrasonic-based person detection method, comprising: (a) emitting, from an emitter, an ultrasonic signal, the ultrasonic signal including a component at a first frequency, f.sub.0; (b) receiving a reflected signal generated from reflections of the ultrasonic signal, the reflected signal including components at frequencies greater than and less than the first frequency, including an upper portion of the reflected signal containing a frequency higher than the first frequency and a lower portion of the reflected signal containing a frequency lower than the first frequency; (c) determining a difference between the upper portion of the reflected signal containing the frequency higher than the first frequency, and the lower portion of the reflected signal containing the frequency lower than the first frequency; and (d) determining, based on the difference between the upper portion and the lower portion, whether a person is present.
2. The method of claim 1, wherein the upper portion of the reflected signal contains higher frequencies immediately adjacent to the first frequency, and the lower portion of the reflected signal contains lower frequencies immediately adjacent to the first frequency.
3. The method of claim 1, wherein the method includes dividing the reflected signal into a plurality of bins, each bin representing a range of frequencies in the reflected signal, and wherein the upper portion is an upper frequency bin, containing portions of the reflected signal which are higher in frequency than the first frequency, and the lower portion is a lower frequency bin, containing portions of the reflected signal which are lower in frequency than the first frequency.
4. The method of claim 3, wherein the determination is performed based on a difference between a normalised power estimate of the upper frequency bin and a normalised power estimate of the lower frequency bin.
5. The method of claim 4, wherein a normalisation factor is a sum of the power estimates of the upper frequency bin and the lower frequency bin.
6. The method of claim 3, wherein determining whether the person is present includes determining a logit function of a normalised power estimate of the upper frequency bin.
7. The method of claim 6, wherein the logit function takes a form:
8. The method of claim 1, wherein steps (b)-(d) are repeated at a predetermined rate.
9. The method of claim 1, further comprising, after it has been determined that a person is present, determining whether the person is moving towards or away from a receiver receiving the reflected signal.
10. The method of claim 9, wherein determining whether the person is moving towards or away from the receiver is further based on a first likelihood ratio test, for determining whether the person is moving towards the receiver; and a second likelihood ratio test, for determining whether the person is moving away from the receiver.
11. The method of claim 10, wherein a log-likelihood ratio is derived for each likelihood ratio, and is computed recursively from a previous value of the respective log-likelihood ratio.
12. The method of claim 1, wherein when it has been determined that a person is present, the method includes taking a video conferencing device out of standby mode.
13. A system for detecting a person, the system including: an emitter, configured to emit an ultrasonic signal including a component at a first frequency, f.sub.0; one or more receivers, configured to receive a reflected signal generated from reflections of the ultrasonic signal; and one or more processors, configured, in response to the one or more receivers receiving the reflected signal, the reflected signal including components at frequencies greater than and less than the first frequency, including an upper portion of the reflected signal containing a frequency higher than the first frequency and a lower portion of the reflected signal containing a frequency lower than the first frequency, to: (a) determine a difference between the upper portion of the reflected signal containing the frequency higher than the first frequency, and the lower portion of the reflected signal containing the frequency lower than the first frequency; and (b) determine, based on the difference between the upper portion and the lower portion, whether a person is present.
14. The system of claim 13, wherein the upper portion of the reflected signal contains higher frequencies immediately adjacent to the first frequency, and the lower portion of the reflected signal contains lower frequencies immediately adjacent to the first frequency.
15. The system of claim 13, wherein the processor(s) are further configured to divide the reflected signal into a plurality of bins, each bin representing a range of frequencies in the reflected signal, and wherein the upper portion is an upper frequency bin, containing portions of the reflected signal which are higher in frequency than the first frequency, and the lower portion is a lower frequency bin, containing portions of the reflected signal which are lower in frequency than the first frequency.
16. The system of claim 15, wherein the determination is performed based on a difference between a normalised power estimate of the upper frequency bin and a normalised power estimate of the lower frequency bin.
17. The system of claim 16, wherein a normalisation factor is a sum of the power estimates of the upper frequency bin and the lower frequency bin.
18. The system of claim 15, wherein determining whether the person is present includes determining a logit function of a normalised power estimate of the upper frequency bin.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) Embodiments of the invention will now be described by way of example with reference to the accompanying drawings in which:
(2)
(3)
(4)
(5)
(6)
when a wideband signal is received;
(7)
(8)
(9)
(10)
DETAILED DESCRIPTION AND FURTHER OPTIONAL FEATURES
(11) Aspects and embodiments of the present invention will now be discussed with reference to the accompanying figures. Further aspects and embodiments will be apparent to those skilled in the art.
(12)
(13) As the ultrasonic signal propagates through the room, it reflects from various objects and/or interfaces. For example, after reflecting from a wall, an un-shifted reflection 104, i.e. one still at f.sub.0, is returned to the receiver 103. This un-shifted reflection is ignored, as it provides little information on the presence of people (indicated by movement) within the room. Whereas, after reflecting from person 105, who is moving towards the receiver 103, upshifted reflection 106 is returned to the receiver. The upshifted reflection 106 has a frequency higher than f.sub.0. This upshifted reflection provides information relating to the presence of a person within the room, particularly that the person is moving towards the receiver 103. Similarly, after reflecting from person 107, who is moving away from the receiver 103, downshifted reflection 108 is returned to the receiver. The downshifted reflection 108 has a frequency lower than f.sub.0. This downshifted reflection also provides information relating to the presence of a person within the room, and particularly that the person is moving away from the receiver 103.
(14) However, as discussed previously, transient noises 110 such as those generated by a door 109 slamming or hands clapping (which may originate from outside of the room) have a relatively broad frequency range and may contain components which have the same or similar frequency to the upshifted or downshifted components. These transient noises, which do not originate from emitter 101, can be interpreted by the receiver (or the processors connect thereto) as indicating the presence of a person.
(15)
(16) Next, at time t.sub.2, a transient signal 204 is received by the receiver. The signal is transient in that it has a limited presence in the x axis. However, the transient signal includes components at the same frequency as the upshifted signal 201, further upshifted signal 202, and downshifted signal 203. There is a risk then, that a processor connected to the receiver may interpret transient signal 204 as being indicative of a person being present.
(17)
(18) Next, in step 303, a logit of the normalised Doppler shift power is computed. Let f.sub.0 denote the frequency bin index that contains the emitted tone's frequency (e.g. 22000 Hz). The logit of the normalised power of the Doppler shift is then defined as:
(19)
(20) Where | . . . | denotes the absolute value, and logit
(21)
is the logit function for p. The argument of the logit function, i.e.
(22)
is the normalised power estimate of the frequency bin above f.sub.0 and the normalisation factor is the sum of the power estimates of the frequency bins above and below f.sub.0.
(23) This means that p is a number between zero and one, and can be likened to a probability. The logit function then transforms this probability such that it can take on values between .
(24) After this has been calculated for a given time-window, the method moves to steps 304 and 307 which are performed simultaneously. In step 304, a first log-likelihood ratio, log-likelihood ratio 0, is updated based on the computed logit, to indicate how likely it is that there is movement towards the receiver. At the same time, in step 307, a second log-likelihood ratio, log-likelihood ratio 1, is updated based on the computed logit, to indicate how likely it is that there is movement away from the receiver.
(25) In general, likelihood ratios do not have closed form expression, and so it can be computationally expensive to compute one. However, since the values of L(t) have been found to be approximately independent and normally distributed, simple expressions for the log-likelihood ratio can be derived.
(26) Log-likelihood ratios, of the type known per se in the art, have the general expression:
(27)
(28) Where p.sub.x|h.sub.
(29) Further, the log-likelihood ratios can be computed recursively, using the previous value and the new value of L(t). The initialisation of the log-likelihood ratios may include initialising them to zero, meaning that the initial likelihood ratio is one. This means that, at initialisation, the likelihood for motion is the same as the likelihood for no motion. Letting LLR.sub.0(t) denote the log-likelihood ratio of motion towards the receiver and LLR.sub.1(t) denote the log-likelihood ratio of motion away from the receiver, the update equations of the log-likelihood ratios can be specified as:
(30)
(31) In these expressions, is the expected change in magnitude, i.e. the expected deviation in the mean of L(t) from zero mean upon motion. This is a constant which is set during an initialisation stage. The variance of L(t) is denoted as var. This is either set to a fixed value during the initialisation stage, or estimated as the values of L(t) are computed.
(32) Once the log-likelihood ratios are calculated using some or all of the information from the computed logit, each log-likelihood ratio is compared to a threshold in steps 305 and 308. If one of the likelihood ratios exceeds its threshold, Yes in steps 305 and/or 308, then motion towards or away from the receiver can be determined in steps 306 and 309 respectively.
(33) Once motion has been detected, or not (No in steps 305 and 308) the method returns to step 302 for a new time-window. In this way, the motion detection method can operate continuously. In the example discussed below, the value of was selected as 5, and var was estimated from the values of L(t). In one example, an estimate value for var is obtained using the maximum likelihood estimator for L(t) in a time window when it was known that no motion was present. The maximum likelihood estimate can be calculated as the average of L(t).sup.2, for t in the time window when it is known that there is not motion.
(34) The logit function discussed above is particularly well suited for motion detection, for three reasons: (1) transient noise immunity; (2) normally distributed values; and (3) indicative of the direction of motion.
(35) Taking point (1) first,
(36)
when a transient, broadband signal is received.
(37) The upper graph in
(38) Thus, as seen in the lower graph in
(39) Turning next to point (2), the normally distributed values,
(40) Next, and with relation to point (3) the detection of motion,
(41)
(42) As can be seen, line 701 rises above threshold 703 between 4 and 5 seconds, and gives an indication that there is motion towards the receiver. At around 7 seconds, line 702 rises above the threshold 703 whilst line 701 falls below it, and gives an indication that there is motion away from the receiver.
(43)
(44) The logit function discussed with respect to
(45) Accordingly, L.sub.1(t) can be formulated as:
(46)
(47) i.e. replacing X(t, f.sub.01) in L(t) with X(t, f.sub.03). This results in a more robust signal for detection of motion towards the video conferencing device, since with normal walking speeds few Doppler shifts are received as low as f.sub.03. Further, noise immunity is still good, as broadband noises such as a door slamming or hands clapping have a very similar amount of energy in both frequency bins f.sub.03 and f.sub.0+1. However, logit function L.sub.1(t) does not perform as well when motion is directed away from the video conferencing device. Therefore the second logit function, L.sub.2(t), is employed which is formulated as:
(48)
(49) This is shown in Steps 303a-309a and 303b-309b for both logit functions, which are executed in parallel.
(50) While the invention has been described in conjunction with the exemplary embodiments described above, many equivalent modifications and variations will be apparent to those skilled in the art when given this disclosure. Accordingly, the exemplary embodiments of the invention set forth above are considered to be illustrative and not limiting. Various changes to the described embodiments may be made without departing from the spirit and scope of the invention.