STATISTICAL ACOUSTIC SENSING-BASED SYSTEM AND METHOD FOR IN-VEHICLE CHILD PRESENCE DETECTION
20240329239 ยท 2024-10-03
Inventors
Cpc classification
B60Q9/00
PERFORMING OPERATIONS; TRANSPORTING
A61B2503/06
HUMAN NECESSITIES
International classification
G01S15/50
PHYSICS
B60Q9/00
PERFORMING OPERATIONS; TRANSPORTING
Abstract
The present invention provides a statistical acoustic sensing (SAS)-based method and system for in-vehicle Child Presence Detection. The SAS-based method comprises: transmitting acoustic signal to a subject in a vehicle cabin; receiving acoustic multipath signals scattered by the subject; and processing the acoustic multipath signals to detect presence of the subject by: extracting a plurality of channel impulse response (CIR) data from the received acoustic multipath signals; aggregating the extracted CIR data to estimate acoustic channel state information (CSI); obtaining an autocorrelation function (ACF) of the acoustic CSI based on a statistical acoustic sensing (SAS) model; and performing motion detection and breath tracking on basis of the ACF to detect presence of the subject in the vehicle cabin. The present invention can leverage in-car audio systems to detect presence of young children including newborns in an accurate and responsive manner.
Claims
1. A system for detecting presence of a subject in a vehicle cabin, comprising: one or more transmitters, each configured to transmit an acoustic signal to the subject in the vehicle cabin; and at least one receiver configured to receive a plurality of acoustic multipath signals scattered by the subject; a controller configured to generate one or more driving signals to control the one or more transmitters to transmit the acoustic signal; and a processor coupled with the controller and configured to receive the plurality of acoustic multipath signals from the receiver and process the plurality of acoustic multipath signals to detect presence of the subject in the vehicle cabin; and wherein the presence of the subject is detected by: extracting a plurality of channel impulse response (CIR) data from the plurality of received acoustic multipath signals; aggregating the plurality of extracted CIR data to estimate acoustic channel state information (CSI); obtaining an autocorrelation function (ACF) of the acoustic CSI based on a statistical acoustic sensing (SAS) model; and performing one or more physiological activity monitoring on basis of the ACF to detect presence of the subject in the vehicle cabin.
2. The system of claim 1, wherein the acoustic CSI is given by:
H(f,t)=?.sub.i?R.sub.
3. The system of claim 2, wherein the autocorrelation function (ACF) is given by:
4. The system of claim 3, wherein the one or more physiological activity monitoring includes motion detection; and the motion detection is performed by: calculating a channel gain of the acoustic CSI from the ACF; comparing the channel gain against a threshold; determining that motion is detected if the channel gain is equal or greater than the threshold.
5. The system of claim 3, wherein the channel gain is associated with the ACF by:
g(f)={tilde over (?)}(f,?)=?(f,?)+n(f,?), where g(f) denotes the channel gain; {tilde over (?)} (f, ?) is the sampled ACF calculated from a time series of CSI measurements with the noise term n(f, ?); and the channel gain is approximated as:
6. The system of claim 1, wherein the one or more physiological activity detection includes breathing tracking; and the breathing tracking is performed by: searching peaks in the autocorrelation function (ACF) over time corresponding to a cycle time of breathing; and determining that breathing is detected and tracked if the peaks are found.
7. The system of claim 6, wherein an optimized ACF is obtained by combining one or more autocorrelation function (ACF) corresponding to one or more subcarriers through a maximal ratio combining (MRC) algorithm; and the one or more physiological activity detection includes breathing tracking and the breathing tracking is performed by: searching peaks in the optimized ACF over time corresponding to a cycle time of breathing; and determining that breathing is detected and tracked if the peaks are found.
8. The system of claim 1, wherein the one or more driving signals are modulated with a pseudo-noise sequence.
9. The system of claim 1, wherein the pseudo-noise sequence is a Kasami sequence.
10. The system of claim 1, further comprising a first high-pass filter applied on the transmitted acoustic signal and a second high-pass filter applied on the received acoustic signal.
11. A method for detecting presence of a subject in a vehicle cabin, the method comprising: transmitting an acoustic signal to the subject in the vehicle cabin; receiving a plurality of acoustic multipath signals scattered by the subject; processing the plurality of acoustic multipath signals to detect presence of the subject in the vehicle cabin by: extracting a plurality of channel impulse response (CIR) data from the plurality of received acoustic multipath signals; aggregating the plurality of extracted CIR data to estimate acoustic channel state information (CSI); obtaining an autocorrelation function (ACF) of the acoustic CSI based on a statistical acoustic sensing (SAS) model; and performing one or more physiological activity monitoring on basis of the ACF to detect presence of the subject in the vehicle cabin.
12. The method of claim 1, wherein the acoustic CSI is given by:
H(f,t)=?.sub.i?R.sub.
13. The method of claim 12, wherein the autocorrelation function (ACF) is given by:
14. The method of claim 13, wherein the one or more physiological activity monitoring includes motion detection; and the motion detection is performed by: calculate a channel gain of the acoustic CSI from the ACF; comparing the channel gain against a threshold; determining that motion is detected if the channel gain is equal or greater than the threshold.
15. The method of claim 13, wherein the channel gain is associated with the ACF by:
g(f)={tilde over (?)}(f,?)=?(f,?)+n(f,?), where g(f) denotes the channel gain; {tilde over (?)}(f, ?) is the sampled ACF calculated from a time series of CSI measurements with the noise term n(f, ?); and the channel gain is approximated as:
16. The method of claim 11, wherein the one or more physiological activity detection includes breathing tracking; and the breathing tracking is performed by: searching peaks in the autocorrelation function (ACF) over time corresponding to a cycle time of breathing; and determining that breathing is detected and tracked if the peaks are found.
17. The method of claim 16, wherein an optimized ACF is obtained by combining one or more autocorrelation function (ACF) corresponding to one or more subcarriers through a maximal ratio combining (MRC) algorithm; and the one or more physiological activity detection includes breathing tracking and the breathing tracking is performed by: searching peaks in the optimized ACF over time corresponding to a cycle time of breathing; and determining that breathing is detected and tracked if the peaks are found.
18. The method of claim 11, wherein the one or more driving signals are modulated with a pseudo-noise sequence.
19. The method of claim 11, wherein the pseudo-noise sequence is a Kasami sequence.
20. The method of claim 11, further comprising: applying filtering the transmitted acoustic signal and the received acoustic signal with a high-pass filter respectively.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0014] Embodiments of the invention are described in more details hereinafter with reference to the drawings, in which:
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
DETAILED DESCRIPTION
[0044] In the following description, a method and a system for detecting presence of a subject in a vehicle cabin and the likes are set forth as preferred examples. It will be apparent to those skilled in the art that modifications, including additions and/or substitutions may be made without departing from the scope and spirit of the invention. Specific details may be omitted so as not to obscure the invention; however, the disclosure is written to enable one skilled in the art to practice the teachings herein without undue experimentation.
[0045]
[0046] Accordingly,
[0050] The presence of the subject is detected by: extracting a plurality of channel impulse response (CIR) data from the plurality of received acoustic multipath signals; aggregating the plurality of extracted CIR data to obtain an acoustic channel state information (CSI); obtaining an autocorrelation function (ACF) of the acoustic CSI based on a statistical acoustic sensing (SAS) model; and performing one or more physiological activity monitoring on basis of the ACF to detect presence of the subject in the vehicle cabin.
[0051] CSI, also known as Channel Frequency Response (CFR), is the frequency-domain counterpart of CIR. CSI for an acoustic multipath channel of frequency fat time t is denoted as the following equation:
[0052] where a.sub.r (t) and ?.sub.r (t) are the complex amplitude and propagation delay of the r-th reflection path, respectively, while R denotes the total number of paths.
[0053] While previous works mostly only focus on reflections from the range of interest and segment others out, the present invention properly aggregates multipath distortions caused by a target at a certain range and extract useful information for sensing. As shown in
[0054] From a rich-scattering perspective, each reflection path can be treated as a scatterer that scatters the incoming energy back to the receiver (i.e., microphone). Thus, it can infer the following equation:
where H.sub.i(f,t) denotes the component contributed by the i th scatterer, N(f,t) is the noise term with variance ?.sub.N.sup.2, and R.sub.S and R.sub.D denote the set of static and dynamic scatterers, respectively.
[0055] Assuming all scatterers are statistically independent of each other, each with the same variance ?.sub.i.sup.2(f) and approximately zero means, the ACF of H(f,t) obeys the 0.sup.th-order Bessel function of the first kind. That is, denoting ?.sub.i (f,?) as the ACF of H.sub.i (f,t) with time lag ?, it can be inferred that ?.sub.i (f,?)=J.sub.0(kvi?), where
v.sub.i is the moving speed of H.sub.i(f, t), k is the wavenumber.
[0056] Suppose there is one single moving target, and thus all dynamic scatterers have approximately the same speed v, v.sub.i?v, ?i?R.sub.D. This assumption is realistic because, for human subjects, the torso scatterers dominate others and have a similar speed. Then motion detection may be performed by: calculating a channel gain of the acoustic CSI from the ACF; comparing the channel gain against a threshold; determining that motion is detected if the channel gain is equal or greater than the threshold.
[0057] More specifically, the ACF ?(f, ?) of H(f, t) can be associated with the target's moving speed v as follows:
where ?( ) is the Dirac's delta function and g(f) is defined as the channel gain of H(f, t).
[0058] The channel gain is then associated with the ACF by: g(f)={tilde over (?)}(f, t)=?(f, ?)+n(f, ?), where {tilde over (?)}(f, ?) is the sampled ACF calculated from a time series of CSI measurements with the noise term n(f, ?).
[0059] From Eq. (3), it can be inferred that g(f)=lim.sub.?.fwdarw.0?(f, ?) since lim.sub.?.fwdarw.0J.sub.0(kv?)=1. Hence, given a sufficient CSI sampling rate F.sub.s, g(f) can approximate as the value of the first tap of the ACF, i.e.,
[0060] If any motion presents, the value of g(f) is greater than zero; otherwise g(f).fwdarw.0. In other words, the defined channel gain g(f) is used as a sensitive and robust indicator for acoustic motion detection.
[0061]
[0062] Breathing signals, which are periodic signals induced by repeated chest movements, are detected from the ACF. If a breathing signal is captured by CSI, the ACF will observe a prominent peak at the time lag Tb corresponding to the cycle time, as shown in
[0063] As indicated by Eq. (3), the ACF of CSI is a function of speed v, which underpins a statistical approach entirely different from the Doppler effect for speed estimation. Specifically, as shown in
where ?.sub.s is the time lag corresponding to the first local peak of the ACF ?(f, ?) and ?(f) is the wavelength of subcarrier f.
[0064] As seen, a peak in the ACF can either indicate a speed signal or a periodic signal. However, the peak locations for breathing (e.g., 1-5 s for breathing rates 60-12 BPM) are usually of magnitude longer than those for speed (e.g., <0.5 s for 0.5 m/s using 10 kHz sound, and the faster the speed, the smaller the delay), a sufficient difference to determine whether to estimate breathing or speed.
[0065] Several unique characteristics of sound waves make CSI measurements (or channel estimation) particularly challenging. First, the sound wave speed is orders of magnitude slower than that of light and EM waves, which imposes limitations on the max possible CSI sampling rate of the acoustic channel. For example, given the in-air sound speed of around 343 m/s, the propagation delay of a path of 7 meters in length will be greater than 20 ms, requiring a minimum channel measurement internal larger than 20 ms to avoid signal mixture. Second, acoustic sensing is vulnerable to environmental sound interference, especially when it is limited to a frequency band under 24 kHz on commodity devices. Ambient interference like the human voice, music, and natural sounds, can smear channel measurements for certain frequency bands. Moreover, concurrent sensing signals transmitted on multiple speakers, if used, may also interfere with each other.
[0066] To address above challenges, pulse coded modulation (PCM) samples are generated from a pseudo-noise sequence and applied on the speaker for channel estimation (or CSI estimation. In other words, the acoustic signal is PCM signal modulated with the pseudo-noise sequence. The PN sequence may be selected from, but not limited to, m-sequence, Golay sequence, GSM training sequence, and ZadoffChu (ZC).
[0067] Preferably, Kasami sequence is selected because of its superior properties of orthogonality and noise tolerance.
[0068]
[0069] Since a correlation operation is equivalent to a conjugate multiplication in the frequency domain, the measured CSI {tilde over (H)}(f) using Kasami sequence can be represented as
[0070] where H(f) denotes the ideal CSI, and S(f) and N(f) are the frequency-domain representations of Kasami sequence and sound noises respectively. S(f) is a wideband signal spanning over the whole spectrum, and S*(f) is its conjugate. The term ?S(f)?.sub.2.Math.H(f) approximates to a scaled version of H(f). An example of the measured CIR is shown in
[0071] There are two issues with the above channel estimation process. First, the Kasami sequences composed of 1's and ?1's with sharp transitions in between can be intrusive to human cars. Second, by transforming CIR, the obtained CSI spans the full spectrum, which might be polluted by the ambient sound noises, especially on the audible frequency band.
[0072] To circumvent these problems, high-pass filtering is applied on both the transmitted and received signals. The passband can be set flexibly, and the SAS-based detection system can work reliably even with only the inaudible pseudo-ultrasound band, e.g., above 18 kHz. Here, an empirical passband of 10 kHz is used as an example.
[0073] When the filter is applied on the transmitter side, as shown in
[0074] On the receiver side, the term N(f). S*(f) in Eq. 5 is eliminated by high-pass filtering. This is because typical daily sound interference, such as traffic and human voice, mostly occurs in the frequency band below 10 kHz.
[0075]
[0076] Due to hardware imperfections and software latency, however, the speakers and microphones are not perfectly synchronized, which makes it difficult to measure accurate channel response. Fortunately, synchronization errors only introduce phase offsets in CSI, which does not affect the detection accuracy because CSI is measured consecutively without blanks in between and the amplitude is only used.
[0077] Sound reflection off human bodies is considerably weak, a major reason confining the coverage of human-centric acoustic sensing. The problem is aggravated when the target subject is an infant/toddler in CPD applications.
[0078] In some embodiments, subcarrier diversity, a phenomenon attributed to frequency selective fading, is exploited to optimize the receiving SNR for better coverage. More specifically, a maximal ratio combining (MRC) approach is employed to combine multiple subcarriers optimally to utilize all multipath signals effectively. Since the noise terms on different subcarriers are statistically independent, the signal SNR by MRC can be maximized as
where {circumflex over (?)}(t) is the combined ACF, w(f) denotes the normalized weight for combining subcarrier f (i.e., ?.sub.f?Fw(f)=1), and F is the set of all subcarriers. The optimal weight w(f) should be linearly proportional to the gain on each subcarrier.
[0079] Following, the normalized g(f) is adapted, defined in Eq. (4), as the weight w(f) in the SAS-based detection system. Note that, some intuitive criteria commonly used like mean/variance of CSI amplitude cannot serve as the optimal weights for MRC, as they are not linearly proportional to the gain g(f) and subcarriers with higher amplitude means/variances do not necessarily better capture the sensing signals, as shown by
[0080] MRC can be applied here because, by taking the ACF, the sensing signals (either breathing or speed) are synchronized across different subcarriers (
[0081] In some embodiments, the subject is a child or an infant, and the one or more physiological activity monitoring may include motion detection and breath tracking such that the SAS-based detection system can be used as a CPD system.
[0082] For motion detection, the gains g(f) is averaged across all subcarriers and obtain
Then given a preset motion threshold ?, the system detects motion at any given time t if
[0083] For tracking breathing, there is need to find whether there exists a dominant peak in the enhanced sensing signal {circumflex over (?)}(t). To achieve so, similar criteria is adopted in for peak finding. Basically, the peak prominence is examined, width, and amplitude to identify potential peaks. Then, the peak location is checked to sift out those beyond the typical range of human breathing rates, e.g., 10-60 BPM. The motion level
[0084] In real-time CPD, a sliding window is employed on the continuous CSI to calculate the ACF. A shorter window of CSI (e.g., 1 s) is employed for calculating the ACF for motion detection to make it more responsive while saving computation. While for breathing, a minimum window larger than a typical breathing cycle (e.g., 6 s, which can be shorter for children who usually have higher breathing rates) is desired. As motion is more common and the computation is more efficient, only further perform breathing estimation is performed when no motion can be detected. Note that the system can output detection decisions as fast as every CSI sample, or at a predefined lower rate, e.g., every 1 second, to save energy. Once the time series of motion/breathing decisions are obtained, and they are checked within a certain window, e.g., 5 s, and child presence is claimed if there is a certain amount of motion/breathing detection, e.g., >30% of the window.
Evaluation
[0085] For performance evaluation, a prototype of the SAS-based detection system is implemented using a programming audio prototype, which consists of a MiniDSP UMA-8SP USB microphone array with 7 built-in Knowles SPH1668LM4H microphones (only one of them is used) and PUI Audio AS07104PO-R speakers connected to the MiniDSP board via cables, as shown in
[0086] A longer period of Kasami Sequence allows higher SNR for channel estimation, which, however, creates an immediate conflict with sampling rates. To tradeoff, a sequence of period 2.sup.10?1 modulated is used into 0.02 s, which allows a desired sampling rate of 50 Hz to use in the SAS-based detection system. By default, 3 seconds of CSI is used for motion calculation and use 8 seconds for breathing rate estimation.
[0087] By applying a high-pass filter, most of the daily environmental noises are eliminated. However, if there are sharp and short impulse-like noises (e.g., horn honk/beep), the impacts may go above 10 KHz and cause false motion detection. That these kinds of sharp noises will impose a sudden change in the CSI amplitudes, which translates into a special ACF pattern, which linearly decreases first and then linearly increases (See Appendix A.3 for more details). Therefore, a detector is designed to identify this linear decrease-then-increase pattern and skip CPD during the interfered period. By doing so, the SAS-based detection system becomes immune to sharp noise like horn beep, an important feature making it more practical. Although this would reduce the effective protection time (the system is not working in presence of such noises), the impact is minimal because these noises are usually short (?1 s) while the SAS-based detection system detects so rapidly that it can find a period for detection.
[0088] The prototype is first evaluated with comprehensive indoor experiments to validate motion detection and breathing estimation. Motion and presence ground truths are manually labelled. Ground truth for adult breathing is measured by Plux piezoelectric Respiration (PZT) sensor. Infant simulators (e.g. SimNewB) have a preset fixed breathing rate. The ground truth breathing rate of children participants did not record as it is difficult to have their cooperation.
[0089] Detection rate (DR) and false alarm rate (FAR) are used as the evaluation metrics for motion, breathing, and overall presence detection, while the mean absolute breathing rate error is evaluated. To show the system's performance under extreme responsiveness constraints, by default a 2-second window is used for the decision. A higher DR is expected if a longer decision window is applied. Also, only one speaker is used for evaluation unless otherwise specified. Using more speakers is expected to provide larger coverage.
[0090] The motion detection performance is evaluated in a 7 m?5 m conference room. One microphone and one speaker are set up in the corner. The room is in an office building, with constant noise from the central fan and occasional footstep sounds when people pass by the outside corridor. An adult is asked to sit in a chair, at various distances from 1 m to 5 m, and only move his one hand slowly to mimic the tiny motion of a child. The system is also tested with the speaker facing different angles with respect to the subject. As shown in
[0091] The breathing estimation performance is evaluated in the same environment. First, the system is also tested with an adult subject at different distances, sitting still in a chair.
[0092] As shown in
[0093] A feasibility study is carried out with the SimNewB newborn simulator in a clinical facility, which features tens of beds and has continuous machinery and HVAC noises. The experimental setup is illustrated in
[0094] A real-world CPD study is conducted with young children in different cars and parking scenarios, such as parking lots, roadside parking, garage, etc. 15 young children are recruited and perform CPD in 7 different cars including sedan and SUV.
[0095] For each child, cases of different locations are tested, with either forward-facing or rear-facing car seats as regulated. For older children who can sit/crawl independently, cases of seats without the baby car seat are also tested. All the children wear their regular winter coats. Motion (awake) cases are tested for every child and evaluate breathing for children who are able to get asleep (or stay very quiet) during the test. The data collection for each child lasts about 30-60 minutes. During tests, the car is parked and locked with windows closed, the typical scenario that hot-car deaths may occur. There are cars parking around and/or passing by, and parents and the experimenters talking/standing/walking around the car. There are frequent traffic noises during most of the tests, done in central downtown Hong Kong. In total, 15 children (aged 7 months, 12 months, 18 months, 2 (2?),3 (4?),4 (2?),5 (3?), and 10 years old, respectively) tested in 7 different cars, including Lexus LS430, BMW 330, Mercedes-Benz C200, Mercedes-Benz S320, Tesla Model 3, Honda Jazz, Nissan Serena. One or two speakers is used for the real-world study, considering not all cars have four or more, and always use one single microphone. In most cases, the LOS condition is occluded, provided that the devices are installed in the front row while the kids are seated in the back.
[0096] The overall presence DR is mainly focused for this CPD test.
[0097] Furthermore, the performance is analyzed at different in-car locations. As shown in
[0098] To further understand the detection coverage in a car, a small toy car is used to simulate tiny motions at nine different on-seat and on-floor locations as shown in
[0099] It is also studied whether increasing number of speakers can increase the sensing coverage of the SAS-based detection system. A case study is presented with two speakers in a meeting room, where an adult sits in a chair moving one hand. As portrayed in
[0100] Besides the high accuracy, it is also critical to study false alarms, especially over a long period in diverse noisy environments like busy streets, noisy garages, etc. It is noted that the above real-world experiments were conducted in noisy urban areas (including noisy garages, busy streets, parking lots next to highways, etc) in downtown Hong Kong in the presence of cars, sirens, pedestrians, etc. To further understand the performance in different environments, it is carried out a relatively long-term evaluation in the busy Beijing City. The car is parked, without kids inside, in a busy garage and a crowded street for about 10 hours, respectively. A false alarm is reported if motion is detected for over 10% of the time for a sliding window of 2 s. The results show that the SAS-based detection system observes a FAR of 0.12% in the garage and 0.28% for the roadside parking case. In practice, a CPD system may not need to run for a long time, but perhaps only for a few minutes after the car is parked and locked, which will further reduce the chance of observing false alarms.
[0101] As a time-critical task, the detection latency of the SAS-based detection system is analyzed. To do so, the delay of the first decision for each test is evaluated. A 3 s window for ACF calculation is used for motion and an 8 s window for breathing, and then use another 2 s window for presence detection. Hence, the minimum delay will be 5 s if motion is detected and 10 s if there is no motion but breathing. With this configuration, the results show that the SAS-based detection system can output the first detection within 5.7 s for 81.9% of the time, 11.2 s for 95.2%, and 15.2 s for 98.8%. The minimum delays and thus the overall latency can be reduced by using a shorter window (e.g., 1 s) for motion detection, the most common case for CPD.
[0102] The SAS-based detection system is compared with the state-of-the-art approach BreathJunior, the closest to our work which successfully uses white noise for infant breathing monitoring. BreathJunior is implemented and performed comparison experiments using an infant simulator. The results demonstrate that the SAS-based detection system outperforms BreathJunior in both accuracy and coverage. As shown in
[0103] As said, the SAS-based detection system can work with any channel estimation methods that output CIR. The performance of using Kasami Sequence against using different CIR estimation methods are compared, including chirp signals (FMCW), Golay Sequence, MLS, Gold Sequence. As shown in
[0104] The SAS-based detection system's performance is examined on different devices. The SAS-based detection system is evaluated using JBL Stage1 621 car speaker and Linhuipad car microphone, both adopted in existing automobile audio systems. As shown in
[0105] The impacts of various factors and validate the robustness of the SAS-based detection system are evaluated. For more controllable data collection, the infant simulator is used instead of real babies for this study, and focuses more on breathing estimation.
[0106] The impact of background sound interference of different types is studied, including human voices, traffic noise, rain sound, wind sound, hailstone sound, and music. To better control the experiments, sound files of these noises are downloaded and play them through a loudspeaker around 50 dB next to the SAS-based detection system. As shown in
[0107] The performance is also evaluated under various background noise levels. Traffic noise and human voices are also focused for this test. Sound files of noise is played through a loudspeaker at various powers and distances and record the actual sound level received at the microphone. As shown in
[0108] The transmitting power of the speaker can affect performance. To verify this, the transmitted sound is varied from 46 dB to 53 dB and evaluate the breathing estimation error accordingly. As seen in
[0109] The band above 10 kHz is used by default in the experiments, which may still be intrusive to human cars. The performance with narrower and higher frequency bands is studied. To do so, the passband of the high-pass filter from 10 kHz to 22 kHz with a 2 kHz step is adapted.
[0110] As sound speed depends on temperature, the SAS-based detection system was also tested under high temperatures. In the experiment, surrounding air is heated to about 120? F. and then let it naturally cool down in a warm room of about 70? F. The system is kept running during the process and show the breathing estimation results in
[0111] Large synchronization errors are introduced to show that the SAS-based detection system is resilient to phase offsets. Particularly, the starting point of the received signals is shifted by an amount of time ranging from 0 to 1 s with a step of 0.1 s. As shown in
[0112] Infants and toddlers usually have higher breathing rates than adults. The performance of the SAS-based detection system with respect to a range of breathing rates from 30 BPM to 60 BPM is evaluated. The breathing rate for each run is fixed by controlling the motor of the infant simulator. The results show insignificant differences for various breathing rates.
[0113] The system is benchmarked overhead on a desktop (Intel i7-11700 @ 4.9 GHZ CPU), a MacBook Air M1, and a Raspberry Pi 3 Model B+, on which the SAS-based detection system use 0.52 s, 0.73 s, and 3.97 s respectively to process 10 s of the data stream. The results show that the SAS-based detection system can run in real-time on embedded devices, promising its integration into existing car control systems.
[0114] A sufficient sampling rate of CSI is required to estimate speed. Given a sound frequency f with wavelength ?(f), a moving speed v is expected to experience a peak at the delay of
Assume that it needs at least Q samples to reliably detect a peak, which corresponds to a delay of ?.sub.min=Q/F.sub.s. Then, it can derive the minimum sampling rate required to measure a speed of v by
which implies
In other words, the maximum speed can be calculated as
which becomes about 0.1 m/s at f=20 kHz (wavelength 1.7 cm), about 0.2 m/s at f=10 kHz, and about 2 m/s at f=1 kHz, assuming Q=5 and a sampling rate of about 50 Hz (considering the sound speed of c=343 m/s). Using lower frequencies immediately allows to support higher speed, which however may suffer more from ambient noises. How to break down the sampling rate limitations and achieve estimation of daily speed (e.g., 0.5 m/s to 2 m/s) using pseudo-ultrasound frequencies remains worthwhile direction.
[0115] For breathing signals, since the periodicity is independent of subcarrier frequency, MRC can be directly performed across subcarriers. However, a further trick is needed to combine speed signals because, for acoustic signals from 10 kHz to 24 kHz, the difference in the wavelengths cannot be neglected (the wavelength at 10 kHz is approximately twice of that at 24 kHz). Recall
Given the same speed v, the first local peaks of the ACF on different subcarriers will appear at different delays ?.sub.s. Hence, to combine subcarriers for speed signals, it needs to first compensate the linear offsets due to different wavelengths. Specifically, the ACF{tilde over (?)}(f, ?) can be expressed as a unit linearly proportional to ?(f), i.e.,
and then average on {tilde over (?)}(f, ?). The operation is equivalent to scaling the ACF in the time lag dimension, which can be achieved by interpolation.
[0116] Here it shows a simple but stringent proof of that synchronization errors do not affect the SAS-based detection system. Denote the CIR measured under synchronization offsets as {tilde over (h)} (t):
{tilde over (h)}(t)=circshift(h(t),?.sub.off)
where h (t) is the true CIR, ?.sub.off is the timing offset caused by asynchronization, and circshift (.) represents circular shift. The time offsets correspond to phase shifts in the frequency domain. Thus, the asychronized CSI can be inferred as the following equation:
where H(f) is the true CSI. Thus, |{tilde over (H)} (f)|=|H(f)| is obtained, meaning that the SAS-based detection system is resilient to synchronization errors.
[0117] The functional units and modules of the SAS-based detection system in accordance with the embodiments disclosed herein may be implemented using computing devices, computer processors, or electronic circuitries including but not limited to application specific integrated circuits (ASIC), field programmable gate arrays (FPGA), microcontrollers, and other programmable logic devices configured or programmed according to the teachings of the present disclosure. Computer instructions or software codes running in the computing devices, computer processors, or programmable logic devices can readily be prepared by practitioners skilled in the software or electronic art based on the teachings of the present disclosure.
[0118] All or portions of the methods in accordance to the embodiments may be executed in one or more computing devices including server computers, personal computers, laptop computers, mobile computing devices such as smartphones and tablet computers.
[0119] The embodiments may include computer storage media, transient and non-transient memory devices having computer instructions or software codes stored therein, which can be used to program or configure the computing devices, computer processors, or electronic circuitries to perform any of the processes of the present invention. The storage media, transient and non-transient memory devices can include, but are not limited to, floppy disks, optical discs, Blu-ray Disc, DVD, CD-ROMs, and magneto-optical disks, ROMs, RAMs, flash memory devices, or any type of media or devices suitable for storing instructions, codes, and/or data.
[0120] Each of the functional units and modules in accordance with various embodiments also may be implemented in distributed computing environments and/or Cloud computing environments, wherein the whole or portions of machine instructions are executed in distributed fashion by one or more processing devices interconnected by a communication network, such as an intranet, Wide Area Network (WAN), Local Area Network (LAN), the Internet, and other forms of data transmission medium.
[0121] The foregoing description of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations will be apparent to the practitioner skilled in the art.
[0122] The embodiments were chosen and described in order to best explain the principles of the invention and its practical application, thereby enabling others skilled in the art to understand the invention for various embodiments and with various modifications that are suited to the particular use contemplated.