Active three-dimensional scene information acquisition method based on dimensionality-reduced sparse representation

Abstract

An active three-dimensional scene information acquisition method based on dimensionality-reduced sparse representation is provided. The method jointly processes multiple one-dimensional active detection signals collected synchronously to achieve three-dimensional positioning of objects in a detected scene or three-dimensional reconstruction of a scene structure. Through an active detection system equipped with one transmitter and multiple receivers, simultaneous three-dimensional positioning of multiple targets in a scene or three-dimensional reconstruction of the geometry of the scene is achieved.

Claims

1. An active three-dimensional scene information acquisition method based on dimensionality-reduced sparse representation, comprising the following steps: S1: calibrating an active detection system, the active detection system comprising a transmitter and a plurality of one-dimensional detection signal receivers; S2: detecting a detected scene, and collecting a multi-channel detection signal within a detection period; and S3: establishing and solving a sparse representation optimization model according to a calibration result and the collected detection signal.

2. The active three-dimensional scene information acquisition method based on dimensionality-reduced sparse representation according to claim 1, wherein a specific process of step S1 is: making the transmitter and the receivers of the active detection system face towards an open area in which there is only one object, starting the transmitter to transmit a detection signal s, recording a waveform of a single reflected signal received by each receiver, the received waveforms being recorded as s.sub.1, s.sub.2, . . . , s.sub.n, where n is a number of the receivers, and completing the calibration for the active detection system after completely recording the received waveforms.

3. The active three-dimensional scene information acquisition method based on dimensionality-reduced sparse representation according to claim 2, wherein a specific process of step S2 is: placing a plurality of receivers at a position that is not coplanar with the transmitter, a distance from the transmitter being not less than L, and for a detection signal with a wave velocity of v and a duration of t, L≥vt; making the transmitter and the receivers face the detected scene, when target information in the scene is to be acquired, activating step S21, and when a three-dimensional structure of the scene is to be acquired, activating step S22; S21: transmitting a detection signal s to a scene without a target, recording signals b.sub.1, b.sub.2, . . . , b.sub.n received by each receiver, when a target appears in a scene, transmitting a detection signal s to the scene with the target, and recording signals z.sub.1, z.sub.2, . . . , z.sub.n received by each receiver, y.sub.1=z.sub.1−b.sub.1, . . . , y.sub.2=z.sub.2−b.sub.2, . . . , y.sub.n=z.sub.n−b.sub.n; and S22: transmitting a detection signal s to a scene, and recording signals y.sub.1, y.sub.2, . . . , y.sub.n, received by each receiver.

4. The active three-dimensional scene information acquisition method based on dimensionality-reduced sparse representation according to claim 3, wherein a specific process of step S3 is: S31: dividing the detected scene into a plurality of voxels, and establishing sparse representation dictionaries D.sub.1, D.sub.2, . . . , D.sub.n,wherein a j.sup.th column of elements of the j.sup.th dictionary D.sub.i is a waveform that should be received by the j.sup.th receiver when the j.sup.th voxel has an object, and may be approximated by s.sub.i translating to a corresponding position; S32: constructing an overall sparse representation dictionary $D = [\begin{matrix} D_{1} \\ D_{2} \\ .Math. \\ D_{n} \end{matrix}]$ and an overall received signal $y = [\begin{matrix} y_{1} \\ y_{2} \\ .Math. \\ y_{n} \end{matrix}],$ the dictionary D being a matrix of p×q, estimating a noise level σ.sub.D of the dictionary and a noise level σ.sub.s of the received signal, if σ.sub.D.sup.2/∥D∥.sub.F.sup.2≤τ and σ.sub.s.sup.2/∥y∥.sub.2.sup.2≤τ, activating step S33, otherwise, activating step S34, a threshold τ being 0.05; S33: establishing and solving an optimization model $\min_{x} \frac{1}{2} {.Math. y - DWx .Math.}_{2}^{2} + λ {.Math. Wx .Math.}_{1},$ where λ=σ.sub.s√{square root over (2 log q)}, so as to obtain three-dimensional information Wx of the scene; and S34: assuming m=min (p,q), and taking an appropriate k, so that ${.Math.}_{i = 1}^{k} ω_{i}^{2} / {.Math.}_{i = 1}^{m} ω_{i}^{2} \approx σ_{s}^{2} / {.Math. y .Math.}_{2}^{2};$ performing singular value decomposition on the dictionary D to obtain D=U ΣV.sup.T; calculating D.sub.k=U.sub.k Σ.sub.k V.sup.T.sub.k: , where U.sub.k is the first k columns of U, V.sub.k is the first k columns of V, and a diagonal matrix Σ.sub.k is an intersection matrix of the first k rows and first k columns of Σ; and establishing and solving a dimensionality-reduced sparse representation optimization model $\min_{x} \frac{1}{2} {.Math. U_{k}^{T} y - {.Math.}_{k} V_{k}^{T} Wx .Math.}_{2}^{2} + λ {.Math. Wx .Math.}_{1},$ where λ=σ.sub.s.sup.2 log q, so as to obtain three-dimensional information Wx of the scene.

5. The active three-dimensional scene information acquisition method based on dimensionality-reduced sparse representation according to claim 4, wherein in steps S33 and S34, if a target information in the scene is acquired, a matrix W is an identity matrix; and if a structure information of the scene is acquired, a matrix W is a transformation matrix that makes Wx sparse.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) FIG. 1 is a flowchart of a method of the disclosure;

(2) FIG. 2 is a schematic arrangement diagram of an indoor positioned scene;

(3) FIG. 3 is a coefficient solved by different models; and

(4) FIG. 4 is scene target information reconstructed by different models.

DETAILED DESCRIPTION OF THE EMBODIMENTS

(5) The drawings are only for illustrative purposes and should not be construed as limiting the patent.

(6) In order to better illustrate the present embodiment, some components in the drawings may be omitted, enlarged, or reduced, and do not represent the size of an actual product.

(7) It will be understood by those skilled in the art that some well-known structures and their descriptions in the drawings may be omitted.

(8) The technical solution of the disclosure will be further described below with reference to the accompanying drawings and embodiments.

Embodiment 1

(9) As shown in FIG. 1, anactive three-dimensional scene information acquisition method based on dimensionality-reduced sparse representation includes the following steps.

(10) S1: An active detection system is calibrated, the active detection system including a transmitter and multiple one-dimensional detection signal receivers.

(11) S2: A detected scene is detected, and a multi-channel detection signal within a detection period is collected.

(12) S3: A sparse representation optimization model is established and solved according to a calibration result and the collected detection signal.

(13) Further, a specific process of step S1 is that:

(14) the transmitter and the receivers of the system face towards an open area in which there is only one object with a size not greater than h.sub.1×h.sub.2×h.sub.3, the transmitter is started to transmit a detection signal s, a waveform of a single reflected signal received by each receiver is recorded, the received waveforms being recorded as s.sub.1, s.sub.2, . . . s.sub.n, where n is the number of receivers, and the calibration for the system is completed after completely recording the received waveforms.

(15) A specific process of step S2 is that:

(16) the multiple receivers are placed at a position that is not coplanar with the transmitter, a distance from the transmitter being not less than L, and for a detection signal with a wave velocity of v and a duration of t, L≥vt;

(17) the transmitter and the receivers face the detected scene, when target information in the scene is to be acquired, step S21 is activated, and when a three-dimensional structure of the scene is to be acquired, step S22 is activated;

(18) S21: a detection signal s is transmitted to a scene without a target, signals b.sub.1, b.sub.2, . . . , b.sub.n received by each receiver are recorded, when a target appears in a scene, a detection signal s is transmitted to the scene with the target, and signals z.sub.1, z.sub.2, . . . , z.sub.n received by each receiver are recorded, y.sub.1=z.sub.1−b.sub.1, y.sub.2=z.sub.2−b.sub.2, . . . , y.sub.n=z.sub.n−b.sub.n; and

(19) S22: a detection signal s is transmitted to a scene, and signals y.sub.1, y.sub.2, . . . , y.sub.n received by each receiver are recorded.

(20) A specific process of step S3 is that:

(21) S31: the detected scene is divided into multiple voxels having a size of h.sub.1×h.sub.2×h.sub.3, and sparse representation dictionaries D.sub.1, D.sub.2, . . . , D.sub.n are established, wherein a j.sup.th column of elements of the i.sup.th dictionary D.sub.i is a waveform that should be received by the j.sup.th receiver when the j.sup.th voxel has an object, and may be approximated by s.sub.i translating to a corresponding position;

(22) S32: an overall sparse representation dictionary

(23) $D = [\begin{matrix} D_{1} \\ D_{2} \\ .Math. \\ D_{n} \end{matrix}]$
and an overall received signal

(24) $y = [\begin{matrix} y_{1} \\ y_{2} \\ .Math. \\ y_{n} \end{matrix}]$
are constructed, the dictionary D being a matrix of p×q, a noise level σ.sub.D of the dictionary and a noise level σ.sub.s of the received signal are estimated, if σ.sub.D.sup.2/∥D∥.sub.F.sup.2≤τ and σ.sub.s.sup.2/∥y∥.sub.2.sup.2.fwdarw.τ, step S33 is activated, otherwise, step S34 is activated, a threshold τ being 0.05;

(25) S33: an optimization model

(26) $\min_{x} \frac{1}{2} {.Math. y - DWx .Math.}_{2}^{2} + λ {.Math. Wx .Math.}_{1}$
is established and solved, where λ=σ.sub.s√{square root over (2 log q)}, so as to obtain three-dimensional information Wx of the scene; and

(27) S34: it is assumed that m=min (p,q), and an appropriate k is taken, so that

(28) ${.Math.}_{i = 1}^{k} ω_{i}^{2} / {.Math.}_{i = 1}^{m} ω_{i}^{2} \approx σ_{s}^{2} / {.Math. y .Math.}_{2}^{2};$
singular value decomposition is performed on the dictionary D to obtain D=UΣV.sup.T; D.sub.k=U.sub.kΣ.sub.kV.sub.k.sup.T, where U.sub.k is the first k columns of U, V.sub.k is the first k columns of V, and a diagonal matrix Σ.sub.k is an intersection matrix of the first k rows and first k columns of Σ; and a dimensionality-reduced sparse representation optimization model

(29) 0 $\min_{x} \frac{1}{2} {.Math. U_{k}^{T} y - {.Math.}_{k} V_{k}^{T} Wx .Math.}_{2}^{2} + λ {.Math. Wx .Math.}_{1}$
is established and solved, where λ=σ.sub.s.sup.2 log q, so as to obtain three-dimensional information Wx of the scene.

(30) A scene of indoor positioning using ultrasonic wave is simulated on a computer. The scene size is 10 m×10 m×4 m, and evenly divided into 3200 voxels (the voxel size is 0.5 m×0.5 m×0.5 m), the speed of sound propagation is set to 350 m/s, and a system sampling rate is 7000 samples/second. A coordinate axis is established with one corner of the room as an origin, and a transmitter is placed at the origin. In addition, four non-coplanar receivers are placed with coordinates of (10, 10, 4), (6.5, 10, 2.5), (10, 6.5, 2.5) and (10, 10, 0.5).

(31) After environment initialization, a pulse amplitude modulated signal is generated as a detection signal, and based on the detection signal, sparse representation dictionaries D.sub.1, . . . , D.sub.n are constructed for each receiver. First, simulated detection is performed on a scene without a target. By simulating the indoor propagation, attenuation and reflection of the detection signal, and adding Gaussian white noise, background reflection signals that should be received by each receiver may be calculated, and record as b.sub.1, . . . , b.sub.4. Then, five objects are randomly placed in the scene, and the indoor propagation, attenuation and reflection of the signal are simulated again, and an overall reflected signal that should be received by each receiver may be calculated, and recorded as z.sub.1, . . . , z.sub.4. y.sub.1=z.sub.1−b.sub.1, y.sub.2=z.sub.2−b.sub.2, . . . , y.sub.4=z.sub.4−b.sub.4. An overall sparse representation dictionary

(32) $D = [\begin{matrix} D_{1} \\ .Math. \\ D_{4} \end{matrix}]$
and an overall received signal

(33) $y = [\begin{matrix} y_{1} \\ .Math. \\ y_{4} \end{matrix}]$
are constructed. By respectively constructing a sparse representation model

(34) $\begin{matrix} \min_{x} \frac{1}{2} {.Math. y - Dx .Math.}_{2}^{2} + λ {.Math. x .Math.}_{1} & (1) \end{matrix}$
and a dimensionality-reduced sparse representation model

(35) $\begin{matrix} \min_{x} \frac{1}{2} {.Math. U_{k}^{T} y - {.Math.}_{k} V_{k}^{T} x .Math.}_{2}^{2} + λ {.Math. x .Math.}_{1}, & (2) \end{matrix}$
a three-dimensional information vector x of the scene is solved. x solved in the model (1) is as shown in FIG. 3 (a), and x solved in the model (2) is as shown in FIG. 3 (b). The red line in the figure represents a real coefficient of the signal, and the blue line represents an estimated coefficient. Because the received signal contains noise, coefficients estimated by model (1) are not very accurate. In contrast, coefficients recovered by the model (2) are closer to a true value, and it can be seen that the model (2) is more suitable for situations with higher noise levels. By rearranging x, the elements of x are arranged in each voxel, and a 3D target positioning result shown in FIG. 4 is obtained. The left figure is a real scene, the middle figure is a 3D target recovered from the solution of the model (1), and the right figure is a 3D target recovered from the solution of the model (2). It can be seen that the model (2) is more suitable for recovering real 3D targets when there is noise.

(36) The same or similar reference numerals correspond to the same or similar parts.

(37) The description of the positional relationship in the drawings is only for illustrative purposes, and should not be understood as a limitation on this patent.

(38) Obviously, the foregoing embodiments of the disclosure are merely examples for clearly explaining the disclosure, and are not intended to limit the embodiments of the disclosure. For those of ordinary skill in the art, other different forms of changes or modifications can be made on the basis of the above description. There is no need and cannot be exhaustive for all implementations. Any modification, equivalent replacement and improvement made within the spirit and principle of the disclosure shall be included in the protection scope of the claims of the disclosure.

Active three-dimensional scene information acquisition method based on dimensionality-reduced sparse representation

Assignee

Inventors

Cpc classification

Classification Explorer

G01S13/003

PHYSICS

Classification Explorer

G01S15/89

PHYSICS

Classification Explorer

G01S15/42

PHYSICS

Classification Explorer

G01S7/539

PHYSICS

International classification

Classification Explorer

G01S7/539

PHYSICS

Classification Explorer

G01S13/00

PHYSICS

Abstract

Claims

Description