METHOD AND APPARATUS FOR DECODING MULTI-USER MULTIPLE-INPUT MULTIPLE OUTPUT WIRELESS SIGNAL
20170230150 · 2017-08-10
Inventors
Cpc classification
H04L25/03955
ELECTRICITY
H04L25/03331
ELECTRICITY
H04L27/362
ELECTRICITY
H04L5/0025
ELECTRICITY
International classification
Abstract
A method of an RFC chip for a station, the RFC chip including a WLAN communication module to decode a DL MU-MIMO wireless signal, the method including: determining an FS layer column based on a product of a channel matrix of the station and an estimate of a steering matrix of the station; ordering columns of a sub-matrix of an effective channel matrix of the station to determine one or more SS layer columns from the sub-matrix, the sub-matrix being determined by removing, from the effective channel matrix, a column corresponding to the determined FS layer column; configuring a modified effective channel matrix based on the determined FS layer column and the ordered columns of the sub-matrix, the determined FS layer column being determined as an FS layer column of the modified effective channel matrix; and determining a symbol of the station based on the modified effective channel matrix.
Claims
1. A method of a radio frequency communication (RFC) chip for a station, the RFC chip comprising a wireless local area network (WLAN) communication module to decode a downlink (DL) multi-user multiple-input multiple-output (MU-MIMO) wireless signal, the method comprising: determining a full search (FS) layer column based on a product of a channel matrix of the station and an estimate of a steering matrix of the station; ordering columns of a sub-matrix of an effective channel matrix of the station to determine one or more single search (SS) layer columns from the sub-matrix, the sub-matrix being determined by removing, from the effective channel matrix, a column corresponding to the determined FS layer column; configuring a modified effective channel matrix based on the determined FS layer column and the ordered columns of the sub-matrix, the determined FS layer column being determined as an FS layer column of the modified effective channel matrix; and determining a symbol of the station based on the modified effective channel matrix.
2. The method of claim 1, wherein: the effective channel matrix of the station is a product of the channel matrix of the station and an estimate of a steering matrix associated with a plurality of stations, the plurality of stations comprising the station; and ordering columns of the sub-matrix of the effective channel matrix of the station comprises: ordering, based on a comparison associated with magnitudes of the columns of the sub-matrix, the columns of the sub-matrix after removing the column corresponding to the determined FS layer column, the ordered columns of the sub-matrix each being classified into an FS layer column of the modified effective channel matrix or an SS layer column of the modified effective channel matrix.
3. The method of claim 2, wherein: the station is one of the plurality of stations to receive the DL MU-MIMO wireless signal associated with symbols for the plurality of stations; and the effective channel matrix
4. The method of claim 1, the determining the symbol of the station comprises: determining a full rank matrix based on the modified effective channel matrix, where the full rank matrix is an M×M full rank matrix, and M is a total number of reception antennas of a plurality of stations comprising the station; and obtaining the symbol of the station based on the full rank matrix, wherein the DL MU-MIMO wireless signal is associated with symbols for the plurality of stations.
5. The method of claim 1, wherein a number of FS layer columns of the product is greater than or equal to ┌√{square root over (N.sub.i)}−1┐, where N is a total number of reception antennas of the station.
6. The method of claim 1, wherein: the effective channel matrix of the station is a product of the channel matrix of the station and an estimate of a steering matrix associated with a plurality of stations, the plurality of stations comprising the station; the station is one of the plurality of stations to receive the DL MU-MIMO wireless signal associated with symbols for the plurality of stations; and ordering the columns of the sub-matrix comprises: determining an index vector used for a fixed-complexity sphere decoder (FSD) ordering scheme for the sub-matrix, based on a squared norm of each row of a pseudo-inverse of the sub-matrix.
7. The method of claim 2, wherein a number of FS layer columns of the modified effective channel matrix is greater than or equal to ┌M−1┐, where M is a total number of reception antennas of the plurality of stations.
8. A radio frequency communication (RFC) chip for a station, the RFC chip comprising: a fixed-complexity sphere decoder (FSD) configured to decode a downlink (DL) multi-user multiple-input multiple-output (MU-MIMO) wireless signal associated with symbols for a plurality of stations, wherein the FSD is configured to: determine a full search (FS) layer column based on a product of a channel matrix of the station and an estimate of a steering matrix of the station; order, based on a comparison associated with magnitudes of columns of an effective channel matrix of the station, the columns of the effective channel matrix of the station; generate a modified effective channel matrix by reordering the ordered columns of the effective channel matrix such that a column of the effective channel matrix corresponding to the determined FS layer column is reordered to be an FS layer column of the modified effective channel matrix; determine a full rank matrix based on the modified effective channel matrix; and determine a symbol of the station based on the full rank matrix.
9. The RFC chip of claim 8, wherein the FSD is further configured to: generate the modified effective channel matrix by removing a column of the effective channel matrix.
10. The RFC chip of claim 9, wherein the FSD is further configured to: perform an interference whitening based on the removed column.
11. The RFC chip of claim 8, wherein: the station corresponds to an i-th station among the plurality of stations; and the effective channel matrix H.sub.i{circumflex over (F)} of the station corresponds to
12. The RFC chip of claim 8, wherein: the station corresponds to an i-th station among the plurality of stations; the FS layer column is determined by ordering the columns of the product H.sub.i{circumflex over (F)}.sub.i based on a determination of an index vector used for a fixed-complexity sphere decoder (FSD) ordering scheme for the product, based on a squared norm of each row of a pseudo-inverse of the product; and H.sub.i corresponds to the channel matrix of the i-th station, and {circumflex over (F)}.sub.i corresponds to the estimate of the steering matrix of the i-th station.
13. The RFC chip of claim 12, wherein a number of FS layer columns of the product is greater than or equal to ┌√{square root over (N.sub.i)}−1┐ and N is a number of reception antennas of the i-th station.
14. The RFC chip of claim 8, wherein: the station corresponds to an i-th station among the plurality of stations; each of the plurality of stations comprises a plurality of reception antennas; and the columns of the effective channel matrix of the i-th station are ordered based on a determination of an index vector used for a fixed-complexity sphere decoder (FSD) ordering scheme for the effective channel matrix, based on a squared norm of each row of a pseudo-inverse of the effective channel matrix.
15. The RFC chip of claim 14, wherein a number of FS layer columns of the effective channel matrix is greater than or equal to ┌√{square root over (M)}−1┐, and M is a total number of reception antennas of the plurality of stations.
16. The RFC chip of claim 8, wherein reordering of the ordered columns of the effective channel matrix is performed according to an FS layer index set ū.sub.FS.sup.i of the modified effective channel matrix and an SS layer index set ū.sub.SS.sup.−i of the modified effective channel matrix.
17. The RFC chip of claim 16, wherein: the FS layer index set ū.sub.FS.sup.i is an FS layer index set of the modified effective channel matrix
18. The RFC chip of claim 16, wherein: the FS layer index set ū.sub.FS.sup.i is an FS layer index set of the modified effective channel matrix
19. A mobile station comprising: a display; a system-on-chip (SoC) comprising a processor configured to process instructions associated with an application program, a system memory configured to store data, and a graphic processing unit (GPU) configured to process graphic data to be displayed on the display of the mobile station; multiple antennas configured to receive a downlink (DL) multi-user multiple-input multiple-output (MU-MIMO) wireless signal associated with symbols for a plurality of stations; and a fixed-complexity sphere decoder (FSD) configured to decode the DL MU-MIMO wireless signal, wherein the FSD is configured to: determine a full search (FS) layer column based on a product of a channel matrix of the mobile station and an estimate of a steering matrix of the mobile station; order, based on a comparison associated with magnitudes of columns of an effective channel matrix of the mobile station, the columns of the effective channel matrix of the mobile station; generate a modified effective channel matrix by reordering the ordered columns of the effective channel matrix such that a column of the effective channel matrix corresponding to the determined FS layer column is reordered to be an FS layer column of the modified effective channel matrix; determine a full rank matrix based on the modified effective channel matrix; and determine a symbol of the mobile station based on the full rank matrix.
20. The mobile station of claim 19, wherein the FSD is implemented in the SoC or implemented in a radio frequency communication (RFC) chip configured to communicate with the SoC.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0014] The accompanying drawings, which are included to provide a further understanding of the inventive concept, and are incorporated in and constitute a part of this specification, illustrate exemplary embodiments of the inventive concept, and, together with the description, serve to explain principles of the inventive concept.
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
DETAILED DESCRIPTION OF THE ILLUSTRATED EMBODIMENTS
[0029] In the following description, for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of various exemplary embodiments. It is apparent, however, that various exemplary embodiments may be practiced without these specific details or with one or more equivalent arrangements. In other instances, well-known structures and devices are shown in block diagram form in order to avoid unnecessarily obscuring various exemplary embodiments.
[0030] Unless otherwise specified, the illustrated exemplary embodiments are to be understood as providing exemplary features of varying detail of various exemplary embodiments. Therefore, unless otherwise specified, the features, blocks, components, modules, elements, and/or aspects of the various illustrations may be otherwise combined, separated, interchanged, and/or rearranged without departing from the disclosed exemplary embodiments. Further, in the accompanying figures, the size and relative sizes of blocks, components, modules, elements, etc., may be exaggerated for clarity and descriptive purposes. Also, like reference numerals may denote like or similar elements.
[0031] When an element is referred to as being “connected to” or “coupled to” another element, it may be directly connected to or coupled to the other element or intervening elements or may be present. When, however, an element or layer is referred to as being “directly connected to” or “directly coupled to” another element, there are no intervening elements present. For the purposes of this disclosure, “at least one of X, Y, and Z” and “at least one selected from the group consisting of X, Y, and Z” may be construed as X only, Y only, Z only, or any combination of two or more of X, Y, and Z, such as, for instance, XYZ, XYY, YZ, and ZZ. Like numbers refer to like elements throughout. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items.
[0032] Although the terms “first,” “second,” etc. may be used herein to describe various elements, blocks, components, regions, and/or sections, these elements, blocks, components, regions, and/or sections should not be limited by these terms. These terms are used to distinguish one element, block, component, region, and/or section from another element, block, component, region, and/or section. Thus, a first element, block, component, region, and/or section discussed below could be termed a second element, block, component, region, and/or section without departing from the teachings of the present disclosure.
[0033] The terminology used herein is for the purpose of describing particular embodiments and is not intended to be limiting. As used herein, the singular forms, “a,” “an,” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. Moreover, the terms “comprises,” “comprising,” “includes,” and/or “including,” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, components, and/or groups thereof, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
[0034] Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure is a part. Terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and will not be interpreted in an idealized or overly formal sense, unless expressly so defined herein.
[0035]
[0036] Although
[0037] The AP 110 may communicate with one or more user stations 120 on the downlink (DL) and uplink (UL). The downlink is the communication link from an AP to one or more user stations, and the uplink is the communication link from one or more user stations to an AP, e.g., AP 110. A user station may also directly communicate with another user station. A system controller 130 may provide coordination and control for a plurality of APs including the AP 110.
[0038] The MIMO system 100 employs multiple transmit and multiple receive antennas for data transmission on the downlink and the uplink. The AP 110 may have a number M of antennas, which represents multiple-input for downlink transmissions and multiple-output for uplink transmissions. For MU-MIMO, single-user (SU)-MIMO, spatial division multiple access (SDMA), or the like, a set Σ.sub.i=1.sup.KN.sub.i of antennas of selected user stations 120 collectively represents the multiple-output for downlink transmissions and the multiple-input for uplink transmissions, where K is the number of selected user stations 120, and N.sub.i is the number of antennas of the i-th user station 120. According to one or more exemplary embodiments, it may be desirable to have M≧Σ.sub.i=1.sup.1N.sub.i≧1 if the data symbol streams for the selected K user stations 120 are not multiplexed in code, frequency, or time by a multiplexing configuration, e.g., code division multiple access (CDMA), orthogonal frequency division multiplexing (OFDM), time division multiple access (TDMA), and the like.
[0039] The MIMO system 100 may be a time division duplex (TDD) system or a frequency division duplex (FDD) system. In a TDD system, the downlink and uplink share the same frequency band. In an FDD system, the downlink and uplink use different frequency bands. Further, the MIMO system 100 may support protocols conforming to various Institute of Electrical and Electronics Engineers (IEEE) 802.11 standards, e.g., IEEE 802.11ac, IEEE 802.11n, etc.
[0040]
[0041] Referring to
[0042] System Model for Interference Mitigation
[0043] In general, the DL MU-MIMO model can be modeled as communications between an AP that employs M transmit antennas and K user stations, where the i-th user station (STA.sub.i) includes N.sub.i receiving antennas (1=1, 2, . . . , K). As such, the DL MU-MIMO may be used in various wireless communication environments, e.g., wireless communication environments conforming to IEEE 802.11ac, Long Term Evolution (LTE), LTE-Advanced (LTE-A), etc. For example, if K user stations have the same number of antennas N, then N.sub.i=N for i=1, 2, . . . , K. Further, according to one or more exemplary embodiments, Σ.sub.i=1.sup.KN.sub.i=M. The received signal y.sub.i of the STA.sub.i may be represented as a linear combination of the transmitted data symbol vector s corrupted by additive Gaussian noise ν.sub.i, as designated in Equation 1.
[0044] where s.sub.i is a desired signal of STA.sub.i, H.sub.i is an N.sub.i×M channel matrix of STA.sub.i, and F is an M×Σ.sub.i=1.sup.KN.sub.i steering matrix for beamforming, which may be referred to as a precoding matrix. Further, ν.sub.i represents N.sub.i×1 complex Gaussian noise, e.g., additive white Gaussian noise (AWGN), with E[ν.sub.i.sup.Tν.sub.i]=σ.sup.2I.sub.N.sub.
[0045] which is a steering matrix of STA.sub.i,
and F=[F.sub.1 . . . F.sub.i−1 F.sub.i F.sub.i+1 . . . F.sub.K]. In some matrices, α.sub.pq.sup.i denotes the element at the p-th row and q-th column of a matrix A.sub.i. For example, h.sub.pq.sup.i denotes the element at the p-th row and q-th column of a matrix H.sub.i, and f.sub.pq.sup.i denotes the element at the p-th row and q-th column of a matrix F.sub.i. In some matrices, α.sub.p.sup.i denotes the element at the p-th row and the first column of a matrix A.sub.i having a single column (i.e., of a column vector A.sub.i) or α.sub.p.sup.i denotes the element at the first row and the p-th column of a matrix A.sub.i having a single row (i.e., of a row vector A.sub.i). In some embodiments, α.sub.p.sup.i denotes elements in the p-th column of a matrix A.sub.i. The desired signal s.sub.i may be uniformly distributed over a discrete and finite set x.sub.m.sub.
[0046]
F.sub.li=[F.sub.1 . . . F.sub.i−1 F.sub.i+1 . . . F.sub.K]
s.sub.li.sup.T=[s.sub.1.sup.T . . . s.sub.i−1.sup.Ts.sub.i+1.sup.T . . . s.sub.K.sup.T] Eq. 2
[0047] The steering matrix F is, in an ideally implemented device, orthogonally organized in a manner that suppresses interference. However, in reality, compressed feedback is used to reconstruct the steering matrix {circumflex over (F)}, which may lead to interference terms. More specifically, the effect of the interference term H.sub.iF.sub.lis.sub.li, may not be negligible in comparison with SU MIMO, owing to a quantization error.
[0048] For descriptive convenience, an illustrative example will be described with respect to the number of antennas of the AP 210 being four (M=4), the number of selected user stations 221, 222 being two (K=2), and the number of antennas per selected user station 221, 222 being two (N.sub.i=2 for 1=1, 2).
[0049] In the aforementioned configuration,
[0050] Utilizing channel state information (CSI) feedback from the selected user stations 221, 222, the channel matrices H.sub.1, H.sub.2 and the steering matrices F.sub.1, F.sub.2 may be determined (estimated).
[0051] To mitigate decoder degradation in an MU-MIMO environment, interference-aware decoding schemes may be employed, such as interference whitening, joint detection, and the like. One or more exemplary embodiments will be described with respect to joint detection with or without interference whitening.
[0052] Joint Detection Model for Interference Mitigation
[0053] In a typical interference-ignored MIMO decoder, when an interference component H.sub.iF.sub.lis.sub.li is regarded as noise, maximum-likelihood decoding minimizes the error probability P.sub.s(ŝ.sub.MLD,i≠s.sub.i) as expressed in Equation 3:
[0054] where “argmin,” the argument of the minimum, is the set of points of the given argument for which the given function attains its minimum value (for example, argmin.sub.xf(x)={x|∀y:f(y)≧f(x)}), the squared norm of x.sub.i, ∥x.sub.i∥.sup.2=x.sub.i.sup.Hx.sub.i, x.sub.i.sup.H is a Hermitian matrix of x.sub.i or ∥x.sub.i∥.sup.2=x.sub.i.sup.ex.sub.i, x.sub.i.sup.e is conjugate transpose of x.sub.i, {circumflex over (F)}.sub.i is an estimate of M×N.sub.i steering matrix of STA.sub.i determined using a compressed beamforming procedure, e.g., the compressed beamforming procedure described in IEEE Computer Society, “Wireless LAN medium access control (MAC) and physical layer (PHY) specifications: Enhancements for very high throughput for operation in bands below 6 GHz,” IEEE P802.11ac/D1.0 Std., January 2011. The symbol “̂” indicates an estimate of the corresponding signal or matrix. For example, ŝ indicates an estimate of the signal s.
[0055] When M=4 and N.sub.i=2, an estimate of s under maximum-likelihood detection may be determined according to Equation 3-1:
[0056]
[0057] When m.sub.i=16, 64, 256, and the like, the N.sub.i-dimensional square lattice is spanned by an m.sub.i-QAM constellation in each dimension. If 4-QAM is used, possible constellation points m.sub.1 in each of the two dimension N.sub.1 are four as shown in
is a vector of two received signals received by two antennas of the user station (STA.sub.T) 221.
[0058] As shown in
For ŝ.sub.1.sup.1 estimation, one of four points (a.sub.1, b.sub.1, c.sub.1, d.sub.1) is a possible candidate. For ŝ.sub.2.sup.1 estimation, one of four points (a.sub.2, b.sub.2, c.sub.2, d.sub.2) is a possible candidate. Thus, in the MLD of Equation 3-1, a total 4.sup.2(=16) combinations for (ŝ.sub.1.sup.1, ŝ.sub.2.sup.1) estimation are possible in 4-QAM constellation, i.e., (a.sub.1, a.sub.2), (a.sub.1, b.sub.2), (a.sub.1, c.sub.2), (a.sub.1, d.sub.2), (b.sub.1, a.sub.2), (b.sub.1, b.sub.2), (b.sub.1, c.sub.2), (b.sub.1, d.sub.2), (c.sub.i, a.sub.2), (c.sub.1, b.sub.2), (c.sub.1, c.sub.2), (c.sub.1, d.sub.2), (d.sub.1, a.sub.2), (d.sub.1, b.sub.2), (d.sub.1, c.sub.2), (d.sub.1, d.sub.2).
[0059]
[0060] However, as previously mentioned, the interference component H.sub.iF.sub.lis.sub.li may increase to such a degree that it can no longer be ignored. As such, to detect the entire streams, including the interference at STA.sub.i, namely ŝ.sub.MLD.sup.i.sup.
[0061] where Ω is a discrete and finite set of s, and
[0062] In the interference ignoring receiver indicated in Equation 3, the total noise strength is E[|H.sub.i{circumflex over (F)}.sub.lis.sub.li+ν.sub.i|.sup.2]; whereas the noise strength is |ν.sub.i|.sup.2 in the interference-aware receiver detector shown in Equation 4. In the case of E[|H.sub.i{circumflex over (F)}.sub.lis.sub.li+ν.sub.i|.sup.2]>E[|ν.sub.i|.sup.2], the noise strength of Equation 4 becomes lower than that of Equation 3, thus leading to the improvement of the receiver performance.
[0063] However, the increased degree of channel matrices significantly increases the calculation complexity of the detector in Equation 4 compared to that of Equation 3. To resolve this complexity problem, the fixed-complexity sphere decoder (FSD) algorithm, known as the low-complexity algorithm in the SU-MIMO, is examined and modified FSD algorithms for MU-MIMO will be described below. According to one or more exemplary embodiments, a method and apparatus for applying a reordered FSD scheme to detect MU-MIMO symbols will be provided.
[0064] Model of Channel Matrix Expansion on Generalized Sphere Decoder
[0065] In Equation 4, the signal detection of the entire s on STA.sub.i, ŝ.sub.MLD.sup.i, should be first performed to detect ŝ.sub.MLD,i.sup.i. However, because effective channel matrix
ŝ.sub.MLD.sup.i=argmin.sub.sεΩ∥
where
G.sup.i=
[0066] Consequently, effective channel matrix
[0067] Extended Fixed-Complexity Sphere Decoder for Downlink MU-MIMO System
[0068] A detector without considering interferences suffers from performance degradation by low signal-to-interference ratio (SIR). On the other hand, the optimal joint MU-MIMO detector suffers from an implementation complexity problem. In this sense, the FSD based algorithm would be attractive solution promising low complexity for supporting MU-MIMO communication system. In this disclosure, a new extended FSD decoder based on reconstruction of interference channel matrix for interference mitigation on MU-MIMO communication system is provided. One or more exemplary embodiments provides methods based on the channel matrix expansion using the generalized SD (GSD) and a new channel matrix reconstruction to offer enhanced receiver performance for each STA.
[0069] Adaptation of Conventional Fixed-Complexity Sphere Decoder
[0070] The FSD algorithm is one of the algorithms designed to simplify the calculation complexity problem in which the MLD expressed through Equation 3 exponentially increases based on modulation and to the degrees of the transmission antenna. The FSD considerably reduces calculation complexity using S.sub.i⊂x.sub.m.sub.
ŝ.sub.FSD,i=argmin.sub.s.sub.
[0071] Thus, the value of n.sub.i,j, the j.sup.th element of n.sub.S.sub.
[0072] To obtain quasi-ML performance, when the number of single-search (SS) layers and full-search (FS) layers are expressed as l.sub.i,SS and l.sub.i,FS respectively, the FSD needs to satisfy the constraint conditions shown in Equation 7.
l.sub.i,FS≧┌√{square root over (N.sub.i)}−1┐,l.sub.i,SS=N.sub.i−l.sub.i,FS Eq. 7
[0073] Under these conditions, the number of symbols searched by the FSD algorithm is m.sub.i.sup.l.sup.
[0074] After determining the l.sub.i,FS and l.sub.i,SS, The FSD channel matrix ordering which determines the detection ordering of the signal s.sub.i,j, for j=1, . . . , N.sub.i, may be performed with channel matrix
to achieve quasi-ML performance (L. G. Barbero and J. S. Thompson, “Fixing the complexity of the sphere decoder for MIMO detection,” IEEE Trans. Wireless Commun., vol. 7, pp. 2131-2142, June 2008.). In this process, the N.sub.i columns of H.sub.iH.sub.i{circumflex over (F)}.sub.i are ordered iteratively so that the signals with largest post-processing noise amplification are detected in the FS layers. On the other hand, the signals with smallest post processing noise amplification are detected in the SS layers.
[0075] When u.sub.i is the index vector used to the FSD ordering, it can be determined in every iteration according to the following, for k=N.sub.i, N.sub.i−1, . . . , 1.
[0076] (A) When, the k+1th index element of u.sub.i, u.sub.i,k+1 is obtained through the previous repetition process and H.sub.i(u.sub.
[0077] (B) When (H.sub.i,(u.sub.
[0078] where H.sub.i,(u.sub.
[0079] Using the index vector u.sub.i, the ordered channel matrix {tilde over (H)}.sub.i by ordering columns of H.sub.i can be obtained as below:
[0080] where
[0081] Then the ŝ.sub.FSD,i can be obtained by sequencing H.sub.i using u.sub.i=(u.sub.SS,i,u.sub.FS,i) which are obtained by repeating the process above and by performing the FSD tree search: it is similar to SD tree search in finding minimum Euclidean distance from N−1 to 0 layer, but its search tree isn't expanded to cover all m.sub.i-QAM constellation in u.sub.SS,i layers.
[0082] The basic FSD algorithm is known as the symbol detection method for the suboptimal MIMO that can be realized with low complexity in the typical MIMO communication system. Hereinafter, a method for effectively applying this algorithm in the MU-MIMO communication system will be provided.
[0083] Extended Scheme Using Reconstruction of Channel Matrix for Interference Mitigation
[0084] In common, the FSD algorithm has two components affecting receiver performance: 1) the structure of search tree and 2) channel matrix ordering. In conventional FSD, the search tree is divided into full-search (FS) layers and single-search (SS) layers according to Equation 7 for offering quasi-ML performance i.e., satisfying Equation 10.
P.sub.e(ŝ.sub.FSD,i≠s.sub.i)≦P.sub.e(ŝ.sub.MLD,i≠s.sub.i)+P.sub.e(s.sub.iε) Eq. 10
[0085] In particular, whenever P.sub.e(s.sub.iε)<<P.sub.e(s.sub.MLD,i≠s.sub.i), the FSD will experience close to optimal performance, and it is proved in the paper, M. O. Damen, K. Abed-Meraim, and J.-C. Belfiore, “Generalised sphere decoder for asymmetrical space-time communication architecture,”Electronics Lett., vol. 36, pp. 166-167, January 2000. In addition, through ordering, the channel matrix is ordered so that a signal with low signal strength is detected in the FS layers, in contrast to that in the SS layers (See J. Jalden, L. G. Barbero, B. Ottersten and J. S. Thompson, “The Error probability of the fixed-complexity sphere decoder,” IEEE Trans. Signal Processing, vol. 57, pp. 2711-2720, July 2009 (“Jalden et al.”).).
[0086] However, in case this method is applied to interference-aware joint, the search tree corresponding to extended channel matrix D′ may be divided into FS layers with l.sub.FS.sup.i≧┌√{square root over (M)}−1┐ and SS layers to satisfy l.sub.SS.sup.i=M−l.sub.FS.sup.i (when Σ.sub.i=1.sup.KN.sub.1=M). Through channel matrix ordering described in previous subsection, the interference channels are input to the FS layers because the signal strengths of the interference signal streams are mostly low. i.e., the channels corresponding to the desired signals are input to the SS layers.
[0087] Thus, the error probability P.sub.e(ŝ.sub.FSD.sup.i≠s) for the entire ŝ.sub.FSD.sup.i detected in STA.sub.i has a quasi-ML performance according to Jalden et al. However, the error probability of the desired signal P.sub.s(ŝ.sub.FSD,i.sup.i≠s.sub.i) cannot have a quasi-ML performance because each ŝ.sub.FSD,i.sup.i symbol operates in the SS layer, i.e., the error probability P.sub.e(ŝ.sub.FSD.sup.i≠s) is not optimal because P.sub.s(s.sub.iε) cannot assume less then P.sub.s(ŝ.sub.MLD,i≠s.sub.i).
[0088] Therefore, to ensure a quasi-ML performance of P.sub.e(ŝ.sub.FSD.sup.i.Math.s) and P.sub.e(s.sub.FSD,i.sup.i≠s.sub.i), columns that are larger than ┌√{square root over (M)}−1┐ in the effective channel matrix of the entire signal streams need to be included in the FS layers. Similarly, columns those are larger than ┌√{square root over (N.sub.i)}−1┌ in the matrix H.sub.i {circumflex over (F)}.sub.i of the desired signal streams need to be included in the FS layers.
[0089] Under this condition, the modified FSD ordering is characterized as follows, according to one or more exemplary embodiments.
[0090]
[0091] Referring to
[0092] According to one or more exemplary embodiments, the effective channel matrix
[0093] When u is the index vector used to the FSD ordering, it can be determined in every iteration according to the following, for k=M, M−1, . . . , 1.
[0094] (A) When, the k+1th index element of u, u.sub.k+1 is obtained through the previous repetition process and
[0095] (B) When (
[0096] where
[0097] Using the index vector u, the ordered channel matrix {umlaut over (H)}.sup.i by ordering columns of {umlaut over (H)}.sup.i can be obtained as below:
{umlaut over (H)}.sub.i±[
[0098] where is
[0099] From the {umlaut over (H)}.sup.i, which is ordered from
[0100] In operation S520, by conducting FSD ordering using H.sub.i{circumflex over (F)}.sub.i, the ordered index set u.sub.FS,i.sup.l of the FS layer of {tilde over (H)}.sub.i and the ordered index set u.sub.SS,i.sup.i of the SS layer of {tilde over (H)}.sub.i are obtained.
[0101] From the {tilde over (H)}.sub.i, which is ordered from H.sub.i{circumflex over (F)}.sub.i, columns that are larger than ┌√{square root over (N)}.sub.i−1┐ from the rightmost column of {tilde over (H)}.sub.i are included in the FS layers of {tilde over (H)}.sub.i. For example, as described above with respect to Equations 8 and 9, if N.sub.i is 4, at least one column, h.sub.i,u.sub.
[0102] In operation S530, by using the ordered index set obtained previously, the modified ordered index set ū.sub.FS.sup.i and u.sub.SS.sup.i may be obtained using the following equations:
ū.sub.FS.sup.i=u.sub.FS.sup.i∪u.sub.FS,i.sup.i
[0103] Consequently, the number of FS layers l.sub.FS.sup.i and the number of SS layers l.sub.SS.sup.i are obtained through the following conditions under which both P.sub.s(ŝ.sub.ML,i.sup.i≠s.sub.i) and P.sub.s(ŝ.sub.ML.sup.i≠s) satisfy the quasi-ML performance:
l.sub.FS.sup.i=|ū.sub.FS.sup.i|
l.sub.SS.sup.i=M−l.sub.FS.sup.i=|ū.sub.SS.sup.i| Eq. 12
[0104] where |ū.sub.FS.sup.i| is the number of elements of the set ū.sub.FS.sup.i, and |ū.sub.SS.sup.i| is the number of elements of the set ū.sub.SS.sup.i.
[0105] In operation S540,
[0106] Here, the extended full rank matrix
[0107] In operation S560, an FSD decoding may be performed using a joint detection scheme to obtain
[0108] where ⊂Ω). Columns of
[0109] In addition, unlike a comparative example shown in Koo et al., “An extendable Fixed-Complexity Sphere Decoder for Downlink Multi-User MIMO Communication System,” The journal of Korea Information and Communications Society, vol. 39A No. 04, pp. 180-187, April 2014, the example of
[0110] The value of n.sub.j, the j.sup.th element of n.sub.s, is either one or m.sub.i, according to the required receiver performance. For example, n.sub.s., may be (1, 1, . . . , m.sub.i, m.sub.s) where the elements corresponding to one are associated with the SS layers and the elements corresponding to m.sub.i are associated with the FS layers.
[0111] In an example, when M=8, N.sub.1=4, N.sub.2=4, K=2,
[0112] Further, a 4×4 matrix H.sub.1{circumflex over (F)}.sub.1=[
[0113] According to S530 and S540,
[0114] In addition, the process described with respect to
[0115] Further, according to one or more exemplary embodiments, the second right most column
[0116] Although the above examples are described when Σ.sub.i=1.sup.KN.sub.i=M, in general,
[0117]
[0118] Referring to
[0119] In operation S620 (similar to the operation S520 of
[0120] In operation S630, by using the ordered index set obtained previously, the modified ordered index set ū.sub.FS.sup.i and ū.sub.SS.sup.i are obtained using the following equations:
u.sub.FS,i.sup.i⊂ū.sub.FS.sup.i⊂(u.sub.FS.sup.i∪u.sub.FS,i.sup.i)
u.sub.SS,i.sup.i⊂ū.sub.SS.sup.i⊂(u.sub.FS.sup.i∪u.sub.SS.sup.i)\ū.sub.FS.sup.i={u:uε(u.sub.FS.sup.i∪u.sub.SS.sup.i) and u⊂ū.sub.FS.sup.i} Eq. 15
[0121] Consequently, the number of FS layers l.sub.FS.sup.i and the number of SS layers l.sub.SS.sup.i are obtained through the following conditions under which both P.sub.s(ŝ.sub.ML,i.sup.i≠s.sub.i) an P.sub.s(ŝ.sub.ML.sup.i≠s) satisfy the quasi-ML performance:
l.sub.FS.sup.i=|ū.sub.FS.sup.i|
l.sub.SS.sup.i=|ū.sub.SS.sup.i Eq. 16
[0122] where |ū.sub.FS.sup.i| is the number of elements of the set ū.sub.FS.sup.i, and |ū.sub.SS.sup.i| is the number of elements of the set ū.sub.SS.sup.i. Unlike the embodiment of
[0123] In operation S640,
[0124] In an example, when M=8, N.sub.1=4, N.sub.2=4, K=2, and |ū.sub.n.sup.i=2,
[0125] Further, a 4×4 matrix H.sub.1{tilde over (F)}.sub.1=[
[0126] According to operations S630 and S640,
[0127] In operation S650, a full rank matrix
[0128] In operation S660, an FSD decoding may be performed using a joint detection scheme to obtain
[0129]
[0130] Referring to
[0131] In operation S720 (similar to the operation S520 of
[0132] In operation S730, by using the ordered index set obtained previously, the modified ordered index set ū.sub.FS.sup.i and ū.sub.SS.sup.i are obtained using the following equations:
u.sub.FS,i.sup.i
u.sub.SS,i.sup.i
ū.sub.n.sup.i=(u.sub.FS.sup.i∪u.sub.SS.sup.i)\(ū.sub.FS.sup.i∪ū.sub.SS.sup.i) Eq. 17
[0133] Consequently, the number of FS layers l.sub.FS.sup.i and the number of SS layers l.sub.SS.sup.i are obtained through the following conditions under which both P.sub.s(ŝ.sub.ML,i.sup.i≠s.sub.i) and P.sub.e(ŝ.sub.ML.sup.i≠s) satisfy the quasi-ML performance:
l.sub.FS.sup.i=|ū.sub.FS.sup.i|
l.sub.SS.sup.i=|ū.sub.SS.sup.i| Eq. 18
[0134] where |ū.sub.FS.sup.i| is the number of elements of the set ū.sub.FS.sup.i, and |ū.sub.SS.sup.i| is the number of elements of the set ū.sub.SS.sup.i. Unlike the embodiment of
[0135] In operation S740,
[0136] In an example, when M=8, N.sub.1=4, N.sub.2=4, K=2, and |ū.sub.n.sup.2|=2 for i=2 (for the station having index i=2),
[0137] Further, a 4×4 matrix H.sub.2{circumflex over (F)}.sub.2=[
[0138] According to operations S730 and S740,
[0139] In operation S741, it is determined whether |ū.sub.n.sup.i|>0. If (|ū.sub.n.sup.i=0), operation S750 is performed. In the operation S750, a full rank matrix,
[0140] In operation S760, an FSD decoding may be performed using a joint detection scheme to obtain
[0141] If (|ū.sub.n.sup.i|>0), operation S745 is performed. In the operation S745, an interference whitening may be applied to interference components eliminated from the joint detection scheme, e g an interference whitening for
[0142] In an example, an interference whitening may be performed using the following equations:
[0143] If |ū.sub.n.sup.i|=n, total n columns are removed from
[0144] In operation S755, an (M−n)×(M−n) full rank matrix,
[0145] Here, the extended full rank matrix
[0146] In operation S765, an FSD decoding may be performed using a joint detection scheme to obtain
[0147] where is the restricted search space of Ω=Π.sub.i=1.sup.Kχ.sub.m.sub.
[0148]
[0149] Referring to
[0150] Further, according to one or more exemplary embodiments, at least one column of the set u.sub.FS.sup.i may be included in the SS layers or may belong to columns corresponding to ū.sub.n.sup.i because at least one of the column(s) of the set u.sub.FS,i.sup.i is added into the FS layers. Thus, the total number of FS layer may not be increased even though the column(s) of the set u.sub.FS,i.sup.i is added into the FS layers.
[0151]
[0152] Referring to
H.sub.2{circumflex over (F)}.sub.2=[
H.sub.2{circumflex over (F)}.sub.3=[
[0153] If the FSD ordering result of H.sub.i{circumflex over (F)}.sub.i for i=2, based on e.g., the Equation 8, is [
[0154] In operation S920, column(s) corresponding to the index set u.sub.FS,i.sup.i may be removed from
[0155] In operation S930, an FSD ordering is conducted based on
[0156] A) When, the k+1th index element of u, u.sub.k+1 is obtained through the previous repetition process and
is the matrix in which columns of u.sub.k+1 selected through the previous repetition process are replaced with zero column vector,
is calculated.
[0157] (B) When
with iε[1, M−|u.sub.FS,i.sup.i|]−{u.sub.k+1}, the index is calculated through Equation 22.
[0158] where
for initial condition.
[0159] Using the index vector u, the ordered channel matrix {umlaut over (H)}.sub.sub.sup.i by ordering columns of
[0160] where
[0161] From the ordered {umlaut over (H)}.sub.sub.sup.i, columns that are larger than ┌√{square root over (M)}−|u.sub.FS,i.sup.i|−1┐ from the rightmost column of {umlaut over (H)}.sub.sub.sup.i may be included in the full search (FS) layers. For example, if M is 12 and |u.sub.FS,i.sup.i|=1 (when only
and the index set u.sub.SS.sup.i of the SS layer may be obtained from the rest columns other than the at least two columns,
[0162] In another example, from the ordered {umlaut over (H)}.sub.sub.sup.i, columns that are larger than ┌√{square root over (M−|u.sub.FS,i.sup.i|)}−1┐ from the rightmost column of {umlaut over (H)}.sub.sub.sup.i may be included in the full search (FS) layers. If M is 12 and |u.sub.FS,i.sup.i=1 (when only
and the index set u.sub.SS.sup.i of the SS layer may be obtained from the rest columns other than the at least three columns,
[0163] In operation S940,
[0164] In operation S950 (similar to the operation S550), an M×M full rank matrix
[0165] In operation S960 (similar to the operation S560), an FSD decoding may be performed using a joint detection scheme to obtain
[0166] Fixed-Complexity Sphere Decoding Algorithm
[0167] In order to obtain ⊂X.sub.m.sup.M, around the received vector y. The transmitted vector sε
with the smallest Euclidean distance is then selected as the solution. The process can be written as
[0168] where R is an N×N upper triangular matrix, with entries denoted r.sub.ij, obtained through QR decomposition of the channel matrix H and ŝ.sub.ZF=R.sup.−1Q.sup.Ty, is the unconstrained ML estimate of s.
[0169] The (squared) Euclidean distance in the following equations can be obtained recursively starting from i=N and working backwards until i=1 using
[0170] The term w.sub.i+1 can be seen as an APED down to level j=i+1 and the term e as the PED contribution from level i.
[0171]
[0172] The RFC module in the access point RFC_AP may be connected to, or may have, M antennas 1010a through 1010b. The RFC module in the user station RFC_STA may be connected to, or may have, Ni antennas 1060a through 1060b. The access point is a transmitting device for the downlink and a receiving device for the uplink. The user station is a transmitting device for the uplink and a receiving device for the downlink. A plurality of user stations including the user station may be selected from simultaneous transmission on the downlink or uplink. The beam-steering or some other spatial processing technique may be employed at the access point and the user station.
[0173] According to one or more exemplary embodiments, on the uplink, at the user station selected for uplink transmission, a TX data processor 1088 receives traffic data from a data source TD1 and control data from a controller 1080. The controller 1080 may store data in a memory 1082. TX data processor 1088 processes (e.g., encodes, interleaves, and modulates) the traffic data for the user station based on the coding and modulation schemes associated with the rate selected for the user station and provides a data symbol stream. A TX spatial processor 1090 performs spatial processing on the data symbol stream and provides transmit symbol streams for the antennas. Each transmitter unit (TX) 1054 receives and processes (e.g., converts to analog, amplifies, filters, and frequency upconverts) a respective transmit symbol stream to generate an uplink signal. Transmitter units 1054 provide uplink signals for transmission from antennas 1060a through 1060b to the access point.
[0174] At access point, antennas 1010a through 1010b receive the uplink signals from all selected user stations transmitting on the uplink. Each antenna 1010a through 1010b provides a received signal to a respective receiver unit (RX) 1022. Each receiver unit 1022 performs processing complementary to that performed by transmitter unit 1054 and provides a received symbol stream. An RX spatial processor 1040 performs receiver spatial processing on the received symbol streams from receiver units 1022 and provides recovered uplink data symbol streams. The receiver spatial processing is performed in accordance with the channel correlation matrix inversion (CCMI), minimum mean square error (MMSE), successive interference cancellation (SIC), or some other technique. Each recovered uplink data symbol stream is an estimate of a data symbol stream transmitted by a respective user station. An RX data processor 1042 processes (e.g., demodulates, deinterleaves, and decodes) each recovered uplink data symbol stream in accordance with the rate used for that stream to obtain decoded data. The decoded data for each user station may be provided for storage and/or to a controller 1030 for further processing. The controller 1030 may store data in a memory 1032.
[0175] According to one or more exemplary embodiments, on the downlink, at access point, a TX data processor 1010 receives traffic data from a data source TD2 for user stations scheduled for downlink transmission, control data from a controller 1030 and possibly other data from a scheduler 1034. The various types of data may be sent on different transport channels. TX data processor 1010 processes (e.g., encodes, interleaves, and modulates) the traffic data for each user station based on the rate selected for that user station. TX data processor 1010 provides downlink data symbol streams for the user stations. A TX spatial processor 1020 performs spatial processing on the downlink data symbol streams, and provides transmit symbol streams for the antennas 1010a through 1010b. Each transmitter unit (TMTR) 1022 receives and processes a respective transmit symbol stream to generate a downlink signal. Transmitter units 1022 provide downlink signals for transmission from antennas 1010a through 1010b to the user stations.
[0176] At each user station, antennas 1060a through 1060b receive the downlink signals from access point. Each receiver unit (RX) 1054 processes a received signal from an associated antenna 1060a through 1060b and provides a received symbol stream. An RX spatial processor 1060 performs receiver spatial processing on received symbol streams from receiver units 1054 and provides a recovered downlink data symbol stream for the user station. The receiver spatial processing is performed in accordance with the CCMI, MMSE, or some other technique. An RX data processor 1070 processes (e.g., demodulates, deinterleaves, and decodes) the recovered downlink data symbol stream to obtain decoded data for the user station.
[0177] The user station includes a fixed-complexity sphere decoder (FSD) configured to modify FSD ordering described above and to perform FSD processing. At least part of the functions and operations of the FSD illustrated herein may be performed by the RX spatial processor 1060, the RX data processor 1070, or another hardware block associated with at least one of the RX spatial processor 1060 and the RX data processor 1070. Further, the FSD may be configured as one or more logic circuits configured in the RFC module in the user station RFC_STA.
[0178] A channel estimation between the access point and selected user stations may be performed based on a signal transmission from the access point to the selected user stations and feedbacks from the selected user stations to the access point. Based on the communication between the access point and the selected user stations, a channel estimator 1028 of the access point and a channel estimator 1078 of each of the selected user stations may perform channel estimation operations. For example, the compressed beamforming procedure may be performed and each station may transmit feedback information to the access point. Based on the feedbacks of the user stations, the access point determines the estimate of the steering matrix of the user stations {circumflex over (F)}=[{circumflex over (F)}.sub.i {circumflex over (F)}.sub.li]. Then, each station estimates
[0179]
[0180] Referring to
[0181] The CPU 1110 may include a microprocessor including hardware, software, and/or firmware, which processes various tasks and operates an operating system (OS) and application programs. The CPU 1110 may include a plurality of processing cores and a cache memory. The CPU 1110 may control the GPU 1120, the memory controller 1130, the system memory 1140, the radio frequency communication module 1150, and the first to k-th IP blocks IP.sub.1 to IP.sub.k through the bus 1160.
[0182] According to one or more exemplary embodiments, the GPU 1120 may be coupled to the bus 1160 and may be configured to handle one or more processes (e.g., a series and/or simultaneous processes) in response to a control signal of the CPU 1110. For example, the GPU 1120 performs a video data processing. The GPU 1120 may be configured to handle one or more threads associated with video data processing, which may be performed via single threading or multithreading configuration. The GPU 1120 may include a control unit configured to calculate (or otherwise determine) and compare graphic-related data. The GPU 1120 may also be configured to analyze and execute a command(s) on behalf of the CPU 1110.
[0183] The SoC 1100 may be coupled to the external memory 1131 through a memory controller 1130 included in the SoC 1100. However, the external memory 1131 may be integrated as part of the SoC 1100. Data processed by the CPU 1110, the GPU 1120, the radio frequency communication module 1150, and the first to k-th IP blocks IP.sub.1 to IP.sub.k may be stored in the external memory 1131. The external memory 1131 may be controlled through the memory controller 1130. The external memory 1131 may include at least one of Static RAM (SRAM), Dynamic RAM (DRAM), and Synchronous DRAM (SDRAM), etc.; however, aspects of external memory 1131 are not limited thereto. Any suitable storage device (or repository) may be coupled to the SoC 1100 through any suitable I/O interface (not shown), which may be wired or wireless.
[0184] The system memory 1140 may be used as a working memory of the SoC 1100. The system memory 1140 may include at least one of Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), etc.
[0185] The radio frequency communication module 1150 may modify FSD ordering described above and perform FSD processing. The radio frequency communication module 1150 may be provided as the radio frequency communication module RFC_STA described with reference to
[0186] The IP blocks IP.sub.1 to IP.sub.k may be SoC components configured to perform various functions and operations in the SoC 1100. Each of the IP blocks IP.sub.1 to IP.sub.k may be classified into a master IP block or a slave IP block. It is also contemplated that the master IP block may serve as the slave IP block, or vice versa.
[0187]
[0188] Referring to
[0189] The application processor 1100 may control overall operations of the external memory 1200, the nonvolatile storage 1300, the display 1500, and the user interface 1600. The application processor 1100 may be provided as the SoC 1100 shown in
[0190] The external memory 1200 can be used as the external memory 1131 shown in
[0191] The nonvolatile storage 1300 includes various types of storages which retain stored data regardless power on/off conditions. For example, the nonvolatile storage 1300 includes at least one of various types such as a flash memory, a hard disk, a Read Only Memory (ROM), a Programmable ROM (PROM), an Electrically Programmable ROM (EPROM), an Electrically Erasable and Programmable ROM (EEPROM), a Phase-change RAM (PRAM), a Magnetic RAM (MRAM), a Resistive RAM (RRAM), a Ferroelectric RAM (FRAM). The nonvolatile storage 1300 may operate in response to a control of the application processor 1100. The memory controller 1130 shown in
[0192] The display 1500 displays information which is processed in mobile station 1000, according to a control of application processor 1100. One of the IP blocks in the application processor (AP) 1100 may be provided as a display controller coupled to the display 1500.
[0193] User interface 1600 detects a user input for controlling an operation of mobile station 1000, and generates corresponding input data. User interface 1600 may include an input device detecting a command or information from the user, such as a key pad, a mouse, a finger scan sensor, a dome switch, a touchpad, a jog wheel, and the like.
[0194] The RFC module 1110 may include at least one of Wireless Local Area Network (WLAN) communication module conforming to e.g., IEEE 802.11, and a cellular mobile communication module conforming to e.g., LTE, LTE-A, etc.
[0195]
[0196] Referring to
[0197] The radio frequency communication RFC module 2600 may be provided as an external component for the application processor 2100, and may be implemented in an RFC chip. The radio frequency communication module 2600 may be coupled to the application processor 2100 through a communication interface for interfacing between the application processor 2100 and the radio frequency communication module 2600. One of the IP blocks in the AP 2100 may be provided as the communication interface. The RFC module 2600 may include at least one of Wireless Local Area Network (WLAN) communication module conforming to e.g., IEEE 802.11, and a cellular mobile communication module conforming to e.g., LTE, LTE-A, etc.
[0198] The RFC module 1110 included in the AP 1100 of
[0199] Although certain exemplary embodiments and implementations have been described herein, other embodiments and modifications will be apparent from this description. Accordingly, the inventive concept is not limited to such embodiments, but rather to the broader scope of the presented claims and various obvious modifications and equivalent arrangements.