ONLINE CONVEX OPTIMIZATION WITH PERIODIC UPDATES FOR DOWNLINK MULTI-CELL MIMO WIRELESS NETWORK VIRTUALIZATION
20230058926 · 2023-02-23
Inventors
Cpc classification
H04B7/024
ELECTRICITY
International classification
H04B7/024
ELECTRICITY
Abstract
A method and network node for online coordinated multi-cell precoding are provided. According to one aspect, a method includes receiving from each of the plurality of service providers a virtual precoder matrix determined by the corresponding service provider. The method also includes determining a precoder matrix by minimizing a precoding deviation from a virtualization demand of the network subject to at least one power constraint, the virtualization demand being based at least in part on a product of a channel state matrix and a virtual precoder matrix. The method further includes applying the determined precoder matrix to signals applied to a plurality of antennas to achieve a sum of throughputs for the plurality of service providers that is greater than a sum of throughputs for the plurality of service providers achievable when the virtual precoder matrices are applied to the signals.
Claims
1. A method of online coordinated multi-cell precoding for a network node configurable at least in part by an infrastructure provider, the network node being configured to facilitate sharing of wireless network infrastructure resources by a plurality of service providers, the method comprising: obtaining a global channel state, the global channel state being based at least in part on a channel state in each cell of a plurality of cells; communicating a channel state to a corresponding service provider; receiving a virtual precoding matrix from each service provider of the plurality of services providers, each virtual precoding matrix being based at least in part on the channel state communicated to the corresponding service provider and being further based at least in part on a condition that each service provider of the plurality of services providers is allowed to use all of a plurality of available antennas and available wireless spectrum resources; executing an optimization procedure to periodically determine a multiple input multiple output, MIMO, precoding matrix that minimizes an accumulated precoding deviation from a virtualization demand subject to constraints, the virtualization demand and the MIMO precoding matrix being based at least in part on the virtual precoding matrices received from the plurality of service providers; and applying the determined MIMO precoding matrix to signals applied to the plurality of available antennas to achieve a sum of throughputs for the plurality of service providers that is greater than a sum of the throughputs for the plurality of service providers achievable when the virtual precoder matrices are applied to the signals.
2. The method of claim 1, wherein the optimization procedure includes periodically determining a gradient of a convex loss function and providing the determined gradient as feedback to the determination of the MIMO precoding matrix.
3. The method of claim 2, wherein feedback in the optimization procedure is allowed to be delayed for multiple time slots, at least one of: received out of order; and partly missing within an update period.
4. The method of claim 1, wherein the optimization procedure includes an online projected gradient ascent algorithm that provides O(√{square root over (T)}) regret and O(1) long term constraint violation, where T is a total time horizon over which multiple updates of the determined MIMO precoding matrix occur.
5. The method of claim 1, wherein the accumulated precoding deviation is determined according to f.sub.t(x.sub.t)where T is a time horizon, x.sub.t is a decision in a sequence of decisions made by the network node, f.sub.t(x.sub.t) is a convex loss function,
f.sub.t(x.sub.t) is an accumulated loss and x.sup.o
arg min.sub.x∈X.sub.
f.sub.t(x) is the argument of
f.sub.t(x) that produces a minimum value of
f.sub.t(x).
6. The method of claim 1, further comprising dividing a total time horizon T into update periods, each update period having a duration of T.sub.o time slots, T.sub.o being at least one time slot, and updating the MIMO precoding matrix at a beginning or end of each update period.
7. The method of claim 6, wherein at a beginning of each update period, a decision is taken from a known convex decision space and a loss is determined by an end of the duration of T.sub.o time slots based at least in part on the decision, the loss being based at least in part on a convex loss function.
8. The method of claim 1, wherein the virtualization demand is further based at least in part on past channel states.
9. The method of claim 1, wherein the constraints include long term transmit power constraints and short term transmit power constraints.
10. The method of claim 1, wherein the MIMO precoding matrix is determined to provide sub-linear T-slot regret with partial feedback on the accumulated precoding deviation from the virtualization demand, where T is a total time horizon.
11. A network node configured for online coordinated multi-cell precoding, the network node configurable at least in part by an infrastructure provider, the network node being configured to facilitate sharing of wireless network infrastructure resources by a plurality of service providers, the network node including processing circuitry configured to: obtain a global channel state, the global channel state being based at least in part on a channel state in each cell of a plurality of cells; communicate a channel state to a corresponding service provider; receive a virtual precoding matrix from each service provider of the plurality of services providers, each virtual precoding matrix being based at least in part on the channel state communicated to the corresponding service provider and being further based at least in part on a condition that each service provider of the plurality of services providers is allowed to use all of a plurality of available antennas and available wireless spectrum resources; execute an optimization procedure to periodically determine a multiple input multiple output, MIMO, precoding matrix that minimizes an accumulated precoding deviation from a virtualization demand subject to constraints, the virtualization demand and the MIMO precoding matrix being based at least in part on the virtual precoding matrices received from the plurality of service providers; and apply the determined MIMO precoding matrix to signals applied to the plurality of available antennas to achieve a sum of throughputs for plurality of service providers that is greater than a sum of the throughputs for the plurality of service providers achievable when the virtual precoder matrices are applied to the signals.
12. The network node of claim 11, wherein the optimization procedure includes periodically determining a gradient of a convex loss function and providing the determined gradient as feedback to the determination of the MIMO precoding matrix.
13. The network node of claim 12, wherein feedback in the optimization procedure is allowed to be delayed for multiple time slots, at least one of: received out of order; and partly missing within an update period.
14. The network node of claim 11, wherein the optimization procedure includes an online projected gradient ascent algorithm that provides O(√{square root over (T)}) regret and O(1) long term constraint violation, where T is a total time horizon over which multiple updates of the determined MIMO precoding matrix occur.
15. The network node of claim 11, wherein the accumulated precoding deviation is determined according to f.sub.t(x.sub.t)where T is a time horizon, x.sub.t is a decision in a sequence of decisions made by an agent, f.sub.t(x.sub.t) is a convex loss function,
f.sub.t(x.sub.t) is an accumulated loss and x.sup.o
arg min.sub.x∈X.sub.
f.sub.t(x) is the argument of
f.sub.t(x) that produces a minimum value of
f.sub.t(x).
16. The network node of claim 11, wherein the processing circuitry is further configured to divide a total time horizon T into update periods, each update period having a duration of T.sub.o time slots, T.sub.o being at least one timeslot, and updating the MIMO precoding matrix at a beginning or end of each update period.
17. The network node of claim 16, wherein at a beginning of each update period, a decision is taken from a known convex decision space and a loss is determined by an end of the duration of T.sub.o time slots based at least in part on the decision, the loss being based at least in part on a convex loss function.
18. The network node of claim 11, wherein the virtualization demand is further based at least in part on past channel states.
19. The network node of claim 11, wherein the constraints include long term transmit power constraints and short term transmit power constraints.
20. The network node of claim 11, wherein the MIMO precoding matrix is determined to provide sub-linear T-slot regret with partial feedback on the accumulated precoding deviation from the virtualization demand, where T is a total time horizon.
21.-32. (canceled).
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0028] A more complete understanding of the present embodiments, and the attendant advantages and features thereof, will be more readily understood by reference to the following detailed description when considered in conjunction with the accompanying drawings wherein:
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
[0045]
[0046]
[0047]
[0048]
[0049]
[0050]
[0051]
DETAILED DESCRIPTION
[0052] Before describing in detail exemplary embodiments, it is noted that the embodiments reside primarily in combinations of apparatus components and processing steps related to online convex optimization with periodic updates for downlink multi-cell multiple input multiple output (MIMO) wireless network virtualization. Accordingly, components have been represented where appropriate by conventional symbols in the drawings, showing only those specific details that are pertinent to understanding the embodiments so as not to obscure the disclosure with details that will be readily apparent to those of ordinary skill in the art having the benefit of the description herein.
[0053] As used herein, relational terms, such as “first” and “second,” “top” and “bottom,” and the like, may be used solely to distinguish one entity or element from another entity or element without necessarily requiring or implying any physical or logical relationship or order between such entities or elements.
[0054] The term “network node” or “radio network node” used herein can be any kind of network node comprised in a radio network which may further comprise any of base station (BS), radio base station, base transceiver station (BTS), base station controller (BSC), radio network controller (RNC), g Node B (gNB), evolved Node B (eNB or eNodeB), Node B, multi-standard radio (MSR) radio node such as MSR BS, multi-cell/multicast coordination entity (MCE), relay node, donor node controlling relay, radio access point (AP), transmission points, transmission nodes, Remote Radio Unit (RRU) Remote Radio Head (RRH), a core network node (e.g., mobile management entity (MME), self-organizing network (SON) node, a coordinating node, positioning node, MDT node, etc.), an external node (e.g., 3rd party node, a node external to the current network), nodes in distributed antenna system (DAS) etc. The network node may also comprise test equipment. The term “radio node” used herein may be used to also denote a wireless device such as a WD or a radio network node.
[0055] A method and network node for online coordinated multi-cell precoding are provided. According to one aspect, a method includes receiving from each of the plurality of service providers a virtual precoder matrix determined by the corresponding service provider. The method also includes determining a precoder matrix by minimizing a precoding deviation from a virtualization demand of the network subject to at least one power constraint, the virtualization demand being based at least in part on a product of a channel state matrix and a virtual precoder matrix. The method further includes applying the determined precoder matrix to signals applied to a plurality of antennas to achieve a sum of throughputs for the plurality of service providers that is greater than a sum of throughputs for the plurality of service providers achievable when the virtual precoder matrices are applied to the signals.
Problem Formulation
[0056] In this subsection, a constrained OCO problem is formulated. Then, for a performance metric, a regret with partial feedback and long-term constraint violations is defined. Finally, an OCO algorithm with periodic updates is derived.
[0057] Under standard OCO settings, at the beginning of each time slot t, an agent makes a decision x.sub.t∈X.sub.0, where X.sub.0∈.sup.n is a known compact convex set. At the end of each time slot t, the system reveals the convex loss function f.sub.t(x):
.sup.n
to the agent, who then suffers a loss f.sub.t(x.sub.t). A goal of an OCO algorithm is to select a sequence {x.sub.t} such that the accumulated loss
f.sub.t(x.sub.t) is comparable with the one yielded by the optimal offline fixed strategy defined as x.sup.o
arg min.sub.x∈X.sub.
f.sub.t(x), with all convex loss functions known in hindsight, where
={0, . . . ,T−1}. A desired OCO algorithm should provide sub-linear regret defined as
f.sub.t(x.sub.t) −f.sub.t(x.sup.o). With the assumption of bounded gradient or sub-gradient, it is known that the online projected gradient ascent algorithm provides O(√{square root over (T)}) regret. Note that the compact convex set X.sub.0 can be expressed as a set of short-term constraints,
.sub.c(x.sub.t) ≤0, where
.sub.c(x.sub.t):
.sup.n
, c∈
with
={1, . . . , C}.
[0058] The standard OCO formulation is useful in many applications. However, it does not accommodate long-term constraints. OCO with long-term constraints was formulated that does not require .sub.c(x.sub.t)≤0 to be satisfied at each time slot t, but only requires the long-term constraint violation, defined as
.sub.c(x.sub.t), to grow sub-linearly. Recently, a virtual-queue-based OCO algorithm in provides O(√{square root over (T)}) regret and O(1) long-term constraint violation under the per-time-slot OCO settings.
[0059] Thus an OCO problem formulation with periodic updates as shown in ={0, . . . , I−1}, each having a duration of T.sub.0≥1 time slots. At the beginning of each update period i∈
, an agent is required to make a decision x.sub.i for the next T.sub.0 time slots, from a known compact convex set X.sub.0 that incorporates the short-term constraints. The variable t.sub.i denotes the first time slot of update period i. Let f.sub.t(x):
.sup.n
be a convex loss function at time slot t, which is determined by the system (e.g., sum rate of a wireless network determined by the wireless channel conditions). Within each update period i, the agent is sent feedback multiple times of the gradient or sub-gradient information ∇f.sub.t(x.sub.i) for t.sub.i≤t<t.sub.i+1. Let
.sub.i={1, . . . , S.sub.i} be the index set of such feedback. Let τ.sub.i.sup.s denote the time slot when feedback s, s∈S.sub.i is sent. Then, at the end of time slot τ.sub.i.sup.s+d.sub.i.sup.s−1,∇f.sub.τ.sub.
[0060] While the set X.sub.0 accommodates short-term constraints, long-term constraints in may also be considered in the problem formulation. Let g.sub.c(x): .sup.n
, c∈
be the known convex long-term constraint function. The agent tries to enforce the long-term constraint
T.sub.0
(x.sub.i)
0 over I periods, where
(x)
[
.sub.1(x), . . . ,
.sub.c(x)].sup.T and
denotes element-by-element inequality.
[0061] In some embodiments, the following two conventional goals of an OCO algorithm, with slight modification for the system model are achieved. One goal of an OCO algorithm is to select a sequence {x.sub.i}∈X.sub.0, such that the accumulated loss is competitive with an optimal offline fixed strategy defined with partial feedback
where X{x∈X.sub.0: g(x)
0}. To capture this, define the regret with partial feedback as follows:
Another goal of an OCO algorithm is to provide sub-linear long-term constraint violation defined for any c as follows:
V0.sub.c(T)T.sub.0
.sub.c(x.sub.i) (2)
Note that the long-term constraints introduce correlation on the decisions over time, and lead to a more complicated online problem when the decision is fixed for one update period that lasts for multiple time slots, while the underlying system evolves over time.
[0062] Assume that each feedback ∇f.sub.τ.sub.. Also assume that there is at least one feedback received by the agent within each update period, i.e., 1≤S.sub.i, ∀i∈
. Note that the feedback within each update period is not assumed to be received in order, i.e., even if r<s, the following statement is not necessarily true: τ.sub.i.sup.r+d.sub.i.sup.r≤τ.sub.i.sup.s+d.sub.i.sup.s, ∀r, ∈
.sub.i.
[0063] When the feedback received for the gradient or sub-gradient of the convex loss functions is incomplete, i.e., S.sub.i<T, it is unfair to compare the algorithm described herein with the standard optimal offline fixed strategy x.sup.o that knows all convex loss functions in hindsight. Let
={τ.sub.i.sup.1, . . . , τ.sub.i.sup.s.sup.
.sub.i={t: t.sub.i≤t<t.sub.i+1} be the set of time slots where decision x.sub.i is made. For the offline setting, assume the convex loss function at time slot t∈
.sub.i\
is the average of the convex loss functions received within update period i, i.e.,
As such, the OCO algorithm derived herein is compared with an optimal offline fixed strategy x.sub.p.sup.o that has the same partial feedback information. Note that when all gradient or sub-gradient feedback for all time slots in each period are received, i.e., S.sub.i=T.sub.0, ∀i∈, x.sub.p.sup.o becomes the standard optimal offline fixed strategy x.sup.0.
OCO Algorithm with Periodic Updates
[0064] The OCO algorithm described herein with periodic updates is presented as Algorithm 1, summarized below. It selects a decision sequence {x.sub.i} and a virtual queue sequence {Q.sub.i}, which can be viewed as the primal and dual variables in the saddle-point-typed OCO algorithms. The main difference is that Algorithm 1 uses the virtual queues as the dual variables to incorporate the violations of long-term constraints. The virtual queue sequence {Q .sub.i} serves as a queue backlog of the violations of long-term constraints, and an upper bound on the virtual queue vector can be readily translated to an upper bound on the long-term constraint violations. It has been shown that a virtual-queue-based algorithm has some advantages over the saddle-point-typed algorithms regarding performance guarantees under the standard OCO settings.
[0065] In some embodiments, algorithm 1 may be summarized as follows: [0066] A. Let γ, α>0 be constant algorithm parameters. At the beginning of period i=0, initialize Q.sub.0 [Q.sub.0.sup.1, . . . , Q.sub.0.sup.C].sup.T=0 and x.sub.0∈X.sub.0. By the end of period 0, receive ∇f.sub.τ.sub.
.sub.0. [0067] B. If i+1≥I, then End. [0068] C. If i+1≤I, then: [0069] D. At the beginning of each i+1-st update period t.sub.i+1, use ∇f.sub.τ.sub.
.sub.i to do the following: [0070] Update the virtual queue for all c∈
Q.sub.i+1.sup.c=max{−T.sub.0.sub.c(x.sub.i), Q.sub.i.sup.c+T.sub.0
.sub.c(x.sub.i)} (3)
where .sub.c(x.sub.i)
γ
.sub.c(x.sub.i). [0071] E. Solve P1 for x.sub.i+1 as
.sub.i+1. [0073] G. i=i+1. [0074] H. Return to B.
The following assumptions, which are common in the OCO literature, are made to facilitate the regret and long-term constraint violation bound analysis of Algorithm 1.
Assumption 1. There exists a constant D>0 such that for any x∈X.sub.0 and any t∈T, and
∥∇f.sub.t(x)∥.sub.2≤D (4)
Assumption 2. There exists a constant β>0 such that g(x) is Lipschitz continuous with modulus β for any x, y∈X.sub.0 as
∥g(x)−g(y)∥.sub.2≤β∥x−y∥.sub.2 (5)
Assumption 3. There exists a constant G>0 such that for any x∈X.sub.0, and
∥g(x)∥.sub.2≤G (6)
Assumption 4. There exists a constant R>0 such that for any x, y∈X.sub.0, and
∥x−y∥.sub.2≤R (7)
Assumption 5. There exists a constant ∈>0 and a {circumflex over (x)}∈X.sub.0 such for any c∈, and
g.sub.c({circumflex over (x)})≤−∈ (8)
Assumptions 1 and 2 are the standard assumptions of bounded gradient or sub-gradient for the convex loss functions and the Lipschitz continuity assumption for long-term constraints. Assumptions 3 and 4 bound the outputs of the long-term constraint functions and the decision set. Assumption 5 states that there is at least one interior point for any of the long-term constraint functions.
[0075] Note that the disclosed OCO formulation and Algorithm 1 account for the impact of multiple-time-slot update period on the regret bound and long-term constraint violation bound, which has not been studied in the literature before. Compared with the virtual-queue-based OCO algorithm under standard OCO settings, Algorithm 1 only uses the gradient or sub-gradient information instead of the loss function information. Furthermore, when the long-term and short-term constraint functions are separable with respect to blocks of x, P1 in Algorithm 1 can be equivalently decomposed into several sub-problems, each corresponding to a block of x.sub.i+1 and can be solved with lower computational complexity.
Performance Analysis
[0076] In this subsection, performance bounds are provided for the OCO algorithm described herein. Impacts of the update period on the regret and long-term constraint violation bounds are explicitly analyzed.
First, bounds are provided on the virtual queues in the following lemma.
Lemma 1. The virtual queues are bounded for any i∈ and any c∈
as follows
Q.sub.i.sup.c≥0, (9)
Q.sub.i+1.sup.c+T.sub.0{tilde over (g)}.sub.c(x.sub.i)≥0, (10)
∥Q.sub.i+1∥.sub.2.sup.2≥∥T.sub.0{tilde over (g)}.sub.c(x.sub.i)∥.sub.2.sup.2, (11)
∥Q.sub.i+1∥.sub.2≤∥Q.sub.i∥.sub.2+∥T.sub.0{tilde over (g)}.sub.c(x.sub.i)∥.sub.2 (12)
Define
[0077]
as the quadratic Lyapunov function and Δ.sub.iL.sub.i+1−L.sub.i as the corresponding Lyapunov drift. Leveraging results in Lemma 1, an upper bound on the Lyapunov drift is provided in the following lemma.
Lemma 2. The Lyapunov drift Δ.sub.i at each update period i∈ is upper bounded by
Δ.sub.i≤Q.sub.i.sup.T[T.sub.0{tilde over (g)}(x.sub.i)]+∥T.sub.0{tilde over (g)}(x.sub.i)∥.sub.2.sup.2 (13)
From the definition of strong convexity, function f(x) is said to be strongly convex on a convex set X∈.sup.n with modulus α, if there exists a constant α>0 such that
is convex on X. Based on the properties of strongly convex functions, the following lemma is commonly used in the literature of OCO for regret analysis.
Lemma 3. Let X∈.sup.n be a convex set. Let function f(x) be strongly convex on X with modulus α and x.sub.opt be a global minimum of f(x) on X. Then, for any x∈X:
Leveraging results in Lemmas 1-3 and OCO techniques, the following theorem provides regret bounds for Algorithm 1.
Theorem 4. If
[0078]
in Algorithm 1, then for all I>0:
Next, an upper bound on the long-term constraint violation in (2) is provided for Algorithm 1. The virtual queue is related to the long-term constraint violation in the following lemma.
Lemma 5. For the virtual queue Q.sub.i produced by Algorithm 1, for any c∈
From Lemma 5, the long-term constraint violation VO.sub.c(T) can be bound through an upper bound on the virtual queue Q.sub.i.sup.c. The following theorem provides an upper bound on the long-term constraint violation over any given I>0 update periods.
[0079] Theorem 6. If
in Algorithm 1, then for all I>0 and any c∈:
Theorems 4 and 6 show that Algorithm 1 provides O(√{square root over (T)}) regret defined with partial feedback in (1) and O(1) long-term constraint violation in (2). Performance analysis explicitly considers the impacts of periodic updates on the performance guarantees of OCO, which has not been studied in the literature before. The above results can be extended to a more general system model with time-varying update periods and unknown time horizon. This is explained in the following remarks.
[0080] Define the variation of update period as V.sub.T=(T.sub.i−T.sub.i+1).sup.2, where T.sub.i is the i-th update period. Under the assumptions that T.sub.i≤T.sub.max, ∀i∈
and V.sub.T≤O(1) , i.e., both the update period and the variation of update period are upper bounded, it can be shown that Algorithm 1 still yields O(√{square root over (T)}) regret and O(1) long-term constraint violation.
[0081] The parameters γ and α in Algorithm 1 depend on the time horizon T. When T is unknown, the standard doubling trick can be applied to preserve the O(√{square root over (T)}) regret bound of Algorithm 1 when T is known. It can be easily shown that applying the doubling trick to Algorithm 1 yields O(log.sub.2 T) long-term constraint violation.
Online Multi-Cell MIMO WNV
[0082] In this subsection, an online coordinated multi-cell precoding problem for downlink MIMO WNV with unknown CDI and delayed CSI is disclosed. Both long-term and short-term transmit power constraints are accommodated in the formulation. Leveraging the OCO algorithm described herein, an online downlink coordinated multi-cell MIMO WNV algorithm is disclosed, with a distributed closed-form precoding solution and guaranteed performance bounds.
Idealized MIMO WNV Model
[0083] In the system under consideration, an InP performs WNV to a MIMO cellular network that consists of C cells, each cell c∈ has one BS equipped with N.sub.c antennas, for a total of N=
N.sub.c antennas in the network. The InP serves M SPs, each SP m∈
has K.sub.c.sup.m users, for a total of K.sub.c=
K.sub.c.sup.m users in cell c and a total of K=
K.sub.c users in the network. Let
={1, . . . , C},
={1, . . . , M} and
={1, . . . , K}. Consider a time-slotted system with time indexed by t, and denote Ĥ.sub.t.sup.lcm∈
.sup.K.sup.
[0084] For ease of exposition, first consider an idealized WNV framework, with channel state feedback in each time slot without channel state feedback delay, as shown in .sup.K.sup.
P.sub.c.sup.m≤P.sub.max.sup.c with P.sub.max.sup.c being the maximum transmit power limit of cell c. Each SP m designs its own precoding matrix Ŵ.sub.t.sup.ccm∈
.sup.N.sup.
{tilde over (y)}.sub.t.sup.c′={tilde over (D)}.sub.t.sup.cx.sub.t.sup.c
where {tilde over (D)}.sub.t.sup.cblkdiag{
Ĥ.sub.t.sup.ccmŴ.sub.t.sup.ccm being the virtualization demand from SP m in cell c, and x.sub.t.sup.c=[x.sub.t.sup.c1H, . . . , x.sub.t.sup.cMH].sup.H is the transmitted signal vectors for the K.sub.c users in cell c, with x.sub.t.sup.cm being the transmitted signal vector for the K.sub.c.sup.m users of SP m in cell c. The desired received signal vector at all K users in the network is
y′.sub.t=D.sub.tx.sub.t
where D.sub.tblkdiag{{tilde over (D)}.sub.t.sup.1, . . . , {tilde over (D)}.sub.t.sup.c} is the virtualization demand from the network, x.sub.t=[x.sub.t.sup.1H, . . . , x.sub.t.sup.CH].sup.H, and y′.sub.t=[{tilde over (y)}.sub.t.sup.c′.sup.H].sup.H. Without loss of generality, assume the transmitted signal to each user is zero-mean with unit power and uncorrelated to each other at each time slot t, i.e.,
{x.sub.t}=0 and
{x.sub.tx.sub.t.sup.H}=I, ∀t∈
.
[0085] The InP coordinates the cells at the precoding level to serve all the users directly. In each cell c, with local channel state {tilde over (H)}.sub.t.sup.c=[.sup.K×N.sup.
.sup.N.sup.
.sup.K.sup.
y.sub.t=H.sub.tV.sub.tx.sub.t
where H.sub.t=[{tilde over (H)}.sub.t.sup.1, . . . , {tilde over (H)}.sub.t.sup.C]∈.sup.K×N is the global channel state, and V.sub.t=blkdiag{{tilde over (V)}.sub.t.sup.1, . . . , {tilde over (V)}.sub.t.sup.C}∈
.sup.N×K is the global precoding matrix designed by the InP. The expected deviation of the received signal vector at all K users via the InP's actual global precoding matrix from that via the SPs' local virtual precoding matrices is given by
{∥y.sub.t−y′.sub.t∥.sub.2.sup.2}=
{∥H.sub.tV.sub.t−D.sub.t∥.sub.F.sup.2}
where ∥.Math.∥.sub.F represents the Frobenius norm.
Problem Formulation
[0086] Define the convex set for InP' s global precoding matrix V∈.sup.N×K as
.sub.0
{V=blkdiag{{tilde over (V)}.sup.1, . . . , {tilde over (V)}.sup.c}: ∥{tilde over (V)}.sup.c∥.sub.F.sup.2≤P.sub.max.sup.c, ∀c ∈
}, where {tilde over (V)}.sup.c∈
.sup.N.sup.
where the first term is the precoding deviation from virtualization demand of cell c, and the second term is the leakage from cell c to all the other cells. Define the long-term transmit power constraint function for cell c∈ as
g.sub.c({tilde over (V)}.sup.c)∥{tilde over (V)}.sup.c∥.sub.F.sup.2−
where
[0087] Consider a periodic demand-response mechanism between the InP and the SPs. At the beginning of each time slot t.sub.i=iT.sub.0, i∈, the InP coordinates the C cells at the precoding level and designs a fixed global precoding matrix V.sub.i∈
.sup.N×K for the next T.sub.0 time slots. The InP then schedules channel state feedback to itself (with any channel estimation and feedback transmission method) at time slot τ.sub.i.sup.s, s∈
.sub.i, and receives the global channel state H.sub.τ.sub.
. In each cell c, the InP shares Ĥ.sub.τ.sub.
[0088] The InP designs {V.sub.i}∈.sub.0 to provide sub-linear T-slot regret defined with partial feedback on the precoding deviation from the virtualization demand
is the optimal offline fixed precoding strategy with partial feedbacks, and {V=diag{{tilde over (V)}.sup.1, . . . , {tilde over (V)}.sup.c}: ∥{tilde over (V)}.sup.c∥.sub.F.sup.2≤
}. Note that when all the channel state feedbacks are received by the InP, (19) becomes the standard regret definition in the literature of OCO. Another goal of the InP is to provide sub-linear long-term constraint violation on the long-term transmit power defined for any c∈
as
VO.sub.c(T)T.sub.0g.sub.c({tilde over (V)}.sub.i.sup.c) (20)
Online Downlink Coordinated Multi-Cell MIMO WNV Algorithm
[0089] The OCO algorithm described herein provides a solution to the afore-mentioned online coordinated multi-cell precoding problem for downlink MIMO WNV. From the first-order condition of a real-valued scalar convex function with respect to the complex-valued matrix variable, for any V, V.sub.i∈.sub.0
f.sub.t(V)≥f.sub.t(V.sub.i)+2Re{tr{∇.sub.V.sub.
where ∇.sub.V.sub.
[0090] Leveraging the OCO algorithm described herein, at the beginning of each demand-response period t.sub.i+1, the InP solves P2 for the global precoding matrix V.sub.i+1 as follows:
Since both the long-term and short-term transmit power constraints are on each cell, it then follows that P2 can be equivalently decomposed into C subproblems, each corresponding to the local precoding matrix {tilde over (V)}.sub.i+1.sup.c for cell c∈ as follows:
[0091] P3 is a convex optimization problem satisfying the Slater's condition, thus strong duality holds. P3 can be solved by studying the Karush-Kuhn-Tucker (KKT) conditions. The Lagrange function for P3 is
where λ.sup.c is the Lagrangian multiplier associated with constraint (22). Taking partial derivative of L({tilde over (V)}.sup.c, λ.sup.c) with respect to {tilde over (V)}.sup.c*:
The KKT conditions for ({tilde over (V)}.sup.co, λ.sup.co) being globally optimal are given by
where (23) follows by setting ∇.sub.{tilde over (V)}.sub.
Equation (27) can be categorized into two subcases: 2.1) If {tilde over (V)}.sup.co in (27) satisfies (24), then it is the optimal solution. 2.2) If {tilde over (V)}.sup.co in (27) cannot satisfy (24), which means the condition in Case 2) does not hold in optimality, i.e., λ.sup.co>0, and the optimal solution is given by Case 1).
[0094] An online downlink coordinated multi-cell MIMO WNV algorithm in accordance with the present disclosure and referred to herein as algorithm 2, is summarized as follows: [0095] A. For large T, set
For intermediate value of T, solve P4 for γ and α. [0096] B. At the beginning of period i=0, initialize Q.sub.0=0 and V.sub.0∈.sub.0. By the end of period 0, receive Hτ.sub.o.sup.s and Dτ.sub.o.sup.s, s∈
.sub.0. [0097] C. If, i+1≥I then end; [0098] D. If, i+1≤I then: [0099] E. At the beginning of each demand-response period i+1, use Hτ.sub.i.sup.s and D.sub.τ.sub.
.sub.i to do the following: [0100] Update virtual queue for all c∈
Q.sub.i+1.sup.c=max{−T.sub.0{tilde over (g)}.sub.c({tilde over (V)}.sub.i.sup.c), Q.sub.i.sup.c+T.sub.0{tilde over (g)}.sub.c({tilde over (V)}.sub.i.sup.c)} (28) [0101] where {tilde over (g)}.sub.c({tilde over (V)}.sub.i.sup.c)γg.sub.c({tilde over (V)}.sub.i.sup.c). [0102] Update {tilde over (V)}.sub.i+1.sup.c, ∀c∈
as
.sub.i+1. [0104] G. i=i+1 [0105] H. Return to C,
Performance Analysis
[0106] The following assumption is made that the channel gain is upper bounded for theoretical performance bound analysis.
Assumption 6. There exists a constant B>0 such that the channel gain is upper bounded for any t∈ as
∥H.sub.t∥.sub.F≤B. (29)
The following lemma shows that Assumptions 1-5 hold for the formulated online downlink MIMO WNV problem.
Lemma 7. Under Assumption 6, the formulated online downlink MIMO WNV problem with decision set .sub.0, loss function f.sub.t(V), and long-term constraint function g.sub.c({tilde over (V)}.sup.j), c∈
, satisfies Assumptions 1-5, with constants D, β, G, R, ∈ given as follows:
Leveraging the results in Lemma 7, Theorem 4, and Theorem 6, the following Theorem shows that Algorithm 2 provides O(√{square root over (T)}) regret on the precoding deviation from the virtualization demand and O(1) long-term transmit power constraint violation.
Theorem 8. If
[0107]
Algorithm 2 yields an O(√{square root over (T)}) upper bound given in (15) on the regret defined in (19) and an O(1) upper bound given in (16) on the long-term constraint violation defined in (20), with constants D, β, G, R, ∈ given in Lemma 7.
Furthermore, three remarks regarding the algorithm parameter selection, the optimal offline fixed precoding strategy, and the short-term per-antenna transmit power constraints are provided below.
[0108] Algorithm 2 provides O(√{square root over (T)}) regret and O(1) long-term constraint violation from Theorem 8 for a large T. However, the constants hidden behind the O(√{square root over (T)}) and O(1) upper bounds can be relatively large for an intermediate value of T. The following problem can be solved to obtain the algorithm parameters γ and α that yield the best regret bound, subject to a constraint on the long-term constraint violation upper bound:
where D, β, G, R, ∈ are given in Lemma 7. P4 is a geometric programming (GP) problem that can be easily solved by a standard convex program solver such as CVX.
[0109] Algorithm 2 provides an O(√{square root over (T)}) regret compared with the optimal offline fixed precoding solution V.sub.p.sup.0 defined with the same partial feedbacks for fair comparison. However, for algorithm performance evaluation in Subsection 6.4, the performance of Algorithm 2 is compared with the one yielded by the standard optimal offline fixed strategy V.sub.opt.sup.0, that solves the following optimization problem with complete feedback
P5: min.sub.v.sub.∥H.sub.tV.sub.opt−D.sub.t∥.sub.F.sup.2
s.t. ∥{tilde over (V)}.sub.opt.sup.c∥.sub.F.sup.2≤ (35)
where V.sub.opt=blkdiag{{tilde over (V)}.sub.opt.sup.1, . . . , {tilde over (V)}.sub.opt.sup.c}. Note that V.sub.opt.sup.o is a more knowledgeable fixed off-line strategy than V.sub.p.sup.o. Since P5 is a convex optimization problem satisfying the Slater's condition, the strong duality holds. P5 can be solved by studying the KKT conditions. The Lagrange function for P5 is
L(V.sub.opt, λ)=∥H.sub.tV.sub.opt−D.sub.t∥.sub.F.sup.2+
λ.sup.c(∥{tilde over (V)}.sub.opt.sup.c∥.sub.F.sup.2−
where λ.sup.c, c∈ is the Lagrangian multiplier associated with constraint (35). Taking partial derivative of L(V.sub.opt, λ) with respect to {tilde over (V)}.sub.opt.sup.c*:
∇.sub.{tilde over (V)}.sub.{tilde over (H)}.sub.t.sup.cH({tilde over (H)}.sub.t.sup.c{tilde over (V)}.sub.opt.sup.c−{tilde over (D)}.sub.t.sup.c′)+λ.sup.c{tilde over (V)}.sub.opt.sup.c
where {tilde over (D)}.sub.t.sup.c′=[0, . . . , {tilde over (D)}.sub.t.sup.cH, . . . , 0].sup.H. The KKT conditions for ({tilde over (V)}.sub.opt.sup.coλ.sup.co), c∈ being globally optimal are given by
(A.sub.c+λ.sup.coI){tilde over (V)}.sub.opt.sup.co=B.sub.c (36)
∥{tilde over (V)}.sub.opt.sup.co∥.sub.F.sup.2−
λ.sup.co≥0 (38)
λ.sup.co(∥{tilde over (V)}.sub.opt.sup.co∥.sub.F.sup.2−
where (36) follows by setting ∇.sub.{tilde over (V)}.sub.{tilde over (H)}.sub.t.sup.cH{tilde over (D)}.sub.t.sup.c′. Consider the following cases. [0110] 1) λ.sup.co>0: From (36), A.sub.c+λ.sup.coI >0, which implies that it is invertible, and:
{tilde over (V)}.sub.opt.sup.co=(A.sub.c+λ.sup.coI).sup.−1B.sub.c (40)
in which λ.sup.co>0 can be found by bisection search such that ∥(A.sub.c+λ.sup.coI).sup.−1B.sub.c∥.sub.F.sup.2=
A.sub.c{tilde over (V)}.sub.opt .sup.co=B.sub.c (41)
categorize (41) into two subcases: 2.1) If A.sub.c is a rank deficient matrix, there are infinitely many solutions for {tilde over (V)}.sub.opt.sup.co. {tilde over (V)}.sub.opt.sup.co is chosen to minimize ∥{tilde over (V)}.sub.opt.sup.co∥.sub.F.sup.2 subject to (41), which is an under-determined least-square problem with a closed-form solution:
{tilde over (V)}.sub.opt.sup.co=A.sub.c.sup.H(A.sub.cA.sub.c.sup.H).sup.−1B.sub.c (42)
By (37), if ∥A.sub.c.sup.H(A.sub.cA.sub.c.sup.H).sup.−1B.sub.c∥.sub.F.sup.2≤
{tilde over (V)}.sub.opt.sup.co=A.sub.c.sup.−1B.sub.c (43)
Again, if ∥A.sub.c.sup.−1B.sub.c∥.sub.F.sup.2≤
[0112] Short-term per-antenna transmit power constraints can be incorporated into the problem formulation, by redefining the convex decision set as .sub.0
{V=blkdiag{{tilde over (V)}.sup.1, . . . , {tilde over (V)}.sup.c}: ∥v.sup.cn∥.sub.F.sup.2≤P.sub.max.sup.cn,
∀n∈
}, where {tilde over (V)}.sup.c=[v.sup.c1, . . . , v.sup.cN.sup.
.sup.N.sup.
.sup.K.sup.
P6 can be equivalently decomposed into N.sub.c sub-problems, each corresponding to the precoding vector v.sup.cn for the n-th antenna in cell c as
where ∇.sub.v.sub.
Numerical Performance Evaluation
[0113] Consider an InP that owns a cellular network consisting of C=7 urban hexagon micro-cells, each cell c is of 500 m radius and has a BS at the center equipped with N.sub.c=32 antennas by default. The InP serves M=4 SPs, each SP m serves K.sub.c.sup.m=2 users, for a total of K.sub.c=8 users in each cell c. Set maximum transmit power limit P.sub.max.sup.c=33 dBm, long-term transmit power limit
[0114] The fading channel is modeled as a first-order Gauss-Markov process
h.sub.t+1.sup.ck=α.sub.hh.sub.t.sup.ck+z.sub.t.sup.ck
where h.sub.t.sup.ck ˜.sup.(0, β.sub.ckI) is the channel state between user k and BS antennas in cell c, β.sub.ck[dB]=−31.54−33 log.sub.10d.sub.ck−ψ.sub.ck captures path-loss and shadowing, d.sub.ck is the distance in kilometers from the BS in cell c to user k, ψ.sub.ck˜
(0, σ.sub.Ø.sup.2) accounts for shadowing with a σ.sub.ø=8 dB, α.sub.h is the channel correlation coefficient, and z.sub.t.sup.ck˜
(0, (1−α.sub.h.sup.2)β.sub.ckI) is independent of h.sub.t.sup.ck. For a coherence time of
and transmission channel bandwidth B.sub.w, α.sub.h is determined by
is the Doppler spread, v is the user speed assumed to be the same for all K users, f.sub.c=2.5 GHz is the central frequency, v.sub.c is the speed of light, and Ø=0.1 is the level of de-correlation. For α.sub.h=0.995 as default in the simulation, v≈3 km/h.
Define Δt as one time slot and set the demand-response period T.sub.0=8Δt (533.6 us) as default in the simulation, such that T.sub.0 can be seen as one slot (500 us) that consists of 6 or 7 symbol durations plus cyclic prefix under the standard LTE transmission frame settings. An overview of the default simulation parameters is shown in Table 1.
TABLE-US-00001 TABLE 1 Parameter Value Number of cells C 7 Number of SPs M 4 Number of users per cell K.sub.c 8 Number of users per cell N.sub.c 32 Transmission channel bandwidth B.sub.W 15 kHz Central frequency f.sub.c 2.5 GHz Noise spectral density N.sub.0 −174 dBm/Hz Noise figure N.sub.F 10 dB User speed v 3 km/h Maximum transmit power per cell P.sub.max.sup.c 33 dBm Long-term transmit power limit per cell
[0115] For performance study, consider the cases where each SP m uses MRT or ZF precoding, which are two commonly used precoding schemes in current MIMO systems, to design their own virtual precoding matrices given by
where a.sub.t.sup.ccm is a power normalizing factor such that
i.e., all SPs are allocated equal proportions of the maximum transmit power in each cell. Based on {V.sub.i} produced by Algorithm 2, define the normalized time-averaged precoding deviation from the virtualization demand as
the time-averaged transmit power as
and the time-averaged rate per user as
h.sub.t.sup.k and v.sub.i.sup.k are the channel state at time slot t and the precoding vector for the i-th demand-response period between user k and the N antennas, and σ.sub.n.sup.2=N.sub.0B.sub.W+N.sub.F is the noise power. For the upper bound B of the channel gain in Assumption 6, B is set as B=1.645√{square root over (N.sub.c
β.sub.ck)}, which gives a Chernoff upper bound of 9.07×10.sup.−14 for the probability of bound violation
{∥H.sub.t∥.sub.F>B}.
Performance vs. Demand-Response Period T.SUB.o
[0116] First evaluate the performance of the disclosed algorithm described herein with different values of the demand-response period T.sub.0. Consider the case where there is only one channel state feedback at the beginning of each demand-response period, i.e., ={iT.sub.0}, ∀i∈
, which is the same as the standard LTE transmission frame structure.
Performance Comparison
[0117] A performance comparison between the disclosed online downlink coordinated MIMO WNV algorithm is provided herein with a different, but generally related arrangement that discloses standard per-time-slot OCO settings and does not provide performance guarantees under the periodic update scenario. However, the information of the update period T.sub.0 may be ignored. The steady-state precoding deviation
Performance vs. Optimal Offline Fixed Precoding Strategy
[0118]
Performance vs. Long-term Transmit Power Limit
[0119] The impact of the long-term transmit power limit
Performance vs. Number of Cells C
[0120] The impact of the number of cells C on the performance of the algorithm provided herein is described with reference to
Per-Antenna Transmit Power Constraint
[0121] The impact of short-term per-antenna transmit power constraints on the system performance of the disclosed algorithm has been studied. As shown in
and long-term transmit power limit .sub.0 for the sequence of global precoding matrices {V.sub.i}. The per-antenna constraint is observed to have negligible impact on the algorithm performance.
[0122] Thus, an OCO algorithm with periodic updates is provided herein. In existing standard OCO settings, decision making and convex loss function feedback are in a strict per-time-slot fashion. In contrast, the OCO algorithm described herein makes a decision at the beginning of each update period that can last for multiple time slots. The convex loss function can change arbitrarily at each time slot with unknown statistics. The gradient or sub-gradient feedbacks are allowed to be delayed for multiple time slots, received out of order, and missing for some time slots within each update period. Based only on the past gradient or sub-gradient information, the OCO algorithm described herein provides O(√{square root over (T)}) regret defined with partial feedbacks and O(1) long-term constraint violation.
[0123] An online downlink coordinated multi-cell MIMO WNV algorithm is disclosed, with unknown CDI and delayed CSI under standard LTE transmission frame settings. In the WNV framework, each SP is allowed to utilize all antennas and wireless spectrum resources simultaneously and design its own virtual precoding matrix in a cell without the need to be aware of the other SPs, or to consider the inter-cell interference created by its own users. The InP may coordinate the cells at the precoding level to minimize the current and future precoding deviations from the virtualization demands, based only on the past channel states and virtual precoding matrices. The sequence of global precoding matrices designed by the InP may subject to both long-term and short-term transmit power constraints. In such a case, the online coordinated multi-cell MIMO precoding problem has a distributed closed-form solution with low computational complexity. The disclosed algorithm provides theoretically guaranteed performance, as it yields O(√{square root over (T)}) regret on the precoding deviation from the virtualization demand, and O(1) long-term transmit power constraint violation.
[0124] The performance of the disclosed online downlink coordinated multi-cell MIMO WNV algorithm under typical LTE cellular network settings has been numerically validated. Simulation studies show fast convergence of the time-averaged performance. The disclosed algorithm shows substantial system performance advantage over an optimal offline fixed precoding strategy, and over the existing standard OCO algorithm with long-term constraints. Extensive simulation results are provided to demonstrate the impact of different values of demand-response period, precoding strategies adopted by the SPs, long-term transmit power limits, numbers of cells and antennas, and per-antenna transmit power constraint.
[0125] Referring to
[0126] The radio interface 4 may be formed as or may include, for example, one or more RF transmitters, one or more RF receivers, and/or one or more RF transceivers. In particular, in addition to or instead of a processor, such as a central processing unit, and memory, the processing circuitry 10 may comprise integrated circuitry for processing and/or control, e.g., one or more processors and/or processor cores and/or FPGAs (Field Programmable Gate Array) and/or ASICs (Application Specific Integrated Circuitry) adapted to execute instructions. The processor 8 may be configured to access (e.g., write to and/or read from) the memory 6, which may comprise any kind of volatile and/or nonvolatile memory, e.g., cache and/or buffer memory and/or RAM (Random Access Memory) and/or ROM (Read-Only Memory) and/or optical memory and/or EPROM (Erasable Programmable Read-Only Memory). The processing circuitry 10 may be configured to control any of the methods and/or processes described herein and/or to cause such methods, and/or processes to be performed, e.g., by network node 2. Processor 8 corresponds to one or more processors for performing network node 2 functions described herein. The memory 6 is configured to store data, programmatic software code and/or other information described herein. In some embodiments, the OCO algorithm 12 may include instructions that, when executed by the processor 8 and/or processing circuitry 10, causes the processor 8 and/or processing circuitry 10 to perform the processes described herein with respect to network node 2. The network node 2, configured to execute the algorithm disclosed herein, may be operated by the infrastructure provider. The network node 2 may be in radio communication with one or more other network nodes operated or used by the service providers.
[0127]
[0128]
[0129] According to one aspect, a method of online coordinated multi-cell precoding for a network node 2 configurable at least in part by an infrastructure provider is provided. The network node 2 is configured to facilitate sharing of wireless network infrastructure resources by a plurality of service providers. The method includes obtaining, via the processing circuitry 10 a global channel state, the global channel state being based at least in part on a channel state in each cell of a plurality of cells and communicating a channel state to a corresponding service provider. The method also includes receiving, via the radio interface 4, a virtual precoding matrix from each service provider of the plurality of services providers, each virtual precoding matrix being based at least in part on the channel state communicated to the corresponding service provider and being further based at least in part on a condition that each service provider of the plurality of services providers is allowed to use all of a plurality of available antennas and available wireless spectrum resources. The method further includes executing, via the processing circuitry 10, an optimization procedure to periodically determine a multiple input multiple output, MIMO, precoding matrix that minimizes an accumulated precoding deviation from a virtualization demand subject to constraints, the virtualization demand and the MIMO precoding matrix being based at least in part on the virtual precoding matrices received from the plurality of service providers. The method includes applying, via the processing circuitry 10, the determined MIMO precoding matrix to signals applied to the plurality of available antennas to achieve a sum of throughputs for the plurality of service providers that is greater than a sum of the throughputs for the plurality of service providers achievable when the virtual precoder matrices are applied to the signals.
[0130] According to this aspect, in some embodiments, the optimization procedure includes periodically determining a gradient of a convex loss function and providing the determined gradient as feedback to the determination of the MIMO precoding matrix. In some embodiments, feedback in the optimization procedure is allowed to be delayed for multiple time slots, received out of order and/or partly missing within an update period. In some embodiments, the optimization procedure includes an online projected gradient ascent algorithm that provides O(√{square root over (T)}) regret and O(1) long term constraint violation, where T is a total time horizon over which multiple updates of the determined MIMO precoding matrix occur. In some embodiments, the accumulated precoding deviation is determined, via the processing circuitry 10, according to f.sub.t(x.sub.t) where T is a time horizon, x.sub.t is a decision in a sequence of decisions made by the network node 2, f.sub.t(x.sub.t) is a convex loss function,
f.sub.t(x.sub.t) is an accumulated loss and x.sup.0
arg min.sub.x∈ X.sub.
f.sub.t(x) is the argument of
f.sub.t(x) that produces a minimum value of
f.sub.t(x). In some embodiments, the method further includes dividing, via the processing circuitry 10, a total time horizon T into update periods, each update period having a duration of T.sub.o time slots, T.sub.o being at least one time slot, and updating the MIMO precoding matrix at a beginning or end of each update period. In some embodiments, at a beginning of each update period, a decision is taken, via the processing circuitry 10, from a known convex decision space and a loss is determined by an end of the duration of T.sub.o time slots based at least in part on the decision, the loss being based at least in part on a convex loss function. In some embodiments, the virtualization demand is further based at least in part on past channel states. In some embodiments, the constraints include long term transmit power constraints and short term transmit power constraints. In some embodiments, the MIMO precoding matrix is determined to provide sub-linear T-slot regret with partial feedback on the accumulated precoding deviation from the virtualization demand, where T is a total time horizon.
[0131] According to another aspect, a network node 2 configured for online coordinated multi-cell precoding, the network node 2 configurable at least in part by an infrastructure provider is provided. The network node 2 is configured to facilitate sharing of wireless network infrastructure resources by a plurality of service providers. The network node 2 includes processing circuitry 10 configured to: obtain a global channel state, the global channel state being based at least in part on a channel state in each cell of a plurality of cells and communicate a channel state to a corresponding service provider. The processing circuitry 10 is further configured to receive a virtual precoding matrix from each service provider of the plurality of services providers, each virtual precoding matrix being based at least in part on the channel state communicated to the corresponding service provider and being further based at least in part on a condition that each service provider of the plurality of services providers is allowed to use all of a plurality of available antennas and available wireless spectrum resources. The processing circuitry 10 is further configured to execute an optimization procedure to periodically determine a multiple input multiple output, MIMO, precoding matrix that minimizes an accumulated precoding deviation from a virtualization demand subject to constraints, the virtualization demand and the MIMO precoding matrix being based at least in part on the virtual precoding matrices received from the plurality of service providers. The processing circuitry 10 is further configured to apply the determined MIMO precoding matrix to signals applied to the plurality of available antennas to achieve a sum of throughputs for plurality of service providers that is greater than a sum of the throughputs for the plurality of service providers achievable when the virtual precoder matrices are applied to the signals.
[0132] According to this aspect, in some embodiments, the optimization procedure includes periodically determining a gradient of a convex loss function and providing the determined gradient as feedback to the determination of the MIMO precoding matrix. In some embodiments, feedback in the optimization procedure is allowed to be delayed for multiple time slots, received out of order and/or partly missing within an update period. In some embodiments, the optimization procedure includes an online projected gradient ascent algorithm that provides O(√{square root over (T)}) regret and O(1) long term constraint violation, where T is a total time horizon over which multiple updates of the determined MIMO precoding matrix occur. In some embodiments, the accumulated precoding deviation is determined, via the processing circuitry 10, according to f.sub.t(x.sub.t)where T is a time horizon, x.sub.t is a decision in a sequence of decisions made by an agent, f.sub.t(x.sub.t) is a convex loss function,
f.sub.t(x.sub.t) is an accumulated loss and x.sup.0
arg min.sub.x∈X.sub.
f.sub.t(x) is the argument of
f.sub.t(x) that produces a minimum value of
f.sub.t(x). In some embodiments, the processing circuitry 10 is further configured to divide a total time horizon T into update periods, each update period having a duration of T.sub.o time slots, T.sub.o being at least one timeslot, and updating the MIMO precoding matrix at a beginning or end of each update period. In some embodiments, at a beginning of each update period, a decision is taken, via the processing circuitry 10, from a known convex decision space and a loss is determined by an end of the duration of T.sub.o time slots based at least in part on the decision, the loss being based at least in part on a convex loss function. In some embodiments, the virtualization demand is further based at least in part on past channel states. In some embodiments, the constraints include long term transmit power constraints and short term transmit power constraints. In some embodiments, the MIMO precoding matrix is determined to provide sub-linear T-slot regret with partial feedback on the accumulated precoding deviation from the virtualization demand, where T is a total time horizon.
[0133] According to yet another aspect, a network node 2 configured for sharing of wireless network resources among a plurality of service providers is provided. The network node 2 includes processing circuitry 10 configured to receive from each of the plurality of service providers a virtual precoder matrix determined by the corresponding service provider. The processing circuitry 10 is further configured to determine a precoder matrix by minimizing a precoding deviation from a virtualization demand of the network subject to at least one power constraint, the virtualization demand being based at least in part on a product of a channel state matrix and a virtual precoder matrix. The processing circuitry 10 is further configured to apply the determined precoder matrix to signals applied to a plurality of antennas to achieve a sum of throughputs for the plurality of service providers that is greater than a sum of throughputs for the plurality of service providers achievable when the virtual precoder matrices are applied to the signals.
[0134] According to this aspect, in some embodiments, determining the precoder matrix includes solving a minimization problem that includes determining a gradient of a convex loss function. In some embodiments, determining the precoder matrix includes decomposing the minimization problem into a number of subproblems, each subproblem involving a gradient of a convex loss function of a local precoding matrix. In some embodiments, the minimization problem includes comparing a function of the determined precoder matrix to an offline fixed precoding strategy. In some embodiments, the virtual precoder matrices received from the plurality of service providers are each based at least in part on a condition that each service provider is allowed to use all of a plurality of available antennas and wireless spectrum resources. In some embodiments, the precoding deviation from virtualization demand is determined, via the processing circuitry 10, at least in part by:
f.sub.t(V)∥H.sub.tV−D.sub.t∥.sub.F.sup.2
where V is a past precoding matrix, H.sub.t is the channel state and D.sub.t is the virtualization demand.
[0135] According to another aspect, a method of online coordinated multi-cell precoding for a network node 2 configurable at least in part by an infrastructure provider is provided, where the network node 2 is configured to facilitate sharing of wireless network infrastructure resources by a plurality of service providers. The method includes receiving, via the radio interface 4, from each of the plurality of service providers a virtual precoder matrix determined by the corresponding service provider. The method includes determining, via the processing circuitry 10, a precoder matrix by minimizing a precoding deviation from a virtualization demand of the network subject to at least one power constraint, the virtualization demand being based at least in part on a product of a channel state matrix and a virtual precoder matrix. The method also includes applying, via the processing circuitry 10, the determined precoder matrix to signals applied to a plurality of antennas to achieve a sum of throughputs for the plurality of service providers that is greater than a sum of throughputs for the plurality of service providers achievable when the virtual precoder matrices are applied to the signals.
[0136] According to this aspect, in some embodiments, determining the precoder matrix includes solving a minimization problem that includes determining a gradient of a convex loss function. In some embodiments, determining the precoder matrix includes decomposing the minimization problem into a number of subproblems, each subproblem involving a gradient of a convex loss function of a local precoding matrix. In some embodiments, the minimization problem includes comparing a function of the determined precoder matrix to an offline fixed precoding strategy. In some embodiments, the virtual precoder matrices received from the plurality of service providers are each based at least in part on a condition that each service provider is allowed to use all of a plurality of available antennas and wireless spectrum resources. In some embodiments, the precoding deviation from virtualization demand is determined, via the processing circuitry 10, at least in part by:
f.sub.t(V)∥H.sub.tV−D.sub.t∥.sub.F.sup.2
where V is a past precoding matrix, H.sub.t is the channel state and D.sub.t is the virtualization demand.
ABBREVIATIONS
[0137] 5G: Fifth Generation
[0138] CDI: Channel Distribution Information
[0139] CSI: Channel State Information
[0140] C-RAN: Cloud Radio Networks
[0141] GP: Geometric Programming
[0142] InP: Infrastructure Provider
[0143] KKT: Karush-Kuhn-Tucker
[0144] LTE: Long-Term Evolution
[0145] MIMO: Multiple Input Multiple Output
[0146] NOMA: Non-orthogonal Multiple Access
[0147] OCO: Online Convex Optimization
[0148] OFDM: Orthogonal Frequency Division Multiplexing
[0149] SP: Service Provider
[0150] WNV: Wireless Network Virtualization
[0151] As will be appreciated by one of skill in the art, the concepts described herein may be embodied as a method, data processing system, and/or computer program product. Accordingly, the concepts described herein may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects all generally referred to herein as a “circuit” or “module.” Furthermore, the disclosure may take the form of a computer program product on a tangible computer usable storage medium having computer program code embodied in the medium that can be executed by a computer. Any suitable tangible computer readable medium may be utilized including hard disks, CD-ROMs, electronic storage devices, optical storage devices, or magnetic storage devices.
[0152] Some embodiments are described herein with reference to flowchart illustrations and/or block diagrams of methods, systems and computer program products. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
[0153] These computer program instructions may also be stored in a computer readable memory or storage medium that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer readable memory produce an article of manufacture including instruction means which implement the function/act specified in the flowchart and/or block diagram block or blocks.
[0154] The computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. It is to be understood that the functions/acts noted in the blocks may occur out of the order noted in the operational illustrations. For example, two blocks shown in succession may in fact be executed substantially concurrently or the blocks may sometimes be executed in the reverse order, depending upon the functionality/acts involved. Although some of the diagrams include arrows on communication paths to show a primary direction of communication, it is to be understood that communication may occur in the opposite direction to the depicted arrows. Computer program code for carrying out operations of the concepts described herein may be written in an object oriented programming language such as Java® or C++. However, the computer program code for carrying out operations of the disclosure may also be written in conventional procedural programming languages, such as the “C” programming language. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer. In the latter scenario, the remote computer may be connected to the user's computer through a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
[0155] Many different embodiments have been disclosed herein, in connection with the above description and the drawings. It will be understood that it would be unduly repetitious and obfuscating to literally describe and illustrate every combination and subcombination of these embodiments. Accordingly, all embodiments can be combined in any way and/or combination, and the present specification, including the drawings, shall be construed to constitute a complete written description of all combinations and subcombinations of the embodiments described herein, and of the manner and process of making and using them, and shall support claims to any such combination or subcombination.
[0156] It will be appreciated by persons skilled in the art that the embodiments described herein are not limited to what has been particularly shown and described herein above. In addition, unless mention was made above to the contrary, it should be noted that all of the accompanying drawings are not to scale. A variety of modifications and variations are possible in light of the above teachings without departing from the scope of the following claims.