COOPERATIVE LEARNING METHOD AND APPARATUS FOR POWER ALLOCATION IN DISTRIBUTED MULTIPLE INPUT AND MULTIPLE OUTPUT SYSTEM
20240297685 ยท 2024-09-05
Assignee
Inventors
- Seung Eun Hong (Daejeon, KR)
- Seok Hwan Park (Jeonju-si, KR)
- Dae Sung YU (Jeonju-si, KR)
- Hoon LEE (Busan, KR)
Cpc classification
H04B7/024
ELECTRICITY
International classification
H04W52/28
ELECTRICITY
H04W52/24
ELECTRICITY
Abstract
A method for power allocation in a dAP may comprise: when a change cycle of a transmit power determination vector arrives, generating an uplink message including long-term CSI, the uplink message being normalized such that the long-term local CSI becomes a value within a preconfigured limit range; transmitting the uplink message to a central processing unit through a fronthaul; receiving a downlink message vector for power allocation from the central processing unit through the fronthaul; generating decentralized determination information using the downlink message vector; and extracting a transmit power determination vector based on the decentralized determination information.
Claims
1. A method for power allocation in a distributed access point (dAP), comprising: when a change cycle of a transmit power determination vector arrives, generating an uplink message including long-term local channel state information (CSI), the uplink message being normalized such that the long-term local CSI becomes a value within a preconfigured limit range; transmitting the uplink message to a central processing unit through a fronthaul; receiving a downlink message vector for power allocation from the central processing unit through the fronthaul; generating decentralized determination information using the downlink message vector; and extracting a transmit power determination vector based on the decentralized determination information, wherein the decentralized determination information includes an output vector for generating a local power allocation value and a variable for the dAP.
2. The method according to claim 1, further comprising: extracting power allocation information corresponding to each of terminals based on the decentralized determination information; determining a transmit power for a channel transmitted to each of the terminals based on the power allocation information; and communicating with each of the terminals by using the determined transmit power.
3. The method according to claim 2, wherein the transmit power for the channel transmitted to each of the terminals is determined by a third preconfigured deep neural network (DNN).
4. The method according to claim 1, wherein the long-term local CSI is calculated based on channel state information and a long-term path loss with each of communicating terminals.
5. The method according to claim 1, wherein the normalized uplink message has a length preset by the central processing unit.
6. The method according to claim 1, wherein the normalized uplink message is generated by a first preconfigured DNN.
7. The method according to claim 1, wherein the change cycle of the transmit power determination vector is determined based on a channel change cycle between a terminal and the dAP.
8. The method according to claim 1, wherein the change cycle of the transmit power determination vector is preset by the central processing unit.
9. The method according to claim 1, wherein the change cycle of the transmit power determination vector is determined differently for each group based on a movement speed of terminals communicating within the dAP.
10. A method of a central processing unit, comprising: when an update cycle of a downlink message arrives, receiving uplink messages corresponding to long-term local channel state information (CSI) respectively from two or more distributed access points (dAPs) communicating with terminals through a fronthaul; generating one downlink message based on a pooling operation on the received uplink messages; and transmitting the downlink message to the dAPs, wherein each of the uplink messages is information normalized to a value within a preconfigured limit range.
11. The method according to claim 10, wherein the one downlink message is generated by a preconfigured second deep neural network (DNN).
12. The method according to claim 10, further comprising: configuring length information of the uplink message to each of the dAPs.
13. The method according to claim 10, wherein the central processing unit is an open-radio access network (O-RAN) central unit (CU) of an O-RAN system.
14. The method according to claim 10, wherein the update cycle of the downlink message is determined based on channel state change information received from each of the dAPs.
15. The method according to claim 14, further comprising: transmitting information on the update cycle of the downlink message to each of the dAPs.
16. A distributed access point (dAP) comprising a processor, wherein the processor causes the dAP to perform: when a change cycle of a transmit power determination vector arrives, generating an uplink message including long-term local channel state information (CSI), the uplink message being normalized such that the long-term local CSI becomes a value within a preconfigured limit range; transmitting the uplink message to a central processing unit through a fronthaul; receiving a downlink message vector for power allocation from the central processing unit through the fronthaul; generating decentralized determination information using the downlink message vector; and extracting a transmit power determination vector based on the decentralized determination information, wherein the decentralized determination information includes an output vector for generating a local power allocation value and a variable for the dAP.
17. The dAP according to claim 16, wherein the processor further causes the dAP to perform: extracting power allocation information corresponding to each of terminals based on the decentralized determination information; determining a transmit power for a channel transmitted to each of the terminals based on the power allocation information; and communicating with each of the terminals by using the determined transmit power.
18. The dAP according to claim 17, wherein the transmit power for the channel transmitted to each of the terminals is determined by a third preconfigured deep neural network (DNN).
19. The dAP according to claim 16, wherein the long-term local CSI is calculated based on channel state information and a long-term path loss with each of communicating terminals.
20. The dAP according to claim 11, wherein the normalized uplink message has a length preset by the central processing unit.
Description
BRIEF DESCRIPTION OF DRAWINGS
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
DETAILED DESCRIPTION OF THE EMBODIMENTS
[0039] Since the present disclosure may be variously modified and have several forms, specific exemplary embodiments will be shown in the accompanying drawings and be described in detail in the detailed description. It should be understood, however, that it is not intended to limit the present disclosure to the specific exemplary embodiments but, on the contrary, the present disclosure is to cover all modifications and alternatives falling within the spirit and scope of the present disclosure.
[0040] Relational terms such as first, second, and the like may be used for describing various elements, but the elements should not be limited by the terms. These terms are only used to distinguish one element from another. For example, a first component may be named a second component without departing from the scope of the present disclosure, and the second component may also be similarly named the first component. The term and/or means any one or a combination of a plurality of related and described items.
[0041] When it is mentioned that a certain component is coupled with or connected with another component, it should be understood that the certain component is directly coupled with or connected with to the other component or a further component may be disposed therebetween. In contrast, when it is mentioned that a certain component is directly coupled with or directly connected with another component, it will be understood that a further component is not disposed therebetween.
[0042] The terms used in the present disclosure are only used to describe specific exemplary embodiments, and are not intended to limit the present disclosure. The singular expression includes the plural expression unless the context clearly dictates otherwise. In the present disclosure, terms such as comprise or have are intended to designate that a feature, number, step, operation, component, part, or combination thereof described in the specification exists, but it should be understood that the terms do not preclude existence or addition of one or more features, numbers, steps, operations, components, parts, or combinations thereof.
[0043] Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. Terms that are generally used and have been in dictionaries should be construed as having meanings matched with contextual meanings in the art. In this description, unless defined clearly, terms are not necessarily construed as having formal meanings.
[0044] A communication system to which exemplary embodiments according to the present disclosure are applied will be described. The communication system to which the exemplary embodiments according to the present disclosure are applied is not limited to the contents described below, and the exemplary embodiments according to the present disclosure may be applied to various communication systems. Here, the communication system may have the same meaning as a communication network.
[0045] Throughout the present disclosure, a network may include, for example, a wireless Internet such as wireless fidelity (WiFi), mobile Internet such as a wireless broadband Internet (WiBro) or a world interoperability for microwave access (WiMax), 2G mobile communication network such as a global system for mobile communication (GSM) or a code division multiple access (CDMA), 3G mobile communication network such as a wideband code division multiple access (WCDMA) or a CDMA2000, 3.5G mobile communication network such as a high speed downlink packet access (HSDPA) or a high speed uplink packet access (HSUPA), 4G mobile communication network such as a long term evolution (LTE) network or an LTE-Advanced network, 5G mobile communication network, or the like.
[0046] Throughout the present disclosure, a terminal may refer to a mobile station, mobile terminal, subscriber station, portable subscriber station, user equipment, access terminal, or the like, and may include all or a part of functions of the terminal, mobile station, mobile terminal, subscriber station, mobile subscriber station, user equipment, access terminal, or the like.
[0047] Here, a desktop computer, laptop computer, tablet PC, wireless phone, mobile phone, smart phone, smart watch, smart glass, e-book reader, portable multimedia player (PMP), portable game console, navigation device, digital camera, digital multimedia broadcasting (DMB) player, digital audio recorder, digital audio player, digital picture recorder, digital picture player, digital video recorder, digital video player, or the like having communication capability may be used as the terminal.
[0048] Throughout the present disclosure, the base station may refer to an access point, radio access station, node B (NB), evolved node B (eNB), base transceiver station, mobile multihop relay (MMR)-BS, or the like, and may include all or part of functions of the base station, access point, radio access station, NB, eNB, base transceiver station, MMR-BS, or the like.
[0049] Hereinafter, preferred exemplary embodiments of the present disclosure will be described in more detail with reference to the accompanying drawings. In describing the present disclosure, in order to facilitate an overall understanding, the same reference numerals are used for the same elements in the drawings, and redundant descriptions for the same elements are omitted.
[0050]
[0051] Referring to
[0052] It is assumed that the distributed MIMO system illustrated in
[0053] In the present disclosure, for convenience of description, it is assumed that each of the dAPs 111, 112, . . . , and 113 has a single antenna. However, each of the M dAPs 111, 112, . . . , and 113 and the K terminals 101, 102, . . . , and 103 may have a plurality of antennas. In this case, the number of antennas or antenna panels may be two or more, and all of the M dAPs 111, 112, . . . , and 113 may have the same number of antennas or antenna panels. As another example, each of the M dAPs 111, 112, . . . , and 113 may have a different number of antennas or a different number of antenna panels. The K terminals 101, 102, . . . , and 103 may all have the same number of antennas, or each of the K terminals 101, 102, . . . , and 103 may have a different number of antennas.
[0054] In addition, for convenience of description, it is assumed that the maximum transmit power of each of the dAPs 111, 112, . . . , and 113 has the same value of P, and a fronthaul link between the CPU 121 and each of the dAPs 111, 112, . . . , and 113 also has the same limited capacity. However, the present disclosure is not limited thereto, and based on the description below, a transmit power of each of the dAPs 111, 112, . . . , and 113 may have a different value. In addition, the fronthaul link capacity between each the dAPs 111, 112, . . . , and 113 and the CPU 121 may be configured to a different value. For example, the fronthaul link capacities configured to different values may mean that a fronthaul link capacity between the first dAP 111 and the CPU 121 is configured to a first value, and the fronthaul link capacity between the second dAP 112 and the CPU 121 is configured to a second value different from the first value.
[0055] In the configuration of
[0056] In Equation 1 and Equation 2, M may correspond to the number of dAPs, and K may correspond to the number of terminals.
[0057] When channel coefficients between the m-th dAP and the k-th terminal are h.sub.k,m, the channel coefficients may usually follow a distribution of h.sub.k,m?(0, ?.sub.k,m) based on Gaussian noises, and a long-term path loss of a link between the m-th dAP and the k-th terminal may be expressed as Equation 3 below.
[0058] Using a standard channel acquisition process, actual local channel state information (CSI) for each of the dAPs 111, 112, . . . , and 113 may be obtained as Equation 4 below, and an estimate of the local CSI may be obtained as Equation 5 below. A value of Equation 5 may be a short-term local CSI estimate or a short-term CSI estimate.
[0059] In Equation 5, ?.sub.k,m may be modeled as in Equation 6 below.
[0060] In Equation 6, e.sub.k,m is a channel estimation error. When using a linear MMSE estimator, e.sub.k,m is independent of ?.sub.k,m. ?.sub.k,m and e.sub.k,m follow distributions shown in Equation 7 below, respectively.
[0061] In Equation 7, ? has a value of [0,1] and represents an error rate. The error rate may depend on a signal to noise ratio (SNR) of a pilot symbol. Therefore, the error rate may be regarded as an arbitrary value that changes dynamically depending on a propagation environment. In addition, statistics on the channel coefficients may be obtained through mathematical channel modeling or may be obtained from channel big data obtained from an actual system.
[0062] In case of centralized interference management, the local CSI estimate needs to be shared with the CPU 121 through fronthaul coordination. However, these frequent updates of short-term CSIs result in significant fronthauling overhead. As an example of solutions to reduce the fronthaul overhead may be that each of the dAPs 111, 112, . . . , and 113 delivers its local long-term CSI to the CPU 121. Here, the local long-term CSI may be expressed as Equation 8 below.
[0063] By having each of the dAPs 111, 112, . . . , and 113 deliver its local long-term CSI to the CPU 121, the CPU 121 can reduce signaling overhead in fronthaul coordination. In addition, the CPU 121 can mitigate interference between users by using long-term fading.
[0064] The m-th dAP may calculate a beam-direction setting precoding w.sub.k,m for the k-th terminal using only its local CSI. In general, as a precoding scheme using local CSI, a conjugate beamforming (hereinafter CB) scheme and a local regularized zero forcing (hereinafter L-RZF) scheme may be used, and these are calculated using Equation 9 below.
[0065] A transmit signal x.sub.m of the m-th dAP may be expressed as Equation 10 below.
[0066] In Equation 10, s.sub.k may represent a data symbol for the k-th terminal, and p.sub.k,m may represent a transmit power allocated to transmit s.sub.k by the m-th AP. A total transmit power of the m-th dAP may be defined as Equation 11 below.
[0067] This may depend on the maximum power P per dAP, such as ?{square root over (p.sub.k,m)}?P, m?
. An achievable data rate R.sub.k of the k-th terminal may be expressed as Equation 12 below.
[0068] In Equation 12, an index set of the local CSI estimates for the m-th dAP may be ??, an index set of channel estimation errors for the m-th dAP may be e?
, and an index set of the maximum powers for the m-th dAP may be p?
. A signal to interference plus noise ratio (SNIR) for the k-th terminal may be defined as in Equation 13 below.
[0069] In the present disclosure, a network utility function U(?, e, p) needs to be maximized by optimizing the transmit power p with respect to channel statistics (?, e, ?). Popular choices for the network utility function U() may be a sum-rate (SR), minimum-rate (MR), or proportional-fairness (PF), each of which may be expressed as Equation 14 to Equation 16.
[0070] In other words, Equation 14 represents a case of maximizing the network utility function U() using the sum-rate (SR), Equation 15 represents a case of maximizing the network utility function U() using the minimum rate (MR), and Equation 16 represents a case of maximizing the network utility function U() using the proportional-fairness (PF).
[0071] Accordingly, the optimization problem for maximizing the network utility may be expressed as Equation 17 below.
[0072] In the following description, for convenience of description, the first row (top line) of Equation 17 will be described as Equation 17a, and the second row (bottom line) of Equation 17 will be described as Equation 17b.
[0073] Equation 17 is generally nonconvex. Therefore, it is not easy to obtain a globally optimal solution therefor. An expected value for a randomly distributed CSI (?, e, ?) has no analytical formula. This makes it difficult to apply traditional nonconvex optimization techniques. Methods known to date propose a traceable closed-form approximation for an utility based on an average transmission rate. According to the approximation method, all short-term fading coefficients may be simply removed using Jensen's inequality, which leads to model mismatch between a transmission rate and its approximated value. In addition, since the representation of the approximated rate relies only on long-term channel statistics, there is no room to utilize short-term CSI in optimizing power control parameters. Moreover, the individually deployed dAPs 111, 112, . . . , and 113 require a new decentralized calculation structure.
[0074] Each of the dAP 111, 112, . . . , and 113 may need to infer its local power allocation solution p.sub.m based only on partial network knowledge, that is, the local CSI vectors ?.sub.m and ?.sub.m. Such partial observations are insufficient to individually recover the optimal solution of Equation 17. Therefore, interaction between the dAPs 111, 112, . . . , and 113 may be essential to configure effective power control schemes.
[0075] The present disclosure proposes a low-complexity solution to Equation 17 described above using deep learning technology. In addition, as described above, there is no optimal solution to Equation 17. Therefore, in the present disclosure, instead of adopting supervised learning methods, a method for identifying an unsupervised deep learning framework is used. This can be implemented even without knowledge of the optimal solution to Equation 17.
[0076] In the present disclosure, the original problem presented in Equation 17 is transformed into a functional optimization problem to be suitable for generalized learning. According to this transformation, the targets of optimization may be transformed into a function representative of the optimization procedure. An arbitrary problem with specified inputs and outputs can be refined into functional optimization tasks.
[0077] In addition, Equation 17 may be regarded as a procedure for identifying a solution p for arbitrarily given channel statistics (?, e, ?) and system parameter P. This input-output relationship may be captured by a functional operator p=(?, ?, P). By applying the functional operator to Equation 17 described above, functional optimization expressed as Equation 18 below may be obtained.
[0078] In the following description, for convenience of description, the first row (top line) of Equation 18 will be described as Equation 18a, and the second row (bottom line) of Equation 18 will be described as Equation 18b. In addition, it can be seen that Equation 18b is the same as Equation 17b described above.
[0079] As a result, by solving Equation 18, a general mapping rule (?) for an arbitrarily given input {?, e, ?, P} may be obtained.
[0080] In the present disclosure, the operator (?), which is the mapping rule, may be designed through cooperation between the CPU 121 and the dAPs 111, 112, . . . , and 113, so that computing powers and short-term CSIs of the dAPs 111, 112, . . . , and 113 can be utilized maximally while minimizing the fronthaul overhead.
[0081] For this purpose, the operator (?), which is the mapping rule, may be divided into an uplink fronthaul cooperation message generation operator
(?) and a distributed power allocation determination operator
(?) performed in each dAP, and a downlink fronthaul cooperation message generation operator
(?) performed in the CPU. Each of these operators may refer to processing of a deep neural network (DNN) illustrated in
[0082]
[0083] Operations of
[0084] In the following description, when describing operations of the dAP with reference to
[0085] In the following description, the dAP will be described as representing a specific dAP. However, it should be noted that the dAP described below and all dAPs illustrated in
[0086] The CPU 121 may need to collect local information from the dAPs for uplink fronthaul cooperation. Therefore, the CPU 121 may instruct the dAPs 111, 112, . . . , and 113 to perform the operation of
[0087] In describing with reference to
[0088] Referring to
[0089] First, in Equation 19, ?.sub.m may be input characteristics for the m-th dAP, and may mean information (or value) on a path loss between the k-th terminal and the m-th dAP.
[0090] In Equation 20, data preprocessing may be performed so that the input characteristics, a result of normalizing the long-term local CSI ?.sub.m, are located within a limited region or have a value within a limited range as shown in Equation 21 below.
[0091] In step S212, the m-th dAP may generate an uplink message having a length .sub.U as shown in Equation 22 below by using the input characteristics on which the preprocessing operation of step S210 has been performed as shown in Equation 19. In this case, the length of the uplink message may be a predetermined length. The predetermined length value may be a length agreed with the CPU 121 or a length indicated (or set) by the CPU 121. By setting the length of the uplink message to a specific value, learning may be performed without changing the size of DNNs described in
[0092] The uplink message um.sub.m in Equation 22 may have a relationship shown in Equation 23.
[0093] In Equation 22, may be implemented using parameters
trainable in the m-th dAP. Here, when the operation of the dAP is implemented using DNN(s), the trainable parameters
may mean connection weights between nodes constituting the respective layers described below. As in Equation 22, the m-th dAP belonging to the total M dAPs may use a dedicated individual operator
. However, this scheme lacks flexibility for the number M of dAPs. In other words, there is a problem that a group of operators
implemented based on the specific total number M of the dAPs cannot be applied equally to networks with different number of dAPs.
[0094] Due to this problem, networks with a variable number of dAPs may need to implement multiple operators for all possible distributed MIMO configurations. In other words, there is a problem of having to implement a plurality of operators in advance in various forms to determine which operator to use based on the number of dAPs in the distributed MIMO network where the dAPs are deployed.
[0095] To solve this problem, the present disclosure proposes to adopt a scalable architecture in which operator implementation is independent of the number M of APs. In other words, all dAPs reuse the same operator as shown in Equation 24 below to realize the corresponding uplink message generation inference.
[0096] When using the operator of Equation 24, the uplink message um.sub.m generated by the dAP may be modified as Equation 25 below instead of Equation 22.
[0097] Accordingly, the m-th dAP according to the present disclosure may generate the uplink message as exemplified in Equation 25 using trainable parameters that can be used regardless of the number of dAPs, as shown in Equation 25 in step S212. In this case, the length of the uplink message may be set to the length described above. From Equation 25, it can be seen that the operator has no dependence on m. This allows the same operator
to be used in all dAPs, and the output uplink message may vary depending on the input of the operator. As a result, since the operator is replaced by a neural network, there is an advantage in that the same neural network can be used for all dAPs regardless of the number of dAPs.
[0098] In step S214, the m-th dAP may deliver the uplink message generated as shown in Equation 25 to the CPU 121 through a fronthaul link.
[0099] Steps S210 to S214 described above may be performed in all dAPs as described above.
[0100] Then, the operations performed by the CPU 121 will be described below with reference to
[0101] In step S240, the CPU 121 may receive the uplink messages um.sub.m from the M dAPs 111, 112, . . . , and 113. In this case, the CPU 121 may calculate the uplink messages um.sub.m received from all dAPs as one uplink message as shown in Equation 26 below based on pooling.
[0102] The operation of Equation 26 may use a superposition coding concept of a non-orthogonal multiple access system. Through this, unnecessary statistics may be removed and important uplink message characteristics um may be extracted from individual dAP message vectors without changing the message length. As a result, dimension-independent fronthaul cooperation can be effectively utilized.
[0103] In step S242, the CPU 121 may use the operator of the CPU 121 with the parameter set
to convert the pooled information vector into an output (i.e. downlink message with a length of
.sub.dn). Here, the parameter set may mean weights for the connections between nodes included in the respective layers constituting the DNN of the CPU 121. Therefore, the parameter set may be updated when the DNN is trained. In the present disclosure, further description on a learning procedure for the DNNs will be omitted. The configuration (structure) of the DNNs will be described with reference to
[0104] In addition, since the length of the uplink message is determined to be a specific value as described above, the length of the downlink message may also be determined to be a specific value. In other words, since one downlink message is generated by performing a pooling operation on the uplink messages, the downlink message may also have a specific length. For example, the downlink message may have the same length as the uplink message.
[0105] In addition, the pooled information vector may be exemplified as shown in Equation 27 below, and the downlink message may have a relationship as shown in Equation 28 below.
[0106] The CPU 121 may generate the downlink message in form of Equation 29 below based on the operators of the CPU 121 and Equations 27 and 28.
[0107] The downlink message calculated as in Equation 29 may be a downlink communication message to be broadcast to all dAPs.
[0108] Therefore, the CPU 121 may transmit the downlink message to all dAPs through the fronthaul link in step S244. In step S246, the CPU 121 may identify whether an update cycle of the downlink message arrives. When the update cycle of the downlink message does not arrive, the CPU 121 may wait until the update cycle of the downlink message arrives. On the other hand, when the update cycle of the downlink message arrives, the CPU 121 may repeatedly perform steps S240 to S244 described above.
[0109] The procedures of steps S240 to S244 described above may be a downlink message generation operation using long-term CSI. Therefore, the update cycle of the downlink message in step S246 may be set to a cycle at which the long-term CSI statistics change.
[0110] As described in
[0111] Meanwhile, in the operation described above, the CPU 121 performs a pooling operation on all dAPs as in Equation 26 in step S240 and then generates the downlink communication message as in Equation 30 in step S242. However, another method is also possible. For example, the CPU 121 may change the order of the pooling operation and the downlink message generation operation. In other words, if the CPU 121 defines the latent characteristics of the uplink message um.sub.m as in Equation 30 below, the latent characteristics of the uplink message may be extracted as in Equation 31 below.
[0112] The unique operator for the uplink messages in Equation 25 may parallelly generate a group of information vectors expressed as Equation 32 below.
[0113] The CPU 121 may use the concept of superposition coding of a non-orthogonal multiple access system, thereby generating a downlink message vector dm as shown in Equation 33 below as an average for the m-th dAP, which is an element of the M dAPs.
[0114] Based on one of the two schemes described above, the CPU 121 may transmit the downlink message to all dAPs in step S244. Accordingly, the m-th dAP in
[0115] Referring again to
[0116] In step S218, the m-th dAP may generate decentralized determination information. Hereinafter, generation of the decentralized determination information will be described. The m-th dAP may determine a local power allocation value (i.e. total transmit power of the m-th dAP) using the local CSI, which is its input characteristics expressed as Equation 19, and an estimate of the short-term CSI defined as Equation 5. Since the total transmit power of the m-th dAP needs to satisfy Equation 17b described above, one operator with parameters trainable in all dAPs may be implemented as shown in Equation 34 below. Here, the parameters trainable in each of all dAPs may be the same parameters.
[0117] If an output result by the operator of Equation 34 is d.sub.m, the local power allocation value p.sub.m may be determined using the output result. In other words, the m-th dAP may implement calculation of the operator of Equation 34 as shown in Equation 35 below.
[0118] The output vector d.sub.m of the operator exemplified in Equation 34 may be defined as shown in Equation 36 below.
[0119] Then, the remaining elements d.sub.k,m?0, ?k? excluding the last element exemplified in Equation 36 may control a ratio between transmit power variables defined as in Equation 37 below. The information described above may be the decentralized determination information. In other words, the output vector d.sub.m of the operator shown in Equation 34 and the last element of Equation 36 may be used as the decentralized determination information. In the following description, the last element ?.sub.m of Equation 36 will be referred to as first information for decentralization decision, and ?.sub.m may be a variable for the m-th dAP.
[0120] The m-th dAP may extract a power allocation variable for each terminal in step S220. The power allocation variable for each terminal may correspond to a postprocessing operation. Therefore, a ratio between transmit power variables defined by Equation 37 below may be a power allocation variable for each terminal.
[0121] On the other hand, the last element of Equation 36, the first information for decentralized determination, may determine the total transmit power to be consumed by the m-th dAP. In order to limit a possible range of the first information for decentralized determination, which is the last element of Equation 36, to [0,P], the first information for decentralized determination may be normalized as in Equation 38 below. Here, P may be the maximum power value that can be transmitted by the m-th dAP, as described above.
[0122] In addition, the power allocation variable p.sub.k,m may be recovered from the output vector d.sub.m of the operator in Equation 34 as shown in Equation 39 below.
[0123] The results according to Equation 38 and Equation 39 may always lead to a solution that satisfies the power constraints of Equation 17b described above, as shown in Equation 40 below.
[0124] The generation cycle of the downlink message d.sub.m received from the CPU 121 and the update cycle of the uplink message of the m-th dAP may be determined according to the long-term CSI change cycle. The long-term CSI change cycle has a relatively much larger value than a short-term CSI change cycle. Therefore, the fronthaul overhead caused by exchanging two messages is much smaller than the overhead caused by short-term CSI exchange.
[0125] In addition, considering that short-term CSI is used as an input to the operator of Equation 35 as shown in Equation 36, the m-th dAP may repeat the process of deriving power allocation variables for the respective terminals with a short-term CSI change cycle using the same downlink message d.sub.m. The power allocation variable for each terminal may be expressed as Equation 39 described above.
[0126] Finally, in step S222, the dAP may identify whether the update cycle of the output vector d.sub.m arrives. When the update cycle of the output vector d.sub.m arrives, the dAP may proceed to step S210, and when the update cycle of the output vector d.sub.m does not arrive, the dAP may proceed to step S218.
[0127] The output vector may be the downlink message d.sub.m as described above, and the output vector may be a vector that determines the transmit power. The update cycle of the output vector may be set in various manners.
[0128] For example, the update cycle of the output vector may be set in advance by the CPU 121. When it is preset by the CPU 121, the CPU 121 may transmit the set output vector update cycle to each of the dAPs. As another example, the update cycle of the output vector may be set independently by each dAP.
[0129] When the CPU 121 or each dAP determines the update cycle of the output vector, the following methods may be used.
[0130] The update cycle of the output vector may be determined based on channel variability. For example, when a dAP is installed in an area where many high-speed vehicles move, such as near a highway, the channel may change very quickly. In cases where the channel change speed is fast, the update cycle of the output vector may be set to a short value. On the other hand, in cases where the movement speed of most users is slow, such as in schools, factories, large buildings, etc., the update cycle of the output vector may be set to a long value. In addition, in areas where vehicle movement and human movement are mixed, the update cycle of the output vector may be determined based on an average channel change speed. As another example, a channel change cycle may be individually set for each individual terminal. As another example, a channel change cycle may be set for each specific group.
[0131] Setting the channel change cycle for each individual terminal or specific group may be necessary in the following cases. For example, assuming a highway rest area, vehicles that do not stop at the rest area may move at high speeds. On the other hand, users moving within the highway rest area may move at a very slow speed compared to vehicles. Therefore, in this case, if an average of the two values is used, both users in the rest area and high-speed vehicles may experience unsatisfactory channel environments. Therefore, in the above-described environment, individual users may be divided into groups of high-speed moving objects and low-speed users, and the channel change cycle may be set for each group.
[0132] When the channel change cycle described above is determined by the CPU 121 and transmitted to each dAP, each of the dAPs may receive and use it. On the other hand, when each dAP determines the channel change cycle, information on the channel change cycle determined by each dAP may be reported to the CPU 121. It should be noted that
[0133] To summarize the operations described above with reference to (?; ?) of the cooperative operation according to the present disclosure as p=
(?, ?; ?). Here, the end-to-end forward pass mapping factor may be defined as Equation 41 below.
[0134] The end-to-end forward pass mapping factor expressed as Equation 41 may represent collection of all trainable parameters.
[0135] The remaining task is to design correct DNNs that successfully approximate the intractable operator (?). In general, it has been theoretically shown that DNN can approximate arbitrary functions within a small error.
[0136] Based on the methods described above, each dAP can communicate with at least one terminal that communicates with it through beamforming.
[0137] In the present disclosure, the operator of Equation 24 expressed as Equation 25, the operator defined as Equation 31 that calculates the latent characteristics of the uplink message defined as Equation 30, and the operator
defined as Equation 35 may be modeled as DNNs that perform basic computational functions to approximate the operator
(?). Hereinafter, a method of modeling such the DNNs will be described with reference to
[0138]
[0139] In the present disclosure, cooperation may mean cooperation between computational operations in a processor included in the dAP or a DNN driven by the processor and computational operations in a processor included in the CPU 121 or a DNN driven by the processor. In other words, this may refer to a procedure in which, in order to obtain a final result, a result of a first operation (or processing) performed in the dAP is received by the CPU 121, a second operation (or processing) is performed by the CPU 121, and a third operation (processing) is performed by the CPU 121 on a result of the second operation (or processing).
[0140] In addition, parameters of the DNN may be specified by a learning procedure. Therefore, in the present disclosure, cooperative learning may refer to a process of training the DNNs provided in each of the dAP and the CPU 121 through cooperation between the dAP and the CPU 121, or a procedure performed by the DNN provided in each of the dAP and the CPU 121 using the trained parameters.
[0141]
[0142] For the input vector defined as Equation 42, calculations of an L-layer DNN with a trainable parameter set ? may be given as Equation 43 below.
[0143] In Equation 43, a.sub.l(?), l=1, . . . , L may be an activation function of an l-th layer, and when N.sub.l represents an output resource of the l-th layer, a weight matrix may be expressed as Equation 44 below, and a bias vector may be expressed as Equation 45 below.
[0144] These may constitute the trainable parameter set described above, and the trainable parameter set may be expressed as Equation 46 below.
[0145] The operators (?; ),
, and
for calculating the end-to-end forward pass mapping factors expressed in Equation 41 may be respectively modeled as DNNs as shown in Equations 47 to 49 below.
[0146] In this case, the input vector of the uplink fronthaul cooperation message generation operator DNN illustrated in
[0147] In
[0148] Information input to each node of the input layer 311 may be a normalized value of the long-term local CSI, as previously described in
[0149] In
[0150] Referring to
[0151] In
[0152] Referring to
[0153] In
[0154] The m-th dAP may allocate power to a channel (or signal) transmitted to each of the terminals communicating within the m-th dAP based on the output of the distributed power allocation determination operator DNN 330 illustrated in
[0155]
[0156] Referring to
[0157] Device local CSI may be output by an estimate calculation unit 410 calculating the short-term CSI estimate. The short-term CSI estimate may be input to an uplink fronthaul cooperation message generation operator DNN 420. The uplink fronthaul cooperation message generation operator DNN may perform the operation as previously described in
[0158] The distributed power allocation determination operator DNN 450 may generate a downlink communication message to be broadcast to all dAPs, and calculate and output the first information for decentralized determination as previously described in Equation 36.
[0159] The downlink communication message and the first information for decentralized determination that are the output of the distributed power allocation determination operator DNN 450 may be input to a transmit power determination unit 460. The transmit power determination unit 460 may use each input to generate power allocation variables through calculations such as Equation 39 described above.
[0160] The power allocation variables may be used as an output of the power allocation deep neural network 400, and simultaneously input to a loss calculation unit 470.
[0161] The loss calculation unit 470 may calculate a loss value using the power allocation variables, channel estimation error, and short-term local CSI estimate as inputs. The loss value calculated may be input to the uplink fronthaul cooperation message generation operator DNN 420, the downlink fronthaul cooperation message generation operator DNN 440, and the distributed power allocation determination operator DNN 450.
[0162] Hereinafter, a derivation process by which the power allocation deep neural network 400 as shown in
[0163] As previously described with reference to
[0164] By substituting the value of Equation 53 into Equation 17 described above, a DNN training problem such as Equation 54 below may be established.
[0165] Equation 17b, which is the power limitation, may be eliminated from Equation 54. The reason is that the power limit is always satisfied by Equation 38 and Equation 39 described above. Therefore, the training problem of Equation 54 may be directly handled by mini-batch stochastic gradient descent (SGD) algorithms such as an Adam optimizer. A loss function used in SGD algorithms may be defined as Equation 55 below.
[0166] A training data set may include numerous realizations of long-term CSI ?. At each training epoch, one mini-batch set comprising long-term CSIs may be arbitrarily selected. The long-term CSIs may be collected in advance by experiments or generated based on well-known dAP-UE deployment scenarios.
[0167] Then, short-term CSI estimates and error vectors may be generated using known distributions as shown in Equation 7. Since an error rate ? in Equation 7 randomly changes in real situations, it is necessary to construct multipurpose DNNs that are adaptive to the randomly changing ?. To this end, in the present disclosure, an error rate factor may be randomly generated in the training step. In other words, it may be generated from a uniform distribution ??(0,1).
[0168] As a result, the cooperative learning proposed in the present disclosure may be universally adapted to the arbitrary CSI error statistics ?. These may be utilized to calculate a gradient of Equation 54, which is a training target averaged over the mini-batch set. As a result, according to the cooperative learning proposed in the present disclosure, several artificially generated CSI error samples may be observed and trained. By observing and training CSI error samples as described above, the DNN may support a powerful power allocation mechanism by learning an unknown distribution of actual CSIs based on the estimates.
[0169] The proposed cooperative training process as shown in
[0170] At this implementation stage, CSI errors are no longer needed, since the proposed cooperative learning only uses long-term CSIs and short-term CSI estimates, as defined by Equation 53.
[0171] The number M of dAPs may be considered as a hyper-parameter of the proposed cooperative learning strategy. When the number of dAPs considered in the training phase is assumed to be M.sub.train, in order to further improve scalability, it needs to be carefully selected so that a result of the proposed cooperative learning based on a specific M.sub.train works well universally over a wide range of test dAP numbers M.sub.test. Small or large M.sub.train values may cause overfitting problems in which the result of cooperative learning only works in a specific network configuration. Therefore, the optimal choice for M.sub.train may not be equal to the test dAP number M.sub.test.
[0172]
[0173]
[0174] According to the O-RAN architecture, a RAN 520 may be configured with three types of logical functional unitsan O-RAN central unit (O-CU) 521, O-RAN distributed units (O-DUs) 531, 532, and 533, and O-RAN radio units (O-RUs) 541, 542, and 543. The O-RUs 541, 542, and 543 may communicate with terminals 551, 552, and 553, respectively. Here, the terminals 551, 552, and 553 may correspond to the terminals 101, 102, and 103 previously described in
[0175] As illustrated in
[0176] More specifically, the SMO/RIC 510 may include a non-real time RIC and a near-real time RIC therein. The SMO/RIC 510 proposed to date may automatically manage life-cycles of AI/ML models. However, the SMO/RIC 510 proposed to date does not consider deployment of AI/ML on the O-CU 521, O-DUs 531, 532, and 533, and O-RUs 541, 542, and 543. Therefore, in the present disclosure, the AI/ML components described in
[0177] Meanwhile, the dAP of the distributed MIMO system shown in
[0178] The cooperative learning model illustrated in
[0179] As another example, the SMO/RIC 510 may deploy all individual cooperative learning models in advance on the O-CU 521 and O-DUs 531, 532, and 533, and select a suitable cooperative learning model according to a situation based on policy information.
[0180] The method of deploying the cooperative learning models described above on the O-CU/O-DUs may be performed using the existing interfaces of O-RAN or using newly-defined interfaces.
[0181]
[0182] The configuration of
[0183] In other words, the configuration of
[0184] Referring to
[0185] A memory 612 may store control information for the operations of the DNNs according to the present disclosure and various information for operations in the corresponding communication node.
[0186] A receiver 613 may be configured to receive signals from other communication nodes. For example, if a received signal is a radio frequency (RF) signal, the receiver 613 may be configured to receive and process the RF signal. As another example, if a received signal is received through a wired line, the receiver 613 may be configured to process the signal received through the wired line.
[0187] A transmitter 614 may be configured to transmit signals to other communication nodes. For example, if an RF signal is transmitted to another communication node, the transmitter 614 may be configured to transmit the RF signal. As another example, if a signal is transmitted through a wired line, the transmitter 614 may be configured to transmit the signal through the wired line.
[0188] An interface 615 may provide various interfaces for connection with operators or other devices. For example, when the configuration in
[0189] A bus 601 may provide a path for data and/or control signals between the respective components illustrated in
[0190] The operations of the method according to the exemplary embodiment of the present disclosure can be implemented as a computer readable program or code in a computer readable recording medium. The computer readable recording medium may include all kinds of recording apparatus for storing data which can be read by a computer system. Furthermore, the computer readable recording medium may store and execute programs or codes which can be distributed in computer systems connected through a network and read through computers in a distributed manner.
[0191] The computer readable recording medium may include a hardware apparatus which is specifically configured to store and execute a program command, such as a ROM, RAM or flash memory. The program command may include not only machine language codes created by a compiler, but also high-level language codes which can be executed by a computer using an interpreter.
[0192] Although some aspects of the present disclosure have been described in the context of the apparatus, the aspects may indicate the corresponding descriptions according to the method, and the blocks or apparatus may correspond to the steps of the method or the features of the steps. Similarly, the aspects described in the context of the method may be expressed as the features of the corresponding blocks or items or the corresponding apparatus. Some or all of the steps of the method may be executed by (or using) a hardware apparatus such as a microprocessor, a programmable computer or an electronic circuit. In some embodiments, one or more of the most important steps of the method may be executed by such an apparatus.
[0193] In some exemplary embodiments, a programmable logic device such as a field-programmable gate array may be used to perform some or all of functions of the methods described herein. In some exemplary embodiments, the field-programmable gate array may be operated with a microprocessor to perform one of the methods described herein. In general, the methods are preferably performed by a certain hardware device.
[0194] The description of the disclosure is merely exemplary in nature and, thus, variations that do not depart from the substance of the disclosure are intended to be within the scope of the disclosure. Such variations are not to be regarded as a departure from the spirit and scope of the disclosure. Thus, it will be understood by those of ordinary skill in the art that various changes in form and details may be made without departing from the spirit and scope as defined by the following claims.