ADJUSTING ALIGNMENT FOR MICROWAVE TRANSMISSIONS BASED ON AN RL MODEL
20220255642 · 2022-08-11
Assignee
Inventors
- Kaushik Dey (Kolkata, IN)
- Abhishek Sarkar (Bengaluru, IN)
- Ashis Kurmar ROY (Durgapur, IN)
- Dairaj Nagaraja HEGDE (Bengalore, IN)
Cpc classification
International classification
Abstract
It is provided a method for adjusting alignment for microwave transmissions from a microwave transmitter to a microwave receiver based on a reinforcement learning, RL, model. The method comprises the steps of: obtaining state space comprising external state space and internal state space, the external state space comprising at least one value of a parameter related to environmental conditions, and the internal state space relates to alignment of the microwave transmitter; determining an action in an action space, the action space comprising actions to adjust alignment of the microwave transmitter; obtaining a measurement of path loss for a transmission from the microwave transmitter to the microwave receiver; determining a reward value based on the path loss, wherein an increase in path loss results in a reduced reward value; and adjusting the RL model based on the obtained state space, the determined action and the determined reward value.
Claims
1. A method for adjusting alignment for microwave transmissions from a microwave transmitter to a microwave receiver based on a reinforcement learning (RL) model, the method being performed in an alignment adjuster and comprising the steps of: obtaining state space comprising external state space and internal state space, the external state space comprising at least one value of a parameter related to environmental conditions at the microwave transmitter or microwave receiver, and the internal state space comprising at least one value of a parameter related to alignment of the microwave transmitter; determining, based on the RL model and the state space, an action in an action space, the action space comprising actions to adjust alignment of the microwave transmitter; obtaining a measurement of path loss for a transmission from the microwave transmitter to the microwave receiver; determining a reward value based on the path loss, wherein an increase in path loss results in a reduced reward value; and adjusting the RL model based on the obtained state space, the determined action and the determined reward value.
2. The method of claim 1, wherein the external state space is based also on captured images passed through a convolutional neural network model.
3. The method of claim 1, wherein the external state space comprises at least one measurement of the following parameters: ambient temperature, rate of precipitation, humidity, wind speed, wind direction, air pressure.
4. (canceled)
5. (canceled)
6. The method of claim 1, wherein the internal state space comprises transmission power of the microwave transmitter.
7. The method of claim 6, wherein the step of determining a reward value comprises determining the reward value also based on the transmission power, wherein, all else being equal, an increase in transmission power results in a reduced reward value.
8. The method of claim 1, wherein the internal state space comprises at least one of vertical angle of the microwave receiver and horizontal angle of the microwave receiver.
9. The method of claim 1, further comprising the step of: triggering the collection of additional parameters related to environmental conditions when a sequence of successive reward values indicate a performance less than a threshold performance.
10. (canceled)
11. An alignment adjuster for adjusting alignment for microwave transmissions from a microwave transmitter to a microwave receiver based on a reinforcement learning, (RL) model, the alignment adjuster comprising: a processor; and a memory wherein the alignment adjuster is configured to: obtain state space comprising external state space and internal state space, the external state space comprising at least one value of a parameter related to environmental conditions at the microwave transmitter or microwave receiver, and the internal state space comprising at least one value of a parameter related to alignment of the microwave transmitter; determine, based on the RL model and the state space, an action in an action space, the action space comprising actions to adjust alignment of the microwave transmitter; obtain a measurement of path loss for a transmission from the microwave transmitter to the microwave receiver; determine a reward value based on the path loss, wherein an increase in path loss results in a reduced reward value; and adjust the RL model based on the obtained state space, the determined action and the determined reward value.
12. The alignment adjuster of claim 11, wherein the external state space is based also on captured images passed through a convolutional neural network model.
13. The alignment adjuster of claim 11, wherein the external state space comprises at least one measurement of the following parameters: ambient temperature, rate of precipitation, humidity, wind speed, wind direction, air pressure.
14. The alignment adjuster of claim 11, wherein the internal state space comprises vertical angle of the microwave transmitter.
15. The alignment adjuster of claim 11, wherein the internal state space comprises horizontal angle of the microwave transmitter.
16. The alignment adjuster of claim 11, wherein the internal state space comprises transmission power of the microwave transmitter.
17. The alignment adjuster of claim 16, wherein the instructions to determine a reward value comprise instructions that, when executed by the processor, cause the alignment adjuster to determine the reward value also based on the transmission power, wherein, all else equal, an increase in transmission power results in a reduced reward value.
18. The alignment adjuster of claim 11, wherein the internal state space comprises at least one of vertical angle of the microwave receiver and horizontal angle of the microwave receiver.
19. The alignment adjuster of claim 11, further comprising instructions that, when executed by the processor, cause the alignment adjuster to trigger the collection of additional parameters related to environmental conditions when a sequence of successive reward values indicate a performance less than a threshold performance.
20. The alignment adjuster of claim 19, wherein the instructions triggering comprises triggering at least one unmanned aerial vehicle to collect environmental data between the microwave transmitter and the microwave receiver.
21. A non-transitory computer readable medium storing a computer program for adjusting alignment for microwave transmissions from a microwave transmitter to a microwave receiver based on a reinforcement learning, (RL) model, the computer program comprising computer program code which, when run on an alignment adjuster, causes the alignment adjuster to: obtain state space comprising external state space and internal state space, the external state space comprising at least one value of a parameter related to environmental conditions at the microwave transmitter or microwave receiver, and the internal state space comprising at least one value of a parameter related to alignment of the microwave transmitter; determine, based on the RL model and the state space, an action in an action space, the action space comprising actions to adjust alignment of the microwave transmitter; obtain a measurement of path loss for a transmission from the microwave transmitter to the microwave receiver; determine a reward value based on the path loss, wherein an increase in path loss results in a reduced reward value; and adjust the RL model based on the obtained state space, the determined action and the determined reward value.
22. (canceled)
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0030] Aspects and embodiments are now described, by way of example, with reference to the accompanying drawings, in which:
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
DETAILED DESCRIPTION
[0039] The aspects of the present disclosure will now be described more fully hereinafter with reference to the accompanying drawings, in which certain embodiments of the invention are shown. These aspects may, however, be embodied in many different forms and should not be construed as limiting; rather, these embodiments are provided by way of example so that this disclosure will be thorough and complete, and to fully convey the scope of all aspects of invention to those skilled in the art. Like numbers refer to like elements throughout the description.
[0040]
[0041] In the scenario of
[0042]
[0043]
[0044] An issue with such a compensation is that the alignment adjustment depends on current environmental conditions. Hence, the alignment should to be dynamically adjusted, to adapt to changing conditions.
[0045]
[0046] Environmental conditions are captured at the microwave transmitter 1 and/or at the microwave receiver 2. For this purpose, at the site of the microwave transmitter 1, there are one or more weather sensors 7 and/or one or more cameras 8. Analogously, at the site of the microwave receiver 2 there can be weather sensor(s) 7 and/or a camera(s) 8. Images from the cameras(s) 8 can be used to determine current environmental conditions, e.g. by determining the presence or absence of rain, snow or fog. The weather sensor(s) can e.g. be used to measure humidity, temperature, wind direction and strength, precipitation etc. A path loss meter of the microwave receiver 2 is used to record the strength of the received signal.
[0047] The environmental conditions, captured by the one or more weather sensors 7 and one or more cameras 8, are provided to an alignment adjuster 10. In this embodiment, the alignment adjuster 10 is provided on the microwave transmitter 1 side. Also the path loss is fed to the alignment adjuster 10. The alignment adjuster 10 determines, as described in more detail below, any adjustment of alignment that is to occur, based on an RL model.
[0048] Alignment of the microwave transmitter 1 is then adjusted by the alignment adjuster 10 sending a signal to control a motor to adjust a horizontal angle 5, i.e. azimuth, of the transmission direction. Alternatively or additionally, alignment of the microwave transmitter 1 can be adjusted by the alignment adjuster 10 sending a signal to control a motor to adjust a vertical angle 6 of the transmission direction.
[0049] Optionally, alignment on the receiver side is also adjusted. In such a case, alignment of the microwave receiver 2 is adjusted by the alignment adjuster 10 sending a signal to control a motor to adjust a horizontal angle 5, i.e. azimuth, of the direction of receiving. Alternatively or additionally, alignment of the microwave receiver 2 can be adjusted by the alignment adjuster 10 sending a signal to control a motor to adjust a vertical horizontal angle 6 of the direction of receiving.
[0050] As described in more detail below, the alignment is here adjusted based on a reinforcement learning (RL) model. In the RL model, parameters describing environmental conditions form part of the current state. The vertical and/or horizontal angle of the microwave transmitter (and optionally of the microwave receiver) is the action of RL model.
[0051] The path loss is used to calculate the reward. The better the alignment adjustment for the given state, the lower the path loss is, which is results in a higher reward.
[0052] The RL model is trained by modelling a value function which assigns a negative reward for increase in misalignment. The magnitude of negative reward can be proportional to degree of reduction in transmission. When initially started, the RL model can be trained using simulation data. The training of the RL model can also involve assigning weights for combining the sensors data and the data for the pictures. Once the RL model is active, the weights are automatically determined by the RL model in the process of the maximising of the reward.
[0053]
[0054] In
[0055] In
[0056] In
[0057]
[0058] In an obtain state space step 40, the alignment adjuster obtains state space. The state space comprises external state space and internal state space. The external state space comprises at least one value of a parameter related to environmental conditions at the microwave transmitter or microwave receiver. The internal state space comprises at least one value of a parameter related to alignment of the microwave transmitter.
[0059] In one embodiment, the external state space comprises at least one measurement of the following parameters: ambient temperature, rate of precipitation, humidity, wind speed, wind direction, and air pressure.
[0060] In one embodiment, the internal state space comprises vertical angle of the microwave transmitter and/or horizontal angle of the microwave transmitter. The internal state space can also comprise vertical angle of the microwave receiver and/or horizontal angle of the microwave receiver.
[0061] In one embodiment, the internal state space comprises transmission power of the microwave transmitter.
[0062] The state space varies over time and each set of state space is associated with a particular time t. Hence, the state space can be expressed as S.sub.t=[S.sub.t.sup.I,S.sub.t.sup.E], where S.sub.t.sup.I represents the internal state space and S.sub.t.sup.E represents the external state space.
[0063] The internal state space is given by S.sub.t.sup.I=[θ.sub.vertical,θ.sub.horizontal,ω], that is the vertical and horizontal angles θ of the transmitter and other antenna parameters ω, like signal strength. Optionally, parameters for the angles of the receiver can also form part of the internal state space.
[0064] The external state space is expressed as S.sub.t.sup.E.
[0065] In one embodiment, the external state space is based also on captured images passed through a neural network comprising a convolutional neural network model. The input to the neural network comprises a captured image (which could be from the transmitter or the receiver end), optionally along with weather parameter values and a current reward value. The first layers of the network are composed of a convolutional neural network (CNN) model to enable processing the images. The final layers of the neural network are composed of fully connected networks to enable choosing the appropriate action based on the image and weather readings.
[0066] The neural network converts the captured images to weather information between the microwave transmitter and the microwave receiver. The use of the neural network for the images can be useful for improving performance of the RL model when it is possible to obtain more information from the images than what the sensor can capture about the weather or environmental conditions between the microwave transmitter and the microwave receiver. One such example is when it is not raining either at the receiver or transmitter end (where the sensors are placed) but it is raining at some point in between the two, and this phenomenon can be identified from captured images.
[0067] In a determine action step 42, the alignment adjuster determines, based on the RL model and the state space, an action in an action space. The action space comprises actions to adjust alignment of the microwave transmitter.
[0068] More formally, the action space A.sub.t can be defined according to the following, where all mentioned parameters are not necessary:
A.sub.t={Δθ.sub.horizontal.sup.T+,Δθ.sub.horizontal.sup.T−,Δθ.sub.horizontal.sup.T0}×{Δθ.sub.vertical.sup.T+Δθ.sub.vertical.sup.T−,Δθ.sub.vertical.sup.T0}×{ΔP.sup.+,ΔP.sup.−,ΔP.sup.0}×{Δθ.sub.horizontal.sup.R+,Δθ.sub.horizontal.sup.R−,Δθ.sub.horizontal.sup.R0}×{Δθ.sub.vertical.sup.R+Δθ.sub.vertical.sup.R−,Δθ.sub.vertical.sup.R0}
[0069] For any given state S.sub.t, we are given at least three possible actions of rotations of the transmitter alignment in the horizontal (azimuth) direction (no change, clockwise, anticlockwise). Analogously, there are actions of rotations in the vertical direction. The same set of actions are possible for the receiving antenna as well. Apart from the rotation of the antenna we have the possibility of increasing, decreasing or making no change in the signal strength of the antenna.
[0070] In an obtain path loss step 44, the alignment adjuster obtains a measurement of path loss for a transmission from the microwave transmitter to the microwave receiver.
[0071] In a determine reward value step 46, the alignment adjuster determines a reward value based on the path loss. An increase in path loss results in a reduced reward value, and vice versa. Additionally, the reward value can be determined based on the transmission power, wherein, all else equal, an increase in transmission power results in a reduced reward value, and vice versa. In other words, the reward function can be expressed as r=f(pathloss, TransmissionPower).
[0072] As explained above, there is a path loss meter which measures the quality of signal received by the receiver, i.e. the path loss. The lower the path loss is, the higher is the reward. Optionally, the reward function is defined such that when the path loss is higher than a threshold (threshold when the signal is no longer useful for the transmitter), then a very high negative reward is given, in order to prevent signal loss. At the same time, all else equal, the higher the transmission power of the signal being used is, the lower is the reward.
[0073] Using such reward function, lower path loss is achieved with low energy expenditure at the transmitter end.
[0074] In an adjust model step 48, the alignment adjuster adjusts the RL model based on the obtained state space, the determined action and the determined reward value. In this way, the RL model adapts over time to improve its performance.
[0075] Looking now to
[0076] In an optional trigger additional parameter collection step 49, the alignment adjuster triggers the collection of additional parameters related to environmental conditions when a sequence of successive reward values indicate a performance less than a threshold performance. This can comprise triggering at least one unmanned aerial vehicle (UAV) to collect environmental data between the microwave transmitter and the microwave receiver which can then be used for determining alignment adjustments. In other words, when performance is substandard, UAVs are used to obtain more data which is then used to improve the performance of the RL model.
[0077] Regardless of whether step 49 is performed or note, the method is repeated, to again adjust alignment in response to changed environmental conditions. This allows the alignment to dynamically adapt to changing precipitation or other changing conditions.
[0078] Embodiments presented herein are useful for any type of communication where environmental conditions affect transmissions, and as such is particularly useful for 5G (fifth generation) technology since the number of microwave transmitters is greatly increased in 5G, leading to increased number of misalignments due to rain or strong wind.
[0079] The embodiments presented herein improve the quality of microwave transmission without any need for any manual operations. This allows dynamic and automatic adjustment to changing conditions, e.g. change in weather or interfering objects entering the Fresnel zone.
[0080]
[0081] The memory 64 can be any combination of random access memory (RAM) and/or read only memory (ROM). The memory 64 also comprises persistent storage, which, for example, can be any single one or combination of magnetic memory, optical memory, solid-state memory or even remotely mounted memory.
[0082] A data memory 66 is also provided for reading and/or storing data during execution of software instructions in the processor 60. The data memory 66 can be any combination of RAM and/or ROM.
[0083] The alignment adjuster to further comprises an I/O interface 62 for communicating with external and/or internal entities. Optionally, the I/O interface 62 also includes a user interface.
[0084] Other components of the alignment adjuster to are omitted in order not to obscure the concepts presented herein.
[0085]
[0086] A state space obtainer 70 corresponds to step 40. An action determiner 72 corresponds to step 42. A path loss determiner 74 corresponds to step 44. A reward value determiner 76 corresponds to step 46. A model adjuster 78 corresponds to step 48. An additional parameter collection determiner 79 corresponds to step 49.
[0087]
[0088] The aspects of the present disclosure have mainly been described above with reference to a few embodiments. However, as is readily appreciated by a person skilled in the art, other embodiments than the ones disclosed above are equally possible within the scope of the invention, as defined by the appended patent claims. Thus, while various aspects and embodiments have been disclosed herein, other aspects and embodiments will be apparent to those skilled in the art. The various aspects and embodiments disclosed herein are for purposes of illustration and are not intended to be limiting, with the true scope and spirit being indicated by the following claims.