Signalling congestion

Abstract

Congestion in respect to a network element operable to forward data items in a telecommunications networks, and in respect to a processing element operable to process requests for service is signaled. In either, the element is operable to perform its processing function at up to a processing rate which is subject to variation, and has a queue for items awaiting processing having a counter associated therewith which maintains a count from which a queue metric is derivable. A method comprises: updating the count at a rate dependent on the processing rate; further updating the count in response to receipt of items awaiting processing; and signalling a measure of congestion in respect of the element in dependence on the queue metric; then altering the rate at which the count is being updated and adjusting the counter whereby to cause a change in the queue metric if the processing rate has changed.

Claims

1. A method of processing data items received at a network element in a communications network, the network element being operable to forward data items at a rate up to a forwarding rate, the forwarding rate being subject to variation; the network element having a queue for data items awaiting forwarding and having a counter associated therewith, the counter maintaining a count in dependence on which a queue metric is derivable according to a predetermined algorithm; the method comprising: updating the count at a rate dependent on the forwarding rate; further updating the count in response to receipt of data items; and signalling a measure of congestion in respect of the network element in dependence on said queue metric; the method being characterised by further comprising: determining if there has been a variation in the forwarding rate, and in response to a determination that there has been a variation in the forwarding rate: altering the rate at which the count is being updated in accordance with the variation in the forwarding rate; and adjusting the counter in dependence on the variation in the forwarding rate whereby to cause a change in the queue metric.

2. A method according to claim 1 wherein the forwarding rate signifies the number of data items that the network element is operable to forward in a unit of time.

3. A method according to claim 1 wherein the forwarding rate signifies the volume of data that the network element is operable to forward in a unit of time.

4. A method according to claim 1 wherein the step of further updating the count is performed in dependence on the number of data items received and/or the rate at which data items are received.

5. A method according to claim 1 the step of further updating the count is performed in dependence on the volume of data received and/or the rate at which data is received.

6. A method according to claim 1 wherein the queue metric is a measure of the level of the count.

7. A method according to claim 1 wherein the queue metric is an averaged measure dependent on measures of the level of the count over a period of time.

8. A method according to claim 1 wherein the queue metric is a measure of disparity between the count and a counter threshold.

9. A method according to claim 8, wherein adjusting the counter in response to a determination that there has been a variation in the forwarding rate comprises updating the count.

10. A method according to claim 8, wherein adjusting the counter in response to a determination that there has been a variation in the forwarding rate comprises updating the counter threshold.

11. A method according to claim 1 wherein the step of signalling a measure of congestion comprises determining whether or not to perform one or more signalling actions in respect of said data items in dependence on the state of said queue metric and performing or not performing said one or more signalling actions in dependence on said determination.

12. A method according to claim 1 wherein the step of signalling a measure of congestion comprises performing one or more signalling actions in respect of said data items with a probability dependent on said queue metric.

13. A method according to claim 1 wherein the step of signalling a measure of congestion comprises performing one or more signalling actions in respect of said data items to an extent dependent on said queue metric.

14. A method according to claim 1 wherein the step of signalling a measure of congestion comprises performing one or more of the following signalling actions in respect of one or more of said data items in dependence on said queue metric: marking, dropping; truncating; delaying; de-prioritising; re-routing; forwarding to a destination other than an intended destination; issuing an out-of-band congestion notification.

15. A method of processing requests for service received at a processing element, the processing element being operable to process requests for service at a rate up to a processing rate, the processing rate being subject to variation; the processing element having a queue for requests awaiting processing and having a counter associated therewith, the counter maintaining a count in dependence on which a queue metric is derivable according to a predetermined algorithm; the method comprising: updating the count at a rate dependent on the processing rate; further updating the count in response to receipt of requests for service; and signalling a measure of congestion in respect of the processing element in dependence on said queue metric; the method being characterised by further comprising: determining if there has been a variation in the processing rate, and in response to a determination that there has been a variation in the processing rate: altering the rate at which the count is being updated in accordance with the variation in the processing rate; and adjusting the counter in dependence on the variation in the processing rate whereby to cause a change in the queue metric.

16. Apparatus operable to perform processing of data items, the apparatus comprising: a network element in a communications network, the network element including at least a first interface via which data items arrive at the network element and at least a second interface via which data items are forwarded from the network element so as to forward the data items at a rate up to a forwarding rate, the forwarding rate being subject to variation; the network element having a queue for data items awaiting forwarding and having a counter associated therewith, the counter maintaining a count in dependence on which a queue metric is derivable according to a predetermined algorithm; and a processing system, including a processor at least being configured to: update the count at a rate dependent on the forwarding rate; further update the count in response to receipt of data items; and signal a measure of congestion in respect of the network element in dependence on said queue metric; and determine if there has been a variation in the forwarding rate, and in response to a determination that there has been a variation in the forwarding rate: alter the rate at which the count is being updated in accordance with the variation in the forwarding rate; and adjust the counter in dependence on the variation in the forwarding rate whereby to cause a change in the queue metric.

17. An apparatus for processing requests for service, the apparatus comprising: a processing element including at least a first interface via which data items arrive at the processing element and at least a second interface via which data items are forwarded from the processing element, the processing element being operable to process requests for service at a rate up to a processing rate, the processing rate being subject to variation; the processing element having a queue for requests awaiting processing and having a counter associated therewith, the counter maintaining a count in dependence on which a queue metric is derivable according to a predetermined algorithm; and a processing system, including a processor at least being configured to: update the count at a rate dependent on the processing rate; further update the count in response to receipt of requests for service; and signal a measure of congestion in respect of the processing element in dependence on said queue metric; and determine if there has been a variation in the processing rate, and in response to a determination that there has been a variation in the processing rate: alter the rate at which the count is being updated in accordance with the variation in the processing rate; and adjust the counter in dependence on the variation in the processing rate whereby to cause a change in the queue metric.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) A preferred embodiment of the present invention will now be described with reference to the appended drawings, in which:

(2) FIG. 1 shows an overview of a network element, such as a router or switch;

(3) FIG. 2 shows a model illustrating a Virtual Queue Marking (VQM) process;

(4) FIG. 3 illustrates the normal operation of a Virtual Queue;

(5) FIG. 4 illustrates an initialisation stage in the operation of a Virtual Queue; and

(6) FIG. 5 illustrates a modification to the normal operation of a Virtual Queue according to a preferred embodiment.

DESCRIPTION OF PREFERRED EMBODIMENTS OF THE INVENTION

(7) With reference to the accompanying figures, methods and apparatus according to preferred embodiments for processing data items received at a network element in a communications network, and for signalling a measure of congestion in respect of the network element, will be described particularly with reference to FIG. 5. Firstly, however, a technique using what may be thought of as the “normal” operation of a Virtual Queue technique will be described with reference to FIGS. 3 and 4.

(8) Referring to FIG. 3, a network element 10 such as that shown in FIG. 1 remains in a “Wait” state 30 until a packet arrives in the real queue of the network element 10 (or of an individual interface 14 thereof). It then proceeds to a “Calculation” step 32, in which the time “T” since the arrival of the previous packet is calculated. From this, the new level of the virtual queue “new_VQ_level” is obtained from the previous level of the virtual queue “previous_VQ_level” by adding an amount corresponding to the size of the newly-received packet “pkt_size” and subtracting an amount corresponding to the product of the virtual queue's drain-rate and the time “T” since the previous arrival, i.e. “VQ_rate*T”. Thus:
new_VQ_level=previous_VQ_level+pkt_size−(VQ_rate*T)
(NB It will be appreciated that the updating of the virtual queue level based on the drain-rate may be done continually, or on an ongoing basis, rather than simply each time a new packet arrives—the effect can be the same.)

(9) The process then proceeds to a “Comparison” step 34, in which the (new) level of the virtual queue is compared with the virtual queue's threshold (i.e. a predetermined length of the virtual queue at which packets are to be dropped or marked, or at which a particular sanction or other such action is to be taken). If the (new) level of the virtual queue is found to be greater the virtual queue's threshold (i.e. if VQ_level>VQ_threshold), the process continues to step 36 (discussed below), otherwise it returns to the “Wait” state 30, and remains there until another packet arrives in the real queue.

(10) In step 36, an instruction is sent to the real queue to drop or mark the packet (or another packet—it need not be the specific packet that has just arrived), or otherwise take the predetermined type of action to signal imminent congestion, or the onset thereof. In addition to this, the level of the virtual queue may be reduced by an amount corresponding to the size of the packet the processing of which has resulted in the level passing the threshold. (This may be done in order to clear the virtual queue sufficiently to allow a smaller packet to be received subsequently that doesn't result in a further “overflow” indication to be sent, but in some embodiments, this reduction need not be done.) The process then returns to the “Wait” state 30, and remains there until another packet arrives in the real queue.

(11) As will be appreciated, a process such as that described above with reference to FIG. 3 may continue indefinitely, but for the sake of completeness, it should be noted that there are various ways in which it could be initiated, either for the first time in the case of a particular network element or interface, or in the event of a re-set.

(12) Referring to FIG. 4, a possible initialisation stage in the normal operation of a Virtual Queue will therefore be described.

(13) In step 40, a value for the real/actual line-rate is obtained. The real line-rate may be discovered, e.g. by the network element performing a measurement of the maximum rate that is possible in current conditions, or it may be imposed (due to a contract, or otherwise), possibly on the instruction of a separate control entity.

(14) In step 42, various values may be set, including the following, for example: The virtual queue's drain-rate “VQ_rate” may be set as a suitably chosen multiple “θ” (generally slightly less than 1, as explained earlier) of the real line-rate. The virtual queue's level “VQ_level” may be set at an initial level, such as its “zero” level (which may be appropriate prior to any packets arriving, let alone causing any congestion). The virtual queue's threshold “VQ_threshold” (i.e. for dropping, marking, or otherwise taking action) may be set at a predetermined level or otherwise. The virtual queue's maximum level “VQ_maximum_level” may be set as a suitably chosen multiple (possibly θ) of the maximum buffer size of the real queue, or otherwise.

(15) The operation of the virtual queue may then proceed (via step 44) to the ‘Wait’ state 30 of its ongoing process.

(16) Referring now to FIG. 5, a preferred embodiment, in which further adjustment is made while performing a virtual-queue-based process such as that explained with reference to FIG. 3, will now be explained. By virtue of this, congestion signalling (by marking, dropping or otherwise) can more accurately reflect the near-term danger of packets (or other data units) being dropped by or from the real queue, or otherwise failing to be forwarded as intended.

(17) During normal operation (and while the real line-rate is static or stable), the network element may operate in its normal fashion (i.e. according to the process of FIG. 3, having been initialised according to the process of FIG. 4, for example). This normal operation during periods when the actual line-rate is static or stable is symbolised by step 50 in FIG. 5. The length of the virtual queue is therefore increased when a packet arrives, by the size of the packet in bytes (in the presently-described embodiment). In addition, its length is decreased at the rate θX, typically with the rate-based decreases also being imposed at packet arrival, calculated according to the time since the previous packet arrival (although as explained earlier, the rate-based decreasing of the virtual queue length may be done on a continuous or ongoing basis). The packet (or one or more other packets) is marked if the length of the virtual queue is greater than a threshold value, and otherwise not. This may be done in a manner such as or similar to the operation of the threshold-meter described in RFC 5670 (although that description is written in terms of a “token bucket”, but this is functionally equivalent to a description in terms of a virtual queue (essentially described the other way up)—see Appendix A1 of RFC 5670 for an example algorithm that implements the defined behaviour).

(18) In the event that there is a variation in the actual line-rate, however (e.g. any step-change, a change above a predetermined amount, or a change in the actual line-rate that is occurring at a higher rate than a predetermined rate of change, for example), the process of FIG. 5 is triggered, and the network element proceeds from step 50 to step 52, as will be discussed later.

(19) Firstly, it should be appreciated that here are various ways in which the network element may determine that there has been a variation in the actual line-rate. Typically this may be via the Application Programming Interface (API) of the network card. The API could regularly poll the network card, making regular-polling Requests expressible in SNMP terms in the form: “What is the current line-rate”, and receiving responses of the form: “Line-rate is <value>”. Alternatively, the API could be interrupt-driven (e.g. by setting an SNMP “trap”), i.e. submitting a Request expressible in the form: “Inform me if the line-rate changes, otherwise keep quiet”, and occasionally receiving Responses of the form “Line-rate has just changed and is now <value>”. The polled mechanism may be preferable where the line-rate changes frequently, whilst the interrupt method may be preferable where the line-rate only changes occasionally.

(20) If it is determined (by either of the techniques set out above, or otherwise) that there has been a variation in the actual line-rate (i.e. the rate up to which the network element is able or operable to forward data items) i.e. from X bps to X.sub.new bps, for example, two changes are made in respect of the virtual queue-based process. The order of which is not generally important—they may be done in either order or simultaneously.

(21) (i) The virtual queue drain-rate is updated to be θX.sub.new.

(22) (ii) In the presently-described embodiment, the count representing the length of the virtual queue is adjusted (although as will be explained later, according to other embodiments, other parameters of or associated with the virtual queue may be adjusted instead to cause the same, a similar, or a corresponding effect).

(23) In the presently-described embodiment, the adjustment to the count representing the length of the virtual queue is done as follows: If the new actual line-rate is greater than the previous actual line-rate, the virtual queue length is reduced (in order to reflect that there is a lower danger that the real queue will start to grow). A preferred option here, on account of being simple to implement, is to set the virtual queue length count to some predetermined value that is significantly below the marking threshold, such as its “zero” level, so decreasing the probability of marking (or dropping, etc.) the next few packets. If, on the other hand, the new actual line-rate is less than the previous actual line-rate, the virtual queue length is increased (in order to reflect that there is a greater danger that the real queue will start to grow). A preferred option here, again on account of being simple to implement, is to set the virtual queue length count equal to some predetermined value that is significantly larger than the marking threshold, such as its “maximum” level, so increasing the probability of marking (or dropping, etc.) the next few packets.

(24) The virtual queue then returns to operating according to its “Normal Operation” process (such as that illustrated in FIG. 3), and may continue operating in its normal fashion until it is determined that there has been another change in the actual line-rate.

(25) As indicated by the above, while the presently-described embodiment responds to a determination that there has been a change in the actual line-rate by adjusting the count representing the length of the virtual queue (in order to cause—at least in the short-term immediately after the determination of a change in the actual line-rate—a change in the likelihood or amount of marking, dropping etc.), it will be understood that with other embodiments, other parameters of or associated with the virtual queue may be adjusted instead to cause the same, a similar, or a corresponding effect, i.e. an at least short-term change in the likelihood or amount of marking, dropping etc.). For example, instead of adjusting the count representing the length of the virtual queue, a corresponding (i.e. but mirror-image) adjustment may be made to the marking threshold, whereby to bring the threshold closer to the current level if a decrease in the actual line-rate has been identified (which would generally increase the chance that it will be appropriate to mark packets), and to move the threshold further from the current level if an increase in the actual line-rate has been identified (which would generally decrease the chance that it will be appropriate to mark packets). Such options are less easy to implement, as they may require a facility for storing an extra variable (a dynamic threshold) and a process for returning this to a predetermined threshold over the course of time.

(26) This and some other possible implementation options will be discussed in brief below.

(27) Some Implementation Options

(28) (a) Some Options for how to Update the Counter:

(29) In situations where it is determined that the actual line-rate has increased, generally causing the danger of congestion to become (at least temporarily) lower, in preferred embodiments, the virtual queue level should generally be reduced to reflect this. Some possibilities for achieving this include the following: Clearing the virtual queue (i.e. setting the counter level to 0). (As explained earlier, this option may be used on the grounds that it is relatively simple to implement.) Calculating how much shorter the virtual queue would have been if it had emptied at the new rate for a round trip time (RTT), then adjusting the level of the virtual queue from its previous level VQ.sub.old to a new level VQ.sub.new as follows: VQ.sub.new=VQ.sub.old−(change of VQ rate*RTT) [min VQ.sub.new=0] (i.e. the square bracket symbolises that this adjustment may be made under the condition that VQ.sub.new cannot be less than 0, or some other pre-decided value.) The reasoning for this is that it generally takes approximately one RTT for sources to react to marks. The RTT may be measured directly (noting that this may be non-trivial, as RTT is generally different for each source-destination pair), estimated (e.g. if the network topology is such that all paths have or may be regarded as having similar RTTs) or assumed (e.g. a typical value for RTT may be assumed in some broadband scenarios; it would even be possible for the ‘typical value’ to be dependent on the technology, e.g. in the provision of broadband Internet access using Digital Subscriber Line (DSL) technology, interleaving is typically used more than for cable, so will have higher RTT). Setting the length of the virtual queue to be equal to the length of the real queue (which will generally be less than the length of the virtual queue). Deflating the length of the virtual queue by some arbitrary multiplier, e.g.
VQ.sub.new=VQ.sub.old*(old VQ rate/new VQ rate) Variants of the above such as: If new VQ rate>(1/θ*old VQ rate) then VQ.sub.new=0, otherwise VQ.sub.new=VQ.sub.old (i.e. clear the virtual queue if the rate has increased significantly, otherwise do nothing). Combinations of the above, e.g. calculating the value according to the RTT method above, but with a minimum value equal to the length of the real queue.

(30) Correspondingly, in situations where it is determined that the actual line-rate has decreased, generally causing the danger of congestion to become (at least temporarily) higher, in preferred embodiments, the virtual queue level should generally be increased to reflect this. Some possibilities for achieving this include the following: Filling the virtual queue (i.e. setting the counter level to its maximum value). (This option may be used on the grounds that it is relatively simple to implement.) Calculating how much longer the virtual queue would have been if it had emptied at the new rate for a round trip time, then adjusting the level of the virtual queue from its previous level VQ.sub.old to a new level VQ.sub.new as follows:
VQ.sub.new=VQ.sub.old+(change of VQ rate*RTT)[max VQ.sub.new=max value] Inflating the virtual queue by some arbitrary multiplier, e.g.
VQ.sub.new=VQ.sub.old*(old VQ rate/new VQ rate) Variants of the above such as:
If new VQ rate<(θ*old VQ rate) then VQ.sub.new=max,otherwise VQ.sub.new=VQ.sub.old Combinations of the above.

(31) As referred to briefly earlier, an alternative to adjusting the virtual queue's length (or counter level) is to adjust the virtual queue's threshold, “VQ_threshold” (i.e. the length of the virtual queue at which packets are dropped/marked/etc.), i.e.: If the real line-rate has decreased, then the threshold may be lowered (generally to some minimum value greater than its zero level). If the real line-rate has increased, then the threshold may be increased (generally to some value less than its maximum value, VQ_maximum_level).

(32) The change in threshold can be calculated in a manner corresponding to any of the possibilities mentioned above. This may be suitable because the behaviour of a communications system with a virtual queue is generally not very sensitive to such threshold values [as indicated with reference to the Zhang & Charny paper discussed earlier]. It may be less preferable, however, because if the threshold is lowered too far, the virtual queue may trigger marking/dropping/etc. too often, and if it increased too far, the virtual queue may not trigger marking/dropping/etc. early enough, i.e. the real queue will fill significantly or even overflow before marking/dropping triggers the source to slow down enough.

(33) (b) Some Options for when to Update the Counter:

(34) In “normal operation”, the virtual queue length generally increases as packets arrive and decreases as packets depart—FIG. 3 shows the subcase where packets are only marked/dropped/etc. as they arrive, so it is essentially equivalent to calculating the virtual queue length only on packet arrival.

(35) As explained earlier, according to preferred embodiments, the virtual queue length is adjusted in response to a determined change in the actual line-rate. This re-setting could be done as soon as it is determined that the line-rate has changed. An alternative may be to do the re-setting at the same time as the next time that a packet could be marked/dropped/etc. For example, in the subcase where packets are only marked/dropped/etc. as they arrive, so it is equivalent to re-setting the virtual queue length only on packet arrival (when the virtual queue length would be re-set according to the combined effect of the calculations in FIG. 3 and FIG. 5, i.e. “normal operation” with the adjustment contributed by the modified technique).

(36) (c) Alternatives for how and when Data Items May be Marked/Dropped/Etc.:

(37) In the preferred embodiments described above, packets are generally marked or dropped (or other types of action may be taken) on a packet-by-packet basis in dependence on whether the length of the virtual queue is above or below a threshold (or according to the state of the applicable queue metric). Such embodiments may be thought of as “deterministic”, in the sense that the state of the applicable queue metric at a particular time determines whether the applicable action will be taken in respect of a particular packet.

(38) In alternative embodiments, which may be thought of as “probabilistic” embodiments, there need not be a threshold determining whether or not marking, dropping or another action is taken in respect of a particular packet. Instead, action (marking/dropping/etc.) may be taken on a probabilistic basis, with the probability of action being taken in respect of each packet being dependent on the state of the applicable (virtual) queue metric at a particular time. In general, with such embodiments, the state of the (virtual) queue metric during a particular time period will result (on average) in a higher or lower percentage of the packets received during that period being marked/dropped/etc., and this percentage may therefore signal the desired measure of congestion (in a manner analogous to RED, discussed earlier, for example, but in a manner that can reflect the likelihood of congestion more quickly and/or more accurately than is possible with RED and other prior techniques).

(39) Some Possible Causes of Changes to the Actual Line-Rate

(40) The real line-rate might change for a variety of reasons, including, for example: A wireless interface may adapt its rate because wireless conditions have changed. The DSL rate may be adapted, using a Digital Line Management (DLM) technique, for example. The DSL rate may be unchanged, but multicast may start or stop. On an optical link, another wavelength may be used, or its use may be stopped. A bonded link may add another bond. More generally, a virtual link may add another underlying path. A customer's upstream rate in a broadband network (DSL or cable) may be limited by a ‘fair usage’ policy (which prevents one user using too much of an amount of shared bandwidth, for example), where the cap depends on whether the consumer's recent traffic is above or below their policy limit, for example.

(41) In the case of DSL (where a Broadband Remote Access Server (BRAS or B-RAS) routes traffic to and from broadband remote access devices such as Digital Subscriber Line Access Multiplexers (DSLAMs) on an Internet service providers (ISP's) network), on the BRAS side, the DSL rate can change frequently with DLM but even more often in the case of a network where Multicast will alter the speed of the line when a multicast session is active downstream of the BRAS. With multicast the BRAS speed of a Virtual Circuit (VC) may have to be altered to:
NEW_VC_Speed=Original_VC_Speed−Multicast Rate.

(42) This may be required to avoid packet loss of prioritised traffic on the BRAS as a Multi-Service Access Node (MSAN)—which is a device typically installed in a telephone exchange or in a cabinet, and which connects customers' telephone lines to the core network to provide telephone, ISDN, and broadband such as DSL all from a single platform—is generally not QoS aware.

(43) In alternative implementations, the network element (or a module thereof responsible for determining that there has been a change in the actual line-rate) may be informed of (or otherwise determine) the absolute value of the new or current line-rate, or may be told (or otherwise determine) of changes in the line-rate.

(44) Virtual queues are normally considered in the context of a node that forwards packets. However they can also be used for nodes that perform general packet processing or other types of processing of “requests for service”, for example communications functions such as deep packet inspection, firewalling and other middle-box functions, or a generic server blade such as used in Software-Defined Networking (SDN).

(45) In such cases, the method may apply is the same or a similar way, where ‘real (or virtual) line-rate’ is replaced (respectively) by ‘real (or virtual) rate of processing packets’.

(46) So there could be a step change in the rate at which the node can process packets. For example, a node with a server blade could switch off one of its blade's cores (perhaps to save energy) or activate another core, or the node could realise that it was having to do more processing per packet (and therefore the ‘real rate of processing packets’ was lower). Reasons could be that the node is a firewall protecting an enterprise that is under a sudden attack, so more rigorous screening of each packet is needed, or because more complex header compression or DPI operations are suddenly needed.

(47) In all these cases a node probably has a control and/or management system that controls and/or monitors the processing function which will know that more processing per packet is needed, or that a new core has been activated. The virtual queue can (as before) explicitly request, or be informed, or know itself, the new ‘real rate of processing packets’.

Signalling congestion

Assignee

Inventors

Cpc classification

Classification Explorer

H04L49/508

ELECTRICITY

Classification Explorer

H04L47/326

ELECTRICITY

Classification Explorer

H04L47/283

ELECTRICITY

Classification Explorer

H04L49/50

ELECTRICITY

Classification Explorer

H04L49/90

ELECTRICITY

Classification Explorer

H04L49/3045

ELECTRICITY

Classification Explorer

H04L47/12

ELECTRICITY

International classification

Classification Explorer

H04L12/861

ELECTRICITY

Classification Explorer

H04L12/931

ELECTRICITY

Classification Explorer

H04L12/801

ELECTRICITY

Classification Explorer

H04L12/935

ELECTRICITY

Classification Explorer

H04L12/823

ELECTRICITY

Abstract

Claims

Description