Cognitive power management for communication systems
09785220 · 2017-10-10
Assignee
Inventors
- Ahmed M. Eltawil (Irvine, CA, US)
- Fadi J. Kurdahi (Irvine, CA, US)
- Muhammad Abdelghaffar (Irvine, CA, US)
- Amr M. A. Hussein (Irvine, CA, US)
- Amin Khajeh (Hillsboro, OR, US)
US classification
- 1/1
Cpc classification
Y02D10/00
GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
International classification
Abstract
A power management technique utilizing a method for accurately and rapidly estimating the change in the statistical distribution of data at each block in a communication system leading to or originating from a memory that is experiencing voltage scaling induced errors is disclosed. An appropriate memory supply voltage that maximizes power savings is found by exploiting the available SNR slack while keeping system performance within a required margin.
Claims
1. A method to achieve a target bit error rate for a communications system, comprising: estimating channel parameters of a channel used by the communications system, wherein the channel parameters include a channel power and a channel noise variance; calculating an average received signal to noise ratio for the communications system based on the channel parameters; obtaining a target signal to noise ratio for the communications system based on the target bit error rate; iterating through a plurality of supply voltages for a memory device of the communications system and a corresponding plurality of effective signal to noise ratios for the communications system, wherein each of the effective signal to noise ratios is characterized based on a memory error rate of the memory device at one of the plurality of supply voltages and the average received signal to noise ratio; and selecting one of the plurality of supply voltages to supply power to the memory device based on the plurality of effective signal to noise ratios and a difference between the average received signal to noise ratio and the target signal to noise ratio, the one of the plurality of supply voltages being selected for power savings while achieving the target bit error rate for the communications system.
2. The method of claim 1, further comprising determining the plurality of effective signal to noise ratios by: reading a bit error rate of the memory device for each of the plurality of supply voltages; based on a derived probability mass functions distribution, calculating an equivalent noise model that achieves a same bit error rate under each of the plurality of supply voltages; and tabulating an effective signal to noise ratio value for each equivalent noise model.
3. The method of claim 1, wherein: the communications system comprises a wireless communications system; and the memory device is a buffering memory in the wireless communication system.
4. The method of claim 3, wherein the buffering memory is in a receiver front end of the wireless communications system.
5. The method of claim 3, wherein the buffering memory is used for storing one of data before a Fast Fourier Transform (FFT) block, data after the FFT block, data before channel estimation, data after channel estimation, data before an interleaver block, data after an interleaver block, data before equalization, or data after equalization.
6. The method of claim 3, wherein the communication system is an orthogonal frequency division multiplexing (OFDM) based communication system.
7. The method of claim 6, wherein the channel used by the communication system comprises additive white Gaussian noise (AWGN).
8. The method of claim 1, wherein the one of the plurality of supply voltages is selected dynamically over time.
9. The method of claim 1, wherein the plurality of supply voltages for the memory device of the communications system and the corresponding plurality of effective signal to noise ratios are stored in a look up table for reference by the communications system.
10. The method of claim 1, further comprising calculating an available signal to noise ratio slack as the difference between the average received signal to noise ratio and the target signal to noise ratio.
11. A method to achieve a target bit error rate for a communication system, comprising: estimating at least one channel parameter of a channel used by the communications system; calculating an average received signal to noise ratio for the communications system based on the at least one channel parameter; calculating an available signal to noise ratio slack for the communications system based on a difference between the average received signal to noise ratio and a target signal to noise ratio; iterating through a plurality of supply voltages for a memory device of the communications system and a corresponding plurality of effective signal to noise ratios for the communications system, wherein each of the effective signal to noise ratios is characterized based on a memory error rate of the memory device at one of the plurality of supply voltages and the average received signal to noise ratio; and selecting one of the plurality of supply voltages to supply power to the memory device based on the plurality of effective signal to noise ratios and a difference between the average received signal to noise ratio and a target signal to noise ratio, the one of the plurality of supply voltages being selected for power savings while achieving the target bit error rate for the communications system.
12. The method of claim 11, further comprising determining the plurality of effective signal to noise ratios by: reading a bit error rate of the memory device for each of the plurality of supply voltages; based on a derived probability mass functions distribution, calculating an equivalent noise model that achieves a same bit error rate under each of the plurality of supply voltages; and tabulating an effective signal to noise ratio value for each equivalent noise model.
13. The method of claim 11, wherein the one of the plurality of supply voltages is selected dynamically over time.
14. A method of dynamically selecting a supply voltage for a memory device of a communications system to achieve a target bit error rate (BER), comprising: calculating a signal to noise ratio for communications over a channel used by the communications system; calculating an available signal to noise ratio slack for the communications system based on a difference between the signal to noise ratio and a target signal to noise ratio; iterating through a plurality of supply voltages for a memory device of the communications system and a corresponding plurality of effective signal to noise ratios for the communications system, wherein each of the effective signal to noise ratios is characterized based on a memory error rate of the memory device at one of the plurality of supply voltages and the signal to noise ratio; and selecting one of the plurality of supply voltages to supply power to the memory device based on the plurality of effective signal to noise ratios and the available signal to noise ratio slack, the one of the plurality of supply voltages being selected for power savings while achieving the target bit error rate for the communications system.
15. The method of claim 14, further comprising determining the plurality of effective signal to noise ratios by: reading a bit error rate of the memory device for each of the plurality of supply voltages; based on a derived probability mass functions distribution, calculating an equivalent noise model that achieves a same bit error rate under each of the plurality of supply voltages; and tabulating an effective signal to noise ratio value for each equivalent noise model.
16. The method of claim 14, wherein: the communications system comprises a wireless communications system; and the memory device is a buffering memory in the wireless communication system.
17. The method of claim 16, wherein the buffering memory is in a receiver front end of the wireless communications system preceding analog to digital conversion.
18. The method of claim 16, wherein the buffering memory is used for storing one of data before a Fast Fourier Transform (FFT) block, data after the FFT block, data before channel estimation, data after channel estimation, data before an interleaver block, data after an interleaver block, data before equalization, or data after equalization.
19. The method of claim 14, wherein the communication system is an orthogonal frequency division multiplexing (OFDM) based communication system.
20. The method of claim 19, wherein the channel used by the communication system comprises additive white Gaussian noise (AWGN).
21. The method of claim 14, wherein the one of the plurality of supply voltages is selected dynamically over time.
Description
BRIEF DESCRIPTION
(1) The accompanying drawings, which are included as part of the present specification, illustrate the presently preferred embodiment and, together with the general description given above and the detailed description of the preferred embodiment given below, serve to explain and teach the principles of the present invention.
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)
(25)
(26) It should be noted that the figures are not necessarily drawn to scale and that elements of similar structures or functions are generally represented by like reference numerals for illustrative purposes throughout the figures. It also should be noted that the figures are only intended to facilitate the description of the various embodiments described herein. The figures do not necessarily describe every aspect of the teachings disclosed herein and do not limit the scope of the claims.
DETAILED DESCRIPTION
(27) The embodiments provided herein are directed to systems and methods utilizing a power management technique involving a method for accurately and rapidly estimating the change in the statistical distribution of data at each block in a communication system leading to or originating from a memory that is experiencing voltage scaling induced errors is disclosed. An appropriate memory supply voltage that maximizes power savings is determined by exploiting an available SNR slack while keeping the system performance within the required margin.
(28) Embodiments disclosed herein are directed to an efficient power management methodology based on a joint statistical modeling for both channel and hardware dynamics, which is referred to herein, in a non-limiting manner, as an equivalent noise technique. The methodology disclosed herein enables engineers and system designers to apply different power management parameters on embedded memories and easily trade-off the degradation in the system performance with the obtained gain in power savings. The present method is illustrated herein by considering the operation of a typical OFDM-based communication system. However, the same method can be applied to any other wireless communication system.
(29) Embodiments disclosed herein address the challenge of accurately and rapidly estimating the change in the statistical distribution of data at each block in the communication system leading to or originating from a memory that is experiencing voltage scaling induced errors. By replacing the traditional noise model in communication systems with an equivalent noise model presented herein, one can investigate different power management parameters, where the faulty hardware can be treated as error-free hardware.
(30) In one embodiment, in a system or device comprising a processor and a memory device operating at a supply voltage, the supply voltage dynamically selected from a plurality of supply voltages to achieve a target bit error rate (BER), the supply voltage is dynamically selected by estimating channel parameters, calculating an average of received signal to noise ratio (SNRrec), based on the target bit error rate (BER), obtaining a target SNRreq, calculating available slack in the SNR, reading an equivalent effective SNReff from a look up table (LUT), and selecting the supply voltage from the plurality of supply voltages, the supply voltage enabling desired power savings.
(31) Turning now to the figures,
(32) As noted above, the present disclosure is directed to providing a rapid means of identifying and propagating the impact of embedded memory failures through the communication system, thus allowing the designer to opportunistically increase the noise contribution of the hardware channel based on the observed statistics of the actual communication channel to meet a certain metric of quality, such as target BER for communication devices.
(33)
(34)
(35) Generally speaking, the performance of SRAM circuits under supply scaling and process variation is well understood. Prior analyses confirm that the access time follows a Gaussian distribution that can be related to the applied supply voltage and the underlying variations in threshold voltage. The explanation that follows begins by discussing a mathematical model for memory errors, followed by the propagation of the distribution through communication building blocks such as filters and FFT units, a zero forcing receiver (as an example), culminating with an entire receiver.
(36)
(37) TABLE-US-00001 TABLE I Notations and Parameters Description Notation Description V.sub.1, V.sub.2 Supply voltage for memory 1 and memory 2 N Number of bits per memory word N.sub.FFT FFT size f.sub.γ (γ) Data distribution after first buffering memory φ.sub.v (iω) Characteristics function of filtered data f.sub.R|s (r|s) Data distribution after equalization
(38)
(39)
(40) Where P(k) is the probability of having k bit flips simultaneously and can be calculated using:
P(k)=P.sub.e.sup.k(1−P.sub.e).sup.N−k (2)
f.sub.Y.sup.k(y) is the distribution of data when k bit flips occur at one word, where, k (number of bit flips) can be a number from 0 up to N.
(41)
(42)
(43) In general, as shown in
(44)
(45) In most modern communication systems orthogonal frequency division multiplexing is extensively used to combat channel fading, where the FFT is an integral block. Therefore, it is important to quantify how FFT affects the statistics of a complex sequence of data V for which the real and imaginary parts are independent with the same distribution. This may be the case when demodulating the received OFDM symbols, where the subcarriers have a certain distribution due to the effect on channel noise, memory errors and interference. The real and the imaginary parts of the N-point FFT are given by:
(46)
Similarly,
(47)
Both the real and imaginary parts of the output can be expressed as:
Z.sub.i(n)=Σ.sub.k=0.sup.N.sup.
Z.sub.r(n)=Σ.sub.k=0.sup.N.sup.
(48) Since the variables S.sub.n(k), C.sub.n(k), P.sub.n(k) and Q.sub.n(k) and can be considered as random variables, the real and imaginary parts of the output of the FFT (Z.sub.r and Z.sub.i) can be derived as a sum of a large number of random variables, which by the central limit theory approaches an asymptotic Gaussian distribution. This means that the distribution of the data after the FFT can be approximated as a Gaussian distribution (with sufficiently large N.sub.FFT).
(49) Prior work has validated the Gaussian distribution of the data after the FFT and derived an expression of the mean and the variance of the real and imaginary parts as follow:
(50)
(51) If the distribution of the input data V has a zero mean (μ.sub.v=0) which is the typical case for any wireless channel noise, then:
μ.sub.Z.sub.
(52) Hence, one can express the distribution of the data after the FFT in the system as a normal distribution N (0, N.sub.FFTσ.sub.v.sup.2). In which the variance of the data after the filtering can be obtained using the distribution in (4) as follows:
σ.sub.v.sup.2=E{(v−μ.sub.v).sup.2}=E{v.sup.2}=Σ.sub.vv.sup.2×f.sub.v(v) (12)
(53) For an OFDM system with a Rayleigh fading channel, the FFT stage converts the data distribution into Gaussian as discussed previously. Hence, the received signal for subcarrier k could be expressed as:
z.sub.k=h.sub.k8k+ñ.sub.k (13)
where ñ.sub.k is a complex Gaussian noise of zero mean and variance a which can be calculated by using (10) and average channel power
(54) The distribution of the equalized signal r.sub.k is determined as follows. Without loss of generality and for mathematical tractability, assume a least squares equalization where one can express the equalized signal as
(55)
(56) The PMF of the equalized signal can be obtained using given probability concept as described in the following equation
(57)
(58) By using integration tables, the distribution of the equalized data can be given by:
(59)
(60) Then, by storing the equalized data r into another faulty buffering memory (memory 2) as shown in
(61) As mentioned above, one can model the hardware as an extension of the wireless channel in communication systems where quality is controlled by operating conditions such as frequency and supply voltage. By propagating data statistics through various communication blocks, the area under the tail of the resultant distribution (after a certain threshold depending on the modulation) represents the bit error rate (BER). The key idea is to find an equivalent Gaussian noise distribution that has the same area under the tail of the distribution (or equivalently, the same BER) as the corrupted data statistics.
(62) As shown in
(63) For analysis, binary phase shift keying (BPSK) modulation is assumed for simplicity. However, the same methodology could be applied to any other modulation scheme without any loss of generality. Considering the system with faulty memories shown in
(64)
Where P(s.sub.1) and P(s.sub.2) represents the probability of transmitting the BPSK symbols (S.sub.i=±1, I=1,2) and f.sub.U|si(u|s.sub.i) is the distribution of the data before the slicer. Due to symmetry of the tails of the distributions f.sub.U|s1(u|s.sub.1) and f.sub.U|s2(u|s.sub.2) and assuming equally likely symbols, the BER is expressed in (18).
(65)
where i=1,2.
(66) The BER is calculated mathematically based on propagating the data distribution after the memory, f.sub.x(x), through the communication system blocks as presented in the previous section. Consequently, the modulation BER after the slicer is mathematically calculated based on the derived distribution.
(67) Once the BER of the system with faulty memory is calculated, an equivalent noise, n.sub.eq with zero mean and variance σ.sub.eq.sup.2 is determined such that the equivalent system with ideal memories achieves the same BER performance of the original system with faulty memories. The Gaussian distribution of the equivalent noise can be written as:
(68)
(69)
(70) A mathematical formula of the BER.sub.eq is calculated using the probability mass functions (PMF) f.sub.U′(u′) of the data after the equalizer as shown in
(71)
where f.sub.U′/s.sub.
(72)
(73) Since the equivalent noise after the filtering n.sub.eq.sub.
σ.sub.eq.sub._.sub.FFT.sup.2=Nσ.sub.eq.sub._.sub.filter.sup.2 (22)
(74) This Gaussian noise will propagate through another ideal memory, and hence the statistics of the output data will be the same. The received signal after the FFT for each subcarrier can be expressed as:
z′.sub.k=h.sub.ks.sub.k+ñ.sub.FFT,k (23)
where
ñ.sub.eq.sub._.sub.FFT,k˜(0,σ.sub.eq.sub._.sub.FFT.sup.2)
After the ZF equalization:
{circumflex over (r)}.sub.k=s.sub.k+ñ.sub.eq.sub._.sub.FFTk/h.sub.k (24)
(75) The distribution of {circumflex over (r)}.sub.k can be written as
(76)
Hence the BER is calculated as follow:
(77)
Finally, after using integration table formula, the BER of the equivalent system is written as
(78)
(79) Using the mathematical formula of the BER in (26) for the equivalent system and that of the original system with faulty memories in (18), the variance of the equivalent noise n.sub.eq can be calculated.
(80)
where the BER is obtained by integrating the distribution f.sub.v(v) as described in Eq. (18).
(81) Forward error correction (FEC) decoders can be employed at the receiver to detect and correct errors. In one embodiment, convolutional codes and Viterbi decoding algorithm are employed at the receiver to decode transmitted bits.
(82) Hard-input FEC decoders employ hamming distance to find the branch metric distance. Since both the original and equivalent systems have the same BER before the decoder, both achieve the same coded-BER.
(83)
(84) The conventional soft-input Viterbi algorithm is based on the Maximum Likelihood (ML) criteria assuming a Gaussian noise. However, incorporating faulty memories results in a distribution that is slightly non-Gaussian. Due to the large coding gain associated with FEC, the equivalence of the BER after the conventional FEC decoder is not achieved as shown in
(85)
(86) In typical communication systems, buffering memories are needed to store the data before and after processing by basic blocks such as FFT, channel estimation, interleaver and equalization. These memories differ in size and the level of the data redundancy. Generally speaking, one would expect that the closer the buffering memory is to the slicer, the lower the data redundancy level, however, as will be discussed later, this is not necessarily always the case. Depending on its location in the processing chain, each block affects system performance in different ways. It is therefore interesting to evaluate the impact of each buffering memory on the system quality of service (QoS) measured by the BER. A straightforward way to address this problem is by performing a sensitivity analysis of the system's BER with respect to the amount of error rate applied at each memory.
(87) Two buffering memories are under consideration. The first one (M.sub.1) is the buffering memory at the receiver front end immediately before the analog to digital conversion. While the second memory (M.sub.2), is the memory preceding the FEC decoder. The sensitivity of the BER to the probability of error is defined as:
(88)
where the BER is given by
(89)
and f.sub.v(v, P.sub.e) is the data distribution before the slicer which depends on the error rate applied at the buffering memory under consideration.
(90)
where the derivative can be approximated by
(91)
(92)
(93) The second observation is that, for the same error rate, M1 and M2 impact the system differently. Interestingly, contrary to what was expected, system performance is more sensitive to errors in M1 than M2 although M2 is closer to the slicer as compared to M1. The reason behind that is the FFT, since it is not a one-to-one mapping process. Any error at one of the inputs of the FFT will affect all the N-data at the output of the FFT. Therefore, errors in M1 have a more severe effect on system performance, especially for larger FFT sizes.
(94) Based on the equivalent noise modeling presented herein, the effect of memory supply voltage over-scaling (VOS) on the final metric of the system (BER in this case) can be mathematically derived. For the different combinations of the supply voltages, the present equivalent noise technique provides a mathematical model that precisely estimates system performance at any given SNR as a result of the applied power management technique. A system with two buffering memories is considered. The purpose of the algorithms is to find the appropriate memory supply voltage that maximizes power savings by exploiting the available SNR slack while keeping the system performance within the required margin.
(95) The supply voltages and the equivalent memory error rates shown in Table II are based on HSPICE circuit simulations of a 6T SRAM cells using 65 nm CMOS predictive. Different SRAM memories could have different memory error rates due to process variations. Furthermore, due to aging and temperature variations, memory error rates may vary. Hence, Built-in self-test (BIST) mechanisms could be applied to measure and characterize memory error rates under VoS. The power manager algorithm will then run the BIST technique for each memory to update the entries of the memory error rates for different supply voltages. Since temperature variations and other factors that affect the memories are slow processes, once the table is constructed, it will need infrequent updates, with negligible overhead and the throughput degradation.
(96) TABLE-US-00002 TABLE II Supply Voltage and Corresponding Memory Error Rate V.sub.dd 1.0 0.85 0.75 0.65 P.sub.e 1.69e.sup.−15* 5.21e.sup.−12 1.8e.sup.−6 2.7e.sup.−3 *extrapolated
(97) The power management methodology presented herein is composed of two parts. The first part is an offline algorithm that characterizes the effective SNR of the system based on the memory error rates and received SNR. In more details, for every pair of memory supply voltages (V.sub.Mem1, V.sub.Mem2), the algorithm reads the corresponding memory error rates (P.sub.e1, P.sub.e2). Then based on the derived PMF distribution under VoS, the equivalent noise that achieves the same BER under these supply voltages is calculated. Hence, the value of the effective SNR is tabulated for the tuple of (SNR.sub.rec, V.sub.Mem1, V.sub.Mem2). The details of this algorithm are explained in Algorithm 1 below, which is also shown in
(98) TABLE-US-00003 Algorithm 1: Offline Power Manager Algorithm 1: for each quantized SNR do 2: V.sub.dd = {1.0, 0.85, 0.75, 0.65} (1402) 3: for V.sub.Mem1 = {1.0, 0.85, 0.75, 0.65} 4: for V.sub.Mem2 = {1.0, 0.85, 0.75, 0.65} 5: P.sub.e1 = LUT(V.sub.Mem1): (1405) 6: P.sub.e2 = LUT(V.sub.Mem2); (1406) 7: Calculate BER.sub.uncoded(P.sub.e1, P.sub.e2, SNR.sub.q); (1407) 8: Calculate σ.sub.n_eq.sup.2(equivalent noise model); (1408) 9: Calculate the effective SNR.sub.eff; (1409) 10: Calculate Power Savings P.sub.s(P.sub.e1,P.sub.e2); (1410) 11: end for 12: end for 13: end for.
(99) The size of the look up table (LUT) depends on the quantization resolution of the received SNR and the number of the allowed memory supply voltages. Assuming a linear quantization of the received SNR with a step ΔSNR and 12-bit precision to store the effective SNR, the size of one LUT for a certain combination of (V.sub.Mem1, V.sub.Mem2) is given by
(100)
(101) Hence, the total required storage for the different combination of the two memories supply voltages can be expressed as:
N.sub.mem.sub._.sub.volt.sup.2×LUT.sub.size
(102)
(103) The second part of the algorithm is the online part, which updates the memory supply voltages to track channel SNR variations. The online algorithm shown below and in Algorithm 2 (see
(104) TABLE-US-00004 Algorithm 2: Online Power Manager Algorithm 1: for each time step do 2: Initialize V.sub.Mem1 = V.sub.Mem2 = V.sub.dd,nominal, P.sub.s,Mem = 0 (1502) 3: Estimate channel parameters (σ.sub.n.sup.2,γ′) (1503) 4: Quantize received SNR, SNR.sub.rec,q= Q(SNR.sub.rec,) (1504) 5: Calculate available slack ΔSNR.sub.Av=SNR.sub.rec,q− SNR.sub.rec (1505) 6: If ΔSNR.sub.Av>0 7: for V.sub.Mem1 = {1.0, 0.85, 0.75, 0.65} 8: for V.sub.Mem2 = {1.0, 0.85, 0.75, 0.65} 9: SNR.sub.eff(v.sub.1,v.sub.2) = LUT(v.sub.1,v.sub.2,SNR.sub.rec,q); (1509) 10: P.sub.s,Mem = P.sub.s(V.sub.1,V.sub.2); (1510) 11: If (SNR.sub.eff>SNR.sub.rec,q)&(P.sub.s>P.sub.s,Mem) (1511) 12: P.sub.s,mem=P.sub.s; (1512) 13: Update candidate V.sub.dd=(v.sub.1,v.sub.2); (1513) 14: end if; 15: end for; 16: end for; 17: end for.
EXAMPLE
(105)
(106) Two of the largest memories in the system are targeted, the buffering and FFT memory after the ADC and the buffering memory after the FFT which is used for the channel equalization. As discussed above, memories before the FFT have a higher impact on the system overall BER. In previous work, a COFDM Baseband Receiver for DVB-T/H applications was implemented in 0.18 um technology. In this implementation, 158 Kbytes of embedded buffering memories were required for FFT processing, channel estimation and equalization. These memories occupy approximately around 35% of the chip area and consume 25% of the total chip power. The area and power consumption for both the memories and logic were scaled down to 65 nm as shown in Table III to quantify for process variation at advanced CMOM technology. Note that in other more advanced schemes such as LTE etc., the memory share of area and power are more pronounced due to buffering requirements of techniques such as HARQ and MIMO etc. It is also important to note that in such advanced systems, typically advanced modulation and coding are used (AMC), thus two loops will need to be jointly optimized. The slower outer AMC loop controls the slack seen by the receiver, and the faster inner loop of the power manager minimizes power based on available slack.
(107) TABLE-US-00005 TABLE III Area and Power Shares for Memory and Logic Power Power Count Area 0.18 um 65 nm Memory 158K Bytes 40.10% 65.6225 mW 7.067 mW Logic 317K Gates 59.90% 187.87 mW 21.2 mW
(108) To verify the accuracy of the disclosed methodology, a full MATLAB simulation of the system depicted in
(109)
(110)
(111) TABLE-US-00006 TABLE I Memory Power Savings Versus Supply Voltage Power Savings % V.sub.1 = 1.0 V.sub.1 = 0.85 V.sub.1 = 0.75 V.sub.1 = 0.65 V.sub.2 = 1.00 0% 16.68% 28.82% 39.4% V.sub.2 = 0.85 5.27% 22.15% 34.09% 44.67% V.sub.2 = 0.75 9.101% 25.78% 37.92% 48.50% V.sub.2 = 0.65 12.45% 29.14% 41.27% 51.85%
(112)
(113)
(114)
(115)
(116) A data storage device 2425 such as a magnetic disk or optical disc and its corresponding drive may also be coupled to architecture 2400 for storing information and instructions. Architecture 1100 can also be coupled to a second I/O bus 2450 via an I/O interface 2430. A plurality of I/O devices may be coupled to I/O bus 2450, including a display device 2443, an input device (e.g., an alphanumeric input device 2442 and/or a cursor control device 2441).
(117) The communication device 2440 allows for access to other computers (e.g., servers or clients) via a network. The communication device 2440 may comprise one or more modems, network interface cards, wireless network interfaces or other interface devices, such as those used for coupling to Ethernet, token ring, or other types of networks.
(118) The proposed model described herein enables system designers to rapidly and accurately design a more efficient power management policy as compared to the traditional power management policies such as DVFS. The embodiments disclosed herein enable a statistical model to accurately and rapidly evaluate the impact of memory failures due to voltage over-scaling as a means of power management. The effect on the data distribution at each block in the communication system leading to and originating from the memory in question is quantified in a closed form solution. By replacing the traditional noise model in communication systems with the developed equivalent noise model, one can investigate different power management policies, where the faulty hardware can be treated as error-free hardware. The accuracy of the model has been verified by performing a full simulation of a DVB system, which demonstrates that results from the simulation are in close agreement with those obtained by the proposed analytical methods.
(119) A power management technique utilizing a method for accurately and rapidly estimating the change in the statistical distribution of data at each block in a communication system leading to or originating from a memory that is experiencing voltage scaling induced errors is disclosed. An appropriate memory supply voltage that maximizes power savings can be found by exploiting the available SNR slack while keeping the system performance within the required margin.
(120) While the invention is susceptible to various modifications, and alternative forms, specific examples thereof have been shown in the drawings and are herein described in detail. It should be understood, however, that the invention is not to be limited to the particular forms or methods disclosed, but to the contrary, the invention is to cover all modifications, equivalents and alternatives falling within the spirit and scope of the appended claims.
(121) In the description above, for purposes of explanation only, specific nomenclature is set forth to provide a thorough understanding of the present disclosure. However, it will be apparent to one skilled in the art that these specific details are not required to practice the teachings of the present disclosure.
(122) The various features of the representative examples and the dependent claims may be combined in ways that are not specifically and explicitly enumerated in order to provide additional useful embodiments of the present teachings. It is also expressly noted that all value ranges or indications of groups of entities disclose every possible intermediate value or intermediate entity for the purpose of original disclosure, as well as for the purpose of restricting the claimed subject matter.
(123) It is understood that the embodiments described herein are for the purpose of elucidation and should not be considered limiting the subject matter of the disclosure. Various modifications, uses, substitutions, combinations, improvements, methods of productions without departing from the scope or spirit of the present invention would be evident to a person skilled in the art. For example, the reader is to understand that the specific ordering and combination of process actions described herein is merely illustrative, unless otherwise stated, and the invention can be performed using different or additional process actions, or a different combination or ordering of process actions. As another example, each feature of one embodiment can be mixed and matched with other features shown in other embodiments. Features and processes known to those of ordinary skill may similarly be incorporated as desired. Additionally and obviously, features may be added or subtracted as desired. Accordingly, the invention is not to be restricted except in light of the attached claims and their equivalents.
REFERENCES
(124) 1. International Technology Roadmap for Semiconductors [Online] Available: http://www.itrs.net 2. B. H. Calhounand, A. P. Chandrakasan, “Ultra-dynamic voltage scaling (UDVS) using sub-threshold operation and local voltage dithering,” IEEE J. Solid-State Circuits, vol. 41, no. 1, pp. 238-245, January 2006. 3. S. Mukhopadhyay, H. Mahmoodi, and K. Roy, “Statistical design and optimization of SRAM cell for yield enhancement,” in Proc. IEEE/ACM Int. Conf. Computer Aided Design, November 2004, pp. 10-13. 4. S. Das, D. Roberts, L. Seokwoo, S. Pant, D. Blaauw, T. Austin, K. Flautner, and T. Mudge, “A self-tuning DVS processor using delay-error detection and correction,” IEEE J. Solid-State Circuits, vol. 41, no. 4, pp. 792-804, April 2006. 5. F. J. Kurdahi, A. Eltawil, K. Yi, S. Cheng, and A. Khajeh, “Low-power multimedia system design by aggressive voltage scaling,” IEEE Trans. Very Large Scale Integr. Syst., vol. 18, pp. 852-856, 2010. 6. G. Karakonstantis, C. Roth, C. Benkeser, and A. Burg, “On the exploitation of the inherent error resilience of wireless systems under un-reliable silicon,” in Proc. 49th Annual Design Automation Conf., 2012, pp. 510-515, ACM. 7. C. Brehm, M. May, C. Gimmler, and N. Wehn, “A case study on error resilient architectures for wireless communication,” in Architecture of Computing Systems. New York, N.Y., USA: Springer, 2012, pp. 13-24. 8. F. Kurdahi, A. Eltawil, A. K. Djahromi, M. Makhzan, and S. Cheng, “Error-aware design,” in Proc. 10th Euromicro Conf. Digital System Design Architectures, Meth. Tools, Aug. 29-31, 2007, pp. 8-15. 9. Y. Liu, T. Zhang, and K. K. Parham, “Analysis of voltage over scaled computer arithmetic's in low power signal processing systems,” in Proc. Asilomar Conf. Signals, Syst. Comput., Oct. 26-29, 2008, pp. 2093-2097. A. K. Djahromi, S. Cheng, A. M. Eltawil, and F. J. Kurdahi, “Power management for cognitive radio platforms,” in IEEE Global Telecommun. Conf., Nov. 26-30, 2007, pp. 4066-4070. 10. Digital Video Broadcasting (DVB): Frame Structure, Channel Coding and Modulation for Digital Terrestrial Television (DVB-T) ETSI, 2004, Tech. Rep. ETSI EN 300 744. 11. C. Chi-Chie, S. Chi-Hong, and W. Jen-Ming, “A low power baseband OFDM receiver IC for fixed WiMAX communication,” in Proc. IEEE Asian Solid-State Circuits Conf., Nov. 12-14, 2007, pp. 292-295. 12. C. Lei-Fone, C. Yuan, C. Lu-Chung, M. Ying-Hao, L. Chia-Hao, L. Yu-Wei, L. Chien-Ching, L. Hsuan-Yu, H. Terng-Yin, and L. Chen-Yi, “A 1.8 V 250 mW COFDM baseband receiver for DVB-T/H applications,” in Proc. IEEE Int. Solid-State Circuits Conf., Feb. 6-9, 2006, pp. 1002-1011. 13. IEEE Standard for Local and Metropolitan Area Networks Part 16: Air Interface for Fixed Broadband Wireless Access Systems, IEEE 802.16, 2004. A. Nilsson, E. Tell, and D. Liu, “An 11 mm.sup.2, 70 mW fully programmable baseband processor for mobile WiMAX and DVB-T/H in 0.12 um CMOS,” IEEE J. Solid-State Circuits, vol. 44, no. 1, pp. 90-97, January 2009. A. B. Ericsson, Long term evolution (LTE): An introduction White Paper, October 2007. 14. R. Hegdeand, N. R. Shanghai, “A voltage overscaled low-power digital filter IC,” IEEE J. Solid-State Circuits, vol. 39, no. 2, pp. 388-391, February 2004. A. K. Djahromi, A. M. Eltawil, and F. J. Kurdahi, “Fault tolerant approaches targeting ultra low power communications system de-sign,” in Proc. IEEE Vehicular Technol. Conf., Apr. 22-25, 2007, pp. 2600-2604. A. Khajeh, K. Amir, M. S. Khairy, A. M. Eltawil, and F. J. Kurdahi, “A unified hardware and channel noise model for communication systems,” in Proc. IEEE Global Telecommun. Conf., Dec. 6-10, 2010, pp. 1-5. 15. M. S. Khairy, A. Khajeh, A. M. Eltawil, and F. J. Kurdahi, “FFT processing through faulty memories in OFDM based systems,” in Proc. IEEE GLOBECOM Workshops Appl. Commun. Theory Emerging Memory Technol., Dec. 6-10, 2010, pp. 1946-1951. B. Roth, C. Benkeser, C. Studer, G. Karakonstantis, and A. Burg, Data Mapping for Unreliable Memories arXiv preprint arXiv:1212.4950, 2012. 16. A. K. Djahromi, A. M. Eltawil, F. J. Kurdahi, and R. Kanj, “Crosslayer error exploitation for aggressive voltage scaling,” in Proc. Int. Symp. Quality Electronic Design, Mar. 26-28, 2007, pp. 192-197. 17. S. Mukhopadhyay, H. Mahmoodi, and K. Roy, “Modeling of failure probability and statistical design of SRAM array for yield enhancement in nanoscaled CMOS,” IEEE Trans. Computer-Aided Des. Integr. Circuits Syst., vol. 24, no. 12, pp. 1859-1880, December 2005. A. Khajeh, A. M. Eltawil, and F. J. Kurdahi, “Embedded memories fault-tolerant pre and post silicon optimization,” IEEE Trans. Very Large Scale Integr. Syst., vol. 19, no. 10, pp. 1916-1921, October 2011. A. Papulis, Probability, Random Variables and Stochastic Processes. New York, N.Y., USA: McGraw-Hill, 1965. 18. J. Brown and H. Piper, “Output characteristic function for an analog crosscorrelator with band pass inputs,” IEEE Trans. Inf. Theory, pp. 6-10, January 1967. 19. J. Schoukens and J. Renneboog, “Modeling the noise influence on the Fourier coefficients after a discrete Fourier transform,” IEEE Trans. Instrum. Meas., vol. IM-35, pp. 278-286, 1986. A. Hussien, M. S. Khairy, A. Khajeh, A. M. Eltawil, and F. J. Kurdahi, “Combined channel and hardware noise resilient viterbi decoder,” in Proc. Asilomar Conf. Signals, Systems, Comput., pp. 395-399, 7-10 November A. M. A. Hussien, M. S. Khairy, A. Khajeh, A. M. Eltawil, and F. J. Kurdahi, “A class of low power error compensation iterative decoders,” in Proc. IEEE Global Telecommun. Conf., Dec. 5-9, 2011, pp. 1-6. 20. J. Geldmacher and J. Gotze, “On fault tolerant decoding of Turbo codes,” in Proc. 7th Int. Symp. Turbo Codes and Iterative Information Processing (ISTC), August 2012, pp. 245-249. 21. Predictive Technology Model (PTM) [Online]. Available: http://www.eas.asu.edu/˜ptm