Complexity-reduced simulation of circuit reliability
11704462 · 2023-07-18
Assignee
Inventors
- Pieter Weckx (Vaalbeek, BE)
- Dimitrios Rodopoulos (Athens, GR)
- Benjamin Kaczer (Leuven, BE)
- Francky Catthoor (Temse, BE)
Cpc classification
G06F30/367
PHYSICS
G06F30/3323
PHYSICS
International classification
Abstract
A system and method for simulating an electronic circuit is disclosed. The method includes creating a finite set of circuit or device parameter points selected from within an n-dimensional parameter space. The method includes determining, for each circuit or device parameter point of the set, a corresponding response value of the performance metric and a corresponding probability of occurrence. The method includes determining, for a predetermined value of the performance metric, the total probability of occurrence.
Claims
1. A method comprising: providing a frequency error margin value and a duty cycle error margin value associated with processing a digital signal having a transient waveform that comprises a sequence of pulses, each pulse having a pulse frequency, a pulse duty cycle, and a pulse duration; identifying a buffer frequency, a buffer duty cycle, and a buffer duration of one of a plurality of sub-sequences of pulses, wherein each sub-sequence of pulses comprises one or more pulses, and wherein for the one of the sub-sequences of pulses: the buffer frequency is a first function of the pulse frequencies of the sub-sequence of pulses, such that the buffer frequency and each of the pulse frequencies of the sub-sequence of pulses differ by less than the frequency error margin, the buffer duty cycle is a second function of the pulse duty cycles of the sub-sequence of pulses, such that the buffer duty cycle and each of the pulse duty cycles of the sub-sequence of pulses differ by less than the duty cycle error margin, and the buffer duration is the sum of the pulse durations over the sub-sequence of pulses; representing the sub-sequence of pulses as a point comprising the buffer frequency, the buffer duty cycle, and the buffer duration; determining a response value of a performance metric and a probability of occurrence corresponding to the response value, based on the point; and designing an electronic circuit based on the probability of occurrence, or driving an electronic circuit with a workload selected based on the probability of occurrence.
2. The method of claim 1, further comprising: identifying a next buffer frequency, a next buffer duty cycle, and a next buffer duration of a next sub-sequence of pulses, wherein for the next sub-sequence of pulses: the next buffer frequency is the first function of the pulse frequencies of the next sub-sequence of pulses, such that the next buffer frequency and each of the pulse frequencies of the next sub-sequence of pulses differ by less than the frequency error margin, the next buffer duty cycle is the second function of the pulse duty cycles of the next sub-sequence of pulses, such that the next buffer duty cycle and each of the pulse duty cycles of the next sub-sequence of pulses differ by less than the duty cycle error margin, and the next buffer duration is the sum of the pulse durations over the next sub-sequence of pulses; and representing the next sub-sequence of pulses by the next buffer frequency, the next buffer duty cycle, and the next buffer duration, wherein the next sub-sequence of pulses starts from a pulse such that: the buffer frequency and the frequency of the pulse differ by an amount equal to or more than the frequency error margin, and/or the buffer duty cycle and the duty cycle of the pulse differ by an amount equal to or more than the duty cycle error margin.
3. The method of claim 2, wherein the buffer frequency, buffer duty cycle, and buffer duration representing each of the sub-sequences of pulses forms a compact digital waveform (CDW) point for the corresponding sub-sequence of pulses, and the method further comprises: storing a representation of the digital signal as a series of CDW points, each CDW point being representative of one of the plurality of sub-sequences of pulses.
4. The method of claim 3, wherein the buffer frequencies, buffer duty cycles, and buffer durations of the CDW points differ between two CDW points.
5. The method of claim 1, wherein the first function is an average.
6. The method of claim 5, wherein the second function is an average.
7. The method of claim 1, wherein the signal is represented in Simulation Program with Integrated Circuit Emphasis (SPICE) format.
8. The method of claim 1, wherein the signal is represented in Value Change Dump (VCD) or equivalent format.
9. The method of claim 1, wherein the buffer frequency, buffer duty cycle, and buffer duration representing each of the sub-sequences of pulses forms a compact digital waveform (CDW) point for the corresponding sub-sequence of pulses, and the method further comprises: storing a representation of the digital signal as a series of CDW points, each CDW point being representative of one of the plurality of sub-sequences of pulses.
10. The method of claim 9, wherein the buffer frequencies, buffer duty cycles, and buffer durations of the CDW points vary over time.
11. The method of claim 1, wherein the digital signal is a voltage applied to a gate terminal of a transistor as a function of time.
12. The method of claim 11, wherein the transistor is a p-channel metal—oxide—semiconductor field-effect transistor, n-channel metal—oxide— semiconductor field-effect transistor, junction field effect transistor, high electron mobility transistor, or insulated-gate bipolar transistor.
13. The method of claim 1, wherein the digital signal is a voltage applied to an input port of a logic cell of a circuit or device as a function of time, the circuit or device comprising a plurality of logic cells.
14. The method of claim 13, wherein the logic cell is a memory cell of static random-access memory (SRAM).
15. The method of claim 1, wherein the pulse frequencies and the pulse duty cycles of the sequence of pulses vary over time.
16. The method of claim 1, wherein the response value of the performance metric and the probability of occurrence corresponding to the response value are determined based on the point and on one or more additional points representing additional sub-sequences of the plurality of sub-sequences of pulses.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The invention will now be described further, by way of example, with reference to the accompanying drawings, in which:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)
(25)
(26)
(27)
(28)
(29)
(30)
(31)
(32) The drawings are only schematic and are non-limiting. In the drawings, the size of some of the elements may be exaggerated and not drawn on scale for illustrative purposes. The dimensions and the relative dimensions do not necessarily correspond to actual reductions to practice of the invention.
(33) Any reference signs in the claims shall not be construed as limiting the scope.
(34) In the different drawings, the same reference signs refer to the same or analogous elements.
DETAILED DESCRIPTION OF CERTAIN ILLUSTRATIVE EMBODIMENTS
(35) The disclosed technology will be described with respect to particular embodiments and with reference to certain drawings, but the invention is not limited thereto but only by the claims.
(36) The terms first, second and the like in the description and in the claims, are used for distinguishing between similar elements and not necessarily for describing a sequence, either temporally, spatially, in ranking or in any other manner. It is to be understood that the terms so used are interchangeable under appropriate circumstances and that the embodiments of the invention described herein are capable of operation in other sequences than described or illustrated herein.
(37) Moreover, directional terminology such as top, bottom, front, back, leading, trailing, under, over and the like in the description and the claims is used for descriptive purposes with reference to the orientation of the drawings being described, and not necessarily for describing relative positions. Because components of embodiments of the disclosed technology can be positioned in a number of different orientations, the directional terminology is used for purposes of illustration only, and is in no way intended to be limiting, unless otherwise indicated. It is, hence, to be understood that the terms so used are interchangeable under appropriate circumstances and that the embodiments of the invention described herein are capable of operation in other orientations than described or illustrated herein.
(38) Reference throughout this specification to “one embodiment” or “an embodiment” means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the disclosed technology. Thus, appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment, but may. Furthermore, the particular features, structures or characteristics may be combined in any suitable manner, as would be apparent to one of ordinary skill in the art from this disclosure, in one or more embodiments.
(39) Similarly it should be appreciated that in the description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various inventive aspects. This method of disclosure, however, is not to be interpreted as reflecting an intention that the claimed invention requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing on its own as a separate embodiment of this invention.
(40) Furthermore, while some embodiments described herein include some but not other features included in other embodiments, combinations of features of different embodiments are meant to be within the scope of the invention, and form different embodiments, as would be understood by those in the art. For example, in the following claims, any of the claimed embodiments can be used in any combination.
(41) It should be noted that the use of particular terminology when describing certain features or aspects of the invention should not be taken to imply that the terminology is being re-defined herein to be restricted to include any specific characteristics of the features or aspects of the invention with which that terminology is associated.
(42) In the description provided herein, numerous specific details are set forth. However, it is understood that embodiments of the invention may be practiced without these specific details. In other instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
(43) An electronic circuit in the context of the disclosed technology relates to a circuit assembled of individual electronic devices such as transistors, diodes, resistors, capacitors, inductors. These electronic devices are connected to one another by means of conductive paths. Electronic devices include semiconductor devices.
(44) Various kinds of electronic devices and device technologies may be investigated by means of embodiments of the disclosed technology. Whilst of particular application to p- and n-channel FETs (MOSFETs), any other kind of electronic device for which the characteristics are influenced by capture and emission events covering both fabrication time or during operation, may be considered. Every effect in such a device that can be modelled by capture and emission events can be simulated by the disclosed technology. This includes bipolar junction transistors (BJTs), junction field effect transistors (JFETs), high electron mobility transistors (HEMTs), insulated-gate bipolar transistors (IGBTs) and so on. The disclosed technology is, however, not restricted to transistors as devices; for example various kinds of diodes known in the art may also be considered. Embodiments of the disclosed technology may be applied to any device were there are effects that degrade the devices in a partly statistical partly deterministic manner, depending on the workload
(45) Embodiments of the disclosed technology relate to workload-dependent reliability simulations of complex circuits and systems. In the context of the disclosed technology, “simulation” means a transient simulation over a given time period. During such simulation, the parameters of the technology are fixed for the active and passive elements that are used in the electronic circuit to be simulated. Hence a given parameter file is used with a predetermined set for all components (NMOS slow, PMOStyp, Rmin, Ctyp, etc.). The environmental conditions, e.g., temperature and supply voltage, determine the workload, and may be fixed for the entire simulation. Therefore, one process implementation may be simulated in a set of particular environmental conditions, such as at one predetermined temperature condition, and at a particular power supply. Nevertheless, such simulations can be repeated for different process parameters and/or for different environmental conditions, e.g., temperatures or power supply values, to get a more complete overview of the simulation results. For instance, one could wish to see the effect of a spike/dip on the supply or ground line, and can introduce this in the simulation by taking another V.sub.dd value. Furthermore, environmental conditions, e.g., temperature and/or supply voltage, may vary across the overall simulation time. Only within a single CDW segment, in accordance with particular embodiments of the disclosed technology, the environmental conditions are kept constant.
(46) A first aspect is a method for simulating an electronic circuit. The method includes creating a finite set of circuit or device parameter points selected from within an n-dimensional parameter space. The set of circuit or device parameter points is a subset, i.e. less than all points of the total amount of possible circuit or device parameter points within that parameter space. The method further includes, for each circuit or device parameter point of the set, determining a corresponding response value of the performance metric and a corresponding probability of occurrence, and determining, for a predetermined value of the performance metric, the total probability of occurrence.
(47) Due to their minimum device dimensions thus subject to variability, SRAM cells are excellent for a high sigma evaluation case study. Therefore, embodiments will be explained and elucidated by referring to SRAM cells and circuits as particular examples of electronic circuits. This, however, is not intended to be limiting for the disclosed technology, which can be applied to any type of electronic circuit. The disclosed methods and devices are particularly useful for simulation of large electronic circuits, for instance electronic circuits including at least thousands of gates, for instance tens of thousands, hundreds of thousands, or millions of gates.
(48) Furthermore, features and advantages are explained with reference to BTI as one particular example of a time-varying phenomenon that is critical to reliability. This, however, is not intended to be limiting for the disclosed technology. The methods and devices according to embodiments of the disclosed technology apply also to other workload-dependent reliability simulations of complex circuits and systems.
(49)
(50) TABLE-US-00001 TABLE 1 BULK FinFET Tech node [nm] 28 14 10 A.sub.VT [mV μm] 2 1.3 1 σ.sub.0,PU [mV] 35 22 19 σ.sub.0,PD [mV] 30 16 14 σ.sub.0,AC[mV] 35 22 19 sizing ratio 2-2-3 1-1-2 1-1-2 V.sub.DD [V] 1 0.8 0.7 V.sub.TH0 [V] 0.6 0.25 0.25 T.sub.INV [nm] 1.5 1.3 1.2 V.sub.OV/T.sub.INV [MV/cm] 2.7 4.2 3.7 A.sub.n [V] 3.5e−2 3.1e−2 3.1e−2 A.sub.p [V] 1.9e−2 2.0e−2 2.0e−2 γ.sub.n 4.8 5.2 5.2 γ.sub.p 2.5 3 3 n.sub.n 0.203 0.158 0.158 n.sub.p 0.186 0.173 0.173
(51) BTI ΔV.sub.TH is simulated using distribution (1) with the powerlaw components (Y, η) and prefactors A. The mean shift <ΔV.sub.TH> follows a power-law dependency
μ.sub.ΔV.sub.
(52) where t is the total DC stress time and V.sub.OV is the overdrive voltage calculated as
V.sub.OV=V.sub.G−V.sub.TH=V.sub.DD−V.sub.TH (5)
(53) Correlation between the time-zero and time-dependent variances have been observed experimentally and can be used to estimate the average impact per defect η by first calculating the time dependent variance as
(54)
(55) To quantify the impact of BTI stress, static margins may be used to determine a performance metric. In the example illustrated, not intended to be limiting for the disclosed technology, the performance metric is the static noise margin (SNM), which may be calculated from a butterfly curve analysis as illustrated in
(56) Taking this into account, the SNM becomes a 6-dimensional non-analytical function. Numerical representation of the response surface (
(57) In embodiments of the disclosed technology, other performance metrics could for instance be, the disclosed technology not being limited thereto, write margin, circuit delay, dynamic and/or leakage power.
(58) In embodiments of the disclosed technology, the response surface may be used for calculation of output statistics.
(59)
(60) By integration of the PDF, one can obtain the cumulative distribution function (CDF) of the performance metric SNM:
(61)
(62) Solving this problem has no analytical approach, however, since the response surface has no closed form solution and is evaluated using a simulator like for instance SPICE simulation. Simulation of the response surface in a finite amount of time requires truncating the response surface. This introduces an error on the final SNM CDF and has to be considered. The error depends on the range taken for the circuit or device parameter domain, hence on the lower bound and upper bound of parameter values taken into account. This error, as illustrated in
(63)
(64) This error can be reduced to very low levels by suitable selection of lower bounds and upper bounds of the plurality of parameter values, and by suitable distribution of the selected parameter values between the lower and upper bounds, making it possible to reach deep into tails.
(65) In alternative embodiments of the disclosed technology, rather than using the response surface for calculating the output statistics as explained above, a mathematically equivalent calculation method may be used. One embodiment thereof is by making use of line integrals. However, calculating line integrals for a large enough set of SNM values can be computationally quite intensive. In accordance with embodiments of the disclosed technology, a numerical method may be used to calculate the CDF using a similar methodology but circumventing the cumbersome line integrals.
(66) Shown in
(67)
(68) which is a mathematically equivalent calculation performed by sorting all circuit or device parameters PDF points g(ξ.sub.i) by their corresponding output value C.sub.i (
(69)
(70) The weights w.sub.i are proportional to the n-dimensional volume of the circuit or device parameter points ξ.sub.i. The methodology can thus make advantage of optimal meshing schemes for the parameter space, e.g., adaptive meshing.
(71) The above embodiment is explained for the case where all circuit or device parameters PDF points g(ξ.sub.i) are sorted by their corresponding output value C.sub.i. Hereto, the response values C.sub.i may be grouped based on value so that response values that do not deviate more than a per-determined error margin from one another are grouped in a same bin. The bins may then be ranked, for instance from high to low or vice versa. Determining the total probability of occurrence of a predetermined value of the performance metric may then include selecting one of the bins, and for the selected bin, summing all probabilities of occurrence.
(72) In alternative embodiments, rather than sorting the circuit or device parameters PDF points g(ξ.sub.i) by their corresponding output value C.sub.i, the output values C.sub.i may each be simply taken and compared to a reference value. In this case, equation (8) can be rewritten as
F(SNM)=Σ.sub.i=0.sup.mg(ξ.sub.i) for Ci<SNM (12)
(73) Determining the total probability of occurrence of a predetermined value of the performance metric in this case may include comparing each response value with the predetermined response value, and for all response values lower than the predetermined response value, summing the probabilities of occurrence.
(74)
(75) A method according to embodiments of the disclosed technology (illustrated in
(76) Experiments
(77) Using the methodology in accordance with embodiments of the disclosed technology, as described hereinabove, the SRAM structure 100 has been simulated with the parameters as given in TABLE 1. When simulating Hold SNM it is sufficient to create the response surface as a function of the threshold voltages of the four core transistors PU1, PU2, PD1, PD2, since in this mode of operation the access transistors AC1, AC2 are off and will have no impact on the butterfly curves. Contrary, for Read SNM calculation, where the Word Lines WL are high, the access transistors AC1, AC2 are on and do impact the butterfly curves. Here the response surface has to be calculated as a function of the threshold voltages of all six transistors PU1, PU2, PD1, PD2, AC1, AC2. Furthermore, margins to assess the writeable of the cell like for instance Write Trip Point (WTP) analysis also requires calculating the response surface as function of all six transistor's threshold voltage.
(78) Reducing the dimensionality of the problem using sensitivity analysis is desirable, since the number of simulation points (i.e. the finite set of circuit or device parameter points) to create the response surface increases rapidly for higher dimensions. Specifically for SRAM analysis a significant reduction can be achieved by exploiting the symmetry of the cell as illustrated in
(79)
(80) This allows simulation of one side of the SRAM cell instead of two for function ƒ(V, V.sub.TH,PU, V.sub.TH,PDV.sub.TH,AC) and reconstruction of the response surface using equation (13). This reduces the 6-dimensional problem of the experiment to a 3-dimensional problem.
(81) Shown in
(82) Since the SNM is defined as the minimum of the two Seevinck squares that can be fit inside the butterfly curve, it will at some point collapse to zero. This will happen abruptly when the two voltage transfer curves stop overlapping. Consequently, the SNM response surface has a discontinuity at SNM=0 which will propagate to the SNM distribution. However, discontinuities of the response surface are inherently handled which is crucial for numerous circuit output performance parameters. Shown in
(83)
(84) Using the method according to embodiments of the disclosed technology, fast sensitivity analysis for worst case −6 quantile can be performed for supply voltage lowering and as a function of stress time using equation (4), as shown in
(85) In accordance with further embodiments of the disclosed technology, the probabilities of occurrence of the circuit or device parameter points are time dependent. Simulating a probability of occurrence of a value of a performance metric of an electronic circuit, for instance such simulations as described in more detail above, can be applied for a sequence of time instances in a potentially very long time period, such as e.g., over a time period as long as a few minutes to a few hours. The time period over which the simulations can be carried out can be at least 10.sup.8 clock cycles, for instance 10.sup.12 clock cycles or more. The method then includes repeating the steps of determining the probability of occurrence of the circuit or device parameter points, and determining, for a predetermined value of the performance metric, the total probability of occurrence.
(86) Each simulation, i.e. each evaluation of the response value of the performance metric is dependent on the active workload (value of input signals) applied to the electronic circuit at that time instance. Each workload signal can be represented by a waveform over time, as for instance illustrated in
(87) In accordance with embodiments of the disclosed technology, a waveform representation called compact digital waveform (CDW) can be used. Being of tunable accuracy, this format exploits optimally the accuracy versus complexity tradeoff. It can be used for fast BTI simulations.
(88) For ease of explanation, a transient waveform is assumed, which in the most detailed case is represented as a SPICE signal. Up to now, SPICE has been considered as the industry standard for IC simulation, in which in a transient simulation mode, signals are adaptively discretized across time. It can be assumed that a SPICE signal is the most accurate signal representation.
(89) In embodiments of the disclosed technology, it is proposed to group consecutive signal regions that feature a similar frequency f and duty cycle α figures and that occupy a duration Δt, at particular environmental conditions such as supply voltage V.sub.DD and temperature T. Each region of the signal that adheres to this property is represented by a single CDW point with coordinates (f,α,Δt). To derive these points, the initial signal is scanned, which for the sake of simplicity is represented by a Value Change Dump (VCD) equivalent. Such VCD representation creates an abstraction of voltage signals to logic levels.
(90) It will be assumed that the VCD format includes only the high (Vdd) and low (Vss) levels; however, the VCD standard does support multiple Vdd representations, and thus, the input can be perceived as a succession of low and high voltage levels, e.g., a sequence of AC pulses that have, in the general case, varying f and a values. As the digital signal is scanned, a buffer accumulates pulses that satisfy Equation (14):
|ƒ.sub.pulse−ƒ.sub.buffer|<εƒ{circumflex over ( )}|α.sub.pulse−α.sub.buffer|<ε.sub.α (14)
(91) When a digital pulse is come across that cannot be appended (i.e., does not satisfy Equation (14)), the buffer is flushed to a single CDW point and re-initialized to receive the next pulses. E.sub.f (Hz) and ε.sub.α (p.u.) are user-defined error margins. These margins enable the tunable accuracy of the CDW approximation. The crudest approximation is achieved with ε.sub.f.fwdarw.+∞ and ε.sub.α.fwdarw.1, since all digital pulses are then appended to the buffer. Conversely, for the finest approximation is set ε.sub.f.fwdarw.0 and ε.sub.α.fwdarw.0. With this configuration, each individual AC pulse constitutes a single CDW point, in the worst case. Using the CDW format, any signal may be represented by a series of periodic activities, each having a period, a duty cycle and a duration over which the activity is periodically repeated. A method according to embodiments of the disclosed technology may include such grouping of a series of distinct periodic activities with similar period and duty cycle by temporally distinguishing such regions.
(92) The values f.sub.buffer and α.sub.buffer are defined in Equation (15) for a buffer containing n pulses and are computed every time a new digital AC pulse is appended to the buffer
(93)
(94) The concept of the waveform approximation according to embodiments of the disclosed technology, as explained above, has been implemented in a tool, which produces CDW representations given SPICE or VCD waveforms. The functionality of this tool can be seen in the flowchart of method 1700 in
(95) In decision block 1710, method 1700 identifies SPICE, VCD, and CDW formats. Method 1700 routes SPICE waveforms from block 1710 to block 1720, which converts SPICE waveforms to VCD waveforms, given V.sub.dd and V.sub.ss margins from block 1740. Method 1700 routes VCD waveforms from blocks 1710 and 1720 to block 1730, which converts VCD waveforms to CDW representations, given ε.sub.f and ε.sub.a from data block 1750. Data block 1760 illustrates CDW representations from decision block 1710 or block 1730.
(96) A qualitative example of the produced CDWs can be seen in
e.sub.s(T)=V.sub.SPICE(t)−V.sub.CDW(t) (16)
and the mean (
(97) Apart from the accuracy of the CDW representation according to embodiments of the disclosed technology from a signal integrity point of view, it is relevant to consider the achieved compression in the number of points it uses for representation.
(98) In the following, the impact is explored that aggressive CDW compression has on the accuracy of simulation modeling.
(99) Given the compression capabilities of a CDW according to embodiments of the disclosed technology, it is important to explore its usability for the target goal, namely to execute the electronic circuit simulation, for instance atomistic BTI/RTN simulation, over strides of V.sub.gs workload. As an example, in accordance with embodiments of the disclosed technology, a version of the atomistic BTI model will be altered and reformulated as follows: Starting from the premise of first order kinetics, the capture probability of a gate stack defect follows the ODE of Equation (17). The general solution returns the capture probability (P.sub.c), after n V.sub.gs pulses of specific frequency f and duty cycle a have been applied (equivalent duration Δt=n/f). The proposed model fully retains workload memory through the initial condition P.sub.c0. Parameters a and b are functions of f and α:
(100)
(101) The initial condition P.sub.c0 is absent from the previous version of the AC BTI model. Hereinafter, P.sub.c0≠0 will be assumed, thus introducing memory to the AC BTI model (i.e., workload dependency over time). Each iteration of the general solution of Equation (17) is used as initial condition for the next model evaluation. The tool implementing the compact digital waveform approximation, presented hereinabove, identifies regions of similar frequency f and duty cycle α in the V.sub.gs (i.e., a CDW representation is produced). Then, at each (f,α,Δt) point, a single iteration of Equation (17) is evaluated, representing the impact of the entire duration of that CDW segment. The latter can potentially be very long, encompassing a huge amount of individual time points. In this way, the CDW representation according to embodiments of the disclosed technology reduces aggressively the number of model iterations required for the calculation of P.sub.c. That way, time-dependent circuit simulation, for instance BTI simulation, becomes computationally feasible over long time spans (see
(102) A method according to embodiments of the disclosed technology then only has to be applied to the CDW segments or primitives and not to the individual time instances. Or stated in other words: the number of simulations to be carried out can be reduced, thus improving (reducing) the time required to simulate the electronic circuit.
(103) In what follows, as an example, BTI evaluation is performed using CDW in accordance with embodiments of the disclosed technology.
(104) The accuracy, from a signal integrity point of view, of the concept of the waveform approximation according to embodiments of the disclosed technology, has been assessed hereinabove. Given the model of Equation (17), the accuracy of a CDW approximation in terms of evaluation of capture probability Pc will now be evaluated.
(105) For experiments, 100 random bit streams with an average frequency (f.sub.avg) of 1 GHz and a duration of 1 μs have been used. Given each signal, an error metric for BTI evaluation is defined, according to Equation (18):
e.sub.p=P.sub.c,SPICE−P.sub.c,CDW (18)
where P.sub.c,SPICE is the capture probability for a defect at the end of the signal based on SPICE evaluation (e.g., using Equation (19)). P.sub.c,CDW is a similar capture probability, using a CDW file (for a specific pair of εf and εα) and the model of Equation (17).
(106)
(107) Equation (19) is the core of a transient workload dependent implementation of the atomistic BTI model as described for instance by B. Kaczer et al. in “Atomistic approach to variability of bias-temperature instability in circuit simulations”, in Proc. IEEE IRPS, 2011, pp. XT.3.1-XT.3.5.
(108) Five different cases are used, ranging from “fast” (τ.sub.cH=10.sup.−11 s) to “slow” (τ.sub.cH=10.sup.−3 s) defects. That way, the mean, standard deviation and maximum of e.sub.p across the 100 signals can be calculated for various values of (ε.sub.f, ε.sub.α) and for the five different defects (
(109) A direct correlation exists between how “fast” a defect is and the average frequency of the V.sub.gs bit stream (f.sub.avg). More specifically, the defects that are significantly “slower” in comparison to the average frequency of the bit stream are accurately modeled, regardless of the compression imposed by the concept of the waveform approximation according to embodiments of the disclosed technology (
(110) The relation between the average frequency f.sub.avg of the bit stream and the defect time constants has been identified earlier; however, no comprehensive study has been performed for the accuracy limitations of the AC BTI/RTN model. Given that a version of this model, enhanced with workload memory, is the kernel of the simulation methodology according to particular embodiments of the disclosed technology, its accuracy limitations for each model iteration can be exhaustively explored. More specifically, it is important to identify for which defects and under which conditions the accuracy of a model according to embodiments of the disclosed technology is acceptable.
(111) In order to conclude on the accuracy limitations of an approach according to embodiments of the disclosed technology, first a set of 400 different defects is created, with time constants (τ.sub.cH, τ.sub.cL, τ.sub.eH, τ.sub.eL) uniformly distributed in the logarithmic scale. The defects are arranged in a two dimensional plane according to min-km, {τ.sub.cH, τ.sub.cL} and min {τ.sub.cH, τ.sub.eL}.
(112) Three sets of bit streams with average frequency f.sub.avg equal to 1 GHz, 100 MHz and 10 MHz are used for each set. Each set contains 100 signals, each one with a duration of 1 μs. Each signal is CDW-approximated with maximum compression (ε.sub.f.fwdarw.+∞ and ε.sub.α.fwdarw.1) and e.sub.p is calculated according to Equation (18). That way, μ{e.sub.p}, σ{e.sub.p} and max{e.sub.p} can be calculated for each defect and for each case with the selected average frequencies f.sub.avg. The results are shown in
(113) For all average frequency values f.sub.avg, a subset of defects can always be found that cannot be accurately modelled. As the average frequency f.sub.avg is increasing, this subset is reduced and an acceptable accuracy of the model according to embodiments of the disclosed technology spreads to more defect time scales. For the case of f.sub.avg=1 GHz, the model according to embodiments of the disclosed technology can accurately simulate all defects that satisfy min {τ.sub.cH, τ.sub.cL}≥10.sup.−7 s and min {τ.sub.cH, τ.sub.eL}≥10.sup.−7 s.
(114) With dynamic voltage and frequency scaling (DVFS) being increasingly common among modern ICs, it is relevant to also include the impact of voltage bias on the BTI simulation. Furthermore, aggressive integration may also cause significant thermal gradients across an IC. As a result, the BTI simulator can also be aware of the local temperature conditions. It has been previously demonstrated that the time constants of BTI/RTN defects are affected by the operating bias (V.sub.dd) and temperature conditions. Based on state-of-the-art measurements, two orthogonal scaling rules of a time constant are proposed, in view of temperature and bias variations (see Equations (20) and (21)):
(115)
(116) These rules constitute a first order approximation of time constant scaling, as observed in measurements from real devices. It can be seen that in each case, the new time constant (τ) depends on its original value (τ.sub.0), as well as the original and new value of bias (V.sub.dd,0 and V.sub.dd) or temperature (T.sub.0 and T), respectively. The original temperature (T.sub.0) and bias (V.sub.dd,0) values refer to the conditions under which the time constant measurement was performed on a sample of devices from a specific technology node. The scaling factors (−2 for bias and − 1/50 for temperature) are also technology dependent. An example of bias and temperature scaling (applied individually) appears in
(117) It is easy to prove that the scaling operators and
maintain a commutative composition, thus they can be applied in an arbitrary sequence. Equations (20) and (21) require an extension of the CDW representation in order to include bias and temperature information. Thus, before the occupancy of the defects is resolved, the time constants can be scaled appropriately (see, for example, the flowchart of method 2400 in
(118) It should be noted that temperature scaling affects all four time constants. Bias affects only the time constants that refer to a non-zero voltage, namely τ.sub.cH and τ.sub.eH for the nMOS and τ.sub.cL and τ.sub.eL for the pMOS.
(119)
(120) From the results of the accuracy assessment of BTI evaluation using CDW, it can be seen that, given the time constants of a defect 2520 and the workload of the respective device 2510, it can be identified whether this defect is suitable for pseudotransient simulation 2550 (see
(121) If that is the case, V.sub.gs intervals can be stridden over using a CDW approximation 2540 in accordance with embodiments of the disclosed technology. This course of action (pseudo-transient simulation 2550) involves a few iterations of the proposed BTI/RTN model and comes at a very small CPU overhead. If the defect is too “fast” for the pseudotransient model, a more detailed simulation 2570 (with a SPICE or VCD waveform 2560) is deemed necessary. However, its duration need not cover the entire device lifetime: the impact of fast defects can be seen within a small time interval. For the subset of the “fast” defects, a margin of V.sub.th short-term variation can be derived that can be superimposed to the long term BTI/RTN impact derived by the pseudo-transient model. These impacts can be combined in block 2580, resulting in graph 2590 showing long-term impact and short-term variations.
(122) In order to verify the aspects of this methodology, it has been tested on a single device. First, this device has been populated with defects and these have been arranged in the min {τ.sub.cH, τ.sub.cL} versus min{τ.sub.eH, τ.sub.eL} plane (
(123) The device workloads presented above are artificial. However, they can be easily derived from industry standard tools when handling realistic circuits and applications. A brief SPICE simulation of the target circuit can produce a representative workload for the “fast” defect simulation. α values for different circuit operation modes can be easily derived using timing analysis tools. The above methodology has no interdevice dependencies and can be generalized to a larger device inventory. This data parallelism is highly compatible with the multi-/many-core paradigm observed in the computing infrastructure of modern design houses.
(124) Given that the CDW signal representation according to embodiments of the disclosed technology leads to an acceleration of electronic circuit simulation, e.g., BTI simulation, across device lifetime, it is good to reflect on the usability of the proposed framework for realistic circuits. As an example, the accurate estimation of functional yield of a circuit over three years of operation is explored.
(125) A subset of an embedded static random access memory (SRAM) is used, isolating a representative path, from a memory cell to the SRAM's input/output ports. This exemplary circuit contains 63 devices and it supports reading and writing of the cell. A sample of this design is defined as functionally correct if a zero value can be written and then read successfully (
(126) The evolution of V.sub.th is perceived for each device according to Equation (22):
V.sub.th(t)=V.sub.th,TZ+ΔV.sub.th,TD(t), where: V.sub.th,TZ˜Norm(μ.sub.0,σ.sub.0) and ΔV.sub.th,TD˜AtomisticModel. (22)
(127) V.sub.th,TZ represents the time-zero component of V.sub.th fluctuations, whereas V.sub.th,TD represents the time-dependent counterpart. The former component is constant throughout the device lifetime but varies across device samples. In the context of the current invention, the latter component is attributed solely to BTI. V.sub.th,TD is a function of time and varies across different device samples. The atomistic model for V.sub.th,TD is configured according to TABLE 2 and is solved in a pseudo-transient way. Table 2 lists atomistic model configuration parameters for yield analysis of the target SRAM circuit, according to BTI/RTN measurement as disclosed by M. Toledano-Luque et al. in “Response of a single trap to AC negative bias temperature stress”, Proc. IEEE IRPS, 2011, pp. 4A.2.1-4A.2.8, and the 90 nm predictive technology model disclosed at http://ptm.asu.edu/.
(128) TABLE-US-00002 TABLE 2 Parameter Distribution V.sub.th,TZ Time-Zero pFET V.sub.th,TZ~Norm (−0.339 V, −0.033 V) Time-Zero nFET V.sub.th,TZ~Norm (0.397 V, 0.04 V) ΔV.sub.th,TD (t) Defects per pFET N.sub.p~Pois (λ = 10.sup.11 × Area[cm.sup.2]) Defects per nFET N.sub.n~Pois (λ = 6.7 × 10.sup.10 × Area[cm.sup.2]) Time Const. log.sub.10 {τ*.sub.pV}~Unif (a = −12, b = 12) ΔV.sub.th per ΔV.sub.th~Exp (η = 5), Defect (mV) for pFETs and nFETs alike *p: process, either capture (c) or emission (e) V: voltage, either high (H) or low (L)
(129) To emphasize on the workload memory/dependency of the proposed flow, five different cases are assumed in terms of voltage supply for the test case circuit (
(130) The functional yield of the circuit is estimated for all five V.sub.dd cases and the 95% confidence interval is also provided based on 150 Monte Carlo iterations for each case. The execution time has been measured at roughly 3 hours of processing per 150 Monte Carlo iterations on a desktop computer with an Intel Core 2 Quad processor operating at 2.66 GHz. Each iteration starts with initialization of V.sub.th,TZ (from the normal distribution) and of the gate stack defects (based on the atomistic model). At each time step, V.sub.th,TD is evaluated per device and the circuits functionality is tested.
(131) For the sake of brevity, the impact of the “fast” defects on the functional yield analysis has been ignored. Apart from being covered by previous work, we have refrained from transient BTI/RTN simulations, since they can be incorporated to the time-zero (and ever present) variations of V.sub.th by adding a “safety” margin. Significant work exists in experimentally delivering yield and failure metrics for SRAMs, strictly due to “fast” defect activity. However, until now and to the best of the inventors' knowledge a time- and workload-dependent yield analysis of memory circuits as in accordance with embodiments of the disclosed technology has never been performed with the atomistic BTI model for the entire circuit lifetime.
(132) The yield estimations of
(133) The proposed pseudo-transient, atomistic simulation scheme for BTI according to embodiments of the disclosed technology is a major enable for a thorough yield analysis. By retaining the time dependency of the atomistic model, the evolution of functional yield can be monitored throughout the circuit lifetime. Based on the model's workload dependency, dynamic alterations to the circuits workload can be tested and their impact on the functional yield can be observed. Such capabilities are very welcome for the design of reliable ICs, especially at aggressively downscaled technologies.
(134) The previous description of the disclosed examples is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these examples will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other examples without departing from the spirit or scope of the invention. As will be recognized, certain embodiments of the invention described herein can be embodied within a form that does not provide all of the features and benefits set forth herein, as some features can be used or practiced separately from others. The scope of the invention disclosed herein is indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope. Thus, the present invention is not intended to be limited to the examples shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.