SYSTEM AND METHODS OF REDUCING WIDEBAND SERIES RESONANT CLOCK SKEW
20240338053 ยท 2024-10-10
Inventors
- Riadul Islam (Windsor Mill, MD, US)
- Dhandeep Challagundla (Baltimore, MD, US)
- Ignatius Bezzam (LOS ALTOS, CA, US)
Cpc classification
H03K3/012
ELECTRICITY
International classification
H03K19/00
ELECTRICITY
Abstract
A resonant clocking system for reducing wideband series resonant clock skew is provided. The resonant clocking system includes at least one Pulsed Series Resonance (PSR) driver, at least one clock gater, and at least one clock buffer. The PSR driver is connected with at least one on-chip inductor. The PSR driver receives a boosted-amplitude pulsed signal V.sub.SR that is generated using a matching pulse generator and at least one on-chip inductor connected with the at least one Pulsed Series Resonance (PSR) driver resonates with a capacitance of the resonant clocking system to generate a pulse signal R.sub.CLK. The at least one clock gater and the at least one clock buffer propagate the pulse signal R.sub.CLK to clock pins of resonant flip-flops. An inductance value of the at least one on-chip inductor is matched with a load capacitance of a corresponding branch capacitance using an inductor tuning technique to obtain equal frequency signals in all clock branches, thereby storing dissipated energy in a form of a magnetic field in the at least one on-chip inductor and reducing wideband series resonant clock skew.
Claims
1. A resonant clocking system for reducing wideband series resonant clock skew, comprising: at least one Pulsed Series Resonance (PSR) driver that receives a boosted-amplitude pulsed signal V.sub.SR from a matching pulse generator; at least one on-chip inductor connected with the at least one Pulsed Series Resonance (PSR) driver resonates with a capacitance of the resonant clocking system to generate a pulse signal R.sub.CLK; at least one clock gater and at least one clock buffer that receive the pulse signal R.sub.CLK from the at least one PSR driver and propagate the pulse signal R.sub.CLK to clock pins of a resonant pulsed flip-flop, wherein an inductance value of the at least one on-chip inductor is matched with a load capacitance of a corresponding branch capacitance using an inductor tuning technique to obtain equal frequency signals in all clock branches, thereby storing dissipated energy in a form of a magnetic field in the at least one on-chip inductor and reducing wideband series resonant clock skew.
2. The resonant clocking system of claim 1, further comprises a dual-rail booster that is integrated with the matching pulse generator to generate the boosted amplitude signal using a series resonance by matching a shared inductor L.sub.2 with a load capacitance C.sub.2.
3. The resonant clocking system of claim 1, wherein the resonant pulsed flip-flop comprises a transistor M2 and a transistor M3, wherein input data is inverted and provided to the transistors M2 and M3, and drain of the transistors M2 and M3 are connected to storage cells for storing logic 1 or logic 0.
4. The resonant clocking system of claim 1, wherein the resonant pulsed flip-flop further comprises a transistor M4 and a transistor M5 controlled by a reset signal, wherein when the reset signal is low, the transistor M4 is turned on and the transistor M5 is turned off, resulting in a logic 1 at node SB and writing a logic 0 to an output Q.
5. The resonant clocking system of claim 1, wherein the resonant pulsed flip-flop further comprises a clock signal P.sub.clk, wherein when the clock signal P.sub.clk is 0, a transistor M1 is turned off, maintaining the resonant pulsed flip-flop in a hold or a retained state with unaltered values of nodes S and SB.
6. The resonant clocking system of claim 1, wherein when the input data is 1 and the clock signal P.sub.clk is 1, the transistors M2 and M1 are turned on, connecting node SB to ground, discharging the node, and resulting in a logic 1 at node Q, thereby writing a 1 into the at least one pulsed register.
7. The resonant clocking system of claim 1, wherein when the input data is 0 and the clock signal Pclk is 1, the transistors M3 and M1 are turned on, writing a logic 0 at node Q.
8. The resonant clocking system of claim 1, wherein the at least one PSR driver generates a clock pulse using the dissipated energy stored in the at least one on-chip inductor to drive clock input pins of the at least one pulsed register.
9. The resonant clocking system of claim 1, wherein the dissipated energy stored in the at least one on-chip inductor is recycled in a next rising clock edge.
10. The resonant clocking system of claim 1, wherein the pulse signal (R.sub.CLK) is a rail-to-rail swing signal.
11. The resonant clocking system of claim 1, wherein the output signal (R.sub.CLK) of the at least one PSR driver is inverted using the at least one clock gater and the at least one clock buffer and employed as the clock input signal (P.sub.clk) for the at least one pulsed register.
12. The resonant clocking system of claim 1, wherein the wideband series resonant clock skew is reduced at runtime by dynamically adjusting a resonant pulse width T.sub.d to compensate for capacitance and inductance mismatch induced by at least one of an open circuit voltage (OCV) and process variations; and finely adjusting equivalent inductance with granularity using a knob, thereby reducing the wideband series resonant clock skew in runtime.
13. A resonant clock mesh system for reducing wideband series resonant clock skew, comprising: at least one Pulsed Series Resonance (PSR) driver that receives a boosted-amplitude pulsed signal V.sub.SR from a matching pulse generator; at least one on-chip inductor connected with the at least one Pulsed Series Resonance (PSR) driver resonates with a capacitance of the resonant clocking system to generate a pulse signal R.sub.CLK; a clock mesh that is connected with the at least one Pulsed Series Resonance (PSR) driver; at least one clock gater and at least one clock buffer that receive the pulse signal R.sub.CLK from the at least one PSR driver through the clock mesh and propagate the pulse signal R.sub.CLK to clock pins of a resonant pulsed flip-flop, wherein an inductance value of the at least one on-chip inductor is matched with a load capacitance of the clock mesh, thereby storing dissipated energy in a form of a magnetic field in the at least one on-chip inductor and reducing wideband series resonant clock skew.
14. The resonant clock mesh system of claim 13, wherein the load capacitance is determined by accumulating a local mesh capacitance (C.sub.mLocal) and corresponding branch capacitances (C.sub.lLocal).
15. The resonant clock mesh system of claim 13, wherein the resonant pulsed flip-flop comprises a transistor M2 and a transistor M3, wherein input data is inverted and provided to the transistors M2 and M3, and drain of the transistors M2 and M3 are connected to storage cells for storing logic 1 or logic 0.
16. The resonant clock mesh system of claim 13, wherein the resonant pulsed flip-flop further comprises a transistor M4 and a transistor M5 that are controlled by a reset signal, wherein when the reset signal is low, the transistor M4 is turned on and the transistor M5 is turned off, resulting in a logic 1 at node SB and writing a logic 0 to an output Q.
17. The resonant clock mesh system of claim 13, wherein the resonant pulsed flip-flop further comprises a clock signal P.sub.clk, wherein when the clock signal P.sub.clk is 0, a transistor M1 is turned off, maintaining the resonant pulsed flip-flop in a hold or a retained state with unaltered values of nodes S and SB.
18. The resonant clock mesh system of claim 13, wherein when the input data is 1 and the clock signal P.sub.clk is 1, the transistors M2 and M1 are turned on, connecting node SB to ground, discharging the node, and resulting in a logic 1 at node Q, thereby writing a 1 into the at least one pulsed register.
19. The resonant clock mesh system of claim 13, wherein when the input data is 0 and the clock signal Pclk is 1, the transistors M3 and M1 are turned on, writing a logic 0 at node Q.
20. A method for reducing wideband series resonant clock skew, comprising: receiving, using at least one Pulsed Series Resonance (PSR) driver, a boosted-amplitude pulsed signal V.sub.SR from a matching pulse generator; generating a pulse signal R.sub.CLK using at least one on-chip inductor connected with the at least one Pulsed Series Resonance (PSR) driver resonates with a capacitance of a resonant clocking system; receiving, using at least one clock gater and at least one clock buffer, the pulse signal R.sub.CLK from the at least one PSR driver and propagate the pulse signal R.sub.CLK to clock pins of a resonant flip-flop; and matching an inductance value of the at least one on-chip inductor with a load capacitance of a corresponding branch capacitance using an inductor tuning technique to obtain equal frequency signals in all clock branches, thereby storing dissipated energy in a form of a magnetic field in the at least one on-chip inductor and reducing wideband series resonant clock skew.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0024] The embodiments herein will be better understood from the following detailed description with reference to the drawings, in which:
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
[0036] The embodiments herein and the various features and advantageous details thereof are explained more fully with reference to the non-limiting embodiments that are illustrated in the accompanying drawings and detailed in the following description. Descriptions of well-known components and processing techniques are omitted to not unnecessarily obscure the embodiments herein. The examples used herein are intended merely to facilitate an understanding of ways in which the embodiments herein may be practiced and to further enable those of skill in the art to practice the embodiments herein. Accordingly, the examples should not be construed as limiting the scope of the embodiments herein.
[0037] As mentioned, there remains a need for a resonant clocking system for reducing wideband series resonant clock skew. Referring now to the drawings, and more particularly to
Definitions
[0038] Skew: Skew is defined as the temporal variation in the arrival time of clock transition at two different locations.
[0039]
[0040] In some embodiments, a clock and a delayed signal are fed into an XNOR gate to generate the pulsed signal V.sub.SR at both clock edges with a pulse width T.sub.d. In some embodiments, a voltage doubler circuit is employed to invert a generated dual-edge triggered pulse resulting in the boosted signal V.sub.SR. The voltage doubler circuit may use a pulsed series resonance technique to generate the boosted amplitude pulsed signal V.sub.SR. When the V.sub.SR is low, PMOS transistors M1 and M3 are ON, and the inductor resonates with the load capacitance C.sub.2, which represents the total capacitance load of the one or more PSR drivers 106A-N. In some embodiments, the value of the series inductors is small for large load capacitances. The inductor in the voltage doubler circuit may be adjusted based on a load of the matching pulse generator 104 to generate boosted amplitude pulsed signal V.sub.SR. In some embodiments, the resonant clocking system 100 includes a dual-rail booster circuit 105 to reduce the power consumed by the voltage doubler by decreasing the resistance of a pull-up network.
[0041] The one or more clock garters 108A-N and the one or more clock buffers 110A-N receive the pulse signal R.sub.CLK from the one or more PSR drivers 106A-N connected with the one or more on-chip inductors (L.sub.SR1, L.sub.SR7, to L.sub.SR8) and propagate the pulse signal R.sub.CLK to clock pins of resonant flip-flops. An inductance value of the one or more on-chip inductors (L.sub.SR1, L.sub.SR7, to L.sub.SR8) is matched with a total capacitance of a corresponding branch capacitance (C.sub.SR1, (C.sub.SR7, to C.sub.SR8) using an inductor tuning technique to obtain equal frequency signals in all clock branches, thereby storing dissipated energy in a form of a magnetic field in the one or more on-chip inductors (L.sub.SR1, L.sub.SR7, to L.sub.SR8) and reducing wideband series resonant clock skew. In some embodiments, the inductance for each PSR driver is determined using
[0043]
[0044]
[0045]
[0046]
[0047] Table: 1 shows a 13TPFF exhibits a better setup and hold times than the industry-standard PSFF while consuming more dynamic power and area; however, it consumes lower static power than PSFF and enables power saving in overall clock architecture.
TABLE-US-00001 Static Dynamic Type of Normalized Delay (ps) power (pW) Power (?W) Register area tc-q ts th D = 0 D = 1 1 GHz 2 GHz 3 GHz 4 GHz 5 GHz MSFF 1 32.5 14 2 1550 593 8.3 14.1 21 28 35.1 PRFF 0.59 35.195 96 278 272 7.16 13.8 20.4 27.1 33.8 TSPCFF 0.84 41.992 93 283 664 12.3 20.2 28 35.9 43.7 13TPFF 1.75 37.325 60 501 538 16.2 31.1 46 61 76
[0048] Table: 2 shows resonant clock tree system using 14 nm and 7 nm technology nodes
TABLE-US-00002 Power consumed (mW) Type of Technology Skew 1 2 3 4 5 network node (ps) GHz GHz GHz GHz GHz PSFF 14 nm 51.1 30.8 60.6 89.2 116 138 network 7 nm 32.4 59.8 119 172 234 296 PRFF 14 nm 4.61 17.4 34.1 50.3 65.7 78.7 network 7 nm 3.95 32.5 64.9 92.4 127 160 TSPCFF 14 nm 2.05 22.2 43.6 64.6 84.4 102 network 7 nm 3.66 42.1 84.3 123 167 209 13TPFF 14 nm 3.92 23.8 46.8 69.4 90.7 110 network 7 nm 4.14 43.9 87.2 126 173 218
[0049] Power and skew comparisons of the clock tree system for frequencies ranging from 1 to 5 GHz are shown in above Table 2. The power and skew values of flip-flop networks by scaling frequency from 1 to 5 GHz depict consistent power savings and skew reduction while using a resonance technique.
[0050]
[0051] Moreover, the resonant clock mesh system 200 using TSPCFF saves 64% power while lowering the skew by 92%, and the resonant clock network using 13TPFF saves 59% power with 88% skew reduction.
[0052] The resonant clock mesh system 200 using PRFF also has 65% reduced power consumption with 83% reduced skew if the ISPD 2010 testbench circuit has more sink density with 1107 sinks. Also, the resonant clock mesh system 200 with TSPCFF saves 61% power and 91% skew, and the resonant clock mesh system 200 using 13TPFF reduces 57% power consumption with 89.5% skew reduction. The resonant clock mesh system 200 using PRFF saves 63% of the power while reducing the skew by 87% if the ISCAS89 testbench circuit has 179 clock sinks and 2779 logic gates. Moreover, the resonant clock mesh system 200 using TSPCFF has 61.7% reduced power consumption with 90% reduced skew, and the resonant clock network using 13TPFF has 57.6% reduced power consumption with 85% reduced skew. Table: 3 shows an analysis of the resonant clock mesh system 200 using 14 nm and 7 nm technology nodes.
TABLE-US-00003 Power consumed (mW) Type of Technology Skew 1 2 3 4 5 network Node (ps) GHz GHz GHz GHz GHz PSFF 14 nm 108 35.3 68.5 102 131 162 network 7 nm 74 74.4 151 221 294 368 PRFF 14 nm 10.3 19.2 38.1 56.5 74.2 88.7 network 7 nm 8.4 40.2 82.7 121 163 202 TSPCFF 14 nm 9.74 23.5 46.7 69.3 91.1 110 network 7 nm 7.94 49.8 102 149 195 242 13TPFF 14 nm 9.16 25.2 50.1 74.4 97.9 118 network 7 nm 8.12 54 110 164 215 267
[0053] The results obtained by scaling the frequency from 1 to 5 GHz support the resonant clock tree analysis showing similar power saving and reduction in skew even on a lower technology node.
[0054] Table: 4 shows the power and skew analysis of the resonant clocking system 100 resulting in consistent power savings and balanced skew on the ISPD 2009 s1r1, ISPD 2010 01. in, and ISCAS89 s5738 circuits; the PRFF network saves 64% power on average with 85% reduced skew at 1 GHz frequency.
TABLE-US-00004 Frequency (GHz) 1 GHz 2 GHz 3 GHz 4 GHz 5 GHz Type of Skew Power Skew Power Skew Power Skew Power Skew Power Benchmark network (ps) (mW) (ps) (mW) (ps) (mW) (ps) (mW) (ps) (mW) ISPD 2009 PSFF 29 6.65 33 12.7 31 18.62 37 25.4 34 30.25 s1r1 network PRFF 4.4 2.15 2.7 4.3 3.6 6.44 3.9 8.58 3.3 10.3 network TSPCFF 2.28 2.32 2.4 4.55 2.3 6.79 2.9 9.02 2.4 10.8 network 13TPFF 3.72 2.57 4.3 5.14 4.2 7.6 3.8 10.1 4.1 12.2 network ISPD 2010 PSFF 22.7 53.3 26.7 107 33.4 160 31.3 213 42.6 269 01. in network PRFF 4.25 18.4 6.37 35.7 6.65 54.3 5.2 70.9 5.73 88 network TSPCFF 2.1 20.3 3.32 39.2 2.92 59 4.22 75 3.7 92 network 13TPFF 12.4 23.4 3.4 44.7 3.1 67.4 3.5 89.2 4.1 105 network ISCAS89 PSFF 24.9 14.1 25.8 28.1 24.9 42.1 23.4 56.1 21.4 70.1 s5378 network PRFF 3.11 5.09 3.72 9.94 3.16 14.4 3.94 19.2 3.31 22.9 network TSPCFF 2.42 5.4 2.75 10.6 2.51 15.9 2.77 21.5 3.1 24.4 network 13TPFF 3.7 5.97 4.22 11.4 3.73 17.1 4.3 22.6 4.74 26.3 network
[0055]
[0056] While using the ISPD 2009 testbench circuit (s1r1 with 81 sinks), the resonant clocking system 100 using PRFF reduces the energy consumption by 66.2% on average compared to conventional clock architecture using PSFF across 1-5 GHz frequency, while the resonant clocking system 100 using 13TPFF saves 59.9% energy on average. While using the ISPD 2010 testbench circuit (01. in with 1107 sinks), the resonant clocking system 100 using TSPCFF saves 63.7% energy on average across 1-5 GHz frequency, while the resonant clocking system 100 using PRFF saves 66.4% energy compared to conventional clock architectures using PSFF. The resonant clocking system 100 using PRFF saves 65.4% average energy consumption on the ISCAS89 testbench circuit (s5378 with 179 sinks) while saving 59.7% average energy while using 13TPFF across 1-5 GHz clock frequency. The heat generated throughout the circuit can be estimated through the junction temperature (Tj) using the transient thermal resistance (?J A) for a pulse length tp, which can be represented as:
[0058] Table: 5 shows the resonant clocking system 100 has 67.2% lower energy and 7.2? C. lower junction temperature while using PRFF for ISPD 2010 01. in the circuit compared to conventional clock architectures at 5 GHz frequency; besides the 13TPFF has a 59.3% lower average energy consumption compared to conventional clock architectures using PSFF.
TABLE-US-00005 1 GHz 2 GHz 3 GHz 4 GHz 5 GHz Type of Energy Temp. Energy Temp. Energy Temp. Energy Temp. Energy Temp. Benchmark network (pJ) (? C.) (pJ) (? C.) (pJ) (? C.) (pJ) (? C.) (pJ) (? C.) ISPD 2009 PSFF 6.65 27.266 6.35 27.508 6.21 27.7448 6.35 28.016 6.05 28.21 s1r1 network PRFF 2.15 27.086 2.15 27.172 2.15 27.2576 2.15 27.3432 2.06 27.412 network TSPCFF 2.32 27.0928 2.28 27.182 2.26 27.2716 2.26 27.3608 2.16 27.432 network 13TPFF 2.57 27.1028 2.57 27.2056 2.53 27.304 2.53 27.404 2.44 27.488 network ISPD 2010 PSFF 53.30 29.132 53.50 31.28 53.33 33.4 53.25 35.52 53.80 37.76 01. in network PRFF 18.40 27.736 17.85 28.428 18.10 29.172 17.73 29.836 17.60 30.52 network TSPCFF 20.30 27.812 19.60 28.568 19.67 29.36 18.75 30 18.40 30.68 network 13TPFF 23.40 27.936 22.35 28.788 22.47 29.696 22.30 30.568 21.00 31.2 network ISCAS89 PSFF 14.10 27.564 14.05 28.124 14.03 28.684 14.03 29.244 14.02 29.804 s5378 network PRFF 5.09 27.2036 4.97 27.3976 4.80 27.576 4.80 27.768 4.58 27.916 network TSPCFF 5.40 27.216 5.30 27.424 5.30 27.636 5.38 27.86 4.88 27.976 network 13TPFF 5.97 27.2388 5.70 27.456 5.70 27.684 5.65 27.904 5.26 28.052 network
[0059] Table 5 compares the junction temperatures of the resonant clocking system 100 with the conventional clock architecture using PSFF at 5 GHz frequency with 27? C. ambient temperature (TA). The resonant clocking system 100 uses lower power than conventional clock distribution networks and thus produces lower heat than existing clock networks. For example, a 48-pin Ceramic Leadless Chip Carrier (CLCC) package has a thermal resistance ({circle around (1)} J A) of 40? C./W [33]. Considering a power difference of 0.181 W would result in a 7.2? C. higher temperature than the resonant clocking system 100 for ISPD 2010 01. in a circuit at 5 GHz frequency. For a typical SOC implementation with 10 sinks as ISPD 2010 01. in a circuit, a power difference of 1.81 W would result in a 72.4? C. higher temperature than the resonant clocking system 100. In a clock network, the variations in branch capacitances require different inductance values.
[0060]
[0061] Table: 6 shows a comparison of the PRFF network with previous low-power clocking techniques depicting an average skew reduction of 78% in the ISPD 2009 s1r1 benchmark and 80.8% while using ISPD 2010 01. in benchmarks.
TABLE-US-00006 Technology Power Skew Benchmark Methodology node Frequency (mW) (ps) ISPD 2009 ROCKS 45 nm 1 GHz 71.1 20 s1r1 LARCS 45 nm 1 GHz HCDN 45 nm 1 GHz 20.2 21 PRFF 14 nm 1 GHz 2.15 4.4 network ISPD 2010 ROCKS 45 nm 1 GHz 179.3 77 01. in LARCS 45 nm 1 GHz 368 32 HCDN 45 nm 1 GHz 38.2 11 PRFF 14 nm 1 GHz 18.4 4.25 network
[0062] Table 6 compares results with the implementation of distributed LC resonant clock grid synthesis (ROCKS), library-aware resonant clock synthesis (LARCS), and hybrid-mode clock distribution networks (HCDN). The HCDN clocking scheme may use global bufferless current-mode (CM) clocking and locally buffered voltage-mode (VM) clocking.
[0063]
[0064]
[0065]
[0066] In some embodiments, the one or more PSR drivers 106A-N and the size of the one or more on-chip inductors (L.sub.SR1, L.sub.SR7, to L.sub.SR8) for a resonant tree are determined using algorithm 1 in the following method. Algorithm 1 obtains a clock network (clk.sub.tree), list of branch capacitances (C.sub.br), system clock frequency (F.sub.clk), duty cycle for resonance (DC.sub.rez), and a maximum skew constraint (S.sub.max) along with a predetermined range of inductor quality factor (Q.sub.range). The output of the algorithm 1 is a resonant clock tree, i.e., the resonant clocking system 100. The method includes transforming a conventional clock network by replacing standard drivers with the one or more PSR drivers 106A-N using a Replace Drivers( ) function. The method includes determining the sizes of the one or more on-chip inductors (L.sub.SR1, L.sub.SR7, to L.sub.SR8) for each PSR driver using an Ind Sizing( ) function. This calculation ensures appropriate inductor values aligned with the capacitance of each branch. Following this, transient simulations are conducted using Synopsys PrimeSim HSPICE to extract key parameters such as rise times, fall times, and initial skew. The method includes verifying an output swing of the replaced PSR drivers within a calculated period (T.sub.d) from a second rising edge to a falling edge of the clock signal. The method includes adjusting driver strength using the Resize Driver( ) function as necessary to meet swing requirements. If the output swing fails to meet V.sub.DD to V.sub.OL requirements, branch partitioning is performed, and new PSR drivers are placed accordingly. The method includes performing transient simulations to determine the latency of each branch and sorting latency values in a descending order using a Sort( ) function to derive the initial skew of the resonant clocking system 100. If the skew exceeds a maximum skew constraint (S.sub.max), branch partitioning is initiated for a branch with the highest latency. Therefore, New PSR drivers are placed, and appropriate inductor sizing is determined. This iterative process continues until the obtained skew falls within the maximum skew constraint or until no significant improvement in skew is observed after an iteration.
Algorithm 1. Determining the Resonant Drivers and Inductor Size for Resonant Tree
[0067]
TABLE-US-00007 Input: Input network (clk.sub.tree), Branch capacitances (C.sub.br), System clock frequency (F.sub.clk), Duty cycle for resonance (DC.sub.rez), Skew constraint (S.sub.max), Q-factor range (Q.sub.range); Output: Resonant clock tree; N.sub.Dr = ReplaceDrivers(clk.sub.tree); L.sub.Dr = IndSizing(C.sub.br, DC.sub.rez, N.sub.Dr, Q.sub.range); TransientSimulation( ) while swing at all nodes ? 90%(VDD) do driver.sub.SizeNew = ResizeDriver(driver size); if swing at all nodes ? 90%(VDD) then N.sub.DrNew = PartitionBranch(localBranch); PlaceDriver(NDrNew); IndSizing(C.sub.br, DC.sub.rez, N.sub.DrNew, Q.sub.range); end if end while TransientSimulation( ) Branch.sub.List = Sort(N.sub.Dr, latency) while skew > S.sub.max do N.sub.DrNew = PartitionBranch(localBranch) PlaceDriver(N.sub.DrNew); IndSizing(C.sub.br, DC.sub.rez, N.sub.DrNew, Q.sub.range) end while
[0068] In some embodiments, the one or more PSR drivers 106A-N and the size of the one or more on-chip inductors (L.sub.SR1, L.sub.SR7, to L.sub.SR8) for a resonant mesh using the following method. The method includes initializing an algorithm with a uniform clock mesh of (d?d) dimension, mesh capacitance (C.sub.m), and load capacitances (C.sub.l). Parameters such as system clock frequency (F.sub.clk), the duty cycle for resonance (DC.sub.rez), maximum skew constraint (S.sub.max), and a predetermined range of inductor quality factors (Q.sub.range) are also considered. The method includes dividing the mesh into smaller grids of size (m?n) based on an initial driver strength. Each partitioned grid is then equipped with a PSR driver placed at its center. The method includes calculating an inductance for each PSR driver using an Ind Sizing( ) function. The method includes executing transient simulations to ensure that all nodes of the clock mesh exhibit a V.sub.DD to V.sub.OL swing within a specified period (T.sub.d). If swing requirements are not met, the one or more PSR drivers 106A-N are resized using a Resize Driver( ) function, or the grid is divided into two equal parts. New PSR drivers are placed, and new inductor values are determined iteratively until swing criteria are fulfilled. In some embodiments, another transient simulation is conducted to determine the latency of each branch to further refine the clock mesh. In some embodiments, the latencies obtained are then sorted in descending order to obtain the initial skew of the clock mesh network. If the skew exceeds the maximum constraint (S.sub.max), the local grid is divided, and the one or more PSR drivers 106A-N are placed accordingly. This process continues until the generated skew is within the specified limit or if no improvement in skew is observed.
Algorithm II. Determining the Resonant Drivers and Inductor Size for Resonant Mesh
[0069]
TABLE-US-00008 Input: Uniform mesh dimensions (d x d), Mesh capacitance (C.sub.m), Load capacitances (C.sub.l), System clock frequency (F.sub.clk), the Duty cycle for resonance (DC.sub.rez), Skew constraint (S.sub.max), Q-factor range (Q.sub.range); Output: Resonant grid; N.sub.Dr = PartitionMesh(DM); PlaceDriver(N.sub.Dr); L.sub.Dr = IndSizing(C.sub.m, C.sub.l, DC.sub.rez, N.sub.Dr, Q.sub.range); TransientSimulation( ) while swing at all nodes ? 90%(VDD) do driver.sub.sizeNew = ResizeDriver(driver size); if swing at all nodes ? 90%(VDD) then N.sub.DrNew = PartitionMesh(localMesh); PlaceDriver(N.sub.DrNew); IndSizing(C.sub.mLocal, C.sub.lLocal, DC.sub.rez, N.sub.DrNew, Q.sub.range); end if end while TransientSimulation( ) Branch.sub.List = Sort(N.sub.Dr, latency) while skew > S.sub.max do N.sub.DrNew = PartitionMesh(localMesh) PlaceDriver(N.sub.DrNew); IndSizing(C.sub.mLocal, C.sub.lLocal, DC.sub.rez, N.sub.DrNew, Q.sub.range) end while
[0070] Energy recycling resonant pulsed flip-flops enable the reuse of dissipated energy through series inductor-capacitor (LC) resonance, thereby increasing energy efficiency and sustainability. Additionally, the unveiling of wideband clocking architectures utilizing series LC resonance and inductor tuning techniques enhances the efficiency and reliability of clock networks. Through the utilization of pulsed resonance, the recycling of switching power is achieved, leading to substantial energy savings. Moreover, the incorporation of inductor tuning techniques contributes to skew reduction, thereby enhancing the overall performance and reliability of clock networks. The resonant clocking system includes enhanced energy efficiency, reduced power consumption, improved performance, and innovative solutions to longstanding challenges in clock network design.
[0071] The foregoing description of the specific embodiments will so fully reveal the general nature of the embodiments herein that others can, by applying current knowledge, readily modify and/or adapt for various applications such specific embodiments without departing from the generic concept, and, therefore, such adaptations and modifications should and are intended to be comprehended within the meaning and range of equivalents of the disclosed embodiments. It is to be understood that the phraseology or terminology employed herein is for the purpose of description and not of limitation. Therefore, while the embodiments herein have been described in terms of preferred embodiments, those skilled in the art will recognize that the embodiments herein can be practiced with modification within the spirit and scope of the appended claims.