Circuits and Methods to harvest energy from transient on-chip data

20220321123 · 2022-10-06

Assignee

Inventors

Cpc classification

International classification

Abstract

Circuits and methods that harvest electrostatic energy from transient on-chip data are described in the Application. In one aspect, a method and inverter circuit harvests electrostatic charge held at its output node at an electric potential comparable to the power supply voltage rail to a common grid/node as the output makes a 1.fwdarw.0 logic transition. This charge harvested at a common grid/node can be used by circuits (described in applications 63/090,169, 63/139,744) to drive 0.fwdarw.1 logic transition at their output nodes at lower energy drain from the on-chip power grid than a conventional CMOS inverter would with similar performance, slew rates at inverter input and output and with similar output driving transistor geometries.

Claims

1. An inverter harvesting charge from its output node comprising of N and P channel FETS with their drain terminals shorted together at the output terminal of the inverter. The source terminals of the N and P channel FETs are connected to the reference Ground and Power supply rails respectively. a second P channel FET whose source and drain terminals couple the output terminal of the inverter with a grid/node whose capacitance holds harvested charge at a voltage lower than the Power supply rail voltage. an input terminal and an output terminal of the inverter whose electric potentials makes full-swing transitions between the power rail voltage and the reference ground rail voltage. The input terminal of the inverter connected directly to the gate input terminal of the first P channel FET. a small HVT keeper N channel FET whose gate input terminal is driven by the input terminal of the inverter and whose source and drain terminals are connected to the reference ground rail at voltage VSS=0V and the output terminal of the inverter respectively. a 2-input NAND gate with its inputs driven by the input and output terminals of the inverter. The 2-input NAND gate output drives the gate input terminal of the second P channel FET and the input terminal of a delay element whose inverted output drives the N channel FET of the inverter

2. The device as recited in claim 1 wherein the second P channel FET is enabled to move charge to the grid/node holding harvested charge from the output terminal of the inverter following a 0.fwdarw.1 logic transition at the input terminal of the inverter with this charge transfer self-disabled by a decreasing inverter output voltage that sets the output of the NAND gate to the power supply voltage as the inverter output voltage decreases below the logic threshold voltage of the NAND gate.

3. The device as recited in claims 1,2 wherein the decreasing inverter output voltage is reinforced by the N channel FET of the inverter when the delayed, leading-edge 0.fwdarw.1 transition at the gate input terminal of the inverter N channel FET completes the 1.fwdarw.0 transition at the output of the inverter while transferring charge from the output of the inverter to the reference ground rail at voltage VSS=0V.

4. The NAND gate is designed to have a logic threshold such that the voltage V2 at which harvested charge is held is lower than the logic threshold of the NAND gate. The delay element is designed to have a delay that is comparable to the time it takes for the output to decrease from voltage power supply rail voltage=VDD to a voltage comparable to the logic threshold of the NAND.

Description

BRIEF DESCRIPTION OF DRAWINGS

[0013] FIG. 1 is a schematic illustrating conventional CMOS circuit schematic of an inverter and its operation in response to 1.fwdarw.0 and 0.fwdarw.1 input transitions

[0014] FIG. 2 is a circuit simulation of the conventional CMOS inverter circuit that shows voltage waveforms at the input and at the output terminals of the inverter in response to 1.fwdarw.0 and 0.fwdarw.1 input transitions. The Figure also shows the current waveform that illustrates current flow dependence on time for the 0.fwdarw.1 transition at the output and the 1.fwdarw.0 transition at the output

[0015] FIG. 3 is a schematic illustrating the proposed circuit of an inverter that harvests charge from its output at VDD to a common grid/node capacitance when the output makes a 1.fwdarw.0 logic transition moving 0.4x−0.25x of the charge held at its output at VDD to the harvest grid/node V2.

[0016] FIG. 4 is a circuit simulation of the proposed inverter circuit that harvests charge during a 1.fwdarw.0 logic transition at its output—showing voltage waveforms at the input and output terminals (that are practically identical to those observed in a conventional CMOS inverter (FIG. 2)) and current waveforms corresponding to current drawn from the power rail at voltage of VDD and current to the common grid/node V2 at voltage VDD/2

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

[0017] FIG. 1 is a schematic illustrating operation of a conventional CMOS inverter 100 driving output node OUT 102 with a capacitive load C.sub.out 104. The power rail 106 at electric potential V.sub.DD provides total energy equal to C.sub.outV.sub.DD.sub.2 (derived in equation (1) below) during a 0.fwdarw.1 transition 108 at the output node OUT 102, storing energy of (½)C.sub.outV.sub.DD.sub.2 on the capacitor 104 at the output 102 modeled by equation (2) below. A 1.fwdarw.0 112 transition at the output 102 discharges from C.sub.out 104 all of this stored energy on the capacitor C.sub.out 104 at the output 102 to the reference ground node 110 at electric potential V.sub.SS=0V [0018] Energy drawn from VDD supply (during 0.fwdarw.1 transition at output)


I.sub.VDD(t)V.sub.DDdt=∫.sub.VSS.sup.VDDC.sub.outV.sub.DDdV.sub.out=C.sub.outV.sub.DD.sup.2   (1) [0019] Energy stored at output


I.sub.VDD(t)V.sub.out(t)dt=∫.sub.VSS.sup.VDDC.sub.outV.sub.outdV.sub.out=½C.sub.outV.sub.DD.sup.2   (2) [0020] Energy discharged from output (during 1.fwdarw.0 transition at output)


I.sub.VSS(t)V.sub.out(t)dt=∫.sub.VDD.sup.VSSC.sub.outV.sub.outdV.sub.out=½C.sub.outV.sub.DD.sup.2   (3)

[0021] FIG. 2 200 is an illustration of the time dependent voltage waveforms of the output node OUT (102 in FIG. 1 100) shown as the waveform V.sub.OUT 202 in FIG. 2. The voltage waveform driving the input of the inverter 118 in FIG. 1, is shown in FIG. 2 as V.sub.IN 204

[0022] The waveform of current flow 206 into the inverter from the power rail at voltage V.sub.DD (106 in FIG. 1) is shown along the same x-axis of time (as used to plot voltage waveforms) in FIG. 2. The absolute value of the integral of the current waveform 206 over time in FIG. 2 200 equals the total charge Q drained from the power rail (106 in FIG. 1) to drive the output node 102 from 0.fwdarw.1. The energy consumed from the power rail 106 to accomplish this logic transition at the output node 102 equals [Q⋅V.sub.DD]=C.sub.outV.sub.DD.sub.2 modeled in equation (1) above.

[0023] In FIG. 3 the proposed circuit schematic functioning as an inverter 300 shows parts highlighted in blue: a 2-input NAND 302 and a delay element 304 that also inverts its input. These highlighted parts have devices with much smaller widths (˜⅕ of driver transistors) than the other transistors (326,314,320) shown in black in this schematic 300.

[0024] The NAND gate 302 in this schematic generates an active low pulse at its output node 306 whose leading edge is triggered by a 0.fwdarw.1 transition at the input 308 and whose trailing edge is triggered by a 1.fwdarw.0 transition at the output node 310 loaded with a total capacitance C.sub.out 312.

[0025] The leading edge of this active low pulse turns on PFET P2 314 which drives charge from the output node at logic ‘1’ and voltage VDD to be harvested on the common grid/node V2 316 (typically at a voltage between VSS and VDD and preferably at a voltage comparable to or lower than the logic threshold of the NAND gate 302).

[0026] The leading edge of the active low pulse at the output of the NAND gate 306, when delayed and inverted to drive the gate input 318 of NFET N1 326, turns on NFET N1 326 to begin discharging the output 310 to VSS—as the output voltage at node OUT 310 approaches V2. Note that a design requirement on the logic threshold voltage of the NAND gate 302 is that it is higher than the typical voltage node V2 would be raised to with harvested charge or during a dynamic equilibrium when rate of charge transfer to and from the common grid/node are balanced. Thus, node OUT 310 when being discharged to V2 through PFET P2 314, can trip the NAND 302 to produce the trailing low.fwdarw.high transition of the active low pulse at output of the NAND gate 306 to turn-off P2 314.

[0027] The NAND 302 would also trip when the N channel FET N1 326 begins conducting after the delayed and inverted leading edge of the active low pulse output from the NAND is inverted by the inverter 304 whose output turns on N1 326.

[0028] The output continues being discharged toward VSS—the reference ground terminal 324 as N1 320 is turned on. The trailing edge of the active high pulse driving the gate input terminal of the N channel FET, N1 326 turns this FET, N1 320 off. A small geometry keeper HVT NFET 328 holds the output to VSS. Its gate input is driven by the inverter input 308 with its source terminal connected to the reference ground voltage rail 322 at voltage VSS=0V and its drain terminal connected to OUT 310.

[0029] The trailing edge of the active low pulse at the output of the NAND 306 is triggered by the transition at the output node from VDD toward V2 since the logic threshold of the NAND 302 is higher than the voltage at which node V2 316 is typically charged to with harvested charge. The trailing edge is triggered by this feedback from OUT 310 to the input of the NAND 306.

[0030] The proposed circuit (1) maintains rail-rail operation (2) drives practically the same waveforms at its output as a conventional inverter and (3) while harvesting about 25%-40% of the total charge it discharges from its output 310—to the harvest grid node V2 316, instead of discharging all of that charge to the reference ground supply rail 322. The primary overhead in area is consumed by the PFET P2 in FIG. 3. The gates highlighted in blue in

[0031] FIG. 3 are small and can be replaced by equivalent standard cells. Transistors N1 320, P1 326 in FIG. 3 are identical to the transistors 116 and 114 in the schematic of the inverter in FIG. 100.

[0032] The NAND gate 302 and the delay element 304 can be optimized to maximize the energy harvested at the grid/node from the output node of the inverter—according to what voltage the harvested charge is typically held at when using the proposed inverter. The closer the voltage of the harvested charge at V2 316 is to VDD, the higher the optimal logic threshold voltage of the NAND gate 302 should be (to avoid reverse flow of current from harvest grid node to output node of inverter) and the shorter the delay value of the delay element 304 needs to be to minimize the delay overheads to accomplish the same 1.fwdarw.0 transition at the output of the inverter. This optimization is especially useful when operating at low, near threshold voltages

[0033] FIG. 4 400 is an illustration of the time dependent voltage waveforms of the output node OUT (310 in FIG. 3) shown as V.sub.OUT 402 in FIG. 4. The input waveform driving the input 308 of the inverter in FIG. 3, V.sub.IN 404 is also shown in FIG. 4

[0034] The waveform of current flow 406 into the inverter from the VDD power rail (324 in FIG. 3) is shown along the same x-axis (as used to plot voltage waveforms) in FIG. 4. The absolute value of the integral of this current 406 over time in FIG. 4 400 equals the total charge drained from the power rail (324 in FIG. 3) to drive the output from 0.fwdarw.1. Note that this charge is the same as drained by a conventional CMOS inverter shown in FIG. 2. The total charge transferred to the common grid/node V2 316 in FIG. 3 is the total area under the curve 408 in FIG. 4.

[0035] Note that the voltage waveform at the output node 310 in FIG. 3 is practically the same as the voltage waveform 202 of the output node (102 in schematic shown in FIG. 1) in FIG. 2 of a conventional inverter. In the schematic in FIG. 3 300 the total charge harvested from the output node 310 during a 140 transition to the common grid/node V2 316 is 25%-40% of the total charge drained from the power rail 324 in FIG. 3. Total current in the comparison is based on simulation of the entire circuit shown in FIG. 1 100 and FIG. 3 300—and thus includes parasitic contributions of all transistors to circuit operation. All parasitic capacitances of transistors and local wires in the complete schematic contribute to slew rate seen at the output and overheads incurred in propagation delay.

[0036] Switching energy consumption by logic gates with low fanouts (<4) are typically small. Gates driving a high fanout (>10) and/or long wires consume more energy and are best candidates for the proposed scheme that harvests charge from these large loads as they are discharged.

[0037] The transistor count increases in the proposed schematic shown in FIG. 3 300 compared to the 2 transistors used in a conventional CMOS inverter. However, area consumed by the proposed schematics in FIG. 3 300 does not increase proportionally with the number of transistors because the transistors of gates highlighted in blue (302, 304 in FIG. 3) are ˜5× smaller than any of the transistors drawn in black (326, 314 and 320 in FIG. 3 300). This because the load seen by the NAND gate 302 highlighted in blue is small—essentially lust the gate input of a single PFET (P2 314 in FIG. 3)—with the load from the delay element 304 much smaller. The transistors P1 320, N1 326 and P2 314 in FIG. 3 are comparable (in dimensions) to the transistors P1 114 and Ni 116 in a conventional CMOS inverter shown in FIG. 1 100 that drives the same capacitive load C.sub.OUT 104 in FIG. 1 and 312 in FIG. 3. The gate footprint of the proposed schematic 300 (in FIG. 3) is not expected to be larger than 1.7×-2.0× of the CMOS inverter it replaces. Note that the proposed schematics are preferred as replacement candidates of CMOS inverters only when driving large loads—that offer the opportunity for larger energy reductions.

[0038] Although illustrative embodiments of the present invention have been described herein, it is to be understood that the invention is not limited to those precise embodiments, and that various other changes and modifications may be made by one skilled in the art without departing from the scope of the invention.

REFERENCES

[0039] [1] B S Kong, J S Choi, S J Lee, and K. Lee “Charge Recycling Differential Logic (CRDL) for Low Power Application,”IEEE JSSC, vol. 31, No. 9, 1996.

[0040] [2] S Y Cheo, G A Rigby, and G R Hellestrand “Half-Rail Differential Logic,”in ISSCC Dig. Tech. Papers. February 1997, pp, 420-42

[0041] [3] J Lee, J park, B Song, W Kim, “Split-level Precharge Differential Logic: A New type of High-Speed Charge Recycling Differential Logic”, IEEE JSSC Vol 36, No. 8 August 2001, pp 1276-1280

[0042] [4] Tien-Ju Yang et al, “Designing Energy Efficient Convolutional Neural Networks using Energy-aware pruning”, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

[0043] [5] Y-H Chen et al, “Understanding the Limitations of Existing Energy-Efficient Design Approaches for Deep Neural Networks”, SYSML'18, February 2018, Stanford, Calif. USA

[0044] [6] Y. Liu et al., “A 0.1pJ/b 5-10 Gb/s Charge-Recycling Stacked Low-Power I/O for On-Chip Signaling in 45 nm CMOS SOI,”ISSCC Dig. Tech. Papers, pp. 400-401, 2013

[0045] [7] J Wilson et al., “A 6.5-to-23.3fJ/b/mm Balanced Charge-Recycling Bus in 16 nm FinFET CMOS at 1.7-to-2.6 Gb/s/wire with Clock Forwarding and Low-Crosstalk Contraflow Wiring”, ISSCC Dig. Tech. Papers, pp. 156-157, 2016

[0046] [8] S Ralapandian et al, “Energy-Efficient Low-Voltage Operation of Digital CMOS Circuits through Charge-recycling-”, 2004 Swap on VLSI Ckts, pp 330-331.

[0047] [9] M. Alimadadi, S. Sheikhaei, G. Lemieux, S. Mirabbasi, W. Dunford, and P. Palmer, “A 4 GHz non-resonant clock driver with inductor-assisted energy return to power grid,”IEEE Trans Circuits Svst. I, Reg. Papers, vol. 57, pp. 2099-2108, August 2010

[0048] [10] K Sim, H Mahmoodi, and K Roy, “A Low-Power SRAM Using Bit-Line Charge-Recycling”, IEEE Journal of Solid-State Circuits, Vol. 43, NO. 2, February 2008

[0049] [11] B D Yang, “A Low-Power SRAM Using Bit-Line Charge-Recycling for Read and Write Operations”, IEEE Journal of Solid-State Circuits, Vol. 45, NO. 10, October 2010.

[0050] [12] A. Bhavnagarwala, et. al, ‘Fluctuation Limits and Scaling Opportunities for CMOS SRAM Cells’, Tech. Dig. IEDM 2005, Pp. 675-678, December 2005.

[0051] [13] WC fit bias et al., “A low-power microprocessor based on resonant energy”, IEEE Journal of Solid-State Circuits, Vol: 32, issue: 11, pp 169-,-1701, November 1997

[0052] [1] L. Svensson, “Adiabatic Switching”, Chapter 6, Low Power Digital CMOS Design, Kluwer Academic Publishers, 1995