Adiabatic logic-in-memory architecture
10868534 ยท 2020-12-15
Assignee
Inventors
Cpc classification
International classification
H03K19/00
ELECTRICITY
Abstract
An adiabatic logic-in-memory based complementary metal-oxide-semiconductor/magnetic-tunnel-junction (ALiM CMOS/MTJ) circuit utilizes an adiabatic logic based pre-charged sense amplifier (PCSA) to recover energy from its output load capacitors. The ALiM CMOS/MTJ includes a non-volatile magnetic-tunnel-junction (MTJ) based memory. The ALiM CMOS/MTJ also includes a dual rail complementary metal-oxide-semiconductor (CMOS) logic that performs logic operations in association with the MTJ, and thereby generates logic outputs based on logic inputs. The ALiM CMOS/MTJ also includes the adiabatic PCSA, which is operatively coupled to the dual rail CMOS logic. The adiabatic logic based PCSA includes PCSA circuitry for which an input is a multi-phase power clock, and a charge recovery circuit having the output load capacitors. The charge recovery circuit is operatively coupled to the PCSA circuitry such that the ALiM CMOS/MTJ circuit uses the power clock to recover energy from the output load capacitors.
Claims
1. An adiabatic logic-in-memory based complementary metal-oxide-semiconductor/magnetic-tunnel-junction (ALiM CMOS/MTJ) circuit, comprising: a set of magnetic-tunnel-junctions (MTJs) configured to store non-volatile data; a logic network, comprising the set of MTJs and a set of complementary metal-oxide-semiconductor (CMOS) transistors configured to together perform logic operations so as to generate logic outputs based on logic inputs; and an adiabatic logic based pre-charged sense amplifier (PCSA) operatively coupled to the logic network, the adiabatic logic based PCSA comprising: PCSA circuitry for which an input is a multi-phase power clock, the PCSA circuitry including cross-coupled transistors coupled to a common discharge transistor; charge recovery circuitry, including output load capacitors, wherein the charge recovery circuitry is operatively coupled to the PCSA circuitry such that the ALiM CMOS/MTJ circuit uses the power clock to recover energy from output load capacitors.
2. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as an OR gate.
3. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as an XOR gate.
4. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as a NOR gate.
5. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as an XNOR gate.
6. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as an AND gate.
7. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as a NAND gate.
8. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as a NOT gate.
9. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as a MUX gate.
10. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as an inverter logic.
11. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as an encoder logic.
12. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as a decoder logic.
13. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as a full-adder logic.
14. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as a half-adder logic.
15. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as a full-subtractor logic.
16. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as a half-subtractor logic.
17. The adiabatic logic-in-memory based ALiM CMOS/MTJ circuit of claim 1, wherein the logic network operates as a D-flip-flop logic.
Description
BRIEF DESCRIPTION OF THE DRAWING(S)
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
(15) The above described drawing figures illustrate the disclosed invention in at least one embodiment, which is further defined in detail in the following description. Those having ordinary skill in the art may be able to make alterations and modifications to what is described herein without departing from its spirit and scope. While the invention is susceptible of embodiment in many different forms, there is shown in the drawings and will herein be described in detail a preferred embodiment of the invention with the understanding that the present disclosure is to be considered as an exemplification of the principles of the invention and is not intended to limit the broad aspects of the invention to any embodiment illustrated. Therefore, it should be understood that what is illustrated is set forth only for the purposes of example and should not be taken as a limitation on the scope of the disclosed invention.
(16)
(17) The non-volatile logic or memory, preferably comprising a plurality of MTJs, is configured to store non-volatile data. In particular, the non-volatile memory is configured to store one-bit (i.e., binary) data. Although a single MTJ based memory is shown for illustration of the principles of the invention, a plurality of MTJ based memories may be utilized without departing from the scope of the invention.
(18) The dual rail CMOS logic is operatively coupled to the non-volatile memory, and is configured to perform desired logic operations in association with the non-volatile memory. The CMOS logic may be any CMOS logic, including one or more of: NOT, AND, OR, NAND, NOR, XOR and XNOR, MUX, inverter, encoder, decoder, full and half adder, full and half subtractor, D flip-flop, and other known logic, configured to perform the desired logical operations. Since the MTJ/CMOS based circuits are dual rail in nature, the complementary outputs are available as one of the output. For example, a XOR based MTJ/CMOS circuit will also include an XNOR output.
(19) The adiabatic logic based PCSA circuit is a modified PCSA circuit that includes a charge recovery circuit such that the ALiM based CMOS/MTJ circuit uses the power clock VCLK to recover the energy from its output load capacitors. This is in contrast to typical LiM based CMOS/MTJ circuits, which use constant voltage inputs V.sub.dd and do not recover energy from their load capacitors. Accordingly, the data stored in the non-volatile memory is sensed and held by the adiabatic logic based PCSA, while the adiabatic logic based PCSA also acts to recover the charge from the output capacitors of the ALiM based CMOS/MTJ circuit.
(20) The inputs to the dual rail CMOS logic tree and the non-volatile memory are the logic inputs and their complements to the CMOS logic functions, A, , B and
(21)
(22) Adiabatic Logic-In-Memory XOR Gate:
(23)
(24) The dual rail CMOS logic 320 and non-volatile memory 420 implement the logic functions, while the adiabatic PCSA 220 reads the logic outputs XOR and XNOR. Due to the adiabatic logic principle, the energy dissipated in an adiabatic circuit when the charge is supplied by a constant current source is very small as compared to conventional CMOS XOR gates.
(25) As shown in
(26) In particular, source terminals of transistors MP1 and MP2 are operatively coupled to the multi-phase clock that provides the multi-phase clock VCLK signal. The gate terminal of transistor MP1 is operatively coupled to the drain terminal of transistor MP2 and to the drain terminal of discharge transistor MN1. The gate terminal of transistor MP2 is operatively coupled to the drain terminal of transistor MP1 and to the source terminal of discharge transistor MN1. The gate terminal of discharge transistor MN1 set to discharge. MP1 and MP2 drain terminals are also operatively coupled to dual branch CMOS logic, and to respective outputs XOR and XNOR, including respective load capacitors C.sub.L.
(27) The ALiM based gate consists of two cross-coupled PMOS devices, transistors T1-T2 and T3-T4, and magnetic tunnel junctions MTJ1 and MTJ2, which store the information. The logic function is constructed through the dual rail CMOS logic and the MTJ devices.
(28) In operation, for example, the logic inputs may be A=0 and B=0, where logic 0 represents ground and logic 1 represents the V.sub.dd. The dual input and
(29) The different phases of the multi-phased clock signal VCLK can be classified as a wait phase t.sub.1, an evaluate phase t.sub.2, a hold phase t.sub.3, and a recover phase t.sub.4. During the wait phase t.sub.1, the clock signal VCLK is at ground (i.e., logic=0). During the evaluate phase t.sub.2, the clock signal VCLK slowly increases from ground to V.sub.dd. During the hold phase t.sub.3, the clock signal VCLK is at V.sub.dd. During the recover phase t.sub.4, the clock signal VCLK slowly decreases from V.sub.dd to ground.
(30) Accordingly, at different phases of the multiphase clock signal VCLK, the operation of the exemplary ALiM based XOR gate is as follows:
(31) The ALiM based XOR gate has an initial state in which all nodes are initially at ground. When the input values are: A=0, =1, B=0 and
(32)
(33) During a hold phase t.sub.3, the clock signal VCLK is at V.sub.dd and the discharge signal DISCHARGE is at 0. While in the hold phase t.sub.3, the outputs are held.
(34)
(35) During the recover phase t.sub.4, the clock signal VCLK gradually decreases from V.sub.dd to 0. The charge stored in the output load capacitor of XNOR is slowly recovered back to the CLK through transistor MP2. The recovery of charge continues until the MP2 transistor is OFF, which occurs when V.sub.scp becomes less than the threshold voltage V.sub.tp.
(36) These steps continue in each cycle ensuring the proper operation of the circuit along with the energy-efficiency by using time ramp voltages to slowly charge and discharge the load capacitors.
(37) Adiabatic Logic-In-Memory Fully Adder:
(38)
(39) The dual rail CMOS logic 330 and non-volatile memory 430 implement the logic functions, while the adiabatic PCSA 230 reads the logic outputs SUM and C.sub.out, and their complements. Due to the adiabatic logic principle, the energy dissipated in an adiabatic circuit when the charge is supplied by a constant current source is very small as compared to conventional CMOS based adders.
(40) The logic inputs to the full adder circuit are A, B, and C.sub.in and respective complement inputs. The outputs are SUM and C.sub.out and their complements. The ALiM MFA circuit also receives a multiphase clock signal VCLK, which slowly charges and discharges the load capacitors in accordance with the principles described herein. Transistors MP1, MP2, MP3, MP4, MN1 and MN2 are operatively coupled to form the adiabatic PCSA. Transistors T1 through T12 along with MTJ1 through MTJ4 are operatively coupled to form the CMOS/MTJ logic structure for the SUM and CARRY outputs.
(41) In other words, the ALiM MFA includes inputs A, B and C.sub.in, and outputs SUM and C.sub.out. Transistors MP1 and MP2 are configured to charge and recover charge from outputs SUM and
(42) The CMOS tree structure of the ALiM MFA is based on the following equations:
SUM=A.Math.B.Math.C.sub.in+
C.sub.out=A.Math.B+A.Math.C.sub.in+B.Math.C.sub.in(5)
(43) As an illustrative example, the input values may be: A=1, B=1 and C.sub.in=1, and the timing diagram for the ALiM MFA may be in accordance with
(44) During the wait phase t.sub.1 of the clock signal VCLK, the inputs A, B, and C.sub.in (and complements) of the ALiM MFA are passed to the circuit and the non-volatile data is stored in MTJ1, MTJ2, MTJ3 and MTJ4. MTJ1 and MTJ3 will have lower resistances as compared to the MTJ2 and MTJ4, due to the parallel orientation of MTJ1 and MTJ3 (B=1) as compared to the anti-parallel orientation of MTJ2 and MTJ4 (
(45) During the evaluate phase t.sub.2 of the clock signal VCLK, the inputs A, B, and C.sub.in (and compliments) are evaluated by the CMOS logic tree. For A=1 and Cin=1, output
(46) In other words, during the evaluate phase, the clock signal VCLK increases from 0 to V.sub.dd. When the VCLK reaches V.sub.tp, transistors MP2 and MP4 are turned ON. Thus, the SUM and C.sub.out outputs follow the clock signal VCLK.
(47) During the hold phase t.sub.3 phase of the clock signal VCLK, the outputs are held.
(48) During the recover phase t.sub.4, the clock signal VCLK slowly decreases from V.sub.dd to 0. The charge stored at SUM and C.sub.out is accordingly recovered in accordance with the principles discussed herein.
(49) These steps continue in each cycle ensuring the proper operation of the circuit along with the energy-efficiency by using time ramp voltages to slowly charge and discharge the load capacitors.
(50) Experimental Results
(51) Simulations were performed on each of the ALiM XOR gate and the ALiM MFA. Simulations were performed using a Cadence Spectre simulator with 45 nm standard CMOS technology with perpendicular anisotropy CoFeB/MgO MTJ model. Table I shows the MTJ device parameters used.
(52) In each of the test simulations, the size of the transistors, except the discharge transistors MN1 and MN2, were W/L=120 nm/45 nm. The discharge transistors MN1 and MN2 were W/L=300 nm/45 nm. The discharge transistors MN1 and MN2 were sized bigger to completely discharge and reset the outputs before the evaluation phase of the subsequent cycle. Simulations are performed at 250 MHz with V.sub.dd=0.9 V.
(53) TABLE-US-00001 TABLE I MTJ DEVICE PARAMETERS USED FOR SIMULATIONS Parameter Description Value t.sub.a1 Thickness of the free layer 1.3 nm a Length of surface long axis 40 nm b Width of surface short axis 40 nm t.sub.ox Thickness of the Oxide barrier 0.85 nm TMR Tunnel Magneto Resistance ratio 150% RA Resistance Area Product 5 ohm m.sup.2 Area MTJ layout surface 40 nm 40 nm /4 I.sub.co Critical switching current Min. 40 A
(54) Table II provides a performance comparison of the PCSA based XOR gate and the ALiM XOR gate. As Table II shows, the ALiM XOR gate has 62% and 50% of energy and power savings as compared to the PCSA based XOR gate. Further, the ALiM XOR gate has 31.7% area savings as compared to the PCSA based XOR gate.
(55) TABLE-US-00002 TABLE II PERFORMANCE COMPARISON OF PCSA BASED MFA AND ALIM BASED XOR GATE PCSA based ALIM based XOR [3] XOR (proposed) % impr. Avg. energy (fJ) 0.98 0.363 62 Avg. power (W) 0.36 0.18 50 Device count 11MOS + 2MTJ 7MOS + 2MTJ Area (m.sup.2) 5.29 3.61 31.7
(56)
(57)
(58) Table III provides a performance comparison of the PCSA based MFA and the ALiM MFA. As Table III shows, the ALiM MFA saves 37% of energy as compared to the PCSA based MFA. The ALiM MFA saves 43% of power consumption as compared to the PCSA based MFA.
(59) TABLE-US-00003 TABLE III PERFORMANCE COMPARISON OF PCSA BASED MFA AND ALIM BASED MFA PCSA based ALIM based MFA [3] MFA (proposed) % impr. Avg. energy (fJ) 4.39 2.77 37 Avg. power (W) 1.81 1.03 43 Device count 26MOS + 4MTJ 18MOS + 4MTJ Area (m.sup.2) 14.4 9 38
(60) Along with reduced energy and power consumption, the ALiM MFA also used a smaller number of devices than the PCSA based MFA. The PCSA based MFA used 26 MOS devices and 4 MTJs. The ALiM MFA used 18 MOS devices and 4 MTJs. Moreover, the ALiM MFA had 38% area savings as compared to the PCSA based MFA.
(61) Turning now to
(62) A novel architecture for forming ALiM based CMOS/MTJ circuits is disclosed herein. These ALiM based CMOS/MTJ circuits have increased energy and power savings in comparison to the existing PCSA based CMOS/MTJ circuits for corresponding logic. The low-power consumption, low-energy consumption and low area of the ALiM based CMOS/MTJ circuits makes them particularly suited for use in ultra-low-power portable electronic devices.
(63) While the principles of the ALiM based CMOS/MTJ circuits are described herein with specific reference to XOR and MFA logics, ALiM CMOS/MTJ circuits corresponding to other logics are expressly contemplated. For example, the ALiM based CMOS/MTJ circuit may include CMOS logic corresponding to any of the known gates, multi-bit adders, and flip-flops, as discussed above.
(64) For example,
(65) The enablements described in detail above are considered novel over the prior art of record and are considered critical to the operation of at least one aspect of the invention and to the achievement of the objectives of the invention. The words used in this specification to describe the exemplary embodiments are to be understood not only in the sense of their commonly defined meanings, but also to include any special definition with regard to structure, material or acts that would be understood by one of ordinary skilled in the art to apply in the context of the entire disclosure.
(66) The definitions of the words or drawing elements described herein are meant to include not only the combination of elements which are literally set forth, but all equivalent structures, materials or acts for performing substantially the same function in substantially the same way to obtain substantially the same result. In this sense it is therefore contemplated that an equivalent substitution of two or more elements may be made for any one of the elements described and its various embodiments or that a single element may be substituted for two or more elements in a claim without departing from the scope of the invention.
(67) Changes from the claimed subject matter as viewed by a person with ordinary skill in the art, now known or later devised, are expressly contemplated as being equivalents within the scope intended and its various embodiments. Therefore, obvious substitutions now or later known to one with ordinary skill in the art are defined to be within the scope of the defined elements. This disclosure is thus meant to be understood to include what is specifically illustrated and described above, what is conceptually equivalent, what can be obviously substituted, and also what incorporates the essential ideas.
(68) The scope of this description is to be interpreted in conjunction with the appended claims.