METHOD AND APPARATUS FOR USING REFERENCE RESISTOR IN ONE-TIME PROGRAMMABLE MEMORY OF AN ARTIFICIAL INTELLIGENCE INTEGRATED CIRCUIT
20190180173 ยท 2019-06-13
Inventors
Cpc classification
G11C11/005
PHYSICS
International classification
Abstract
An integrated circuit may include an AI logic circuit, and an embedded one-time programmable (OTP) MRAM memory electrically coupled to the AI logic circuit. The embedded OTP MRAM memory may include multiple storage cells, one or more reference resistors, and a memory-reading circuit for determining the state of each storage cell. The reading circuit may include: a multiplexer configured to electrically couple each storage cell to a reference resistor; a source line selectively providing an input electrical signal to each storage cell to generate a first output signal; a driving circuit providing an input electrical signal to the reference resistor to generate a second output signal; and a comparator configured to compare the first output signal and the second output signal to generate an output signal that indicates the state of each storage cell. Each reference resistor may be shared among multiple storages in an array or multiple storage arrays.
Claims
1. An integrated circuit for artificial intelligence (AI) processing comprising: an AI logic circuit; and an embedded one-time programmable (OTP) MRAM memory electrically coupled to the AI logic circuit for storing AI model parameters, the embedded OTP MRAM memory comprising: a plurality of storage cells, each storage cell comprising a one-time programmable MTJ bit cell for storing AI model parameters that do not frequently change, and a reference resistor, and a memory-reading circuit comprising: a multiplexer configured to electrically couple each storage cell to the reference resistor, a source line selectively providing a first electrical signal to each storage cell to generate a first output signal, a driving circuit providing a second electrical signal to the reference resistor to generate a second output signal, and a comparator configured to compare the first output signal and the second output signal to generate an output that indicates a state of each storage cell.
2. The integrated circuit of claim 1, wherein the OTP MRAM is a spin orbit torque (SOT), spin transfer torque (STT), magnetoelectric RAM (MeRAM)/Voltage-controlled magnetic anisotropy (VCMA) MRAM or orthogonal spin transfer (OST) MRAM.
3. The integrated circuit of claim 1, wherein the reference resistor is a constant resistor having a constant resistance value.
4. The integrated circuit of claim 1, wherein the reference resistor is formed in a bottom-electrode (BE) layer, a top-electrode (TE) layer, or a metal layer of a CMOS transistor.
5. The integrated circuit of claim 1, wherein: each of the first and second electrical signals is a current; and each of the first and second output signals is a voltage.
6. The integrated circuit of claim 1, wherein the comparator is a sense amplifier.
7. The integrated circuit of claim 1, wherein the multiplexer of the embedded OTP MRAM memory is configured to electrically couple each storage cell in an array of storage cells to the reference resistor.
8. The integrated circuit of claim 7, wherein the embedded OTP MRAM memory further comprises: an additional multiplexer configured to electrically couple each storage cell in an additional array of storage cells to the reference resistor; an additional source line selectively providing a third electrical signal to each storage cell in the additional array of storage cells to generate a third output signal; and an additional comparator configured to compare the third output signal and the second output signal to generate an output that indicates a state of each storage cell in the additional array of storage cells.
9. The integrated circuit of claim 1, wherein: each storage cell in the embedded OTP MRAM memory has a read-margin window; and the reference resistor has a value that is in a mid-range of the read-margin window.
10. The integrated circuit of claim 1, wherein: the plurality of storage cells in the embedded OTP MRAM store one or more weights of a convolutional neural network (CNN); and the AI logic circuit is configured to execute certain AI functions using the one or more weights of the CNN.
11. A method of reading an embedded one-time programmable (OTP) MRAM memory for storing artificial intelligence (AI) model parameters in an AI integrated circuit, the method comprising: coupling, by a multiplexer, each of a plurality of storage cells in the OTP MRAM to a reference resistor having a constant resistance value, wherein each storage cell comprises a one-time programmable MTJ bit cell for storing AI model parameters that do not need frequent change and wherein one-time programmable MTJ bit cell is embedded in the AI integrated circuit; providing, by a source line, a first electrical signal to each storage cell to generate a first output signal; providing, by a driving circuit, a second electrical signal to the reference resistor to generate a second output signal; and comparing a first output signal and the second output signal to generate an output signal that indicates a state of each storage cell.
12. The method of claim 11, wherein the reference resistor is formed in a bottom-electrode (BE) layer, a top-electrode (TE) layer, or a metal layer of a CMOS transistor.
13. The method of claim 11, wherein: each of the first and second electrical signals is a current; and each of the first and second output signals is a voltage.
14. The method of claim 11, wherein comparing the first output signal and the second output signal uses a sense amplifier configured to receive input signals from the first output signal and the second output signal.
15. The method of claim 14, further comprising: coupling, by an additional multiplexer, each of an additional plurality of storage cells in the embedded OTP MRAM to the reference resistor, wherein each of the additional plurality of storage cells comprises a one-time programmable MTJ bit cell; providing, by an additional source line, a third electrical signal to each of the additional plurality of storage cells to generate a third output signal; and comparing the third output signal and the second output signal to generate an output signal that indicates a state of each of the additional plurality of storage cells.
16. The method of claim 11, wherein: each storage cell in the embedded OTP MRAM memory has a read-margin window; and the reference resistor has a value that is in a mid-range of the read-margin window.
17. The method of claim 11, further comprising: storing in the plurality of storage cells in the embedded OTP MRAM one or more weights of a convolutional neural network (CNN); and causing the AI logic circuit to execute certain AI functions using the one or more weights of the CNN.
18. An integrated circuit for artificial intelligence (AI) processing comprising: an AI logic circuit; an embedded one-time programmable (OTP) MRAM memory of a first type electrically coupled to the AI logic circuit fir storing AI model parameters, the embedded OTP MRAM memory comprising: a plurality of storage cells, each storage cell comprising at least a one-time programmable MTJ bit cell for storing AI model parameters that do not need frequent change, and a reference resistor, and a memory-reading circuit comprising: a multiplexer configured to electrically couple each storage cell to the reference resistors, a source line selectively providing a first electrical signal to each storage cell to generate a first output signal, a driving circuit providing a second electrical signal to the reference resistor to generate a second output signal, and a comparator configured to compare the first output signal and the second output signal to generate an output signal that indicates a stage of each storage cell; and one or more additional embedded RAM memories of types that are different from the first type.
19. The integrated circuit of claim 18, wherein the embedded OTP MRAM memory of the first type is a spin orbit torque (SOT), spin transfer torque (STT), magnetoelectric RAM (MeRAM)/Voltage-controlled magnetic anisotropy (VCMA) MRAM or orthogonal spin transfer (OST) MRAM.
20. The integrated circuit of claim 18, wherein each of the one or more additional embedded RAM memories is a static random access memory (SRAM), spin orbit torque (SOT), spin transfer torque (STT), magnetoelectric RAM (MeRAM)/Voltage-controlled magnetic anisotropy (VCMA) MRAM or orthogonal spin transfer (OST) MRAM.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0013]
[0014]
[0015]
[0016]
DETAILED DESCRIPTION
[0017] It will be readily understood that the components of the embodiments as generally described herein and illustrated in the appended figures could be arranged and designed in a wide variety of different configurations. Thus, the following more detailed description of various embodiments, as represented in the figures, is not intended to limit the scope of the present disclosure, but is merely representative of various embodiments. While the various aspects of the embodiments are presented in drawings, the drawings are not necessarily drawn to scale unless specifically indicated.
[0018] The present invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described embodiments are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is, therefore, indicated by the appended claims rather than by this detailed description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.
[0019] References throughout this specification to features, advantages, or similar language do not imply that all of the features and advantages that may be realized with the present invention should be or are in any single embodiment of the invention. Rather, language referring to the features and advantages is understood to mean that a specific feature, advantage, or characteristic described in connection with an embodiment is included in at least one embodiment of the present invention. Thus, discussions of the features and advantages, and similar language, throughout the specification may, but do not necessarily, refer to the same embodiment.
[0020] Furthermore, the described features, advantages and characteristics of the invention may be combined in any suitable manner in one or more embodiments. One skilled in the relevant art will recognize, in light of the description herein, that the invention can be practiced without one or more of the specific features or advantages of a particular embodiment. In other instances, additional features and advantages may be recognized in certain embodiments that may not be present in all embodiments of the invention.
[0021] As used in this document, the singular form a, an, and the include plural references unless the context clearly dictates otherwise. Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art. As used in this document, the term comprising means including, but not limited to. Unless defined otherwise, all technical and scientific terms used in this document have the same meanings as commonly understood by one of ordinary skill in the art.
[0022] Each of the terms artificial intelligence logic circuit and AI logic circuit refers to a logic circuit that is configured to execute certain AI functions such as a neural network in AI or machine learning tasks. An AI logic circuit can be a processor. An AI logic circuit can also be a logic circuit that is controlled by an external processor and executes certain AI functions.
[0023] Each of the terms integrated circuit, semiconductor chip, chip and semiconductor device refers to an integrated circuit (IC) that contains electronic circuits on semiconductor materials, such as silicon, for performing certain functions. For example, an integrated circuit can be a microprocessor, a memory, a programmable array logic (PAL) device, an application-specific integrated circuit (ASIC) or others. An integrated circuit that contains an AI logic circuit is referred to as an AI integrated circuit or an AI chip.
[0024] The term wafer level for the purpose of programming a memory in an integrated circuit refers to programming the memory while the chip is still in wafer form. Wafer-level programming is generally performed at foundries at the time of manufacturing.
[0025] Each of the terms chip packaging level or chip level for the purpose of programming a memory in an integrated circuit (i.e., the semiconductor chip) refers to programming the semiconductor chip while the chip is already packaged. Chip-level programming can be performed at foundries or at a user site after the chip has been manufactured.
[0026] The term electrically coupled refers to establishing a current flow from an electrical component to another. The establishment of the current flow may be done via applying a voltage, a current or use of a switching device (e.g., a transistor) that is operable to turn on/off the current flow between components.
[0027] In
[0028] For example, in performing an AI task, the OTP MRAM memory may store weights of a convolutional neural network and the AI logic circuit, such as the CNN logic circuit may retrieve the stored weights from the OTP MRAM memory 104. A specific application such as face recognition requires a particular set of filter coefficients, which can be permanently written to the AI chip during fabrication. Alternatively, and/or additionally, for a specific application (e.g., security for certain application or user), certain data can be permanently written to the AI chip.
[0029] In some scenarios, OTP MRAM memory 104 may include one or more storage cells 106, one or more reference resistors 108, and a memory reading circuit 110, which will be explained later in the document. The AI chip also may include a second type of memory 112 co-existing inside the chip. The second type of memory 112 may be a random access memory (RAM) to store programming instructions for the AI logic circuit or an external processor to perform various AI tasks, or to store intermediate results in performing the AI tasks. In some scenarios, the second type of memory 112 may include static random access memory (SRAM), STT-MRAM, MeRAM/VCMA MRAM, SOT MRAM, and/or OST MRAM.
[0030]
[0031] With reference to
[0032] In an OTP MRAM memory, the low resistance Rp of the MTJ bit cell becomes zero when the MTJ bit cell is broken down. This causes the read-margin window of the OTP MRAM memory to become Rp (i.e., zero)+Rc (i.e., the resistance of a transistor coupled to the MTJ bit cell))(Rap+Rc)=Rc(Rap+Rc). In the example described earlier in this document, when Rp is about 1.6 Kohms, Rap is about 3.2 Kohms, and Rc is about 1.6 Kohms, the read margin window for OTP MRAM memory is now 1.6 Kohms4.8 Kohms, which is much higher than 3.2 Kohms4.8 Kohms with the MRAM memory. Due to the increase of read-margin window in OTP MRAM memory, in some scenarios, the reading of the OTP MRAM memory storage cells may not need to use a typical reference cell but instead use a constant reference resistor having a constant resistance value. Further, a lower current or voltage may be applied in reading the memory. Device and method for reading OTP MRAM memory cells are further described in detail.
[0033] With reference to
[0034] Embedded OTP MRAM memory 200 may include multiple storage cells, each storage cell including a one-time programmable MTJ bit cell 206. The embedded OTP MRAM memory may also include a reference resistor 210, which may be a constant reference resistor having a constant resistance value. The constant reference resistor is preferably formed in a non-MTJ bit cell, such as in a bottom-electrode (BE) layer, a top-electrode (TE) layer, or a metal layer of a CMOS transistor. The constant reference resistor may also be formed in an MTJ bit cell layer.
[0035] The embedded OTP MRAM memory 200 also may include a memory-reading circuit for reading the states of each storage cell contained in the memory. In some scenarios, the memory-reading circuit includes a multiplexer 204 for electrically coupling a storage cell 204 to the reference resistor 210. For example, multiplexer 204 may operably switch on to electrically couple the storage cell 206 to reference resistor 210. As illustrated in
[0036] Returning to
[0037] The embedded memory 200 also may include a comparator 212 configured to compare the first output signal for the storage cell and the second output signal for the reference resistor to generate an output signal 214. In some scenarios, the output signal 214 indicates the state of the selected storage cell. In some scenarios, the comparator is a sense amplifier commonly used in a memory circuit.
[0038] With reference to
[0039] In determining the state of selected storage cell from the output signal of the comparator, in some scenarios, the output signal of the sense amplifier may be compared to a threshold. If the output signal exceeds the threshold, a state corresponding to a value of one in the selected storage cell may be determined. If the output signal is below the threshold, a state corresponding to a value of zero in the selected storage cell may be determined.
[0040] In
[0041] With reference to
[0042] For example, in the above illustrated example, each MTJ bit cell in the storage cell of the embedded OTP MRAM memory has a high-resistance value Rap and a low-resistance value Rp, for example, at 3.2 Kohms and 1.6 Kohms, respectively. Each storage cell also includes a CMOS transistor electrically coupled to the MTJ bit cell in the storage cell, wherein the CMOS transistor has a resistance value Rc, for example, at 1.6 Kohms. In such a case, the reference resistor may have a value that is in a mid-range of the read-margin window. In the above example, the read-margin window may be Rc to (Rc+Rap), i.e., 1.6 Kohms4.8 Kohms. The reference resistor may have a value in the mid-range of the window, e.g., (1.6 Kohms+4.8 Kohms)/2=3.2 Kohms. Due to the increased read-margin window associated with OTP MRAM memory as compared to the conventional MRAM memory, the source line and driving circuit may provide a lower current or voltage, which will result in decreased size of the chip and lower power consumption.
[0043] With further reference to
[0044] In case of an OTP MRAM memory array as shown in
[0045] MRAM memory may include additional comparators, e.g., 318 corresponding to MRAM array 304, in which the storage cell is selected.
[0046] The embedded memory may include a source line 309, 311 that provide a first electrical signal to the selected storage cell in each respective MRAM array to generate a first output signal. The embedded memory also may include a driving circuit 314 that provides a second electrical signal to the reference resistor 312 to generate a second output signal. The embedded memory also may include multiple comparators, such as 316, 318, each configured to compare the first output signal and the second output signal for each respective MRAM array 302, 304. The comparator 316, 318 each generates an output signal 320, 322 which indicates the state of the selected storage cell Similar to the single storage cell in
[0047] In the case of a voltage sense amplifier, similar to
[0048] The above example of OTP MRAM memory is described to have two arrays that have a common word-line driver for illustration purpose only. Other variations of the circuit layout may be possible. For example, the source line 309, 311 may share one common voltage line. Multiple memory arrays may not be required to share a common word-line driver. Each of the memory arrays may have its own bit-line multiplexer and word-line driver. Multiple memory arrays also may have a common bit-line multiplexer. Further, multiple memory arrays may not need to share a common reference resistor 312 as shown in
[0049] A process for memory reading in the above illustrated embedded OTP MRAM memory is now further described. With reference to
[0050] The various structures and methods disclosed in this patent document provide advantages over the prior art, whether standalone or combined. The above illustrated embedded OTP MRAM memory and memory reading methods use one or more reference resistors, in lieu of conventional reference cells in existing MRAM memory. This may avoid excessive use of reference or memory cell data redundancy and decrease overhead capacities. The constant reference resistor may not be required to be placed near the storage cells or have a similar structure as that of the storage cells to maintain uniformity and accurate reading. Instead, the reference resistor may be flexibly placed anywhere in the layout, and the number of reference resistors may also be minimized. Due to increased read-margin window associated with OTP MRAM, a lower current/voltage is required to read the memory. These advantages make it possible to optimize the memory chip in size, power consumption and performance.
[0051] The advantages of smaller chip size and low-power consumption achieved from above described embodiments make the AI chip particularly suitable for many mobile and Internet-of-things (IoT) applications. For example, in an AI application, the OTP MRAM memory may store trained weights of a convolutional neural network (CNN) architecture, and the AI logic circuit in the AI chip may be configured to execute certain AI functions using one or more weights stored in the OTP MRAM memory. In some scenarios, for example, in a security application, the OTP MRAM memory may store registered human faces and/or features of the human faces, and a face recognition application may retrieve the features of registered human faces from the OTP MRAM memory while executing programming instructions to perform the face recognition task.
[0052] Other advantages can be apparent to those skilled in the art from the foregoing specification. For example, the above described various embodiments are illustrated for one-transistor-one-magnetic-tunnel-junction (1T-1MTJ), but can also be applicable for other variations, such as 2T-2MTJ and so on. The driving circuit for the reference resistor may provide a current signal, and the output signal at the input of the comparator may be a voltage signal. Alternatively, the driving circuit for the reference resistor may provide a voltage signal, and the output signal at the input of the comparator may be a current signal. Further,
[0053] Accordingly, it will be recognized by those skilled in the art that changes or modifications may be made to the above-described embodiments without departing from the broad inventive concepts of the invention. It should, therefore, be understood that this invention is not limited to the particular embodiments described herein, but is intended to include all changes, modifications, and all combinations of various embodiments that are within the scope and spirit of the invention as defined in the claims.