Physical unclonable function using augmented memory for challenge-response hashing
09787481 · 2017-10-10
Assignee
Inventors
Cpc classification
G09C1/00
PHYSICS
H04L2209/12
ELECTRICITY
International classification
H04L9/32
ELECTRICITY
Abstract
A technique is presented for performing a physical unclonable function (PUF) using an array of SRAM cells. The technique can be viewed as an attempt to read multiple cells in a column at the same time, creating contention that is resolved according to process variation. An authentication challenge is issued to the array of SRAM cells by activating two or more wordlines concurrently. The response is simply the value that the SRAM produces from a read operation when the challenge condition is applied. The number of challenges that can be applied the array of SRAM cells grows exponentially with the number of SRAM rows and these challenges can be applied at any time without power cycling.
Claims
1. A method for performing a physical unclonable function using an array of static random-access memory (SRAM) cells, comprising arranging an array of SRAM cells in rows and columns, where SRAM cells in each row in the array share a common wordline and SRAM cells in each column in the array share common bitlines; issuing an authentication challenge to the array of SRAM cells by activating two or more wordlines concurrently; and detecting, in response to the authentication challenge, an output for each column of the array, where the outputs collectively form a response to the authentication challenge.
2. The method of claim 1 wherein issuing an authentication challenge further comprises writing values into SRAM cells and loading drivers for each wordline prior to activating the two or more wordlines.
3. The method of claim 2 further comprises loading drivers for a given wordline using a control circuit, where the control circuit includes a flip-flop circuit that operates to store the activation signal.
4. The method of claim 1 wherein detecting an output for each column further comprises detecting a differential voltage across bitlines associated with each column in the array of SRAM cells and, for each differential voltage, assigning a value to a binary number based on the differential voltage, and aggregating the values from each column in the array of SRAM cells to form the output.
5. The method of claim 4 further comprises performing an authentication function on a computing device based on the output.
6. The method of claim 1 wherein each SRAM cell includes a pair of inverters cross coupled to each other.
7. The method of claim 2 further comprises issuing a first authentication challenge to the array of SRAM cells by activating a first subset of wordlines concurrently and detecting, in response to the first authentication challenge, an output for the first authentication challenge.
8. The method of claim 7 further comprises issuing a second authentication challenge to the array of SRAM cells by activating a second subset of wordlines concurrently and detecting, in response to the second authentication challenge, an output for the second authentication challenge, where the second subset of wordlines is mutually exclusive from the first subset of wordlines.
9. A method for performing a physical unclonable function using an array of static random-access memory (SRAM) cells, comprising: loading values in each SRAM cell in an array of SRAM cells, where the array of SRAM cells is arranged in rows and columns, the SRAM cells in each row in the array share a common wordline, and SRAM cells in each column in the array share common bitlines; loading drivers for each wordline in the array of SRAM cells; asserting two or more wordlines concurrently and thereby issue an authentication challenge to the array of SRAM cells; detecting a differential voltage across bitlines associated with each column in the array of SRAM cells; assigning, for each differential voltage, a value to a binary number based on the differential voltage; and aggregating the value of the binary number from each column in the array of SRAM cells to form an output.
10. The method of claim 9 further comprises loading drivers for a given wordline using a flip-flop circuit.
11. The method of claim 9 wherein each SRAM cell includes a pair of inverters cross coupled to each other.
12. The method of claim 9 further comprises performing an authentication function on a computing device based on the output.
Description
DRAWINGS
(1) The drawings described herein are for illustrative purposes only of selected embodiments and not all possible implementations, and are not intended to limit the scope of the present disclosure.
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17) Corresponding reference numerals indicate corresponding parts throughout the several views of the drawings.
DETAILED DESCRIPTION
(18) Example embodiments will now be described more fully with reference to the accompanying drawings.
(19) Static Random-Access Memory (SRAM) is a ubiquitous building block of integrated circuits that is found in caches, register files, and buffers. Single VLSI circuits commonly contain millions of bits of SRAM storage. Each bit of SRAM is typically implemented by a pair of inverters cross coupled to each other. More specifically, an example single 6-transistor cell 12 as seen in FIG. 1A. In this arrangement, each SRAM cell has two stable states. In each stable state node A or B is pulled high through transistor p.sub.1 or p.sub.2 while the other is pulled low through n.sub.1 or n.sub.2. The cell is read and written using complementary bitlines (BL) and (BLB) through two access transistors n.sub.3 and n.sub.4. The two access transistors of a cell are controlled by a single wordline. While reference is made to a six transistor cell, other circuit arrangements for a SRAM cell are also contemplated by this disclosure.
(20) Referring to
(21)
(22)
(23)
(24) The technique can be viewed as an attempt to read multiple cells in a column at the same time, creating contention that is resolved according to process variation. An authentication challenge is issued to the array of SRAM cells at 32 by activating two or more wordlines concurrently. More specifically, a challenge is applied to the array of SRAM cells 14 by pre-loading chosen values into the SRAM cells, and choosing the wordlines to concurrently activate. The PUF response is simply the value that the SRAM produces from a read operation when the challenge condition is applied. Thus, in response to the challenge, an output is detected at 33 for each column of the array, such that the outputs collectively form a response to the authentication challenge. In an example implementation, the response to the authentication challenge is compared to a previous challenge of the same array. When the responses match, the challenge is deemed to be authenticated and thus a requested function may be performed by the computing device. On the other hand, the challenge is not authenticated when the responses are mismatched. This technique for performing a physical unclonable function may also serve as the basis for other types of computing functions.
(25) The array of SRAM cells 14 requires additional circuitry to enable the concurrent activation of multiple wordlines because the capability of activating multiple wordlines has no use in SRAM's traditional tasks of reading, writing, and storing data. While a particular type of PUF response is described herein, the PUF can be view more broadly as causing electrical contention across different memory cells. The accumulator circuit is a way of causing this contention in one embodiment, but even if the SRAM architecture changes in the future, different ways of modifying the architecture to cause contention are envisioned.
(26) This technique is further described as follows. Let the challenge applied to an array be C: {c.sub.0, c.sub.1, . . . , c.sub.Y-1}, where Y is the number of rows in the array of SRAM cells. Each element c.sub.i of the challenge corresponds to SRAM row i as follows, and say that any row is active in a challenge if its corresponding challenge element (c.sub.i) is either 0 or 1. For example,
(27) if c.sub.i=0, then row i is loaded with 0s and WL.sub.i is on during evaluation.
(28) if c.sub.i=1, then row i is loaded with 1 s and WL.sub.i is on during evaluation.
(29) if c.sub.i=2, then row i is loaded with 0s and WL.sub.i is off during evaluation
(30) if c.sub.i=3, then row i is loaded with 1 s and WL.sub.i is off during evaluation
(31) A single SRAM column constitutes a Bitline PUF with a 1-bit response, and Bitline PUFs are therefore inherently parallel because a challenge is applied concurrently to many SRAM columns. Let a 1-bit PUF at column i be denoted P.sub.i, and its response to challenge C be denoted P.sub.i (C). Let an X-column Bitline PUF be denoted P.sub.0:X-1 and its response be P.sub.0:X-1 (C)={P.sub.0 (C), P.sub.1 (C), P.sub.X-1 (C)}. Note that for simplicity of description the same challenge is applied to all columns of the SRAM PUF. Different challenges can be applied to different columns provided that the challenges agree on which rows are active. This can be particularly useful in the case of inactive rows that retain pre-existing data through a challenge. Therefore, a Bitline PUF with Y rows and X columns has 4.sup.Y possible challenges and 2.sup.X possible responses.
(32) The sequence of events necessary to implement the challenge technique is shown in
(33) With reference to
(34) At the start of the second phase 42, the accumulator of every wordline is reset. In each of the subsequent Y or fewer cycles, a log.sub.2 (Y)-bit select signal sets high the flip-flop of one active wordline as indicated at 52. Once all flip-flops are appropriately loaded, an evaluation signal passes the loaded values to the wordline drivers, so that multiple wordlines are asserted in the same cycle during the PUF evaluation.
(35) The proposed technique requires augmentation to the wordline control circuitry so that multiple wordlines can be concurrently enabled during PUF evaluation. In a typical SRAM, an externally supplied log.sub.2 (Y)-bit address is decoded to select exactly one of the Y rows for reading; the selected row then uses a clocked driver to set its wordline high at the appropriate time during the clock cycle. The proposed PUF technique requires multiple wordlines to be concurrently enabled, and this can be accomplished by having a flip-flop that accumulates wordline activation signals at the input of each wordline driver. An example wordline control circuit 60 is shown in
(36) Evaluating the PUF response is identical to an SRAM read operation, except that multiple wordlines are asserted. For each column, the cells at any active row will discharge one of the two complementary bitlines, and considering that different cells in the column will discharge each bitline, this causes both bitlines of a column to be discharged during the evaluation. The discharging of bitlines for a variety of challenges are shown in
(37) The three phases of operation as seen in
(38)
For a 256-column by 256-row SRAM with a 5 ns cycle time, this corresponds to a latency of 2.6 μs and a response throughput of 99.8 Mbps.
(39) The results in this disclosure are obtained from circuit simulation using the Ngspice simulator (Rev 25). On account of the long runtimes of large SPICE simulations, the columns of the simulated bitline PUFs have only 16 rows, whereas a real SRAM would typically have hundreds of rows.
(40) Transistor and interconnect models used to validate the design are from the freely-available Predictive Technology Model (PTM). More specifically, the transistor models are BSIM4 PTM models for a 90 nm process. Transistor sizes are shown in Table 1 below.
(41) TABLE-US-00001 TABLE 1 Process Variation Sizing vth0 [mV] lint [nm] W [nm] L [nm] μ σ μ σ SRAM CELL n1, n2 200 90 397 13.4 7.5 3 n3, n4 140 90 397 16.0 7.5 3 p1, p2 140 90 −339 16.0 7.5 3 Sense Amp & NMOS 1000 90 397 6.0 7.5 3 Percentage PMOS 1000 90 −339 6.0 7.5 3
(42) To better represent a real design, the 16 SRAM rows simulated are modeled as being distributed over a typical-length bitline. In this way, the 16 rows can be considered as existing among many others within a realistic-sized SRAM. Assume for bitline modeling an SRAM with 520 rows and a cell height of 0.72 μm, for a total length of 374.4 μm per bitline. According to the PTM interconnect calculator, a 374.4 μm local interconnect in 90 nm technology has a total resistance of 183.04Ω and capacitance of 69.67 fC. The resistance and capacitance is distributed such that the bitlines between each pair of adjacent rows is implemented by a wire model with an 11.44Ω resistance between two capacitors of 2.17 fC each.
(43) To model process variations from fabrication, random parameter variation is applied to every transistor of each PUF instance. The transistor parameters determining threshold voltage and length are replaced by normally distributed N (μ, σ.sup.2) random variables. Table 1 shows the mean and standard deviation for each such parameter.
(44) Random dopant fluctuation is represented in transistor parameter vth0. The mean value for threshold voltage is the default value in the transistor model, and the standard deviation depends on transistor geometry according to Eq. 1; larger devices have less threshold variation than the small devices in the SRAM cells. We use a value of 1.8 mV μm for A.sub.VT.
(45)
Variations in effective transistor length are represented by changes to parameter lint. Lint, standing for internal length, represents the difference between nominal and effective transistor length. The nominal value of lint is 7.5 nm and its standard deviation is set to 3 nm based on the observation that effective transistor length has a 3σ value that is 10% of overall transistor length.
(46) Thermal noise is modeled in SPICE by transient random voltage sources. As represented by small grey circles in
(47)
(48) The simulation methodology explained in the previous section is used for experimental evaluation of the Bitline PUF. Uniqueness of responses, and reliability with respect to temperature and supply voltage variation are evaluated. Finally, power consumption and susceptibility to modeling attacks are considered. These experimental results indicate that the Bitline PUF is promising as a reliable and unique strong PUF.
(49) The mixture of c.sub.i values in each challenge can bias PUFs toward producing 0-responses or 1-responses, but ideal challenges should produce either response with equal probability across a population. From a circuit perspective, ideal challenges should discharge both bitlines with equal strength to increase the sensitivity of response to process variations. For a symmetric SRAM cell, where only variation differentiates n.sub.1 and p.sub.1 from n.sub.2 and p.sub.2, the two complementary bitlines discharge with equal strength when the same number of NMOS transistors (i.e, n.sub.1 or n.sub.2 of each active cell) are discharging each one. The challenges that cause this situation are those having an equal number of c.sub.i=0 and c.sub.i=1 values, along with some unspecified mixture of inactive rows with c.sub.i=2 or c.sub.i=3; challenges satisfying this condition are therefore denoted as “unbiased”.
(50) The heat map of
(51) The number of unbiased challenges having exactly k challenge values with c.sub.i=0 and k with c.sub.i=1 is given by n′.sub.k (Y) (Eq. 3). The number of total unbiased challenges with any number of c.sub.i=0 and c.sub.i=1 values is given by n(Y) (Eq. 4). The number of unbiased challenges is exponential in the number of rows Y (i.e. the challenge size). Therefore, an adversary cannot hope to mimic a PUF by simply recording all challenge-response pairs, and must instead resort to predicting responses using a parametric model.
(52)
(53) A single PUF should always respond to the same challenge similarly, and two PUF instances should never respond to the same unbiased challenges similarly. For a challenge C, a comparison of two responses from the same PUF is denoted “within-class”, and a comparison of responses from two different PUFs is denoted “between-class.” Hamming distance (Eq. 5) is used to quantify the similarity of responses in each between-class or within-class comparison. Within-class distances are a measure of unreliability, and between-class distances are a measure of uniqueness.
(54) Within-class and between-class Hamming distances are evaluated experimentally on 32-column bitline PUFs. For each of 200 random unbiased challenges, 5 PUF instances are generated and the challenge is applied 6 times to each. Within-class distances are obtained by comparing the responses of the same PUF to the same challenge, and between-class distances are obtained by comparing the response of different PUFs to the same challenge. The separability of within-class and between-class Hamming distances (
HD(P.sub.0:X-1,P′.sub.0:X-1,C)=Σ.sub.i=0 . . . X-1P.sub.i(C)⊕p′.sub.i(C) (5)
(55) A PUF response should not be highly sensitive to changes in supply voltage or temperature, as this would restrict its useful application to tightly controlled environments. PUF responses at the nominal operating conditions of 1.2 V supply and 27° C. are compared against a variety of temperatures from 0° C. to 80° C. and supply voltages from 1.1 V to 1.3 V (
(56) The Bitline PUF is susceptible to modeling attacks if the challenge-response pairs (CRPs) can be observed, and therefore care must be taken to avoid or obfuscate the CRPs of the Bitline PUF. Otherwise, an adversary can use a parametric model to predict the PUF response to any challenge, without needing a dictionary of all possible challenge response pairs.
(57) A modeling attack on bitline PUFs is demonstrated using support vector machine (SVM) classification. The task of the SVM classifier is, after training on some number of observed CRPs, to correctly predict responses to new challenges. To use SVM classification, each CRP is converted to a pair (x, y)|x ε{0, 1}.sup.4Y, yε{−1, +1} where Y is the number of rows in the PUF and the number of values in the challenge. In the pair (x, y), x represents the challenge and is determined according to Eq. 6, while y represents the response of the PUF to the challenge. Note that for SVM classification, negative responses are entered as the value −1 instead of 0.
(58)
(59)
(60) Parametric models exist for many PUFs including the arbiter PUF. Yet, the practical usefulness of PUFs with parametric models is not diminished because modeling resistance can be assured through obfuscating or preventing access to the PUF responses. The Bitline PUF is uniquely suited to protection via access control because it uses ordinary SRAM, and as such can employ SRAM access protection mechanisms such as ARM TrustZone.
(61) The power consumption of a bitline PUF evaluation is higher than that of a standard SRAM read operation. More specifically, a bitline PUF draws significant current during metastability when the bitline potentials are approximately equal. During metastability, all of the cells that are active in the challenge are drawing current, either through p.sub.1 and n.sub.2, or else through p.sub.2 and n.sub.1.
(62)
(63) This disclosure presents a new PUF design termed the Bitline PUF. The Bitline PUF is a low cost solution that shares most of its circuitry with SRAM, and is created by adding two logic gates and a flip-flop to the wordline driver of each SRAM row to enable challenge-response hashing. The Bitline PUF, applied to a SRAM of typical size, has a response latency of 2.6 μs and response throughput of 99.8 Mbps. Circuit simulation indicates that responses produced by the Bitline PUF in 90 nm technology have a nominal bit error rate of 0.023, and that the bit error rate does not exceed 0.076 for any supply voltage between 1.1 V and 1.3 V, or temperature between 0° C. and 80° C.
(64) The foregoing description of the embodiments has been provided for purposes of illustration and description. It is not intended to be exhaustive or to limit the disclosure. Individual elements or features of a particular embodiment are generally not limited to that particular embodiment, but, where applicable, are interchangeable and can be used in a selected embodiment, even if not specifically shown or described. The same may also be varied in many ways. Such variations are not to be regarded as a departure from the disclosure, and all such modifications are intended to be included within the scope of the disclosure.