Three-dimensional array of re-programmable non-volatile memory elements having vertical bit lines and a single-sided word line architecture
09721653 ยท 2017-08-01
Assignee
Inventors
Cpc classification
G11C2213/78
PHYSICS
H10N70/235
ELECTRICITY
H10N70/823
ELECTRICITY
G11C11/5685
PHYSICS
H10N70/245
ELECTRICITY
G11C13/0011
PHYSICS
H10B63/20
ELECTRICITY
B82Y10/00
PERFORMING OPERATIONS; TRANSPORTING
H10B63/84
ELECTRICITY
H10B63/30
ELECTRICITY
H10N70/231
ELECTRICITY
G11C11/5614
PHYSICS
G11C2213/75
PHYSICS
H10N70/884
ELECTRICITY
G11C2013/0088
PHYSICS
H10N70/826
ELECTRICITY
G11C13/0007
PHYSICS
H10B63/845
ELECTRICITY
G11C2213/55
PHYSICS
G11C11/06035
PHYSICS
G11C2213/19
PHYSICS
H10N70/24
ELECTRICITY
G11C11/5678
PHYSICS
G11C13/025
PHYSICS
International classification
G11C11/00
PHYSICS
G11C13/00
PHYSICS
B82Y10/00
PERFORMING OPERATIONS; TRANSPORTING
G11C13/02
PHYSICS
G11C11/56
PHYSICS
G11C11/06
PHYSICS
Abstract
A three-dimensional array especially adapted for memory elements that reversibly change a level of electrical conductance in response to a voltage difference being applied across them. Memory elements are formed across a plurality of planes positioned different distances above a semiconductor substrate. Bit lines to which the memory elements of all planes are connected are oriented vertically from the substrate and through the plurality of planes.
Claims
1. A data memory, comprising: a plurality of non-volatile re-programmable memory elements arranged in a three dimensional pattern defined by rectangular coordinates along x, y and z directions and with a plurality of parallel planes stacked in the z direction on top of a semiconductor substrate; a plurality of first conductive lines elongated in the z direction through the plurality of planes and arranged in a two dimensional rectangular array of rows in the x direction and columns in the y-direction; a plurality of second conductive lines elongated in the x-direction across each of the planes and spaced apart in the y-direction between and separated from the first plurality of conductive lines in each of the planes, wherein the first and second conductive lines cross adjacent each other at a plurality of locations across each of the planes, the plurality of second conductive lines forming pairs of adjacent second conductive lines, wherein each non-volatile re-programmable memory element is connected between a corresponding first conductive line and a corresponding second conductive line crossing each other at an associated location of the plurality of locations; and a plurality of select devices arranged to individually couple a selected row of first conductive lines to respective sensing circuits via a plurality of third conductive lines, wherein, for each of the pairs of adjacent second conductive lines, each second conductive line is connected to only one corresponding row of first conductive line such that both of the second conductive lines of each pair are connected to the same row of first conductive lines and such that each connection made with each first conductive line of the corresponding row of first conductive lines is via a single corresponding non-volatile re-programmable memory element, wherein each memory element is positioned between a respective first conductive line and a respective second conductive line, and is in the plane of said plurality of planes where said respective second conductive line is placed, to contact the first conductive line in the y-direction, and wherein the individual memory elements are characterized by a level of electrical conductance that changes in response to an electrical stimulus applied thereto through the first and second conductive lines.
2. The data memory of claim 1, wherein: the third conductive lines are elongated in the y-direction, the select devices are arranged to make those of the first conductive lines aligned in the y-direction connectable with selected ones of the plurality of third conductive lines, and a plurality of control lines extending in the x-direction and individually connected with a plurality of the select devices aligned in the x-direction to enable connection of a plurality of first conductive lines aligned in the x-direction with different ones of the third conductive lines.
3. The data memory of claim 2, wherein the plurality of select devices and the plurality of third conductive lines are formed in the semiconductor substrate and the plurality of planes are formed as a stack over the semiconductor substrate.
4. The data memory of claim 1, wherein the plurality of select devices and the plurality of third conductive lines are formed in the semiconductor substrate and the plurality of planes are formed as a stack over the semiconductor substrate.
5. The data memory of claim 4, wherein the individual memory elements are characterized by including a material that reversibly changes its level of electrical conductance between at least first and second stable levels in response to an electrical stimulus being applied through the first and second conductive lines between which the memory element is connected.
6. The data memory of claim 1, wherein the individual memory elements are characterized by including a material that reversibly changes its level of electrical conductance between at least first and second stable levels in response to an electrical stimulus being applied through the first and second conductive lines between which the memory element is connected.
7. A data memory, comprising: a plurality of non-volatile re-programmable memory elements arranged in a three dimensional pattern defined by rectangular coordinates along x, y and z directions and with a plurality of parallel planes stacked in the z direction on top of a semiconductor substrate; a plurality of first conductive lines elongated in the z direction through the plurality of planes and arranged in a two dimensional rectangular array of rows in the x direction and columns in the y-direction; a plurality of second conductive lines elongated in the x-direction across each of the planes and spaced apart in the y-direction between and separated from the first plurality of conductive lines in each of the planes, wherein the first and second conductive lines cross adjacent each other at a plurality of locations across each of the planes, the plurality of second conductive lines forming pairs of adjacent second conductive lines, wherein each non-volatile re-programmable memory element is connected between a corresponding first conductive line and a corresponding second conductive line crossing each other at an associated location of the plurality of locations; and a plurality of select devices arranged to individually couple a selected row of first conductive lines to respective sensing circuits via a plurality of third conductive lines, wherein, for each of the pairs of adjacent second conductive lines, each second conductive line is connected to only one corresponding row of first conductive line such that both of the second conductive lines of each pair are connected to the same row of first conductive lines and such that each connection made with each first conductive line of the corresponding row of first conductive lines is via a single corresponding non-volatile re-programmable memory element, wherein each memory element is positioned between a respective first conductive line and a respective second conductive line, and is in the plane of said plurality of planes where said respective second conductive line is placed, to contact the first conductive line in the y-direction, and wherein memory elements include individual quantities of material adjacent individual ones of the crossings of the first and second conductive lines, the memory elements being separated from one another in all of the x, y and z-directions.
8. The data memory of claim 1, wherein the memory elements are additionally positioned to contact the first conductive lines in the y-direction.
9. The data memory of claim 1, additionally comprising a circuit connected to the first and second conductive lines to apply electrical stimuli to selected ones of the memory elements, the electrical stimuli being applied to cause the selected memory elements to switch from their first to second stable states having substantially the same magnitude but different polarities as the electrical stimuli applied to cause the memory elements to switch from their second to first stable states.
10. A data memory, comprising: a plurality of non-volatile re-programmable memory elements arranged in a three dimensional pattern defined by rectangular coordinates along x, y and z directions and with a plurality of parallel planes stacked in the z direction on top of a semiconductor substrate; a plurality of first conductive lines elongated in the z direction through the plurality of planes and arranged in a two dimensional rectangular array of rows in the x direction and columns in the y-direction; a plurality of second conductive lines elongated in the x-direction across each of the planes and spaced apart in the y-direction between and separated from the first plurality of conductive lines in each of the planes, wherein the first and second conductive lines cross adjacent each other at a plurality of locations across each of the planes, the plurality of second conductive lines forming pairs of adjacent second conductive lines, wherein each non-volatile re-programmable memory element is connected between a corresponding first conductive line and a corresponding second conductive line crossing each other at an associated location of the plurality of locations; a plurality of select devices arranged to individually couple a selected row of first conductive lines to respective sensing circuits via a plurality of third conductive lines; and data input-output circuits connected to the third plurality of conductive lines, wherein, for each of the pairs of adjacent second conductive lines, each second conductive line is connected to only one corresponding row of first conductive line such that both of the second conductive lines of each pair are connected to the same row of first conductive lines and such that each connection made with each first conductive line of the corresponding row of first conductive lines is via a single corresponding non-volatile re-programmable memory element, and wherein each memory element is positioned between a respective first conductive line and a respective second conductive line, and is in the plane of said plurality of planes where said respective second conductive line is placed, to contact the first conductive line in the y-direction.
11. The data memory of claim 10, wherein the data input-output circuits include a plurality of sense amplifiers that are connected with the third conductive lines in a manner to provide a binary representation of data carried by the third conductive lines when data are being read from the memory.
12. The data memory of claim 10, wherein the data input-output circuits additionally include data programming circuits that apply voltages to selected ones of the third conductive lines for programming data into at least some of the memory elements connected thereto through the plurality of select devices.
13. A method of operating a re-programmable non-volatile memory system, comprising: utilizing at least one integrated circuit that includes a three-dimensional pattern of memory elements defined by rectangular coordinates along x, y and z-directions and which includes: a plurality of parallel planes stacked in the z-direction on top of a semiconductor substrate, a plurality of conductive local bit lines elongated in the z-direction through the plurality of planes and arranged in a two-dimensional rectangular array of rows in the x and columns in the y-directions, a plurality of word lines elongated in the x-direction across each of the planes and spaced apart in the y-direction between and separated from the plurality of local bit lines in each of the planes, wherein the local bit lines and word lines cross adjacent each other at a plurality of locations across each of the planes, the plurality of word lines forming pairs of adjacent word lines, a plurality of re-programmable non-volatile memory elements, each non-volatile re-programmable memory element being connected between a corresponding local bit line and a corresponding word line crossing each other at an associated location of the plurality of locations, and a plurality of select devices arranged to individually couple a selected row of local bit lines to respective sensing circuits via a plurality of global bit lines in response to a select control signal; applying said select control signal to the plurality of select devices in order to connect the selected row of local bit lines to individual ones of the global bit lines; and causing a selected one or more of the plurality of memory elements to simultaneously change between at least first and second states by applying one of first and second stimuli through one or more of the plurality of word lines and global bit lines between which the selected one or more of the plurality of memory elements are operably connected, wherein, for each of the pairs of adjacent word lines, each word line is connected to only one corresponding row of local bit lines, such that both of the word lines of each pair are connected to the same row of local bit lines and such that each connection made with each local bit line of the corresponding row of local bit lines is via a single corresponding non-volatile re-programmable memory element, and wherein each memory element is positioned between a respective local bit line and a respective word line to contact the respective local bit line in the y-direction.
14. The method of claim 13, wherein causing a selected one or more of the plurality of memory elements to simultaneously change between their at least first and second states includes simultaneously resetting to the first state two rows of memory elements connected to the selected row of local bit lines and along opposite sides thereof in the y-direction by applying the second electrical stimulus to a plurality of the global bit lines and to two of the word lines adjacent the selected row of local bit lines on said opposite sides thereof.
15. The method of claim 14, wherein causing a selected one or more of the plurality of memory elements to simultaneously change between their at least first and second states additionally includes subsequently programming data into one of the two rows of reset memory elements by applying the first electrical stimulus to a plurality of the global bit lines and one of the word lines adjacent the selected row of local bit lines on the side thereof of the one row of memory elements being programmed.
16. The method of claim 14, additionally comprising reading the states of a row of memory elements along one side in the y-direction of the selected row of local bit lines by applying reading electrical stimuli to the global bit lines and a word line such that the states of the memory elements are read from an electrical quantity appearing on the global bit lines.
17. The method of claim 14, wherein causing a selected one or more of the plurality of memory elements to simultaneously change between their at least first and second states additionally includes applying a plurality of pulses of the one of the first and second stimuli and, in between successive pulses, verifying the state of the one or more of the plurality of memory elements.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
DESCRIPTION OF EXEMPLARY EMBODIMENTS
(18) Referring initially to
(19) A circuit for selectively connecting internal memory elements with external data circuits is preferably formed in a semiconductor substrate 13. In this specific example, a two-dimensional array of select or switching devices Q.sub.xy are utilized, where x gives a relative position of the device in the x-direction and y its relative position in the y-direction. The individual devices Q, may be a select gate or select transistor, as examples. Global bit lines (GBL.sub.x) are elongated in the y-direction and have relative positions in the x-direction that are indicated by the subscript. The global bit lines (GBL.sub.x) are individually connectable with the source or drain of the select devices Q having the same position in the x-direction, although during reading and also typically programming only one select device connected with a specific global bit line is turned on at time. The other of the source or drain of the individual select devices Q is connected with one of the local bit lines (LBL.sub.xy). The local bit lines are elongated vertically, in the z-direction, and form a regular two-dimensional array in the x (row) and y (column) directions.
(20) In order to connect one set (in this example, designated as one row) of local bit lines with corresponding global bit lines, control gate lines SG.sub.y are elongated in the x-direction and connect with control terminals (gates) of a single row of select devices Q.sub.xy having a common position in the y-direction. The select devices Q.sub.xy therefore connect one row of local bit lines (LBL.sub.xy) across the x-direction (having the same position in the y-direction) at a time to corresponding ones of the global bit-lines (GBL.sub.x), depending upon which of the control gate lines SG.sub.y receives a voltage that turns on the select devices to which it is connected. The remaining control gate lines receive voltages that keep their connected select devices off. It may be noted that since only one select device (Q.sub.xy) is used with each of the local bit lines (LBL.sub.xy), the pitch of the array across the semiconductor substrate in both x and y-directions may be made very small, and thus the density of the memory storage elements large.
(21) Memory storage elements M.sub.zxy are formed in a plurality of planes positioned at different distances in the z-direction above the substrate 13. Two planes 1 and 2 are illustrated in
(22) Each plane of the three-dimensional memory cell structure is typically formed of at least two layers, one in which the conductive word lines WL.sub.zy are positioned and another of a dielectric material that electrically isolates the planes from each other. Additional layers may also be present in each plane, depending for example on the structure of the memory elements M.sub.zxy. The planes are stacked on top of each other on a semiconductor substrate with the local bit lines LBL.sub.xy being connected with storage elements M.sub.zxy of each plane through which the local bit lines extend.
(23)
(24) The memory system controller 25 typically receives data from and sends data to a host system 31. The controller 25 usually contains an amount of random-access-memory (RAM) 34 for temporarily storing such data and operating information. Commands, status signals and addresses of data being read or programmed are also exchanged between the controller 25 and host 31. The memory system operates with a wide variety of host systems. They include personal computers (PCs), laptop and other portable computers, cellular telephones, personal digital assistants (PDAs), digital still cameras, digital movie cameras and portable audio players. The host typically includes a built-in receptacle 33 for one or more types of memory cards or flash drives that accepts a mating memory system plug 35 of the memory system but some hosts require the use of adapters into which a memory card is plugged, and others require the use of cables therebetween. Alternatively, the memory system may be built into the host system as an integral part thereof.
(25) The memory system controller 25 conveys to decoder/driver circuits 37 commands received from the host. Similarly, status signals generated by the memory system are communicated to the controller 25 from the circuits 37. The circuits 37 can be simple logic circuits in the case where the controller controls nearly all of the memory operations, or can include a state machine to control at least some of the repetitive memory operations necessary to carry out given commands. Control signals resulting from decoding commands are applied from the circuits 37 to the word line select circuits 27, local bit line select circuits 29 and data input-output circuits 21. Also connected to the circuits 27 and 29 are address lines 39 from the controller that carry physical addresses of memory elements to be accessed within the array 10 in order to carry out a command from the host. The physical addresses correspond to logical addresses received from the host system 31, the conversion being made by the controller 25 and/or the decoder/driver 37. As a result, the circuits 29 partially address the designated storage elements within the array 10 by placing proper voltages on the control elements of the select devices Q.sub.xy to connect selected local bit lines (LBL.sub.xy) with the global bit lines (GBL.sub.x). The addressing is completed by the circuits 27 applying proper voltages to the word lines WL.sub.zy of the array.
(26) Although the memory system of
(27) Although each of the memory elements M.sub.zxy in the array of
(28) Previously programmed memory elements whose data have become obsolete may be addressed and re-programmed from the states in which they were previously programmed. The states of the memory elements being re-programmed in parallel will therefore most often have different starting states among them. This is acceptable for many memory element materials but it is usually preferred to re-set a group of memory elements to a common state before they are re-programmed. For this purpose, the memory elements may be grouped into blocks, where the memory elements of each block are simultaneously reset to a common state, preferably one of the programmed states, in preparation for subsequently programming them. If the memory element material being used is characterized by changing from a first to a second state in significantly less time than it takes to be changed from the second state back to the first state, then the reset operation is preferably chosen to cause the transition taking the longer time to be made. The programming is then done faster than resetting. The longer reset time is usually not a problem since resetting blocks of memory elements containing nothing but obsolete data is typically accomplished in a high percentage of the cases in the background, therefore not adversely impacting the programming performance of the memory system.
(29) With the use of block re-setting of memory elements, a three-dimensional array of variable resistive memory elements may be operated in a manner similar to current flash memory cell arrays. Resetting a block of memory elements to a common state corresponds to erasing a block of flash memory cells to an erased state. The individual blocks of memory elements herein may be further divided into a plurality of pages of storage elements, wherein the memory elements of a page are programmed and read together. This is like the use of pages in flash memories. The memory elements of an individual page are programmed and read together. Of course, when programming, those memory elements that are to store data that are represented by the reset state are not changed from the reset state. Those of the memory elements of a page that need to be changed to another state in order to represent the data being stored in them have their states changed by the programming operation.
(30) An example of use of such blocks and pages is illustrated in
(31) A page is also illustrated in
(32) Example resetting, programming and reading operations of the memory array of
(33) To reset (erase) a block of memory elements, the memory elements in that block are placed into their high resistance state. This state will be designated as the logical data state 1, following the convention used in current flash memory arrays but it could alternatively be designated to be a 0. As shown by the example in
(34) The following steps may be taken to reset all the memory elements of a block, using the block illustrated in
(35) The result is that H volts are placed across each of the memory elements of the block. In the example block of
(36) It may be noted that no stray currents will flow because only one word line has a non-zero voltage. The voltage on the one word line of the block can cause current to flow to ground only through the memory elements of the block. There is also nothing that can drive any of the unselected and electrically floating local bit lines to H volts, so no voltage difference will exist across any other memory elements of the array outside of the block. Therefore no voltages are applied across unselected memory elements in other blocks that can cause them to be inadvertently disturbed or reset.
(37) It may also be noted that multiple blocks may be concurrently reset by setting any combination of word lines and the adjacent select gates to H or H respectively. In this case, the only penalty for doing so is an increase in the amount of current that is required to simultaneously reset an increased number of memory elements. This affects the size of the power supply that is required.
(38) The memory elements of a page are preferably programmed concurrently, in order to increase the parallelism of the memory system operation. An expanded version of the page indicated in
(39) For programming a page, only one row of select devices is turned on, resulting in only one row of local bit lines being connected to the global bit lines. This connection alternatively allows the memory elements of both pages of the block to be programmed in two sequential programming cycles, which then makes the number of memory elements in the reset and programming units equal.
(40) Referring to
(41) The result of this operation, for the example memory element material mentioned above, is that a programming current I.sub.prog is sent through the memory element M.sub.124, thereby causing that memory element to change from a reset to a set (programmed) state. The same will occur with other memory elements (not shown) that are connected between the selected word line WL.sub.12 and a local bit line (LBL) that has the programming voltage level H applied.
(42) An example of the relative timing of applying the above-listed programming voltages is to initially set all the global bit lines (GBLs), the selected select gate line (SG), the selected word line and two adjacent word lines on either side of the selected word line on the one page all to the voltage level M. After this, selected ones of the GBLs are raised to the voltage level H according to the data being programmed while simultaneously dropping the voltage of the selected word line to 0 volts for the duration of the programming cycle. The word lines in plane 1 other than the selected word line WL.sub.12 and all word lines in the unselected other planes can be weakly driven to M, some lower voltage or allowed to float in order to reduce power that must be delivered by word line drivers that are part of the circuits 27 of
(43) By floating all the local bit lines other than the selected row (in this example, all but LBL.sub.12, LBL.sub.22 and LBL.sub.32), voltages can be loosely coupled to outer word lines of the selected plane 1 and word lines of other planes that are allowed to float through memory elements in their low resistance state (programmed) that are connected between the floating local bit lines and adjacent word lines. These outer word lines of the selected plane and word lines in unselected planes, although allowed to float, may eventually be driven up to voltage level M through a combination of programmed memory elements.
(44) There are typically parasitic currents present during the programming operation that can increase the currents that must be supplied through the selected word line and global bit lines. During programming there are two sources of parasitic currents, one to the adjacent page in a different block and another to the adjacent page in the same block. An example of the first is the parasitic current I.sub.P1 shown on
(45) Other parasitic currents can similarly flow from the same local bit line LBL.sub.22 to an adjacent word line in other planes. The presence of these currents may limit the number of planes that can be included in the memory system since the total current may increase with the number of planes. The limitation for programming is in the current capacity of the memory power supply, so the maximum number of planes is a tradeoff between the size of the power supply and the number of planes. A number of 4-8 planes may generally be used in most cases.
(46) The other source of parasitic currents during programming is to an adjacent page in the same block. The local bit lines that are left floating (all but those connected to the row of memory elements being programmed) will tend to be driven to the voltage level M of unselected word lines through any programmed memory element on any plane. This in turn can cause parasitic currents to flow in the selected plane from these local bit lines at the M voltage level to the selected word line that is at zero volts. An example of this is given by the currents I.sub.P2, I.sub.P3 and I.sub.P4 shown in
(47) The above-described programming techniques ensure that the selected page is programmed (local bit lines at H, selected word line at 0) and that adjacent unselected word lines are at M. As mentioned earlier, other unselected word lines can be weakly driven to M or initially driven to M and then left floating. Alternately, word lines in any plane distant from the selected word line (for example, more than 5 word lines away) can also be left uncharged (at ground) or floating because the parasitic currents flowing to them are so low as to be negligible compared to the identified parasitic currents since they must flow through a series combination of five or more ON devices (devices in their low resistance state). This can reduce the power dissipation caused by charging a large number of word lines.
(48) While the above description assumes that each memory element of the page being programmed will reach its desired ON value with one application of a programming pulse, a program-verify technique commonly used in NOR or NAND flash memory technology may alternately be used. In this process, a complete programming operation for a given page includes of a series of individual programming operations in which a smaller change in ON resistance occurs within each program operation. Interspersed between each program operation is a verify (read) operation that determines whether an individual memory element has reached its desired programmed level of resistance or conductance consistent with the data being programmed in the memory element. The sequence of program/verify is terminated for each memory element as it is verified to reach the desired value of resistance or conductance. After all of memory elements being programmed are verified to have reached their desired programmed value, programming of the page of memory elements is then completed. An example of this technique is described in U.S. Pat. No. 5,172,338.
(49) With reference primarily to
(50) Parasitic currents during such a read operation have two undesirable effects. As with programming, parasitic currents place increased demands on the memory system power supply. In addition, it is possible for parasitic currents to exist that are erroneously included in the currents though the addressed memory elements that are being read. This can therefore lead to erroneous read results if such parasitic currents are large enough.
(51) As in the programming case, all of the local bit lines except the selected row (LBL.sub.12, LBL.sub.22 and LBL.sub.32 in the example of
(52) Although the neighboring word lines should be at V.sub.R to minimize parasitic currents, as in the programming case it may be desirable to weakly drive these word lines or even allow them to float. In one variation, the selected word line and the neighboring word lines can be pre-charged to V.sub.R and then allowed to float. When the sense amplifier is energized, it may charge them to V.sub.R so that the potential on these lines is accurately set by the reference voltage from the sense amplifier (as opposed to the reference voltage from the word line driver). This can occur before the selected word line is changed to V.sub.RVsense but the sense amplifier current is not measured until this charging transient is completed.
(53) Reference cells may also be included within the memory array 10 to facilitate any or all of the common data operations (erase, program, or read). A reference cell is a cell that is structurally as nearly identical to a data cell as possible in which the resistance is set to a particular value. They are useful to cancel or track resistance drift of data cells associated with temperature, process non-uniformities, repeated programming, time or other cell properties that may vary during operation of the memory. Typically they are set to have a resistance above the highest acceptable low resistance value of a memory element in one data state (such as the ON resistance) and below the lowest acceptable high resistance value of a memory element in another data state (such as the OFF resistance). Reference cells may be global to a plane or the entire array, or may be contained within each block or page.
(54) In one embodiment, multiple reference cells may be contained within each page. The number of such cells may be only a few (less than 10), or may be up to a several percent of the total number of cells within each page. In this case, the reference cells are typically reset and written in a separate operation independent of the data within the page. For example, they may be set one time in the factory, or they may be set once or multiple times during operation of the memory array. During a reset operation described above, all of the global bit lines are set low, but this can be modified to only set the global bit lines associated with the memory elements being reset to a low value while the global bit lines associated with the reference cells are set to an intermediate value, thus inhibiting them from being reset. Alternately, to reset reference cells within a given block, the global bit lines associated with the reference cells are set to a low value while the global bit lines associated with the data cells are set to an intermediate value. During programming, this process is reversed and the global bit lines associated with the reference cells are raised to a high value to set the reference cells to a desired ON resistance while the memory elements remain in the reset state. Typically the programming voltages or times will be changed to program reference cells to a higher ON resistance than when programming memory elements.
(55) If, for example, the number of reference cells in each page is chosen to be 1% of the number of data storage memory elements, then they may be physically arranged along each word line such that each reference cell is separated from its neighbor by 100 data cells, and the sense amplifier associated with reading the reference cell can share its reference information with the intervening sense amplifiers reading data. Reference cells can be used during programming to ensure the data is programmed with sufficient margin. Further information regarding the use of reference cells within a page can be found in U.S. Pat. Nos. 6,222,762, 6,538,922, 6,678,192 and 7,237,074.
(56) In a particular embodiment, reference cells may be used to approximately cancel parasitic currents in the array. In this case the value of the resistance of the reference cell(s) is set to that of the reset state rather than a value between the reset state and a data state as described earlier. The current in each reference cell can be measured by its associated sense amplifier and this current subtracted from neighboring data cells. In this case, the reference cell is approximating the parasitic currents flowing in a region of the memory array that tracks and is similar to the parasitic currents flowing in that region of the array during a data operation. This correction can be applied in a two-step operation (measure the parasitic current in the reference cells and subsequently subtract its value from that obtained during a data operation) or simultaneously with the data operation. One way in which simultaneous operation is possible is to use the reference cell to adjust the timing or reference levels of the adjacent data sense amplifiers. An example of this is shown in U.S. Pat. No. 7,324,393.
(57) In conventional two-dimensional arrays of variable resistance memory elements, a diode is usually included in series with the memory element between the crossing bit and word lines. The primary purpose of the diodes is to reduce the number and magnitudes of parasitic currents during resetting (erasing), programming and reading the memory elements. A significant advantage of the three-dimensional array herein is that resulting parasitic currents are fewer and therefore have a reduced negative effect on operation of the array than in other types of arrays.
(58) Diodes may also be connected in series with the individual memory elements of the three-dimensional array, as currently done in other arrays of variable resistive memory elements, in order to reduce further the number of parasitic currents but there are disadvantages in doing so. Primarily, the manufacturing process becomes more complicated. Added masks and added manufacturing steps are then necessary. Also, since formation of the silicon p-n diodes often requires at least one high temperature step, the word lines and local bit lines cannot then be made of metal having a low melting point, such as aluminum that is commonly used in integrated circuit manufacturing, because it may melt during the subsequent high temperature step. Use of a metal, or composite material including a metal, is preferred because of its higher conductivity than the conductively doped polysilicon material that is typically used for bit and word lines because of being exposed to such high temperatures. An example of an array of resistive switching memory elements having a diode formed as part of the individual memory elements is given in patent application publication no. US 2009/0001344 A1.
(59) Because of the reduced number of parasitic currents in the three-dimensional array herein, the total magnitude of parasitic currents can be managed without the use of such diodes. In addition to the simpler manufacturing processes, the absence of the diodes allows bi-polar operation; that is, an operation in which the voltage polarity to switch the memory element from its first state to its second memory state is opposite of the voltage polarity to switch the memory element from its second to its first memory state. The advantage of the bi-polar operation over a unipolar operation (same polarity voltage is used to switch the memory element from its first to second memory state as from its second to first memory state) is the reduction of power to switch the memory element and an improvement in the reliability of the memory element. These advantages of the bi-polar operation are seen in memory elements in which formation and destruction of a conductive filament is the physical mechanism for switching, as in the memory elements made from metal oxides and solid electrolyte materials.
(60) The level of parasitic currents increases with the number of planes and with the number of memory elements connected along the individual word lines within each plane. But since the number of word lines on each plane does not significantly affect the amount of parasitic current, the planes may individually include a large number of word lines. The parasitic currents resulting from a large number of memory elements connected along the length of individual word lines can further be managed by segmenting the word lines into sections of fewer numbers of memory elements. Erasing, programming and reading operations are then performed on the memory elements connected along one segment of each word line instead of the total number of memory elements connected along the entire length of the word line.
(61) The re-programmable non-volatile memory array being described herein has many advantages. The quantity of digital data that may be stored per unit of semiconductor substrate area is high. It may be manufactured with a lower cost per stored bit of data. Only a few masks are necessary for the entire stack of planes, rather than requiring a separate set of masks for each plane. The number of local bit line connections with the substrate is significantly reduced over other multi-plane structures that do not use the vertical local bit lines. The architecture eliminates the need for each memory cell to have a diode in series with the resistive memory element, thereby further simplifying the manufacturing process and enabling the use of metal conductive lines. Also, the voltages necessary to operate the array are much lower than those used in current commercial flash memories.
(62) Since at least one-half of each current path is vertical, the voltage drops present in large cross-point arrays are significantly reduced. The reduced length of the current path due to the shorter vertical component means that there are approximately one-half the number memory cells on each current path and thus the leakage currents are reduced as is the number of unselected cells disturbed during a data programming or read operation. For example, if there are N cells associated with a word line and N cells associated with a bit line of equal length in a conventional array, there are 2N cells associated or touched with every data operation. In the vertical local bit line architecture described herein, there are n cells associated with the bit line (n is the number of planes and is typically a small number such as 4 to 8), or N+n cells are associated with a data operation. For a large N this means that the number of cells affected by a data operation is approximately one-half as many as in a conventional three-dimensional array.
(63) Materials Useful for the Memory Storage Elements
(64) The material used for the non-volatile memory storage elements M.sub.zxy in the array of
(65) Metal oxides are characterized by being insulating when initially deposited. One suitable metal oxide is a titanium oxide (TiO.sub.x). A previously reported memory element using this material is illustrated in
(66) But when a large negative voltage (such as 1.5 volt) is applied across the structure, the oxygen vacancies drift toward the top electrode and, as a result, the potential barrier Pt/TiO.sub.2 is reduced and a relatively high current can flow through the structure. The device is then in its low resistance (conductive) state. Experiments reported by others have shown that conduction is occurring in filament-like regions of the TiO.sub.2, perhaps along grain boundaries.
(67) The conductive path is broken by applying a large positive voltage across the structure of
(68) While this specific conduction mechanism may not apply to all metal oxides, as a group, they have a similar behavior: transition from a low conductive state to a high conductive occurs state when appropriate voltages are applied, and the two states are non-volatile. Examples of other materials include HfOx, ZrOx, WOx, NiOx, CoOx, CoalOx, MnOx, ZnMn.sub.2O.sub.4, ZnOx, TaOx, NbOx, HfSiOx, HfAlOx. Suitable top electrodes include metals with a high work function (typically >4.5 eV) capable to getter oxygen in contact with the metal oxide to create oxygen vacancies at the contact. Some examples are TaCN, TiCN, Ru, RuO, Pt, Ti rich TiOx, TiAlN, TaAlN, TiSiN, TaSiN, IrO.sub.2. Suitable materials for the bottom electrode are any conducting oxygen rich material such as Ti(O)N, Ta(O)N, TiN and TaN. The thicknesses of the electrodes are typically 1 nm or greater. Thicknesses of the metal oxide are generally in the range of 5 nm to 50 nm.
(69) Another class of materials suitable for the memory storage elements is solid electrolytes but since they are electrically conductive when deposited, individual memory elements need to be formed and isolated from one another. Solid electrolytes are somewhat similar to the metal oxides, and the conduction mechanism is assumed to be the formation of a metallic filament between the top and bottom electrode. In this structure the filament is formed by dissolving ions from one electrode (the oxidizable electrode) into the body of the cell (the solid electrolyte). In one example, the solid electrolyte contains silver ions or copper ions, and the oxidizable electrode is preferably a metal intercalated in a transition metal sulfide or selenide material such as A.sub.x(MB2).sub.1-x, where A is Ag or Cu, B is S or Se, and M is a transition metal such as Ta, V, or Ti, and x ranges from about 0.1 to about 0.7. Such a composition minimizes oxidizing unwanted material into the solid electrolyte. One example of such a composition is Ag.sub.x(TaS2).sub.1-x. Alternate composition materials include -AgI. The other electrode (the indifferent or neutral electrode) should be a good electrical conductor while remaining insoluble in the solid electrolyte material. Examples include metals and compounds such as W, Ni, Mo, Pt, metal silicides, and the like.
(70) Examples of solid electrolytes materials are: TaO, GeSe or GeS. Other systems suitable for use as solid electrolyte cells are: Cu/TaO/W, Ag/GeSe/W, Cu/GeSe/W, Cu/GeS/W, and Ag/GeS/W, where the first material is the oxidizable electrode, the middle material is the solid electrolyte, and the third material is the indifferent (neutral) electrode. Typical thicknesses of the solid electrolyte are between 30 nm and 100 nm.
(71) In recent years, carbon has been extensively studied as a non-volatile memory material. As a non-volatile memory element, carbon is usually used in two forms, conductive (or graphene like-carbon) and insulating (or amorphous carbon). The difference in the two types of carbon material is the content of the carbon chemical bonds, so called sp.sup.2 and sp.sup.3 hybridizations. In the sp.sup.3 configuration, the carbon valence electrons are kept in strong covalent bonds and as a result the sp.sup.3 hybridization is non-conductive. Carbon films in which the sp.sup.3 configuration dominates, are commonly referred to as tetrahedral-amorphous carbon, or diamond like. In the sp.sup.2 configuration, not all the carbon valence electrons are kept in covalent bonds. The weak tight electrons (phi bonds) contribute to the electrical conduction making the mostly sp.sup.2 configuration a conductive carbon material. The operation of the carbon resistive switching nonvolatile memories is based on the fact that it is possible to transform the sp.sup.3 configuration to the sp.sup.2 configuration by applying appropriate current (or voltage) pulses to the carbon structure. For example, when a very short (1-5 ns) high amplitude voltage pulse is applied across the material, the conductance is greatly reduced as the material sp.sup.2 changes into an sp.sup.3 form (reset state). It has been theorized that the high local temperatures generated by this pulse causes disorder in the material and if the pulse is very short, the carbon quenches in an amorphous state (sp.sup.3 hybridization). On the other hand, when in the reset state, applying a lower voltage for a longer time (300 nsec) causes part of the material to change into the sp.sup.2 form (set state). The carbon resistance switching non-volatile memory elements have a capacitor like configuration where the top and bottom electrodes are made of high temperature melting point metals like W, Pd, Pt and TaN.
(72) There has been significant attention recently to the application of carbon nanotubes (CNTs) as a non-volatile memory material. A (single walled) carbon nanotube is a hollow cylinder of carbon, typically a rolled and self-closing sheet one carbon atom thick, with a typical diameter of about 1-2 nm and a length hundreds of times greater. Such nanotubes can demonstrate very high conductivity, and various proposals have been made regarding compatibility with integrated circuit fabrication. It has been proposed to encapsulate short CNT's within an inert binder matrix to form a fabric of CNT's. These can be deposited on a silicon wafer using a spin-on or spray coating, and as applied the CNT's have a random orientation with respect to each other. When an electric field is applied across this fabric, the CNT's tend to flex or align themselves such that the conductivity of the fabric is changed. The switching mechanism from low-to-high resistance and the opposite is not well understood. As in the other carbon based resistive switching non-volatile memories, the CNT based memories have capacitor-like configurations with top and bottom electrodes made of high melting point metals such as those mentioned above.
(73) Yet another class of materials suitable for the memory storage elements is phase-change materials. A preferred group of phase-change materials includes chalcogenide glasses, often of a composition Ge.sub.xSb.sub.yTe.sub.z, where preferably x=2, y=2 and z=5. GeSb has also been found to be useful. Other materials include AgInSbTe, GeTe, GaSb, BaSbTe, InSbTe and various other combinations of these basic elements. Thicknesses are generally in the range of 1 nm to 500 nm. The generally accepted explanation for the switching mechanism is that when a high energy pulse is applied for a very short time to cause a region of the material to melt, the material quenches in an amorphous state, which is a low conductive state. When a lower energy pulse is applied for a longer time such that the temperature remains above the crystallization temperature but below the melting temperature, the material crystallizes to form poly-crystal phases of high conductivity. These devices are often fabricated using sub-lithographic pillars, integrated with heater electrodes. Often the localized region undergoing the phase change may be designed to correspond to a transition over a step edge, or a region where the material crosses over a slot etched in a low thermal conductivity material. The contacting electrodes may be any high melting metal such as TiN, W, WN and TaN in thicknesses from 1 nm to 500 nm.
(74) It will be noted that the memory materials in most of the foregoing examples utilize electrodes on either side thereof whose compositions are specifically selected. In embodiments of the three-dimensional memory array herein where the word lines (WL) and/or local bit lines (LBL) also form these electrodes by direct contact with the memory material, those lines are preferably made of the conductive materials described above. In embodiments using additional conductive segments for at least one of the two memory element electrodes, those segments are therefore made of the materials described above for the memory element electrodes.
(75) Steering elements are commonly incorporated into controllable resistance types of memory storage elements. Steering elements can be a transistor or a diode. Although an advantage of the three-dimensional architecture described herein is that such steering elements are not necessary, there may be specific configurations where it is desirable to include steering elements. The diode can be a p-n junction (not necessarily of silicon), a metal/insulator/insulator/metal (MIIM), or a Schottky type metal/semiconductor contact but can alternately be a solid electrolyte element. A characteristic of this type of diode is that for correct operation in a memory array, it is necessary to be switched on and off during each address operation. Until the memory element is addressed, the diode is in the high resistance state (off state) and shields the resistive memory element from disturb voltages. To access a resistive memory element, three different operations are needed: a) convert the diode from high resistance to low resistance, b) program, read, or reset (erase) the memory element by application of appropriate voltages across or currents through the diode, and c) reset (erase) the diode. In some embodiments one or more of these operations can be combined into the same step. Resetting the diode may be accomplished by applying a reverse voltage to the memory element including a diode, which causes the diode filament to collapse and the diode to return to the high resistance state.
(76) For simplicity the above description has consider the simplest case of storing one data value within each cell: each cell is either reset or set and holds one bit of data. However, the techniques of the present application are not limited to this simple case. By using various values of ON resistance and designing the sense amplifiers to be able to discriminate between several of such values, each memory element can hold multiple-bits of data in a multiple-level cell (MLC). The principles of such operation are described in U.S. Pat. No. 5,172,338 referenced earlier. Examples of MLC technology applied to three dimensional arrays of memory elements include an article entitled Multi-bit Memory Using Programmable Metallization Cell Technology by Kozicki et al., Proceedings of the International Conference on Electronic Devices and Memory, Grenoble, France, Jun. 12-17, 2005, pp. 48-53 and Time Discrete Voltage Sensing and Iterative Programming Control for a 4F2 Multilevel CBRAM by Schrogmeier et al. (2007 Symposium on VLSI Circuits).
(77) Specific Structural Examples of the Three-Dimensional Array
(78) Three alternative semiconductor structures for implementing the three-dimensional memory element array of
(79) A first example, illustrated in
(80) Referring to
(81) Each bit line pillar is connected to one of a set of global bit lines (GBL) in the silicon substrate running in the y-direction at the same pitch as the pillar spacing through the select devices (Q.sub.xy) formed in the substrate whose gates are driven by the select gate lines (SG) elongated in the x-direction, which are also formed in the substrate. The switching devices Q.sub.xy may be conventional CMOS transistors (or vertical npn transistors) and fabricated using the same process as used to form the other conventional circuitry. In the case of using npn transistors instead of MOS transistors, the select gate (SG) lines are replaced with the base contact electrode lines elongated in the x-direction. Also fabricated in the substrate but not shown in
(82) Each vertical strip of non-volatile memory element (NVM) material is sandwiched between the vertical local bit lines (LBL) and a plurality of word lines (WL) vertically stacked in all the planes. Preferably the NVM material is present between the local bit lines (LBL) in the x-direction. A memory storage element (M) is located at each intersection of a word line (WL) and a local bit line (LBL). In the case of a metal oxide described above for the memory storage element material, a small region of the NVM material between an intersecting local bit line (LBL) and word line (WL) is controllably alternated between conductive (set) and non-conductive (reset) states by appropriate voltages applied to the intersecting lines.
(83) There may also be a parasitic NVM element formed between the LBL and the dielectric between planes. By choosing the thickness of the dielectric strips to be large compared to the thickness of the NVM material layer (that is, the spacing between the local bit lines and the word lines), a field caused by differing voltages between word lines in the same vertical word line stack can be made small enough so that the parasitic element never conducts a significant amount of current. Similarly, in other embodiments, the non-conducting NVM material may be left in place between adjacent local bit lines if the operating voltages between the adjacent LBLs remain below the programming threshold.
(84) An outline of a process for fabricating the structure of
(85) A significant advantage of the configuration of
(86) A second example of implementing the three-dimensional memory cell array of
(87)
(88) An outline of a process for forming one plane of the three-dimensional structure of
(89) A third specific structural example is shown by
(90) The structure shown in
(91) The second example structure of
Embodiments with Reduced Leakage Currents
(92) Conventionally, diodes are commonly connected in series with the variable resistive elements of a memory array in order to reduce leakage currents that can flow through them. The highly compact 3D reprogrammable memory described in the present invention has an architecture that does not require a diode in series with each memory element while able to keep the leakage currents reduced. This is possible with short local vertical bit lines which are selectively coupled to a set of global bit lines. In this manner, the structures of the 3D memory are necessarily segmented and couplings between the individual paths in the mesh are reduced.
(93) Even if the 3D reprogrammable memory has an architecture that allows reduced current leakage, it is desirable to further reduce them. As described earlier and in connection with
(94)
(95) In accordance with the general principle described in connection with
(96) The architecture shown in
(97) Double-Global-Bit-Line Architecture
(98) According to one aspect of the invention, a 3D memory includes memory elements arranged in a three-dimensional pattern defined by rectangular coordinates having x, y and z-directions and with a plurality of parallel planes stacked in the z-direction. The memory elements in each plane are accessed by a plurality of word lines and local bit lines in tandem with a plurality of global bit lines. The plurality of local bit lines are in the z-direction through the plurality of planes and arranged in a two dimensional rectangular array of rows in the x-direction and columns in the y-directions. The plurality of word lines in each plane are elongated in the x-direction and spaced apart in the y-direction between and separated from the plurality of local bit lines in the individual planes. A non-volatile, reprogramming memory element is located near a crossing between a word line and local bit line and accessible by the word line and bit line and wherein a group of memory elements are accessible in parallel by a common word line and a row of local bit lines. The 3D memory further includes a double-global-bit line architecture with two global bit lines respectively serving even and odd local bit lines in a column thereof in the y-direction. This architecture allows one global bit line to be used by a sense amplifier to access a selected local bit line and the other global bit line to be used to access an unselected local bit lines adjacent the selected local bit line in the y-direction. In this way the adjacent, unselected local lines can be set to exactly a reference voltage same as that of the selected local bit line in order to eliminate leakage currents between adjacent bit lines.
(99)
(100) Memory storage elements M.sub.zxy are formed in a plurality of planes positioned at different distances in the z-direction above the substrate 13. Two planes 1 and 2 are illustrated in
(101) Each plane of the three-dimensional memory cell structure is typically formed of at least two layers, one in which the conductive word lines WL.sub.zy are positioned and another of a dielectric material that electrically isolates the planes from each other. Additional layers may also be present in each plane, depending for example on the structure of the memory elements M.sub.zxy. The planes are stacked on top of each other on a semiconductor substrate with the local bit lines LBL.sub.xy being connected with storage elements M.sub.zxy of each plane through which the local bit lines extend.
(102) Essentially the three-dimensional memory 10 shown in
(103) A circuit for selectively connecting internal memory elements with external data circuits is preferably formed in a semiconductor substrate 13. In this specific example, a two-dimensional array of select or switching devices Q.sub.xy are utilized, where x gives a relative position of the device in the x-direction and y its relative position in the y-direction. The individual devices Q.sub.xy may be a select gate or select transistor, as examples.
(104) A pair of global bit lines (GBL.sub.xA, GBL.sub.xB) is elongated in the y-direction and have relative positions in the x-direction that are indicated by the subscript. The individual devices Qxy each couples a local bit line to one global bit line. Essentially, each local bit line in a row is coupleable to one of a corresponding pair of global bit lines. Along a column of local bit lines, even local bit lines are coupleable to a first one of a corresponding pair of global bit line while odd local bit lines are coupleable to a second one of the corresponding pair of global bit line.
(105) Thus, a pair of global bit lines (GBL.sub.xA, GBL.sub.xB) at about the x-position, are individually connectable with the source or drain of the select devices Q in such a manner that local bits (LBLxy) at the x-position and along the y-direction are coupleable alternately to the pair of global bit lines (GBL.sub.xA, GBL.sub.xB). For example, the odd local bit lines along the column in the y-direction at the x=1 position (LBL.sub.1, LBL.sub.13, . . . ) are coupleable respectively via select devices (Q.sub.11, Q.sub.13, . . . ) to a first one GBL.sub.1A of the pair of global bit line at x=1. Similarly, the even local bit lines along the same column at the x=1 position (LBL.sub.12, LBL.sub.14, . . . ) are coupleable respectively via select devices (Q.sub.12, Q.sub.14, . . . ) to a second one GBL.sub.1B of the pair of global bit line at x=1.
(106) During reading and also typically programming, each global bit line is typically coupled to one local bit line by accessing through a corresponding select device that has been turned on. In this way a sense amplifier can access the local bit line via the coupled global bit line.
(107) In order to connect one set (in this example, designated as one row) of local bit lines with a corresponding set of global bit lines, control gate lines SG.sub.y are elongated in the x-direction and connect with control terminals (gates) of a single row of select devices Q.sub.xy having a common position in the y-direction. In this way, a set or page of memory elements can be accessed in parallel. The select devices Q.sub.xy therefore connect one row of local bit lines (LBL.sub.xy) across the x-direction (having the same position in the y-direction) at a time to corresponding ones of the global bit-lines, depending upon which of the control gate lines SG.sub.y receives a voltage that turns on the select devices to which it is connected. In the double-global-bit line architecture, there is a pair of global bit lines at about each x-position. If a row of local bit lines along the x-directions are coupleable to the first one of each pair of corresponding global bit lines, then along the y-direction, an adjacent row of local bit lines will be coupleable to the second one of each pair of corresponding global bit lines. For example, the row of local bit lines (LBL.sub.11, LBL.sub.21, LBL.sub.31, . . . ) along the x-direction are coupled to the first of each pair of corresponding global bit lines (GBL.sub.1A, GBL.sub.2A, GBL.sub.3A, . . . ) by turning on select devices (Q.sub.11, Q.sub.21, Q.sub.31, . . . ) via the control gate line SG.sub.1. Along the y-direction, an adjacent row of local bit lines (LBL.sub.12, LBL.sub.22, LBL.sub.32, . . . ) along the x-direction are coupled to the second of each pair of corresponding global bit lines (GBL.sub.1B, GBL.sub.2B, GBL.sub.3B, . . . ) by turning on select devices (Q.sub.12, Q.sub.22, Q.sub.32, . . . ) via the control gate line SG.sub.2. Similarly, a next adjacent row of local bit lines (LBL.sub.13, LBL.sub.23, LBL.sub.33, . . . ) are coupled to the first of each pair of corresponding global bit lines (GBL.sub.1A, GBL.sub.2A, GBL.sub.3A, . . . ) in an alternating manner between the first and second one of each pair.
(108) By accessing a row of local bit lines and an adjacent row using different ones of each pair of corresponding global bit lines, the row and adjacent row of local bit lines can be accessed independently at the same time. This is in contrast to the case of the single-global-bit-line architecture shown in
(109) As discussed in connection with
(110)
(111) The double-global-bit-line architecture doubles the number of global bit lines in the memory array compared to the architecture shown in
(112) Single-Sided Word Line Architecture
(113) According to another embodiment of the invention, a 3D memory includes memory elements arranged in a three-dimensional pattern defined by rectangular coordinates having x, y and z-directions and with a plurality of parallel planes stacked in the z-direction. The memory elements in each plane are accessed by a plurality of word lines and local bit lines in tandem with a plurality of global bit lines. The plurality of local bit lines are in the z-direction through the plurality of planes and arranged in a two dimensional rectangular array of rows in the x-direction and columns in the y-directions. The plurality of word lines in each plane are elongated in the x-direction and spaced apart in the y-direction between and separated from the plurality of local bit lines in the individual planes. A non-volatile, reprogramming memory element is located near a crossing between a word line and local bit line and accessible by the word line and bit line and wherein a group of memory elements are accessible in parallel by a common word line and a row of local bit lines. The 3D memory has a single-sided word line architecture with each word line exclusively connected to one row of memory elements. This is accomplished by providing one word line for each row of memory elements instead of sharing one word line between two rows of memory elements and linking the memory element across the array across the word lines. While the row of memory elements is also being accessed by a corresponding row of local bit lines, there is no extension of coupling for the row of local bit lines beyond the word line.
(114) A double-sided word line architecture has been described earlier in that each word line is connected to two adjacent rows of memory elements associated with two corresponding rows of local bit lines, one adjacent row along one side of the word line and another adjacent row along the other side. For example, as shown in
(115)
(116) The 3D memory array with the double-sided word line architecture illustrated in
(117)
(118)
(119)
(120) The single-sided word-line architecture doubles the number of word lines in the memory array compared to the architecture shown in
(121)
(122) The 3D array is configured for use of memory element (NVM) material that is non-conductive when first deposited. A metal oxide of the type discussed earlier has this characteristic. As explained with respect to
(123) Referring to
(124) Each bit line pillar is connected to one of a set of global bit lines (GBL) in the silicon substrate running in the y-direction at the same pitch as the pillar spacing through the select devices (Q.sub.xy) formed in the substrate whose gates are driven by the select gate lines (SG) elongated in the x-direction, which are also formed in the substrate. The switching devices Q.sub.xy may be conventional CMOS transistors (or vertical npn transistors) and fabricated using the same process as used to form the other conventional circuitry. In the case of using npn transistors instead of MOS transistors, the select gate (SG) lines are replaced with the base contact electrode lines elongated in the x-direction. Also fabricated in the substrate but not shown in
(125) Each vertical strip of non-volatile memory element (NVM) material is sandwiched between the vertical local bit lines (LBL) and a plurality of word lines (WL) vertically stacked in all the planes. Preferably the NVM material is present between the local bit lines (LBL) in the x-direction. A memory storage element (M) is located at each intersection of a word line (WL) and a local bit line (LBL). In the case of a metal oxide described above for the memory storage element material, a small region of the NVM material between an intersecting local bit line (LBL) and word line (WL) is controllably alternated between conductive (set) and non-conductive (reset) states by appropriate voltages applied to the intersecting lines.
(126) There may also be a parasitic NVM element formed between the LBL and the dielectric between planes. By choosing the thickness of the dielectric strips to be large compared to the thickness of the NVM material layer (that is, the spacing between the local bit lines and the word lines), a field caused by differing voltages between word lines in the same vertical word line stack can be made small enough so that the parasitic element never conducts a significant amount of current. Similarly, in other embodiments, the non-conducting NVM material may be left in place between adjacent local bit lines if the operating voltages between the adjacent LBLs remain below the programming threshold.
(127) The single-sided word line architecture almost double the number of word line in the memory array compared to the double-sided one. This disadvantage is offset by providing a more partitioned memory array with less leakage currents among the memory elements.
(128) While the exemplary embodiments have been described using a 3D co-ordinate system preferably with orthogonal axes, other embodiment in which the local bit lines LBL, word lines WL and global bit lines GBL cross at angles different than 90 degrees are also possible and contemplated.
(129) Conclusion
(130) Although the various aspects of the present three-dimensional non-volatile memory and methods have been described with respect to exemplary embodiments thereof, it will be understood that the present memory and methods are entitled to protection within the full scope of the appended claims.