SYSTEMS AND METHODS FOR MACHINE LEARNING PROCESSOR WITH INTRA-DIE AND INTER-DIE WIRELESS COMMUNICATION
20190171931 ยท 2019-06-06
Inventors
Cpc classification
H04B5/266
ELECTRICITY
H01L25/0652
ELECTRICITY
H01L23/481
ELECTRICITY
H01L2223/6677
ELECTRICITY
G06F17/16
PHYSICS
H01L2225/06565
ELECTRICITY
H01L23/5227
ELECTRICITY
H01L2225/06541
ELECTRICITY
International classification
H01L25/065
ELECTRICITY
Abstract
The need for specialized machine learning processors has become a major focal point in the industry as the computation demanded by machine learning workloads grows rapidly. However, the industry has quickly come to a roadblock as the industry realizes, in the context of machine learning, the device memory is more important than complex computation ability. As a result, there has been renewed interest in three dimensional and 2.5D machine learning processors, which are more suited to handle large volume of data. However, conventional multi-layer devices use through silicon vias (TSVs) which have a number of disadvantages and drawbacks. To address these issues, method and devices are disclosed that allow wireless communication between processing layers in a 3D and/or 2.5D integrated dice machine learning processors.
Claims
1. A machine learning processor optimized for carrying out machine learning operations, the processor comprising: a substrate; a plurality of processing dice on the substrate, wherein one or more of the plurality of the processing dice are substantially vertically stacked on one another and on the substrate and the processing dice comprise circuitry to carry out machine learning operations and one or more machine learning operations is carried out with circuitry on or embedded in two or more of the plurality of processing dice; and one or more wireless communication components embedded in and/or on one or more of the substrate and the plurality of processing dice, wherein the wireless communication components carry wireless communication signals between one or more of the substrate and the plurality of processing dice and the circuitry therein and wherein the operations of the machine learning processor comprises the communication signals.
2. The processor of claim 1, wherein the one or more wireless communication components comprise components providing electromagnetic coupling between the components.
3. The processor of claim 2, wherein the electromagnetic coupling comprises one or more of capacitive coupling and inductive coupling.
4. The processor of claim 3, wherein the electromagnetic coupling comprises capacitive coupling and metal layers in the plurality of dice and/or substrate form the capacitive coupling between the plurality of dice and/or the substrate.
5. The processor of claim 3, wherein the electromagnetic coupling comprises capacitive coupling, and wherein a transistor gate in a processing die is used to form a parallel plate of a capacitive coupling.
6. The processor of claim 3, wherein the electromagnetic coupling comprises inductive coupling and metal layers in a processing die are arranged in substantially polygonic shape to form an inductor coil.
7. The processor of claim 1, wherein the wireless communication components carry wireless communication signals between one or more regions of one or more of the substrate and the plurality of dice.
8. The processor of claim 1, wherein the one or more wireless communication components comprise components providing communication via electromagnetic radiation.
9. The processor of claim 8, wherein the wireless communication components comprise an antenna and/or an antenna array.
10. The processor of claim 1, wherein the wireless communication components comprise components providing wireless communication via one or more of capacitive coupling, inductive coupling and electromagnetic radiation.
11. The processor of claim 1, wherein the communication signals are generated via a communication protocol.
12. The processor of claim 11, wherein the communication protocol comprises differential signaling.
13. The processor of claim 12, wherein differential signaling comprises one or more of Chordal coding, PAM-X, CNRZ-5, CNRZ-X, permutation vectors, vector coding, and line coding.
14. A method of manufacturing a machine learning processor, comprising: vertically stacking a plurality of processing dice, wherein each die comprises circuitry configured to carry out machine learning processes; and forming a wireless communication link between processing dice, wherein the circuitry in two or more processing dice are configured to carry out machine learning processes via the wireless communication link.
15. The method of claim 14, wherein the wireless communication link comprises one or more of capacitive coupling, inductive coupling and electromagnetic radiation.
16. The method of claim 14, wherein forming the wireless communication link comprises: forming metal layers in the plurality of processing dice to generate parallel plate capacitors between the plurality of processing dice.
17. The method of claim 14, wherein forming the wireless communication link comprises: using transistor gates in the plurality of processing dice to form parallel plate capacitors between the plurality of processing dice.
18. The method of claim 14, wherein forming the wireless communication link comprises arranging metal layers in the plurality of processing dice in substantially polygonic shape to form inductor coils.
19. A machine learning processor comprising: a plurality of vertically stacked processing dice comprising circuitry to carry out machine learning operations; wireless communication means embedded in two or more of the plurality of processing dice and configured to carry wireless communication signals between one or more of the plurality of processing dice and the circuitry therein and wherein the machine learning operations comprise the communication signals.
20. The machine learning processor of claim 19 wherein the wireless communication means comprises one or more of electromagnetic coupling means and electromagnetic radiation means.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0025] These drawings and the associated description herein are provided to illustrate specific embodiments of the invention and are not intended to be limiting.
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
DETAILED DESCRIPTION
[0036] The following detailed description of certain embodiments presents various descriptions of specific embodiments of the invention. However, the invention can be embodied in a multitude of different ways as defined and covered by the claims. In this description, reference is made to the drawings where like reference numerals may indicate identical or functionally similar elements.
[0037] Unless defined otherwise, all terms used herein have the same meaning as are commonly understood by one of skill in the art to which this invention belongs. All patents, patent applications and publications referred to throughout the disclosure herein are incorporated by reference in their entirety. In the event that there is a plurality of definitions for a term herein, those in this section prevail.
[0038] When the terms one, a or an are used in the disclosure, they mean at least one or one or more, unless otherwise indicated.
[0039] Before any embodiments are explained in detail, it is understood that the disclosed technology is not limited in its application to the details of construction and the arrangement of components set forth in the following description or illustrated in the following drawings. The disclosed technology is capable of other embodiments and of being practiced or of being carried out in various ways. Furthermore, it is understood that sometimes a different number of apparati, systems and methods may be illustrated and/or described but the disclosed technology may be embodied as containing any number of such aspects. Finally, it should be understood that some embodiments may combine presented aspects. For example, a specific embodiment may utilize both inductive coupling and capacitive coupling.
[0040] Machine learning processors can be fabricated by vertically stacking die (e.g., round or square silicon die) layers building three-dimensional processors to carry out the operations of the processor. Each layer can include various logic circuits, transistors, capacitors, resistors, inductors or other electrical components. The various layers within a 3D machine learning processor need to be connected or communicate electrically or wirelessly to provide processor functionality. Sometimes a signal on one layer is communicated to another layer. TSV or other physical, electrical connection between vertically stacked layers achieve the communication objective, but introduce technical difficulties of their own.
[0041]
[0042]
[0043]
[0044] Wireless communication within and/or between layers of a vertically stacked machine learning processor can address several issues present in conventional technologies. Such wireless communication can be implemented via electromagnetic coupling mechanisms, capacitive links, inductive links, electromagnetic radiation and/or other on-chip wireless technology.
[0045] Examples of communication within and/or between layers and circuits therein can include, generating a signal, encoding and/or decoding a signal using Os and is, communication using a communication protocol, such as differential signaling (e.g., Chordal coding, PAM-X, CNRZ-5, CNRZ-X, permutation vectors, vector coding, line coding and any combination of the aforementioned), and/or other on-chip communication technology.
[0046]
[0047] Example capacitive coupling links 0090, 0105, 0115 and other similar capacitive links can be created by fabricating metal layer regions in stacked dice, such that alignment of metal layers in dice forms one or more capacitors between the metal layers. Charging and discharging such capacitive coupling links can be used to implement communication between stacked dice. In one embodiment, the gate of transistors from different stacked dice layers can be used and aligned such that a capacitor forms between the transistor gates and charging/discharging that capacitor can be used to provide communication between stacked dice where the transistor gates are formed. Various semiconductor fabrication processes, for example, deposition, etching, patterning using lithography, doping and/or other suitable fabrication techniques may be employed to fabricate the capacitive coupling links described above.
[0048]
[0049]
[0050] As described, the processor communications can be carried out via wireless links described herein as well as conventional communication techniques (e.g., TSV) and/or any combination thereof as determined by the design objectives/constrains of a machine learning processor.
[0051] In one embodiment, the inductive links 0200, 0205, 0210 and 0215 can be formed by depositing or implanting a plurality of inductor coils. In one embodiment, the metal layers of a semiconductor chip may be used for the creation of inductor coils. In some embodiments, metal layers in dice can be arranged in polygonic shape (e.g., square, rectangle, circle, oval, etc.) to form inductive coils. In another embodiment, a ferromagnetic core may be placed in the center and/or above the inductor coil regions to enhance the inductance. In one embodiment, topological insulator-based quantum inductors may be used. Various semiconductor fabrication processes, for example, deposition, etching, patterning using lithography, doping and/or other suitable fabrication techniques may be employed to fabricate the inductive coupling links described above.
[0052]
[0053] As described, the illustrated wireless communication links and/or their locations are provided as examples. It is understood that the number, location and/or type of inter/intra die wireless communication links vary depending on the design objectives/constraints of a multilayer machine learning processors and/or circuits and signals implemented in its respective layers.
[0054]
[0055] As described earlier, the disclosed wireless communication links address several shortcoming and issues present in conventional communication techniques such as TSVs. For example, the disclosed wireless communication links have a smaller foot print on a die and can be less complex and costly to fabricate and therefore can improve yield when fabricating machine learning processors.
[0056] The term processor can refer to various microprocessors, controllers, and/or hardware and software optimized for loading and executing software programming instructions or processors including processing units optimized for handling high volume matrix data related to machine learning algorithms. Examples of processors built according to the described techniques can be used in a variety of applications, including applications in the field of artificial intelligence (AI) and related fields. For example, a machine learning processor built according to the described embodiments can be configured to perform inference and/or training of a neural network. Other applications of the disclosed technology exist and can readily be ascertained by persons of ordinary skill in the art.
[0057] While the foregoing has described what are considered to be the best mode and/or other examples, it is understood that various modifications may be made therein and that the subject matter disclosed herein may be implemented in various forms and examples, and that the teachings may be applied in numerous applications, only some of which have been described herein.
[0058] Except as stated immediately above, nothing that has been stated or illustrated is intended or should be interpreted to cause a dedication of any component, step, feature, object, benefit, advantage, or equivalent to the public, regardless of whether it is or is not recited in the claims.
[0059] It will be understood that the terms and expressions used herein have the ordinary meaning as is accorded to such terms and expressions with respect to their corresponding respective areas of inquiry and study except where specific meanings have otherwise been set forth herein. Relational terms such as first, second, other and another and the like may be used solely to distinguish one entity or action from another without necessarily requiring or implying any actual such relationship or order between such entities or actions.
[0060] The terms comprises, comprising, or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. An element proceeded by a or an does not, without further constraints, preclude the existence of additional identical elements in the process, method, article, or apparatus that comprises the element.
[0061] The Abstract of the Disclosure is provided to allow the reader to quickly ascertain the nature of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. In addition, in the foregoing Detailed Description, it can be seen that various features are grouped together in various implementations. This is for purposes of streamlining the disclosure and is not to be interpreted as reflecting an intention that the claimed implementations require more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter lies in less than all features of a single disclosed implementation. Thus, the following claims are hereby incorporated into the Detailed Description, with each claim standing on its own as a separately claimed subject matter.