COHERENT NODE CONTROLLER
20210064530 ยท 2021-03-04
Inventors
Cpc classification
G06F2212/621
PHYSICS
Y02D10/00
GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
G06F12/0833
PHYSICS
International classification
Abstract
A cache coherent node controller at least includes one or more network interface controllers, each network interface controller includes at least one network interface, and at least two coherent interfaces each configured for communication with a microprocessor. A computer system includes one or more of nodes wherein each node is connected to at least one network switch, each node at least includes a cache coherent node controller.
Claims
1.-12. (canceled)
13. A cache coherent node controller comprising: a. one or more network interface controllers, NIC, each network interface controller includes at least one network interface, c. at least two coherent interfaces, CIF, each configured for communication with a microprocessor, d. two or more interconnected protocol engines, PE, and e. at least two coherent link layers, each coherent link layer at least connected to one protocol engine, each coherent link layer provides coherent interfaces, wherein each network interface controller is connected with two coherent link layers, each coherent link layer is connected with one neighbouring coherent link layer, and where the protocol engines are directly or indirectly connected with coherent link layers.
14. A computer system comprising one or more nodes wherein each node is connected to at least one network switch, each node comprising: a. a cache coherent node controller comprising: i. one or more network interface controllers, each network interface controller includes at least one network interface, ii. at least two coherent interfaces each connected to a microprocessor, and iii. one or more interconnected protocol engines, PE.
15. The computer system according to claim 14, wherein the cache coherent node controller further comprises: iv. one or more coherent link layers, each coherent link layer connected to one protocol engine, each coherent link layer provides coherent interfaces.
16. The computer system according to claim 15, wherein each network interface controller is connected with a coherent link layer, each coherent link layers is connected with one neighbouring coherent link layer, and where the protocol engines are directly or indirectly connected with coherent link layers.
17. The computer system according to claim 15, wherein connection between each node and the network switches is provided by the network interfaces.
18. The system according to claim 15, wherein one network interface of each cache coherent node controller is connected to one network switch to form a 1-dimensional fat tree topology.
19. The system according to claim 15, wherein two network interfaces of each cache coherent node controller are connected to two network switches to form a 2-dimensional fat tree topology.
20. The system according to claim 15, wherein three network interfaces of each cache coherent node controller are connected to three network switches to form a 3-dimensional fat tree topology.
21. The system according to claim 15, wherein four network interfaces of each cache coherent node controller are connected to four network switches to form a 4-dimensional fat tree topology.
22. The system according to claim 15, wherein n network interfaces of each cache coherent node controller are connected to n network switches to form a n-dimensional fat tree topology.
23. The system according to claim 17, wherein the network switches are one of: an Omnipath switch, an InfiniBand switch, a Gen-Z switch or an Ethernet switch.
24. The system according to claim 17, wherein connection between each node and the network switches is provided by fibre optic links to the network switches.
Description
BRIEF DESCRIPTION OF DRAWINGS
[0031] Following is a brief description of the drawings in order to make the invention more readily understandable, the discussion that follows will refer to the accompanying drawings, in which
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
DETAILED DESCRIPTION OF THE INVENTION
[0043] In the following it is firstly disclosed general embodiments in accordance to the present invention, thereafter particular exemplary embodiments will be described. Where possible reference will be made to the accompanying drawings and where possible using reference numerals in the drawings. It shall be noted however that the drawings are exemplary embodiments only and other features and embodiments may well be within the scope of the invention as described.
[0044] In the context of the present invention a processor is defined as a module containing one or more CPU cores, one or more memory channels, one or more I/O channels, one or more coherent links and memory in the form caches or with or without main memory in any form included in the processor module.
[0045] A node contains one or more processor modules, one or more memory banks, zero or one or more I/O modules and one or more coherent node controllers.
[0046] In the present invention it is not necessarily different bandwidths within a fat tree configuration as indicated in the figures.
[0047] The present invention is based on integrating several functions that in combination will produce a flexible and versatile interconnect component for building large scale computer systems. The key of the invention is the close connection between a packet-based network switch and a cache coherent switch for inter-processor communication inside the node. This connection enables building of scalable systems with low latency and high bandwidth communication with very low power consumption. As an example, a device with 8 coherent links to CPUs and 4 interconnect links to connect to other devices of the same kind will consume less than 100 W of power. Such a device will reduce the main system interconnect port requirement with a factor of four compared to the traditional configuration for HPC systems with 2 CPUs per network interface.
[0048] A Network Interface Controller, NIC, can be implemented for use with different types of high performance networks like Ethernet, InfiniBand, Omnipath (Intel), BXI (ATOS/Bull), and Gen-Z (Gen-Z consortium standard). Gen-Z defines a load/store architecture where CPU instructions can address memory directly without going through a message passing protocol through software driver layers. Omnipath has a similar function in addition to RDMA functionality for message passing. The ability for processors to use load and store instructions to operate directly on memory locations across the interconnect reduces the overhead for data movement. This is most important for short data transfers where latency and overhead becomes a significant factor for performance.
First Embodiment of the Present Invention
[0049] The present invention will now be described with a specified example. The invention introduces a new element, namely a cache coherent node controller, it also provides examples of the use of said new element in system configurations from 1-dimensional fat tree configurations to n-dimensional fat tree configurations.
[0050] The cache coherent node controller may at least comprise: [0051] a. one or more network interface controllers, each network interface controller includes at least one network interface. An example of such a coherent node controller is shown schematically in
[0054] The cache coherent node controller can be scaled up or down maintaining the same principle of incorporating one or more NICs.
[0055] Obviously, the design layout shown in
Second Embodiment of the Present Invention
[0056] In the second embodiment of the present invention the coherent node controller with NICs described above is used in a computer system. The computer system comprises one or more nodes where each node is connected to at least one network switch,
[0057] In the event that only one NIC is included or only one NIC is connected to network switches one will have a one dimensional system as depicted in
[0058]
[0059] The cache coherent node controller of
TABLE-US-00001 100a, 100b, A computer, a server or a blade server - above referred 100c, 100d to as a node 200 On-chip distributed switching, such as but not limited to. Ethernet switch, Omnipath switch, Infiniband switch, and Gen-Z switch. CIF Coherent Interface CLL Coherent Link Layer NIC Network Interface Controllers NIF Network Interface PE Protocol Engine HPC High performance computing CNCNIC Coherent Node Controller w/Integrated NICs CPU Central Processing Unit RDMA Remote direct memory access