G06F9/4405

APPROXIMATE SYNCHRONIZATION FOR PARALLEL DEEP LEARNING
20170351530 · 2017-12-07 ·

Techniques facilitating synchronization of processing engines for parallel deep learning are provided. In one example, a first processing component associated with a processor and processing components can: generate first output data based on input data associated with a machine learning process, wherein the processing components are communicatively coupled with an assignment component via a network; transmit the first output data to a second processing component of the processing components, wherein the first processing component and the second processing component comprise a first group of the processing components and the first group of the processing components is determined by the assignment component based on a first defined criterion; receive communication data generated by the second processing component; and generate second output data based on the communication data, wherein the second output data is an updated version of the first output data stored in the memory of the first processing component.

APPLICATION PROGRAMMING INTERFACE TO LIMIT MEMORY
20230185611 · 2023-06-15 ·

Apparatuses, systems, and techniques to limit memory during execution of one or more kernels and/or thread groups during PPU execution. In at least one embodiment, a process indicates a memory limit for one or more kernels and/or thread groups to a parallel processing library, and said parallel processing library restricts memory allocation for said one or more kernels and/or thread groups according to said memory limit.

ELECTRONIC DEVICE FOR BOOTING OPERATING SYSTEM USING PLURALITY OF CORES AND OPERATION METHOD THEREOF

An electronic device is provided. The electronic device includes a processor including a plurality of cores and a memory electrically connected with the processor and storing instructions. The instructions store instructions, when executed, causing the processor to control a dispatcher to determine whether loading of a plurality of modules for booting an operating system of the electronic device is completed, identify at least one module, dependency of which is cleared, based on a module information table stored in the memory, as it is determined that there is a module, loading of which is not completed, among the plurality of modules, identify at least one of state information and priority information of a module loader allocated to each of two or more cores among the plurality of cores based on a core information table stored in the memory, as the at least one module is identified, and select at least one of two or more module loaders respectively allocated to the cores and distribute the at least one module, based on the identified information.

Out-of-band custom baseboard management controller (BMC) firmware stack monitoring system and method

An Information Handling System (IHS) includes multiple hardware devices, and a baseboard Management Controller (BMC) in communication with the plurality of hardware devices. The BMC includes executable instructions for monitoring a parameter of one or more of the hardware devices of the IHS when a custom BMC firmware stack is executed on the BMC. The instructions that monitor the parameter are separate and distinct from the instructions of the custom BMC firmware stack. When the parameter exceeds a specified threshold, the instructions are further executed to control the BMC to perform one or more operations to remediate the excessive parameter.

Coordinated initialization system

A coordinated initialization system includes a computing system with first and second initialization subsystems coupled to a coordinated initialization subsystem. The coordinated initialization subsystem receives first and second initialization progress information associated with respective first and second initialization subsystem operations performed by the respective first and second initialization subsystems. Using a coordinated initialization database that identifies dependences between the first and second initialization operations, the coordinated initialization subsystem determines that the first initialization progress information identifies a first initialization operation that is going to be performed by the first initialization subsystem and that is dependent on a second initialization operation that is identified by the second initialization progress information and that has not yet been performed by the second initialization subsystem and, in response, causes the first initialization subsystem to pause the first initialization subsystem operations until the second initialization operation has been performed.

SCALE OUT FILE SYSTEM USING REFS AND SCALE OUT VOLUME

Some storage systems are configured with VDL (valid data length) type controls that are implemented on a per cluster basis and, in some instances, on a sub-cluster basis, rather than simply a per file basis. In some instances, per-cluster VDL metadata for the storage clusters is stored and referenced at the edge data volume nodes of a distributed network for the storage system rather than, and/or without, storing or synchronizing the per-cluster VDL metadata at a master node that manages the corresponding storage clusters for the different data volume nodes. Sequence controls are also provided and managed by the master node and synchronized with the edge data volume nodes to further control access to data contained in the storage clusters.

DUAL PURPOSE BOOT REGISTERS

Techniques for detecting an early boot error are provided. In one aspect, a host processor may transition to a first phase of an early boot process. The early boot process may occur before the host processor initializes a primary link between the host processor and a management controller. The host processor may then update a dual purpose boot register to store an early boot phase identifier corresponding to the first phase and an early boot status identifier corresponding to the first phase.

Display Method for Operating Systems, Display Device for Operating Systems, and Multi-System Terminal
20170308414 · 2017-10-26 ·

The present disclosure provides a display method for operating systems, a display device for operating systems, and a multi-system terminal. The display method includes: running multiple operating systems simultaneously; and displaying each of the multiple operating systems in a preset display mode. The multi-system operating system that are run simultaneously is displayed on one or more display screens, and restarting a terminal is avoided when switching the operating systems, thus a user operates the multiple operating system simultaneously or separately, which facilitates user's operation and improves user's experience.

ENERGY-EFFICIENT CORE VOLTAGE SELECTION APPARATUS AND METHOD

A processor core energy-efficiency core ranking scheme akin to a favored core in a multi-core processor system. The favored core is the energy-efficient core that allows an SoC to use the core with the lowest V.sub.min for energy-efficiency. Such V.sub.min values may be fused in appropriate registers or stored in NVM during HVM. An OS scheduler achieves optimal energy performance using the core ranking information to schedule certain applications on the core with lowest V.sub.min. A bootstrap flow identifies a bootstrap processor core (BSP) as the most energy efficiency core of the SoC and assigns that core the lowest APIC ID value according to the lowest V.sub.min. Upon reading the fuses or NVM, the microcode/BIOS calculates and ranks the cores. As such, microcode/BIOS calculates and ranks core APIC IDs based on efficiency around LFM frequencies. Based on the calculated and ranked cores, the microcode or BIOS transfers BSP ownership to the most efficiency core.

Methods and apparatus for recovering errors with an inter-processor communication link between independently operable processors
09798377 · 2017-10-24 · ·

Methods and apparatus for an inter-processor communication (IPC) link between two (or more) independently operable processors. In one aspect, the IPC protocol is based on a “shared” memory interface for run-time processing (i.e., the independently operable processors each share (either virtually or physically) a common memory interface). In another aspect, the IPC communication link is configured to support a host driven boot protocol used during a boot sequence to establish a basic communication path between the peripheral and the host processors. Various other embodiments described herein include sleep procedures (as defined separately for the host and peripheral processors), and error handling.