Patent classifications
G06F7/556
Computer-Implemented Method of Executing SoftMax
The present disclosure concerns a method of executing a SoftMax function, the method comprising: (i) pre-storing in memory M fraction components (fc.sub.j) in binary form, derived from the expression 2.sup.(j/M), said fc.sub.j forming a lookup table (T) of size M; (ii) calculating, for each z.sub.i, an element y.sub.i of a number of the form 2.sup.y.sup.
HARDWARE ACCELERATOR METHOD AND DEVICE
A processor-implemented hardware accelerator method includes: receiving input data; loading a lookup table (LUT); determining an address of the LUT by inputting the input data to a comparator; obtaining a value of the LUT corresponding to the input data based on the address; and determining a value of a nonlinear function corresponding to the input data based on the value of the LUT, wherein the LUT is determined based on a weight of a neural network that outputs the value of the nonlinear function.
HARDWARE ACCELERATOR METHOD AND DEVICE
A processor-implemented hardware accelerator method includes: receiving input data; loading a lookup table (LUT); determining an address of the LUT by inputting the input data to a comparator; obtaining a value of the LUT corresponding to the input data based on the address; and determining a value of a nonlinear function corresponding to the input data based on the value of the LUT, wherein the LUT is determined based on a weight of a neural network that outputs the value of the nonlinear function.
EXPONENTIAL CALCULATOR USING PARALLEL PROCESSOR SYSTEMS
An exponential calculator based on parallel computing is disclosed. The exponential calculator includes a master system and a plurality of nodes interconnected with each other to transfer and receive information and perform sub-computations independently and simultaneously. The master system is configured to select a number of nodes from the plurality of nodes required to perform sub-computations for calculation of an integer exponent. The selected nodes is configured to receive value of a node base and a node exponent from the master system. The selected nodes calculate a first computation value, a second computation value and a third computation value. The master system is further configured to instruct a sub-set of the selected nodes to perform summation of a final sub-computation of the selected nodes and provide an output.
EXPONENTIAL CALCULATOR USING PARALLEL PROCESSOR SYSTEMS
An exponential calculator based on parallel computing is disclosed. The exponential calculator includes a master system and a plurality of nodes interconnected with each other to transfer and receive information and perform sub-computations independently and simultaneously. The master system is configured to select a number of nodes from the plurality of nodes required to perform sub-computations for calculation of an integer exponent. The selected nodes is configured to receive value of a node base and a node exponent from the master system. The selected nodes calculate a first computation value, a second computation value and a third computation value. The master system is further configured to instruct a sub-set of the selected nodes to perform summation of a final sub-computation of the selected nodes and provide an output.
Method, device, and program product for determining model compression rate
A method for determining a model compression rate comprises determining a near-zero importance value subset from an importance value set associated with a machine learning model, a corresponding importance value in the importance value set indicating an importance degree of a corresponding input of a processing layer of the machine learning model, importance values in the near-zero importance value subset being closer to zero than other importance values in the importance value set; determining a target importance value from the near-zero importance value subset, the target importance value corresponding to a turning point of a magnitude of the importance values in the near-zero importance value subset; determining a proportion of importance values less than the target importance value in the importance value set in the importance value set; and determining the compression rate for the machine learning model based on the determined proportion.
Method, device, and program product for determining model compression rate
A method for determining a model compression rate comprises determining a near-zero importance value subset from an importance value set associated with a machine learning model, a corresponding importance value in the importance value set indicating an importance degree of a corresponding input of a processing layer of the machine learning model, importance values in the near-zero importance value subset being closer to zero than other importance values in the importance value set; determining a target importance value from the near-zero importance value subset, the target importance value corresponding to a turning point of a magnitude of the importance values in the near-zero importance value subset; determining a proportion of importance values less than the target importance value in the importance value set in the importance value set; and determining the compression rate for the machine learning model based on the determined proportion.
FPGA specialist processing block for machine learning
The present disclosure describes a digital signal processing (DSP) block that includes a plurality of columns of weight registers and a plurality of inputs configured to receive a first plurality of values and a second plurality of values. The first plurality of values is stored in the plurality of columns of weight registers after being received. Additionally, the DSP block includes a plurality of multipliers configured to simultaneously multiply each value of the first plurality of values by each value of the second plurality of values.
FPGA specialist processing block for machine learning
The present disclosure describes a digital signal processing (DSP) block that includes a plurality of columns of weight registers and a plurality of inputs configured to receive a first plurality of values and a second plurality of values. The first plurality of values is stored in the plurality of columns of weight registers after being received. Additionally, the DSP block includes a plurality of multipliers configured to simultaneously multiply each value of the first plurality of values by each value of the second plurality of values.
Calculation processor and calculation method for determining an exponential function
A calculation processor for determining a digital output value from a digital input value based on an exponent value a, the processor comprising a first calculation block, a second calculation block and a final calculation block. The first calculation block initializes an intermediate value and an error value depending on a position of a Most Significant Bit of a significant part of the input value. The second calculation block is configured to perform repeatedly, until an exit criterion is fulfilled, the incrementation of a counter value, the determination of a power error value based on the error value and, if the power error value is larger than or equal to an error threshold, adjustment of the intermediate value y by multiplying the intermediate value with an adaptation value and setting the error value to the power error value divided by the base value. If the power error value is smaller than the error threshold, the error value is set to the power error value. The final calculation block is configured to set the output value to the intermediate value.