IPIQ

G06F7/556

Computer-Implemented Method of Executing SoftMax

20220383077 · 2022-12-01 ·

Christof Petig

The present disclosure concerns a method of executing a SoftMax function, the method comprising: (i) pre-storing in memory M fraction components (fc.sub.j) in binary form, derived from the expression 2.sup.(j/M), said fc.sub.j forming a lookup table (T) of size M; (ii) calculating, for each z.sub.i, an element y.sub.i of a number of the form 2.sup.y.sup.i; (iii) separating y.sub.i into an integral part (int.sub.i) and a fractional part (fract.sub.i); (iv) determining a lookup index (ind.sub.i) that corresponds to fract.sub.i scaled by the size M; (v) retrieving a fraction component fc.sub.i from T with ind.sub.i; (vi) generating, in a result register, a binary number representative of the exponential value of said z.sub.i, by combining said fc.sub.i retrieved from T and said int.sub.i; (v) adding the K result registers corresponding to z.sub.i into a sum register R7; and (vi) determining the K probability values p.sub.i from the K result registers and the sum register.

HARDWARE ACCELERATOR METHOD AND DEVICE

20220383103 · 2022-12-01 ·

Samsung Electronics Co., Ltd.

A processor-implemented hardware accelerator method includes: receiving input data; loading a lookup table (LUT); determining an address of the LUT by inputting the input data to a comparator; obtaining a value of the LUT corresponding to the input data based on the address; and determining a value of a nonlinear function corresponding to the input data based on the value of the LUT, wherein the LUT is determined based on a weight of a neural network that outputs the value of the nonlinear function.

HARDWARE ACCELERATOR METHOD AND DEVICE

20220383103 · 2022-12-01 ·

Samsung Electronics Co., Ltd.

EXPONENTIAL CALCULATOR USING PARALLEL PROCESSOR SYSTEMS

20220382518 · 2022-12-01 ·

Bhavesh Lakhotia

An exponential calculator based on parallel computing is disclosed. The exponential calculator includes a master system and a plurality of nodes interconnected with each other to transfer and receive information and perform sub-computations independently and simultaneously. The master system is configured to select a number of nodes from the plurality of nodes required to perform sub-computations for calculation of an integer exponent. The selected nodes is configured to receive value of a node base and a node exponent from the master system. The selected nodes calculate a first computation value, a second computation value and a third computation value. The master system is further configured to instruct a sub-set of the selected nodes to perform summation of a final sub-computation of the selected nodes and provide an output.

EXPONENTIAL CALCULATOR USING PARALLEL PROCESSOR SYSTEMS

20220382518 · 2022-12-01 ·

Bhavesh Lakhotia

Method, device, and program product for determining model compression rate

11507782 · 2022-11-22 ·

Emc IP Holding Company Llc

A method for determining a model compression rate comprises determining a near-zero importance value subset from an importance value set associated with a machine learning model, a corresponding importance value in the importance value set indicating an importance degree of a corresponding input of a processing layer of the machine learning model, importance values in the near-zero importance value subset being closer to zero than other importance values in the importance value set; determining a target importance value from the near-zero importance value subset, the target importance value corresponding to a turning point of a magnitude of the importance values in the near-zero importance value subset; determining a proportion of importance values less than the target importance value in the importance value set in the importance value set; and determining the compression rate for the machine learning model based on the determined proportion.

Method, device, and program product for determining model compression rate

11507782 · 2022-11-22 ·

Emc IP Holding Company Llc

FPGA specialist processing block for machine learning

11494186 · 2022-11-08 ·

Intel Corporation

The present disclosure describes a digital signal processing (DSP) block that includes a plurality of columns of weight registers and a plurality of inputs configured to receive a first plurality of values and a second plurality of values. The first plurality of values is stored in the plurality of columns of weight registers after being received. Additionally, the DSP block includes a plurality of multipliers configured to simultaneously multiply each value of the first plurality of values by each value of the second plurality of values.

FPGA specialist processing block for machine learning

11494186 · 2022-11-08 ·

Intel Corporation

Calculation processor and calculation method for determining an exponential function

11573767 · 2023-02-07 ·

AMS AG

Stefaan Margriet Albert Van Hoogenbemt

A calculation processor for determining a digital output value from a digital input value based on an exponent value a, the processor comprising a first calculation block, a second calculation block and a final calculation block. The first calculation block initializes an intermediate value and an error value depending on a position of a Most Significant Bit of a significant part of the input value. The second calculation block is configured to perform repeatedly, until an exit criterion is fulfilled, the incrementation of a counter value, the determination of a power error value based on the error value and, if the power error value is larger than or equal to an error threshold, adjustment of the intermediate value y by multiplying the intermediate value with an adaptation value and setting the error value to the power error value divided by the base value. If the power error value is smaller than the error threshold, the error value is set to the power error value. The final calculation block is configured to set the output value to the intermediate value.

Patent classifications

G06F7/556