G06F7/544

Creating a machine learning model with k-means clustering
11544596 · 2023-01-03 · ·

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, that creates a machine learning model with k-means clustering. In some implementations, an instruction to create a model is obtained. A data set including geographic data and non-geographic data is received. The data set includes multiple data entries. Geographic centroids are determined from the geographic data. The data set is analyzed to obtain statistics of the data set. Transformed data is generated from the data set, the statistics, and the geographic centroids. A model is generated with the transformed data, the model indicating multiple data groupings.

Low area multiply and accumulate unit

An improved electronic mixed mode multiplier and accumulate circuit for artificial intelligence and computing system applications that perform vector-vector, vector-matrix and other multiply-accumulate computations. The circuit is provided is a high resolution, high linearity, low area, low power multiply—accumulate (MAC) unit to interface with a memory device for storing computation output results. The MAC unit uses a less number of current carrying elements resulting in much lower integrated circuit area, and provides a tight matching between the current elements thus preserving inherent linearity requirements due to current mode operation. Further the MAC performs current scaling using switches and current division where the current switches occupy minimum size transistors requiring a small area to implement that renders it compatible with MRAM such as a magnetic tunnel junction device. The MAC is hierarchically extended for increased number of bits to provide a delay implementation using orthogonal vector and current addition.

Accelerating binary neural networks within latch structure of non-volatile memory devices

A non-volatile memory device includes an array of non-volatile memory cells that are configured to store weights of a neural network. Associated with the array is a data latch structure that includes a page buffer, which can store weights for a layer of the neural network that is read out of the array, and a transfer buffer, that can store inputs for the neural network. The memory device can perform multiply and accumulate operations between inputs and weight of the neural network within the latch structure, avoiding the need to transfer data out of the array and associated latch structure for portions of an inference operation. By using binary weights and inputs, multiplication can be performed by bit-wise XNOR operations. The results can then be summed and activation applied, all within the latch structure.

COMPUTATION APPARATUS, METHOD AND PROGRAM FOR THE SAME

A computation apparatus, a method of the same, and a program which perform a secure computation using fixed-point arithmetic, and overflow is unlikely to occur and the occurrence of division by zero can be detected when an odds ratio is calculated. The computation apparatus includes an odds ratio computation unit for obtaining an odds ratio between a first group (a+b) and a second group (c+d) based on four plaintext values a, b, c, and d, by means of secure computation; a zero-division detection unit for determining, by means of secure computation, whether or not at least one of the plaintext values b and c is not zero, and detecting division by zero; and a selection unit for selecting the odds ratio if division by zero is not detected, by means of secure computation.

INFORMATION PROCESSING CIRCUIT AND METHOD OF DESIGNING INFORMATION PROCESSING CIRCUIT
20220413806 · 2022-12-29 · ·

The information processing circuit 10 performs operations on layers in deep learning, and includes a product sum circuit 11 which performs a product-sum operation using input data and parameter values, and a parameter value output circuit 12 which outputs the parameter values, wherein the parameter value output circuit 12 is composed of a combinational circuit.

OPTICAL COMPUTING APPARATUS AND SYSTEM, AND COMPUTING METHOD

An optical computing apparatus and system and a computing method are provided. The optical computing apparatus includes a linear operation module, a first delay module, and a coupler. The linear operation module can modulate, based on received electrical signals, optical signals input to the linear operation module; the first delay module may adjust a delay of optical signals output by the linear operation module; and after the first delay module adjusts the delay of the optical signals output by the linear operation module, the coupler may combine a plurality of groups of optical signals successively output by the linear operation module, to output one group of optical signals used to indicate a computing result that is obtained after a multiply-add operation is performed on one group of data and weights.

SYSTOLIC ARRAY HAVING SUPPORT FOR OUTPUT SPARSITY

A processing apparatus is described herein that includes a general-purpose parallel processing engine comprising a matrix accelerator including one or more systolic arrays, at least one of the one or more systolic arrays comprising multiple pipeline stages, each pipeline stage of the multiple pipeline stages including multiple processing elements, the multiple processing elements configured to perform processing operations on input matrix elements based on output sparsity metadata. The output sparsity metadata indicates to the multiple processing elements to bypass multiplication for a first row of elements of a second matrix and multiply a second row of elements of the second matrix with a column of matrix elements of a first matrix.

METHOD, SYSTEM, AND CIRCUIT FOR EXTRACTING FEATURES FOR USE IN EMBEDDED ARTIFICIAL INTELLIGENCE MECHANISMS

System, method, and circuitry for utilizing sequential input inertial sensor data to calculate recursive features for training a machine learning algorithm or for classifying the data as a known class. The recursive feature values of a current data sample are calculating based on comparisons between the current data sample value and previous recursive feature values. The recursive features include a recursive maximum, recursive minimum, recursive peak to peak, recursive average, recursive root mean square, and recursive variance.

PARTIAL SUM COMPRESSION
20220413805 · 2022-12-29 ·

A method for performing a neural network operation. In some embodiments, method includes: calculating a first plurality of products, each of the first plurality of products being the product of a weight and an activation; calculating a first partial sum, the first partial sum being the sum of the products; and compressing the first partial sum to form a first compressed partial sum.

CIRCUIT FOR HANDLING PROCESSING WITH OUTLIERS

A system and method for handling processing with outliers. In some embodiments, the method includes: reading a first activation and a second activation, each including a least significant part and a most significant part, multiplying a first weight and a second weight by the respective activations, the multiplying of the first weight by the first activation including multiplying the first weight by the least significant part of the first activation in a first multiplier, the multiplying of the second weight by the second activation including: multiplying the second weight by the least significant part of the second activation in a second multiplier, and multiplying the second weight by the most significant part of the second activation in a shared multiplier, the shared multiplier being associated with a plurality of rows of an array of activations.