G06N3/105

Dynamic placement of computation sub-graphs

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for assigning operations of a computational graph to a plurality of computing devices are disclosed. Data characterizing a computational graph is obtained. Context information for a computational environment in which to perform the operations of the computational graph is received. A model input is generated, which includes at least the context information and the data characterizing the computational graph. The model input is processed using the machine learning model to generate an output defining placement assignments of the operations of the computational graph to the plurality of computing devices. The operations of the computational graph are assigned to the plurality of computing device according to the defined placement assignments.

DAG MODIFICATION MODULE, PROCESSING DEVICE INCLUDING SAME, AND DAG MODIFICATION METHOD OF PROCESSING DEVICE
20230214636 · 2023-07-06 ·

A DAG modification module, a processing device including the same, and a DAG modification method of the processing device are provided. The DAG modification module comprises an identification module configured to receive a directed acyclic graph (DAG) as an input, identify sub-graphs including non-unit operations that are not predefined unit operations out of the DAG, and replace the sub-graphs with transformed sub-graphs to thereby generate a transformed DAG, a transform module configured to receive the sub-graphs including the non-unit operations, transform the sub-graphs into the transformed sub-graphs including the unit operations, and transfer the transformed sub-graphs to the identification module, a unit operation database configured to provide a unit operation list in which the unit operations are recorded to the identification module, and an optimization module configured to receive the transformed DAG, receive a calculation method table for each of the unit operations from the unit operation database.

Dynamic reconfiguration training computer architecture
11551026 · 2023-01-10 · ·

A dynamic reconfiguration training machine learning computer architecture is disclosed. According to some aspects, a computing machine accesses a configuration file. The configuration file specifies parameters for a machine learning session. The computing machine trains a machine learning module to solve a problem, where the machine learning module operates according to the parameters specified in the configuration file. The computing machine generates an output representing the trained machine learning module.

Debugging deep neural networks

A method, computer system, and a computer program product for debugging a deep neural network is provided. The present invention may include identifying, automatically, one or more debug layers associated with a deep learning (DL) model design/code, wherein the identified one or more debug layers include one or more errors, wherein a reverse operation is introduced for the identified one or more debug layers. The present invention may then include presenting, to a user, a debug output based on at least one break condition, wherein in response to determining the at least one break condition is satisfied, triggering the debug output to be presented to the user, wherein the presented debug output includes a fix for the identified one or more debug layers in the DL model design/code and at least one actionable insight.

Credit eligibility predictor

Aspects extract, from payroll data of employees of an organization, data historically associated to previous instances of certified tax credit eligibility; normalize the extracted data with respect to data type and data value; generate from the normalized extracted data via a neural network classifier multi-class outputs for each employee that indicate strengths of likelihood that each employee is currently eligible for each of a plurality of different tax credits; filter the normalized extracted data by removing portions associated to employees indicated within the multi-class outputs as having no currently eligible likelihood for the different tax credits, thereby generating a remainder set of normalized extracted data associated to remainder eligible ones of the employees; and prioritize application for the tax credits for the remainder eligible employees as a function of respective values and likelihoods of eligibility within the remainder set of normalized extracted data.

System and method for executing convolution in a neural network
11544559 · 2023-01-03 · ·

A system and method of executing a convolution layer of a neural network may include: (a) selecting an output spatial position (OSP) of an output matrix data element of the convolution layer; (b) selecting, based on the selected OSP, a non-zero input element of an input matrix data element; (c) producing, based on the selected OSP, a vector of kernel elements from a kernel matrix data element; (d) performing a vectoral multiplication operation of the selected non-zero input element and the vector of kernel elements, and accumulating a product of the vectoral multiplication in a vector register of a processor; (e) repeating (c) and (d) with subsequent non-zero input elements and corresponding vectors of kernel elements to obtain an outcome of the convolution of the selected OSP; and (f) repeating (a) through (e) with subsequent selection of OSPs, to obtain an outcome of the convolution layer.

Integrating machine learning models into an interpreted software development environment

The subject technology provides for parsing a line of code in a project of an integrated development environment (IDE). The subject technology executes indirectly, using the interpreter, the parsed line of code. The interpreter references a translated source code document generated by a source code translation component from a machine learning (ML) document written in a particular data format. The translated source code document includes code in a chosen programming language specific to the IDE, and the code of the translated source code document is executable by the interpreter. Further the subject technology provides, by the interpreter, an output of the executed parsed line of code.

System and method for generating photorealistic synthetic images based on semantic information

Embodiments described herein provide a system for generating semantically accurate synthetic images. During operation, the system generates a first synthetic image using a first artificial intelligence (AI) model and presents the first synthetic image in a user interface. The user interface allows a user to identify image units of the first synthetic image that are semantically irregular. The system then obtains semantic information for the semantically irregular image units from the user via the user interface and generates a second synthetic image using a second AI model based on the semantic information. The second synthetic image can be an improved image compared to the first synthetic image.

System and method for compact neural network modeling of transistors
11537841 · 2022-12-27 · ·

A method for generating a model of a transistor includes: initializing hyper-parameters; training the neural network in accordance with the hyper-parameters and training data relating transistor input state values to transistor output state values to compute neural network parameters; determining whether the transistor output state values of the training data match an output of the neural network; porting the neural network to a circuit simulation code to generate a ported neural network; simulating a test circuit using the ported neural network to simulate behavior of a transistor of the test circuit to generate simulation output; determining whether a turnaround time of the generation of the simulation output is satisfactory; in response to determining that the turnaround time is unsatisfactory, re-training the neural network based on updated hyper-parameters; and in response to determining that the turnaround time is satisfactory, outputting the ported neural network as the model of the transistor.

ELECTRONIC DEVICE FOR CONVERTING ARTIFICIAL INTELLIGENCE MODEL AND OPERATING METHOD THEREOF
20220405546 · 2022-12-22 ·

A method performed by an electronic device is provided. The method includes obtaining an artificial intelligence model based on a first framework, determining a second framework on which an artificial intelligence model converted from the artificial intelligence model based on the first framework is to be based, obtaining a conversion graph comprising a plurality of nodes representing a plurality of frameworks, respectively, and a plurality of edges each representing that one framework is convertible into another framework, determining a path that leads from a node representing the first framework to a node representing the second framework based on the conversion graph, and converting the artificial intelligence model based on the first framework into an artificial intelligence model based on the second framework, according to the determined path.