SYSTEM AND METHOD FOR CONSTRUCTING A MATHEMATICAL MODEL OF A SYSTEM IN AN ARTIFICIAL INTELLIGENCE ENVIRONMENT

Abstract

A system and method for constructing a mathematical model of a system. The method includes constructing an initial mathematical system representation with a combination of terms, the terms comprising mathematical functions including independent variables dependent on an input signal. A first set of known data is inputted to the initial mathematical representation to generate a corresponding set of output data. The corresponding set of output data of the initial mathematical representation and a second set of known data, correlated to the first set of known data, is fed to a comparator to generate error signals representing differences between output data and correlated members of the second set of known data. A parameter of the combination of terms is iteratively varied to produce a refined mathematical representation of the system until a measure of the error signals is reduced to a value wherein the set of corresponding output data of the refined mathematical representation over a desired range is approximately equivalent to the second set of known data.

Claims

1. A method in an artificial intelligence (AI) system of constructing a mathematical model of a system, comprising: constructing an initial mathematical representation of said system with a combination of terms, said terms comprising mathematical functions including independent variables dependent on an input signal; inputting a first set of known data to said initial mathematical representation to generate a corresponding set of output data; feeding said corresponding set of output data of said initial mathematical representation and a second set of known data correlated to said first set of known data, to a comparator, said comparator generating error signals representing a difference between members of said set of output data and correlated members of said second set of known data; and iteratively varying a parameter of at least one of said combination of terms comprising said initial mathematical representation to produce a refined mathematical representation of said system until a measure of said error signals is reduced to a value wherein the set of corresponding output data of said refined mathematical representation over a desired range is approximately equivalent to said second set of known data.

2. The method recited in claim 1, wherein said iteratively varying a parameter of at least one of said combination of terms includes setting a coefficient of each term to a value between 0 and 1 such that all coefficients sum to 1.

3. The method recited in claim 1, wherein said combination of terms comprises at least one of a transcendental function, a polynomial function, and a Boolean function.

4. The method recited in claim 1, wherein said first set of known data and said second set of known data respectively comprise known input data and corresponding known output data for said real system.

5. The method recited in claim 1, wherein said first set of known data and said second set of known data both comprise known output data for said real system.

6. The method recited in claim 1, wherein said first set of known data and said second set of known data both comprise known input data for said real system.

7. The method as recited in claim 1, wherein said first set of known data and said second set of known data are a subset of all known data for said real system.

8. The method recited in claim 7, wherein said subset of all known data is utilized to produce said refined mathematical representation of said system and remaining data of said all known data is utilized to test said refined mathematical representation for coherence over a fuller range of data.

9. The method recited in claim 1, wherein said measure of said error signals corresponds to a maximum error signal for the first and second sets of known data.

10. The method recited in claim 1, wherein said measure of said error signals is a root-mean-square (RMS) value of said error signals.

11. A system for constructing an artificial intelligence (AI) mathematical model of a system, comprising: a processor; and, a memory, said memory storing instructions which, when executed by said processor, are operative to: construct an initial mathematical representation of said system with a combination of terms, said terms comprising mathematical functions including independent variables dependent on an input signal; input a first set of known data to said initial mathematical representation to generate a corresponding set of output data; feed said corresponding set of output data of said initial mathematical representation and a second set of known data, correlated to said first set of known data, to a comparator, said comparator generating error signals representing a difference between members of said set of output data and correlated members of said second set of known data; iteratively vary a parameter of at least one of said combination of terms comprising said initial mathematical representation to produce a refined mathematical representation of said system until a measure of said error signals is reduced to a value wherein the set of corresponding output data of said refined mathematical representation over a desired range is approximately equivalent to said second set of known data.

12. The system recited in claim 11, wherein iteratively varying a parameter of at least one of said combination of terms includes setting a coefficient of each term to a value between 0 and 1 such that all coefficients sum to 1.

13. The system recited in claim 11, wherein said combination of terms comprises at least one of a transcendental function, polynomial function, and a Boolean function.

14. The system recited in claim 11, wherein said first set of known data and said second set of known data respectively comprise known input data and corresponding known output data for said real system.

15. The system recited in claim 11, wherein said first set of known data and said second set of known data both comprise known output data for said real system.

16. The system recited in claim 11, wherein said first set of known data and said second set of known data both comprise known input data for said real system.

17. The system as recited in claim 11, wherein said first set of known data and said second set of known data are a subset of all known data for said real system.

18. The system recited in claim 17, wherein said subset of all known data is utilized to produce said refined mathematical representation of said system and remaining data of said all known data is utilized to test said refined mathematical representation for coherence over a fuller range of data.

19. The system recited in claim 11, wherein said measure of said error signals corresponds to a maximum error signal for the first and second sets of known data.

20. The system recited in claim 11, wherein said measure of said error signals is a root-mean-square (RMS) value of said error signals.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

[0034] For a more complete understanding of the present disclosure, reference is now made to the following detailed description taken in conjunction with the accompanying drawings, in which:

[0035] FIG. 1 illustrates an artificial intelligence, machine learning, and deep learning hierarchy;

[0036] FIG. 2 illustrates an elementary artificial neural network model architecture;

[0037] FIG. 3 illustrates a simplistic artificial neural network model architecture for deep learning;

[0038] FIG. 4 illustrates a system architecture showing a mathematical model coupled to a subtractor;

[0039] FIG. 5 illustrates a generic mathematical model for input/output;

[0040] FIG. 6 illustrates a generic mathematical model for input/input;

[0041] FIG. 7 illustrates a mathematical model for system identification;

[0042] FIG. 8 illustrates a mathematical model for reinforcement learning;

[0043] FIG. 9 illustrates a mathematical model for Fourier series;

[0044] FIG. 10 illustrates a mathematical model for order finding;

[0045] FIG. 11 illustrates a Boolean circuit for classical logic;

[0046] FIG. 12 illustrates a mathematical model for a power series;

[0047] FIG. 13 illustrates a mathematical model for clustering;

[0048] FIG. 14 illustrates a flow diagram of an embodiment of a method of constructing a mathematical model of a real system; and,

[0049] FIG. 15 illustrates a block diagram of an embodiment of an apparatus for constructing a mathematical model of a real system.

[0050] Corresponding numerals and symbols in the different figures generally refer to corresponding parts unless otherwise indicated and, in the interest of brevity, may not be described after the first instance.

DETAILED DESCRIPTION

[0051] A unifying system architecture adaptable to a wide range of technological applications (e.g., machine, deep, and reinforcement learning; dynamic systems; cryptography; and quantum computation/information) is introduced herein. System architectures may contain nonlinearities, nonconvexities, and/or discontinuities. The designer has control over what is being learned and thus provides inherent elucidation of the results. This lends transparency and explanation to applications based on interpretable artificial neural networks. Furthermore, less data is needed to discover cause-effect relationships.

[0052] With limited success, artificial neural networks bring several disadvantages. The design process becomes an academic exercise in numerical investigation resulting in an untrusted black box where the designer has no influence over what is being learned. In the end, because of the depth of complexity, it is virtually impossible to understand how conclusions were reached.

[0053] A novel system architecture is introduced herein where the designer has control over what is being learned and thus provides inherent elucidation. This lends transparency and explanation to applications based on artificial neural networks. Embodiments include forms of artificial intelligence: machine, deep, and reinforcement learning; dynamic systems; cryptography; and quantum computation/information.

[0054] The making and using of the present exemplary embodiments are discussed in detail below. It should be appreciated, however, that the embodiments provide many applicable inventive concepts that can be embodied in a wide variety of specific contexts. The specific embodiments discussed are merely illustrative of specific ways to make and use the systems, subsystems, and modules for estimating the state of a system in a real-time, noisy measurement, machine-learning environment. While the principles will be described in the environment of a linear system in a real-time machine-learning environment, any environment such as a nonlinear system, or a non-real-time machine-learning environment, is well within the broad scope of the present disclosure.

[0055] Where the current state of the art creates a connection between two sets of data with a multitude of nodes, layers, and arbitrarily simple functions, the novel process introduced herein instead inserts a curated set of lucid mathematical functions between the two sets of data. This is a fundamental difference in that mathematical nonlinearities, and/or nonconvexities, and/or discontinuities can more quickly be approximated to reveal relationships between the two sets of data.

[0056] Referring to the system architecture (400) illustrated in FIG. 4, signal (420) is sent to the mathematical model (460) yielding output signal (430). The error signal (440), which is a difference between the feedforward signal (410) and the output signal (430), is minimized. The mathematical model (460) may be generic or specific, depending on the application. If available, one skilled in the art should incorporate a priori knowledge into the design of the mathematical model architecture. For example, if the problem is associated with mechanical vibration, then the mathematical model (460) should include Fourier sine and cosine terms. Minimization of the error signal (440) is achieved through optimization techniques. Through this process, signal (430) is forced to match signal (410) by adjusting parameters associated with the mathematical model (460).

[0057] This approach is unique in that it serves as a unifying system architecture among the many varied specialized sciences, including machine learning (Table 1). For example, in supervised learning (classification), output is related to input. Referring again to FIG. 4, an embodiment of the proposed invention solves this type of problem by simply connecting known input data to signal (420) and known output data to signal (410). In supervised learning (regression), output is related to output. An embodiment of the novel process solves this type of problem by simply connecting known output data to signal (420) and known output data to signal (410). For both supervised learning cases, parameters associated with the mathematical model (460) are varied until the computed result matches the known result. In the case of unsupervised learning (clustering), another embodiment of the proposed invention solves this type of problem by connecting the known input to both signal (410) and signal (420). By minimizing the error, signal (430) will match signal (410) and thus, characterize the input data based on the mathematical model (460). While various embodiments leverage the same system architecture, only the assignment of signal (410), signal (420), and the mathematical model architecture differ.

[0058] Any theory has two parts: a mathematical description and an interpretation of the mathematical formulas. Clearly, the model forms the mathematical description and because of an overt design, the transparent mathematical model is interpretable and explainable.

[0059] To understand how the system operates, consider an embodiment of the system architecture (400) where the designer has no a priori knowledge about the relationships of the data. In this case, assume the mathematical model contains generic mathematical functions such as a polynomial such as a second-order polynomial function, transcendental functions such as sine and cosine terms, exponential functions, and logarithmic functions. An example sum of terms is a.sub.0+a.sub.1x+a.sub.2x.sup.2+ . . . +b.sub.s sin(nx)+b.sub.c cos(nx)+c exp(nx)+d ln(nx). Other embodiments can involve different mathematical functions and operations, including classical Boolean/logic functions or quantum logic gates. To guard against such discontinuities that can be produced by logic functions, a novel optimization algorithm is employed which avoids partial derivatives and their associated numerical instabilities.

[0060] The coefficients a.sub.0, a.sub.1, a.sub.2, b.sub.s, b.sub.c, c, d are random variables between 0 and 1 and weighted such that they sum to 1. Because the system architecture is designed to minimize a differential error between some computed quantity and a known quantity, the coefficients are changed to place different weights on each of the mathematical function. Since the coefficients are random variables, their adaptation (over multiple Monte Carlo iterations) is probabilistic. All the statistics are available such that the designer can explore any set of coefficients for interesting (rare condition) cases. Nominally, however, the designer selects the median coefficient values which define a transparent, interpretable, and explainable relationship between the known input and the computed output. The system architecture is self-defined because the coefficients are determined empirically. There is no need for the designer to perform a numerical investigation of trial and error as in the case for artificial neural nets. The system architecture is transparent, interpretable, and explainable because the designer can show the mathematical function that relates known data to computed data.

[0061] FIG. 5 refers to a generic mathematical model for input/output problems. Let the integer 5 be the known input (signal 420 of FIG. 4) and serve as the independent variable for a generic mathematical model a.sub.0+a.sub.1x+a.sub.2x.sup.2+a.sub.e exp(nx)+a.sub.l ln(x)+a.sub.s sin(x)+a.sub.c cos(x). Let the integer 10 be the known output (signal 410 of FIG. 4). Minimizing a difference between the computed output and the known output (signal 440 of FIG. 4) determines the coefficients a.sub.0, a.sub.1, a.sub.2, a.sub.e, a.sub.l, a.sub.s, a.sub.c of the mathematical functions. These coefficients describe the mathematical model and are used to explain the relationship between the input and output. FIG. 6 refers to a generic mathematical model for input/input problems and follows a similar approach as described in the preceding paragraph. However, these coefficients are used to explain the characteristics of the input. Both examples (input/output of FIG. 5 and input/input of FIG. 6) demonstrate the system architecture of the proposed invention supports a unified approach to supervised and unsupervised learning, respectively.

[0062] As a practical example, consider the process of system identification as applied to the estimation of the rolling moment aerodynamic parameter, C.sub.l. One artificial neural net approach uses 5 independent variables to determine 3 dependent variables. After a preliminary exercise in numerical investigation (input/output scaling, initial network weights, number of hidden nodes, learning rate, momentum parameter, and slope factors of the sigmoidal activation functions) convergence is achieved after 2000 iterations. The result is a complex, opaque, uninterpretable, unexplainable relationship between the inputs and outputs. Also, if there are any changes to the inputs or outputs, the model must be retrained.

[0063] FIG. 7 refers to a mathematical model for system identification problems using the proposed invention. Let the aileron deflection be the known input (signal 420 of FIG. 4). Let the roll moment aerodynamic parameter be the known output (signal 410 of FIG. 4). One skilled in the art will recognize the direct relationship between aileron deflection and rolling moment aerodynamics. Minimizing a difference between the computed output and the known output (signal 440 of FIG. 4) determines the coefficients of the mathematical functions. Assuming the aerodynamic relationship between input and output is unknown, a generic mathematical model is used: a.sub.0+a.sub.1x+a.sub.2x.sup.2+a.sub.e exp(nx)+a.sub.l ln(x)+a.sub.s sin(x)+a.sub.c cos(x). The coefficients describe the model and are used to explain the relationship between the input (aileron deflection) and output (roll moment aerodynamic parameter). Rather than using an input/output ratio of 5:3, a 1:1 ratio is used with the proposed invention. Much less data is required to determine the relationship between the two data sets. Also, the results are achieved in 200 iterationsan order of magnitude less than required by the artificial neural net approach. Furthermore, the artificial neural net approach required the time series data to be in chronological order. The proposed invention is agnostic to any timestamp. The relationship between the two data sets is important, not the time at which they occur. While the model is still relatively complex, it is transparent, interpretable, and explainable. Because of these attributes, the proposed invention is much more reliable for flight safety certification. Finally, the mathematical model can be subsequently exercised to explore extreme cases, e.g., letting variables go to zero and letting variables approach infinity. Hence increasing confidence model deployment.

[0064] The designer has complete control over what is being learned using the novel process introduced herein. If the designer has a priori knowledge, mathematical or logical representations may or may not be included accordingly. The adaptive discovery of the proposed invention finds the best configuration of terms contributing to a scientific equation (based on a combination of elementary mathematical functions) which matches real-world observations. Because of mathematical transparency, the designer can easily interpret the results to see if they correspond with intuition and explain how the system works.

[0065] Back-propagation methods are replaced by an adaptive system for solving nonlinear, nonconvex problems. Paired with a rich set of options for mathematical functions, the system can be optimized for a training set of nearly any size. There are no restrictions on the problem space, including nonlinearities and/or discontinuities. In the case of multiple inputs/outputs, prior knowledge of the hyperspace is not needed. The mathematical architecture is independent of the input/output complexity. Inputs and outputs can be discrete, continuous, deterministic, random, or any combination thereof.

[0066] Regarding data, normalization may be performed to avoid domination by any one input. Otherwise, there is no need to manipulate the data. Furthermore, much less data is needed for the system identification architecture embodiment compared with the artificial neural net approach. This demonstrates no need for massive training sets.

[0067] There is also no need for enormous computing power. Every embodiment discussed in this specification runs on a laptop personal computer.

[0068] In the case of unsupervised learning (clustering), the number of clusters is not required to be known a priori, data does not have to be labelled, and an artificial neural net model does not have to be trained.

[0069] The novel process disclosed herein lends transparency and explanation to applications based on artificial neural networks. Benefits include, but aren't limited to, minimizing risk associated with data security legislation, reducing reliance on large, clean data sets which otherwise limit practical applications, and reducing footprint for real-time applications dominating networks, servers, and GPUs.

[0070] The following embodiments are just a few examples and are discussed with intentions to demonstrate the flexibility of the system architecture as applicable to the problem space of current technologies, e.g., reinforcement learning, cryptography, information theory, and quantum computation/information. Those skilled in these arts will understand and appreciate their content.

[0071] In one embodiment, the present invention can be used to emulate reinforcement learning. Reinforcement learning is the science of optimal decision-making. An agent, operating in an environment, is rewarded based on actions taken. The agent tries to figure out the optimal way to act within the environment. In mathematical terms, this is known as a Markov Decision Process (MDP). For this example, assume a manufacturer has a machine that is critical in the production process. The machine is evaluated each week to determine its operating condition. The state of the machine is either good as new, functioning with minor defects, functioning with major defects, or inoperable. Statistical data shows how the machine evolves over time. As the machine deteriorates, the manufacturer may select from the following options: do nothing, overhaul the machine, or replace the machineall with corresponding immediate and subsequent costs. The manufacturer's objective is to select the optimal maintenance policy, as illustrated by the example shown in FIG. 8.

[0072] As another example, in one embodiment of the present invention (emulating cryptography) a sinusoidal signal, composed of a summation of many individual frequency components, is used as an input to a mathematical model of a discrete Fourier transform. By minimizing a difference between the computed signal and the reference signal, the reference signal is decomposed to determine its frequency content (FIG. 9). Continuing with another cryptography example, an embodiment of the present invention is used to perform the task of order finding (FIG. 10). Efficient order-finding can be used to break RSA public key cryptosystems. In this problem, the integer value of r is sought which satisfies the expression a.sup.r1(mod N) where mod N means modulus N. In this example embodiment, the problem has been formulated as a.sup.r(modN)1, where a difference has been minimized over different integer values of r. Again, the same architectural approach is applied to a completely different problem type. Additional embodiments may be extended from Fourier transforms and cryptography to their quantum counterparts, i.e., quantum Fourier transforms and quantum cryptography.

[0073] Another example of an embodiment of the present invention (emulating Boolean logic) is a discontinuous classical circuit with three AND gates serving as the mathematical model (FIG. 11), i.e. A AND B AND C.

[0074] A truth table, Table 2, responsive to the binary inputs A, B, and C, showing the logical result A AND B AND C is illustrated below:

TABLE-US-00002 TABLE 2 A B C A&B&C 0 0 0 0 0 0 1 0 0 1 0 0 0 1 1 0 1 0 0 0 1 0 1 0 1 1 0 0 1 1 1 1
Minimizing the output yields seven of the 2.sup.3=8 truth table values (0), while maximizing the output yields the final entry in a truth table, e.g., in Table 2. When maximizing this logic architecture, there is only one solution, i.e., A=B=C=1. Likewise, minimizing the architecture will yield all other results. This is significant because while some mathematical models may include many logic gates (e.g., decision-making) the complexity of the model architecture may render the problem intractable. Yet, the process introduced herein allows a practitioner to simply exercise the system to yield the corresponding truth table leading to the discovery of cause-effect relationships. Classical computation with Boolean circuits, using an acyclic directed graph, may be extended to another example embodiment of quantum computation/information by implementation of quantum circuits. These circuits form the basis for implementing various computations. While physicists and mathematicians view quantum computation as hypothetical experiments, computer scientists view quantum computation as games where players, typically Alice and Bob, optimize their performance in various abstractions. Applications include the minimization of bits for quantum error correction, and GHZ (Greenberger, Home, and Zeilinger) and CHSH (Clauser, Horne, Shimony, and Holt) games.

[0075] Another example of an embodiment of the present invention emulates information and self-organized complex systems. The human brain and behavior are shown to exhibit features of pattern-forming, dynamical systems, including multi-stability, abrupt phase changes, crises, and intermittency. How human beings perceive, intend, learn, control, and coordinate complex behaviors is understood through dynamic systems. Here, a dynamic system is modeled by a power series (.sub.n a.sub.nx.sup.n) as a solution to an ordinary differential equation. A second-order harmonic oscillator (mass, spring, damper system) is used to create a set of input-output relations. Using the novel process introduced herein, the (spring and damping) coefficients are determined through the power series implementation of the differential equation (FIG. 12). Again, this demonstrates the flexibility of this unifying system architecture which is adaptable to a wide range of technological applications.

[0076] An example embodiment of the present invention applied to unsupervised learning is clustering. This example combines the benefits of hard and soft clustering, i.e., the number of clusters does not need to be known, data may belong to more than one cluster, ellipsoidal clusters may have different sizes. Because data does not have to be labelled, dimensionality reduction techniques (e.g., Principal Component Analysis) are unnecessary and subsequently dismissed. Also, since the approach does not use artificial neural nets, a model does not need to be trained and thus, no training data is required. Furthermore, since the approach is stochastic, it allows for black swan clusters to be identified, if they exist.

[0077] The number of clusters, k, is determined automatically. After processing the data for a given cluster number, a histogram displays the number of data points assigned to each cluster. When the histogram is uniform, the data is over-fitted. Hence, the number of clusters (k) is one less than the current number. To identify the clusters, select k random points out of the n data points as medoids. Associate each data point with the nearest medoid by selecting the minimum distance. The sum of all minimums (for each data point) is the cost (objective function). Minimize (optimize) the cost to identify the clusters. Once the clusters have been identified, it's rudimentary to determine which data point is associated with each cluster. With the data clustered accordingly, it is a simple exercise to determine the centroid of the ellipsoidal cluster.

[0078] By avoiding deep learning techniques based upon artificial neural net architectures, all corresponding disadvantages (lack of transparency, lack of explainability, and the need to reserve training data and the time spent training the artificial neural net) are dismissed. Because data does not have to be cleaned or labelled, dimensionality reduction techniques (e.g., Principal Component Analysis) are unnecessary. Instead, statistical distributions of the data are applied. This approach does not rely on stochastic gradient descent (random guesses at partial derivatives) which can become numerically unstable with practical conditions. Alternatively, the objective function is evaluated directly using Monte Carlo techniques. The solution is scalable and may be implemented for real-time analysis.

[0079] To conclude, consider an example embodiment for real-time systems. As one skilled in the art is aware, real-time requirements for aerospace guidance, navigation, and control processes are different than real-time requirements for e-commerce transactions. However, in either case, the system may be augmented such that known constraints (if any) could be built into the objective function a priori. Also, by selecting an appropriate resolution, the system may be configured to execute in a deterministic time frame. This single approach for multifunctional systems may be used for industrial applications. These multifunctional systems must manage diverse objectives, multiple resources, and numerous constraints. A factory might use several types of power (e.g., pneumatic, electrical, and hydraulic), several types of labor skills, many different raw materials, all while making multiple products. A production optimization system based on the Industrial Internet of Things (IIoT) can collect data from thousands of sensors. A system with the computational efficiency to support real-time monitoring and control is a valuable advance in optimization techniques.

[0080] Again, the foregoing embodiments serve as examples across relevant technologies and are not meant to be exhaustive.

[0081] Turning now to FIG. 14, illustrated is a flow diagram of an embodiment of a method 1400 of constructing a mathematical model of a system that can be a real system. The method 1400 is operable on a processor such as a microprocessor coupled to a memory. The method 1400 begins at a start step or module 1410.

[0082] At a step or module 1420, an initial mathematical representation of the system is constructed with a combination of terms, the terms comprising mathematical functions including independent variables dependent on an input signal. The combination of terms includes at least one of a transcendental function, a polynomial function, and a Boolean function. A transcendental function can be a trigonometric function, a logarithmic function, an exponential function, or another analytic function.

[0083] At a step or module 1430, a first set of known data (corresponding to the signal 420 in FIG. 4) is inputted to the initial mathematical representation to generate a corresponding set of output data (corresponding to signal 430 in FIG. 4).

[0084] At a step or module 1440, the corresponding set of output data (corresponding to the signal 430 in FIG. 4) of the initial mathematical representation and a second set of known data (corresponding to the signal 410 in FIG. 4) correlated to the first set of known data, is fed to a comparator, the comparator generating error signals (corresponding to the signal 440 in FIG. 4) representing a difference between members of the set of output data (corresponding to the signal 430 in FIG. 4) and correlated members of the second set of known data (corresponding to the signal 410 in FIG. 4).

[0085] In one embodiment, the first set of known data and the second set of known data respectively comprise known input data and corresponding known output data for the real system; as such, this represents a supervised-classification learning mode. In another embodiment, the first set of known data and the second set of known data both comprise known output data for the real system; as such, this represents a supervised-regression learning mode. In a third embodiment, the first set of known data and the second set of known data both comprise known input data for the system; as such, this represents an unsupervised-clustering learning mode.

[0086] In an embodiment, the first set of known data and the second set of known data are a subset of all known data for the real system. As an example, the signal 420 illustrated in FIG. 4 can have multiple values. In a related embodiment, the subset of all known data is utilized to produce the refined mathematical representation of the real system and remaining data is utilized to test the refined mathematical representation for coherence over a fuller range of data.

[0087] At a step or module 1450, a parameter of at least one of the combination of terms comprising the initial mathematical representation is iteratively varied to produce a refined mathematical representation of the real system until a measure of the error signals is reduced to a value wherein the set of corresponding output data of the refined mathematical representation over a desired range is suitably equivalent to the second set of known data.

[0088] In an embodiment, the measure of the error signals corresponds to a maximum error signal for the first and second sets of known data. In an alternative embodiment, the measure of the error signals is a root-mean-square (RMS) value of the error signals.

[0089] In an embodiment, the step of iteratively varying a parameter of at least one of the combination of terms includes setting the coefficient of each term to a value between 0 and 1 such that all coefficients sum to 1. Setting the coefficient of each term to a value between 0 and 1 can be employed to normalize the terms.

[0090] The method 1400 terminates at end step or module 1460.

[0091] Turning now to FIG. 15, illustrated is a block diagram of an embodiment of an apparatus 1500 for-constructing a mathematical model of a system. The apparatus 1500 is configured to perform functions described hereinabove of constructing the mathematical model of the system. The apparatus 1500 includes a processor (or processing circuitry) 1510, a memory 1520 and a communication interface 1530 such as a graphical user interface.

[0092] The functionality of the apparatus 1500 may be provided by the processor 1510 executing instructions stored on a computer-readable medium, such as the memory 1520 shown in FIG. 15. Alternative embodiments of the apparatus 1500 may include additional components (such as the interfaces, devices and circuits) beyond those shown in FIG. 15 that may be responsible for providing certain aspects of the device's functionality, including any of the functionality to support the solution described herein.

[0093] The processor 1510 (or processors), which may be implemented with one or a plurality of processing devices, perform functions associated with its operation including, without limitation, performing the operations of constructing the mathematical model of the system. The processor 1510 may be of any type suitable to the local application environment, and may include one or more of general-purpose computers, special purpose computers, microprocessors, digital signal processors (DSPs), field-programmable gate arrays (FPGAs), application-specific integrated circuits (ASICs), and processors based on a multi-core processor architecture, as non-limiting examples.

[0094] The processor 1510 may include, without limitation, application processing circuitry. In some embodiments, the application processing circuitry may be on separate chipsets. In alternative embodiments, part or all of the application processing circuitry may be combined into one chipset, and other application circuitry may be on a separate chipset. In still alternative embodiments, part or all of the application processing circuitry may be on the same chipset, and other application processing circuitry may be on a separate chipset. In yet other alternative embodiments, part or all of the application processing circuitry may be combined in the same chipset.

[0095] The memory 1520 (or memories) may be one or more memories and of any type suitable to the local application environment, and may be implemented using any suitable volatile or nonvolatile data storage technology such as a semiconductor-based memory device, a magnetic memory device and system, an optical memory device and system, fixed memory and removable memory. The programs stored in the memory 1520 may include program instructions or computer program code that, when executed by an associated processor, enable the respective device 1500 to perform its intended tasks. Of course, the memory 1520 may form a data buffer for data transmitted to and from the same. Exemplary embodiments of the system, subsystems, and modules as described herein may be implemented, at least in part, by computer software executable by the processor 1510, or by hardware, or by combinations thereof.

[0096] The communication interface 1530 modulates information for transmission by the respective apparatus 1500 to another apparatus. The respective communication interface 1530 is also configured to receive information from another processor for further processing. The communication interface 1530 can support duplex operation for the respective other processor 1510.

[0097] As described above, the exemplary embodiments provide both a method and corresponding apparatus consisting of various modules providing functionality for performing the steps of the method. The modules may be implemented as hardware (embodied in one or more chips including an integrated circuit such as an application specific integrated circuit), or may be implemented as software or firmware for execution by a processor. In particular, in the case of firmware or software, the exemplary embodiments can be provided as a computer program product including a computer readable storage medium embodying computer program code (i.e., software or firmware) thereon for execution by the computer processor. The computer readable storage medium may be non-transitory (e.g., magnetic disks; optical disks; read only memory; flash memory devices; phase-change memory) or transitory (e.g., electrical, optical, acoustical or other forms of propagated signals-such as carrier waves, infrared signals, digital signals, etc.). The coupling of a processor and other components is typically through one or more busses or bridges (also termed bus controllers). The storage device and signals carrying digital traffic respectively represent one or more non-transitory or transitory computer readable storage medium. Thus, the storage device of a given electronic device typically stores code and/or data for execution on the set of one or more processors of that electronic device such as a controller.

[0098] Thus, as introduced herein, the novel unified system architecture is adaptable to a wide range of technological applications. The unified system architecture is employed to construct a mathematical model of a system. The system architecture produces results that are transparent, interpretable, and can be used for explainable artificial intelligence. Control can be exercised over what is being learned by the model. The model may contain nonlinearities, nonconvexities, and discontinuities. Less data is needed for the model to discover cause-effect relationships.

[0099] Although the embodiments and its advantages have been described in detail, it should be understood that various changes, substitutions, and alterations can be made herein without departing from the spirit and scope thereof as defined by the appended claims. For example, many of the features and functions discussed above can be implemented in software, hardware, or firmware, or a combination thereof. Also, many of the features, functions, and steps of operating the same may be reordered, omitted, added, etc., and still fall within the broad scope of the various embodiments.

[0100] Moreover, the scope of the various embodiments is not intended to be limited to the embodiments of the process, machine, manufacture, composition of matter, means, methods and steps described in the specification. As one of ordinary skill in the art will readily appreciate from the disclosure, processes, machines, manufacture, compositions of matter, means, methods, or steps, presently existing or later to be developed, that perform substantially the same function or achieve substantially the same result as the corresponding embodiments described herein may be utilized as well. Accordingly, the appended claims are intended to include within their scope such processes, machines, manufacture, compositions of matter, means, methods, or steps.

SYSTEM AND METHOD FOR CONSTRUCTING A MATHEMATICAL MODEL OF A SYSTEM IN AN ARTIFICIAL INTELLIGENCE ENVIRONMENT

Assignee

Inventors

Cpc classification

Classification Explorer

G06F17/142

PHYSICS

Classification Explorer

G06N20/00

PHYSICS

Classification Explorer

G06N3/084

PHYSICS

Classification Explorer

G06F7/023

PHYSICS

Classification Explorer

G06N3/08

PHYSICS

Classification Explorer

G06F17/18

PHYSICS

Classification Explorer

G06F2207/4824

PHYSICS

Classification Explorer

G06F11/3452

PHYSICS

Classification Explorer

G06N3/045

PHYSICS

Classification Explorer

G06F7/22

PHYSICS

Classification Explorer

G06N3/048

PHYSICS

Classification Explorer

G06N3/042

PHYSICS

Classification Explorer

G06F30/27

PHYSICS

Classification Explorer

G06N7/01

PHYSICS

Classification Explorer

G06F17/16

PHYSICS

Classification Explorer

G06N5/01

PHYSICS

Classification Explorer

G06F2111/10

PHYSICS

Classification Explorer

G06N3/105

PHYSICS

Classification Explorer

G06F17/11

PHYSICS

International classification

Classification Explorer

G06F30/27

PHYSICS

Classification Explorer

G06N20/00

PHYSICS

Classification Explorer

G06N7/00

PHYSICS

Abstract

Claims

Description