Process Modelling Method and System for Non-Linear Continuous-Like Process and Application in Pulp and Paper Industry

Abstract

A method, system, and computer program product are described capable of controlling an industrial based chemical process through accessing sensor data, pre-processing accessed sensor data through an automated process, forming an initial prediction model of the chemical process, and automatically determining linearity or non-linearity of the initial prediction model. As a function of quantitatively measured linearity, being non-linear, quasilinear, or linear, automatically train the initial prediction model, then deploy the trained prediction model in a manner that controls the subject chemical process at the industrial plant, including optimizing consumption of a certain resource. Data-driven modeling of non-linear, continuous-like industrial or chemical processes results.

Claims

1. A computer-implemented method of controlling an industrial based chemical process, the method comprising: obtaining working data representative of a subject chemical process of an industrial plant, said obtaining being automatically performed by a digital processor; responsively in computer memory, forming an initial model of the subject chemical process based on the working data, the working data having empirical values of dependent variables and independent variables of the subject chemical process, different characteristics of the subject chemical process being represented by different mathematical relationships of respective dependent variables and independent variables, including a certain resource consumption being indicated by one or more independent variables; said forming including the processor automatically determining non-linearity of the initial model as a function of non-linearity of the different mathematical relationships in the working data, and automatically training the initial model based on results of the determining, such that: (a) where the different relationships are determined to be substantially linear, then training the initial model as a linear predictive model and resulting in a trained prediction model, and (b) where the different relationships are determined to be quasi-linear or non-linear, then (i) training the initial model as a non-linear predictive model, (ii) combining the trained non-linear predictive model with one or more local linear models dynamically adjusting control variables of the certain resource consumption in respective windows of time, and (iii) producing a resulting trained prediction model based on the non-linear predictive model combined with the one or more local linear models; and deploying the resulting trained prediction model in a manner controlling the subject chemical process at the industrial plant, including optimizing consumption of the certain resource.

2. The method of claim 1, wherein the subject chemical process is continuous, semi-continuous, or continuous in one or more parts.

3. The method of claim 1, wherein the subject chemical process is a Kraft process or similar, and the industrial plant is of a pulp and paper industry.

4. The method of claim 3, wherein the working data represents quality of production of the subject chemical process as a function of the certain resource consumption, and the quality of production is determined by any one or combination of: a measurement of completeness of a pulping process, a Kappa number measurement or equivalent, total alkaline charge, and amount of residual alkali.

5. A method as claimed in claim 3, wherein the certain resource is a reagent utilized in the Kraft process.

6. A method as claimed in claim 5, wherein the reagent is white liquor.

7. A method as claimed in claim 3, wherein output of the deployed prediction model further controls any one or combination of: total H factor, a black liquor stream after extraction, digester discharge consistency of a digester at the industrial plant, and liquor temperature in the digester.

8. A method as claimed in claim 3, wherein the prediction model is further configured to manipulate control parameters of the subject chemical process at the industrial plant in a manner that minimizes any one or more of: toxic waste of the industrial plant, waste water, and chemicals in a drying process.

9. A method as claimed in claim 1, wherein obtaining working data includes accessing sensor output data indicative of the subject chemical process; and the method further comprising automatically adjusting the prediction model over time based on additional sensor output data.

10. A method as claimed in claim 1 wherein training the initial model as a linear predictive model employs partial least squares (PLS) regression, and number of components for the PLS regression is an estimated rank among the independent variables and dependent variables.

11. A method as claimed in claim 1 wherein training the initial model as a non-linear predictive model employs extreme gradient boosting (XGBoost), and number of estimators for XGBoost is based on an estimated rank among the independent variables and dependent variables.

12. A method as claimed in claim 1 wherein obtaining working data includes: accessing sensor output data indicative of the subject chemical process, the accessed sensor output data including one or more time periods of operating states of the subject chemical process; and pre-processing the accessed sensor output data in a manner that: (i) groups data based on time, and (ii) removes outlier data from groups of data, said pre-processing resulting in the working data, the accessing and pre-processing being automatically performed by the digital processor.

13. A system for controlling an industrial based chemical process, the system comprising: a digital processor; and a process modeler executable by the digital processor such that during execution the digital processor: automatically obtains working data representative of a subject chemical process in an industrial plant; responsively forms in computer memory, an initial model of the subject chemical process based on the working data, the working data having empirical values of dependent variables and independent variables of the subject chemical process, different characteristics of the subject chemical process being represented by different mathematical relationships of respective dependent variables and independent variables, including a certain resource consumption being indicated by one or more independent variables; said forming including the digital processor automatically determining non-linearity of the initial model as a function of non-linearity of the different mathematical relationships in the working data, and automatically training the initial model based on results of the determining, such that: (a) where the different relationships are determined to be substantially linear, then training the initial model as a linear predictive model and resulting in a trained prediction model, and (b) where the different relationships are determined to be quasi-linear or non-linear, then (i) training the initial model as a non-linear predictive model, (ii) combining the trained non-linear predictive model with one or more local linear models dynamically adjusting control variables of the certain resource consumption in respective windows of time, and (iii) producing a resulting trained prediction model based on the non-linear predictive model combined with the one or more local linear models; and deploys the resulting trained prediction model in a manner controlling the subject chemical process at the industrial plant, including optimizing consumption of the certain resource.

14. The system of claim 13, wherein the subject chemical process is continuous, semi-continuous, or continuous in one or more parts.

15. The system of claim 13, wherein the subject chemical process is a Kraft process or similar, and the industrial plant is of a pulp and paper industry.

16. The system of claim 15, wherein the working data represents quality of production of the subject chemical process as a function of the certain resource consumption, and the quality of production is determined by any one or combination of: a measurement of completeness of a pulping process, a Kappa number measurement or equivalent, total alkaline charge, and amount of residual alkali.

17. A system as claimed in claim 15, wherein the certain resource is a reagent utilized in the Kraft process that is white liquor.

18. A system as claimed in claim 15, wherein output of the deployed prediction model further controls any one or combination of: total H factor, a black liquor stream after extraction, digester discharge consistency of a digester at the industrial plant, and liquor temperature in the digester.

19. A system as claimed in claim 15, wherein the prediction model is further configured to manipulate control parameters of the subject chemical process at the industrial plant in a manner that minimizes any one or more of: toxic waste of the industrial plant, waste water, and chemicals in a drying process of the Kraft process.

20. A system as claimed in claim 13 wherein the digital processor obtaining working data includes accessing sensor output data indicative of the subject chemical process; and the process modeler when executed further comprising the digital processor automatically adjusting the prediction model over time based on additional sensor output data.

21. A system as claimed in claim 13, wherein the digital processor obtaining working data includes: automatically accessing sensor output data indicative of the subject chemical process, the accessed sensor output data including one or more time periods of operating states of the chemical process at the industrial plant; and automatically pre-processing the accessed sensor output data in a manner that: (i) groups or otherwise classifies data based on time, and (ii) removes outlier data from the groups of data, said pre-processing resulting in the working data.

22. A non-transitory computer program product controlling an industrial based chemical process, the computer program product comprising a computer-readable medium with computer code instructions stored thereon, the computer code instructions being configured, when executed by a processor, to cause an apparatus associated with the processor to: obtain working data representative of a subject chemical process in an industrial plant; responsively in computer memory, form an initial model of the subject chemical process based on the working data, the working data having empirical values of dependent variables and independent variables of the subject chemical process, different characteristics of the subject chemical process being represented by different mathematical relationships of respective dependent variables and independent variables, including representing quality of production of the subject chemical process as a function of a certain resource consumption, quality of production corresponding to one or more dependent variables, and the certain resource consumption being indicated by one or more independent variables; said responsively form including the apparatus automatically determining non-linearity of the initial model as a function of non-linearity of the different mathematical relationships in the working data, and automatically training the initial model based on results of the determining, such that: (a) where the different relationships are determined to be substantially linear, then training the initial model as a linear predictive model and resulting in a trained prediction model, and (b) where the different relationships are determined to be quasi-linear or non-linear, then (i) training the initial model as a non-linear predictive model, (ii) combining the trained non-linear predictive model with one or more local linear models dynamically adjusting control variables of the certain resource consumption in respective windows of time, and (iii) producing a resulting trained prediction model based on the non-linear predictive model combined with the one or more local linear models; and deploy the resulting trained prediction model in a manner controlling the subject chemical process at the industrial plant, including optimizing consumption of the certain resource.

23. The computer program product of claim 22, wherein obtaining the working data includes: accessing sensor output data indicative of the subject chemical process, the accessed sensor output data including one or more time periods of operating states of the chemical process at the industrial plant; and pre-processing the accessed sensor output data in a manner that: (i) groups or otherwise classifies data based on time, and (ii) removes outlier data from the groups of data, said pre-processing resulting in working data.

24. The computer program product of claim 22 wherein the apparatus automatically determining non-linearity of the initial model includes quantitatively measuring non-linearity of the working data.

25. A computer-implemented method of modeling an industrial-based process, the method comprising: obtaining working data based on the past collected data and real-time collected data indicative of a subject industrial-based process, said obtaining being automatically performed by one or more digital processors; as a function of linearity of the obtained working data, automatically selecting between linearly modeling the subject industrial-based process and non-linearly modeling the subject industrial-based process, said selecting being automatically performed by the one or more digital processors, the working data having empirical values of dependent variables and independent variables of the subject industrial-based process, different characteristics of the subject industrial-based process being represented in the working data by different mathematical relationships of respective dependent variables and independent variables; the one or more digital processors automatically selecting being by: (a) testing linearity of the different mathematical relationships in the obtained working data, and (b) where the different relationships are determined to be substantially linear, then selecting and training a linear predictive model as representative of the subject industrial-based process, and where the different relationships are determined to be quasi-linear or non-linear, then selecting and training a non-linear predictive model as representative of the subject industrial-based process; and generating a resulting model of the subject industrial-based process based on one of: (i) the selected and trained linear predictive model, and (ii) a combination of the selected and trained non-linear predictive model and one or more local linear models dynamically adjusting certain variables of the subject industrial-based process.

26. A computer-implemented method as claimed in claim 25 wherein the testing linearity of the different mathematical relationships in the obtained working data includes the one or more digital processors quantitatively measuring non-linearity of the different mathematical relationships in the obtained working data.

27. A computer-implemented method as claimed in claim 25 wherein the industrial-based process is a chemical process, an industry plant process, or the like.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

[0022] The foregoing will be apparent from the following more particular description of example embodiments, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating embodiments.

[0023] FIG. 1 is a schematic illustration of a chemical process of some embodiments, specifically showing a pulp and paper mill Kraft process and liquor recovery.

[0024] FIG. 2 is a schematic illustration of recycling of white liquor in the Kraft process of FIG. 1.

[0025] FIG. 3 is a schematic illustration depicting a workflow for determining Kappa number through stages of the Kraft process of FIGS. 1 and 2.

[0026] FIGS. 4A and 4B are flowcharts of methods of controlling an industrial process, chemical process, and the like in an embodiment of the present invention.

[0027] FIG. 5 is a schematic illustration of data processing and workflow including three modules of: data selection, model selection, and dynamic sensitivity analysis and optimization. The data processing and workflow produce a prediction model that models a subject chemical process (the Kraft process of FIG. 1 for non-limiting example) in an embodiment.

[0028] FIG. 6 is a z-score visualization depicting sensor data normalization and selection for excluding sensor variance in the data processing of FIG. 5 toward removal of outliers.

[0029] FIG. 7 shows density map 2D projections of sensor data collected and serving as dimension reduced visualizations in the data selection (pre-processing) steps for excluding outlier data in the data processing and workflow of FIG. 5. Represented are density maps by PCA, tSNE, and UMAP dimension-reduction methods.

[0030] FIG. 8 is a timeline visualization of data collected from the sensors throughout the subject chemical process of FIG. 5 based on visual alignments and response variables. In the workflow of FIG. 5, the corresponding timeline visualizations of FIG. 8 may be combined with the 2D projections of FIG. 7 (and other dimension reduced data distribution types) to make better sense of the data distributions of the sensor data.

[0031] FIG. 9 shows cluster overlays on 2D projection maps of multiple operating states of the subject chemical process of FIG. 5. Represented are dimension reduction methods PCA, tSNE, and UMAP applied to collected sensor data, and clustering methods GMM and HDBSCAN applied to the same to improve automation in selecting desired training data without the outliers (isolated clusters with small numbers of data points).

[0032] FIG. 10 is a flowchart of steps for clustering data and testing linearity in an embodiment of the present invention, for non-limiting example, in the model selection module and data processing of FIG. 5.

[0033] FIG. 11 shows multiple bar charts depicting nonlinearity measurement predictors and response variables used or represented in model selection of FIG. 5.

[0034] FIG. 12 shows multiple plots of actual chemical process responses versus predictions by a PLS prediction model considered in model selection of FIG. 5.

[0035] FIG. 13 shows multiple plots of actual chemical process responses versus predictions by an XGBoost (Extreme Gradient Boosting) prediction model considered in model selection of FIG. 5.

[0036] FIG. 14 shows multiple plots of contribution coefficients from PLS and LASSO prediction models with different data conditions in the dynamic sensitivity analysis and model optimization of FIG. 5.

[0037] FIG. 15 shows multiple plots of contribution coefficients for a non-linear function from LASSO/PLS/XGBoost prediction models in the dynamic sensitivity analysis and model optimization of FIG. 5.

[0038] FIG. 16 is a schematic view of a computer network or similar digital processing environment in which embodiments of the present invention may be implemented.

[0039] FIG. 17 is a block diagram of the internal structure of a computer node or device (e.g., client processor/device or server computers) in the computer network of FIG. 16.

[0040] FIG. 18 is a block diagram illustrating an example network environment for data collection, monitoring, and controlling a plant process of the example embodiments disclosed herein.

[0041] FIG. 19 is a block diagram of a process control (or more generally a process modeling and simulation) method and system embodying the present invention and thus controlling an industrial based chemical process, for non-limiting example the Kraft process of FIG. 1.

DETAILED DESCRIPTION

[0042] A description of example embodiments follows. Embodiments of the present invention provide methods and systems of generating data-driven models of an industrial (e.g., chemical) process of interest, especially of non-linear continuous chemical processes. By way of non-limiting example and for purposes of illustrating the principles of the present invention, a pulp and paper industry chemical process is described next. It is understood that embodiments (computer-based methods and systems) of the present invention may be applied to various and numerous other industrial (chemical) processes, non-linear such processes, continuous-like processes and the like, producing data-driven models of the same.

[0043] In the pulp and paper industry, the Kraft process is a process for conversion of wood into wood pulp, which comprises almost pure cellulose fibers, the main component of paper. The Kraft process involves treatment of wood chips with a hot mixture of water, sodium hydroxide, and sodium sulfide, known as white liquor, that breaks the bonds that link lignin, hemicellulose, and cellulose. This treatment, and more generally, the Kraft process entails several steps, both mechanical and chemical including the stages of: impregnation, cooking, recovery, blowing, screening, washing, and bleaching. FIG. 1 presents a flowchart depicting the Kraft process 110 in pertinent detail. Wood and wood chips 111a are added to a container of hot white liquor for chemical digestion 111b. From the chemical digestion, a resulting solution comprising chemicals and wood pulp are washed and bleached with oxygen 111c. Once completed, bleaching 111d is the next step followed by drying 111e producing a product of treated wood pulp that can be formed into paper.

[0044] In addition, the digestion process 111b comprises a separate cycle relative to the cycle described above. From the digestion process 111b, a thin liquor 112a is removed and evaporated 112b, yielding a black liquor 112c. The black liquor is fed to a recovery boiler 112d, where the cycle splits to afford a green liquor 112f and released energy 112e. The green liquor 112f then undergoes causticization 112g that yields white liquor 112k that is fed back into the digestor 111b as so called recovered white liquor. The causticization 112g has an additional cycle where calcium carbonate 112h is removed from the recovered white liquor 112k into a lime kiln 112i to produce lime 112j. The lime 112j then reenters the causticization reactor 112g to complete the cycle.

[0045] At pulp and paper mill factories or industrial plants implementing the Kraft process, there is a mathematical value, called a Kappa number for pulp and paper. The Kappa number provides a quantitative test method for determining a level of lignin remaining in a sample, either finished from the pulp process (Kraft process 110) or during/undergoing the pulp process. The Kappa number is a value that: (a) measures the completeness of the pulping process, as well as (b) provides information about properties of the produced pulp (especially the level of residual lignin present). The Kappa number is based on the various chemical reactions performed during the process, and specifically on potassium permanganate oxidizing the wood to release lignin. Kappa number has a range from 1 to 100. Depending on the wood species involved, it is possible for the Kappa number to be above 100, which may be indicative of precision of the test decreasing and of relationship between Kappa number and lignin content decreasing.

[0046] An equation to represent or approximate Kappa number is shown in Equation 1, wherein lignin percent is equal to Kappa number multiplied by a value (a constant). (A more precise relationship can be established by testing the specific pulp of interest through in-process test to reflect the changing conditions of process flow described in FIG. 3. An embodiment of the present invention may employ a machine learning approach to handle the nonlinearity when the equation cannot be approximated by a linear relationship.)

[00001] $\begin{matrix} Lignin level (%) = Kappa number 0.13 & Equation 1 \end{matrix}$

[0047] Turning to FIGS. 2 and 3, there is a cycle inside of the Kraft process 110 for recovering residual chemicals. Initially, the recycling process 220 begins at a recausticizing plant 221a that outputs white liquor 221b, comprising sodium hydroxide (NaOH) and sodium sulfide (Na.sub.2S). The white liquor 221b is fed into a digestor 221c with the wood chips such as described at 111a and 111b in FIG. 1 above. Through the digester 221c process, the white liquor 221b is converted to what is called black liquor 221d, comprising sodium carbonate (Na.sub.2CO.sub.3), sodium sulfate (Na.sub.2SO.sub.4), sodium thiosulfate (Na.sub.2S.sub.2O.sub.3), and other degraded organics and inorganics from the digestion of the wood chips. The black liquor 221d proceeds into a recovery boiler 221e (like that of 112d in FIG. 1), where a chemical reaction occurs to convert the sodium sulfate and sodium thiosulfate back into sodium sulfide to produce green liquor 221f, which comprises sodium carbonate and sodium sulfide. The recovery boiler 221e also separates the degraded organics from the green liquor 221f. The green liquor is then used in the recausticizing plant 221a to produce white liquor and repeat the cycle (recycling process 220).

[0048] Based on the recovery cycle 220 described herein, it is possible to determine the Kappa number 332 from this cycle. FIG. 3 illustrates and depicts the recovery cycle/recycling process 220 of FIG. 2 in simplified terms. In FIG. 3, the digester 221c at an input end receives or otherwise intakes wood chips and white liquor 221b. At an output end of digester 221c, an interim product flows to a filter 331, where the degraded organics and inorganics are separated from the black liquor 221d. The black liquor is converted to green liquor 221f, which is then converted to white liquor 221b to repeat the process 220 as detailed above. During this process, it is possible to determine the Kappa number 332 based on the amount of lignin found in the pulp interim products post digesting (at 221c) and filtering (at 331).

[0049] The amount of white liquor that goes into the process of digesting wood chips to produce the pulp and eventually the paper of interest is a variable that can be optimized. The optimization of white liquor consumption would result in reducing production of hazardous waste from the Kraft process 110. Such optimization challenges advanced modeling of the intricate wood variable, process variables, and variable interactions in the Kraft process. Applicant's approach of using advanced machine learning (ML) algorithms to model non-linear interactions advantageously unlocks substantial cost savings and waste reduction by reducing chemical and fuel consumption, thereby enhancing cellulose pulp production within existing facilities.

[0050] FIGS. 4A and 4B illustrate a method 440 for controlling an industrial-based chemical process (such as the Kraft process 110 of FIG. 1) in embodiments of the present invention. The method 440 is computer implemented and may be performed via any combination of hardware and software as is known in the art. For non-limiting example, the method 440 may be implemented via one or more digital processors with associated memory storing computer code that causes the processor(s), a controller, a process modeling system, other apparatus, subsystem, or the like to implement steps 441, 442, 443, and 444 of the method 440. Further, the method 440 may be implemented in existing simulation software. In such an implementation, the method 440 and/or any other embodiments described herein, may be implemented in existing software products or systems. Further, it is noted that herein embodiments are not limited to being implemented into existing software and, instead, embodiments can be performed using any combination of hardware and software as is known in the art. Embodiments may be part of the software system or suite that supports, monitors, controls, provides maintenance of, etc. industrial processing plants, refineries, pulp and paper mill factories, chemical or pharmaceutical processing plants, and the like.

[0051] Returning to FIG. 4A, the method 440 begins at step 441 by a digital processor automatically obtaining working data representative of a subject chemical process of an industrial plant. According to an embodiment, the subject chemical process is the Kraft process 110 (FIG. 1) or similar, and the industrial plant is of a mill, factory, refinery, or the like of the pulp and paper industry. Other non-linear continuous-like industrial (e.g., chemical) processes are in the purview of one skilled in the art given this disclosure.

[0052] FIG. 4B elaborates on the step 441 obtaining working data representative of a subject chemical process of an industrial plant. The digital processor in one embodiment and one or more digital processors in other embodiments automatically perform the following tasks 451, 452 of step 441.

[0053] According to an embodiment, obtaining working data includes: accessing sensor output data indicative of the subject chemical process of an industrial plant as shown in step 451. The accessed sensor output data includes one or more time periods of operating states of the subject chemical process.

[0054] Step 452 continues the tasks of step 441 by pre-processing the accessed sensor output data (from step 451). In particular, the digital processor automatically preprocesses the sensor output data at step 452 by: (i) grouping or otherwise classifying data based on time, and (ii) removing outlier data from the groups of data. The pre-processing by step 452 results in working data obtained at step 441.

[0055] The method 440 continues at step 442 (FIG. 4A) where the digital processor responsively forms in computer memory, an initial model of the subject chemical process based on the working data (output from step 441). The working data has empirical values of dependent variables and independent variables of the subject chemical process. Different characteristics of the subject chemical process are represented by different mathematical relationships of respective dependent variables and independent variables. For non-limiting example, the different characteristics include different physics-based or physical aspects of the subject chemical process, different chemical aspects of the subject chemical process, etc. For non-limiting example, quality of production of the subject chemical process is represented as a function of a certain resource consumption. The quality of production corresponds to one or more dependent variables, and the certain resource consumption is indicated by one or more independent variables including a control variable used in process control of the industrial plant. In embodiments where the subject chemical process is the Kraft process 110 in a pulp and paper industry plant, the quality of production of the subject chemical process is determined by any one or combination of: a measurement of completeness of the pulping process, a Kappa number measurement or equivalent, total alkaline charge, and amount of residual alkali. The certain resource consumption is consumption of white liquor or similar reagent in the Kraft process represented by independent (control) variables.

[0056] Next at step 443, the processor automatically determines non-linearity of the initial model (formed by step 442) as a function of non-linearity of the different mathematical relationships in the working data output by step 441. The processor/step 443 automatically trains the initial model based on results of the linearity/non-linearity. Specifically: (a) where the different mathematical relationships in the obtained working data are determined to be substantially linear, then the processor trains the initial model as a linear predictive model resulting in a trained prediction model, and (b) where the different mathematical relationships are determined to be quasi-linear or non-linear, then the processor (i) trains the initial model as a non-linear predictive model, (ii) combines the trained non-linear predictive model with one or more local linear models dynamically adjusting control variables of the certain resource consumption in respective windows of time (time periods), and (iii) produces a resulting trained prediction model based on the non-linear predictive model combined with the one or more local linear models. In some embodiments, step 443 training of the initial model as a linear predictive model employs partial least squares (PLS) regression, and the number of components for the PLS regression is an estimated rank among independent variables and dependent variables. In embodiments, step 443 training of the initial model as a non-linear predictive model employs extreme gradient boosting (XGBoost), and the number of estimators for XGBoost is based on an estimated rank among the independent variables and dependent variables as will be made clear below.

[0057] At step 444, the method 440/processor deploys the resulting trained prediction model (from step 443) in a manner controlling the subject chemical process at the industrial plant, including optimizing consumption of the certain resource, such as white liquor in the above non-limiting example embodiment. Restated, output of the deployed prediction model includes values of plant equipment settings or subject chemical process (Kraft process 110) settings that are the control variables corresponding to the one or more independent variables indicative of the certain resource (white liquor) consumption. In some embodiments, the subject chemical process is continuous, semi-continuous, continuous in parts (or portions), or a combination thereof. According to an embodiment, the method 440 of FIGS. 4A and 4B automatically adjusts the prediction model over time based on additional sensor output data.

[0058] In some embodiments, working data is representative of a continuous, semi-continuous, or continuous in one or more parts or portions process that is based on sensor output data and/or derived from sensor output data.

[0059] In some embodiments, continuous-like can refer to a process that is continuous, semi-continuous, or continuous in one or more parts or portions of the process. It is understood that as used herein the process may be a chemical process, an industrial plant process, a manufacturing process, a reactor process, and the like. The terms industrial-based process, process of interest, and equivalents are used interchangeably with plant process or process given the context of the present disclosure.

[0060] In some embodiments, output of the deployed prediction model further controls any one or combination of: total H factor, a black liquor stream after extraction (filter 331), digester discharge consistency of a digester 221c at the industrial plant, and liquor temperature in the digester. The method 440 may also further configure the prediction model to manipulate control parameters of the subject chemical process (Kraft process 110) at the industrial plant in a manner that minimizes: toxic waste overall, wastewater, and chemicals in a drying stage of the Kraft process in some embodiments.

[0061] In an embodiment, the resulting trained prediction model (deployed at step 444 of FIG. 4A) predicts and optimizes the white liquor consumption using subspace analysis and nonlinear machine learning models detailed next. FIG. 5 is a schematic view of data processing workflow 550 depicting data selection (like that at step 452), model selection (like that at steps 442, 443), and dynamic sensitivity analysis and optimization (like that of step 443 discussed in FIG. 4A). In the workflow 550, data 551 includes raw process sensor data, independent variables data (X), and dependent variables data (Y). As will become clearer below, the workflow and data processing 550 preprocesses the data 551 in data selection module 552 by employing: (i) regime identification (identifying operating states and time periods), (ii) outlier data detection and removal, and (iii) linearity recognition (including quantitative measurements of severity of non-linearity and of gradual gain in non-linearity over time). The data processing in data selection module 552 finds significant operating conditions (of the subject chemical process, e.g., Kraft process 110) to train the prediction model for each condition. Based on the linearity recognition, the data processing workflow 550 determines which route to pursue, either determining the best fit for the data 551 is a linear approach 553a or determining a nonlinear approach 553b is the best fit for the data 551. The determined approach 553a, b informs the workflow model selection 554. In an embodiment, model selection module 554 utilizes partial least squares (PLS) prediction and optimization for linear fits of data (determined at 553a) or utilizes nonlinear (extreme gradient boosting, XGBoost for non-limiting example) prediction and optimization for nonlinear fits of data (determined at 553b). A nonlinear fitting model is needed for variables, like Kappa number prediction, as the relationship between Kappa number and lignin percent varies significantly based on different conditions of the subject Kraft process 110.

[0062] During model training, embodiments estimate contribution coefficients or ranks advantageously increasing accuracy of Applicant's resulting trained prediction model 560. Embodiments provide this and other advantages over the prior art including: [0063] a novel method to measure the degree of nonlinearity by estimating the partial ranks of projections of independent and dependent variables represented by sensor data; [0064] a novel combination of machine learning methods PCA (Principal Component Analysis), tSNE, and UMAP followed by clustering methods HDBSCAN and GMM on the tasks of identifying nonuniform distribution of covariances and spatial connectedness of data 551; [0065] application of XGBoost to model both linear and nonlinear relationships between the independent and dependent variables; [0066] a novel scheme of using local LASSO (Least Absolute Shrinkage and Selection Operator) and PLS to approximate global nonlinear correlations to estimate the sensitivity to perform local optimization; and [0067] novel applications of three data processing results: (a) a normalized score in the range [0,1] for different algorithms to measure the distance of a sample to its cluster center for outlier removal; (b) the optimal number of model components or estimators using the partial ranks; and (c) the partial ranks of concatenated independent and dependent variables to infer the data redundancy.

[0068] In embodiments, data selection module 552 includes both (i) user interactive selection of outlier data for removal from model training, and (ii) automated processor selection of outlier data in source data 551 and removal of same from model training. The user-interactive selection allows domain knowledge to guide outlier data removal in data selection module 552 as illustrated next. In an embodiment, preprocessing in data selection module 552 includes at least the following steps: 1) data dimension reduction by PCA (Principal Component Analysis), tSNE (t-distributed Stochastic Neighbor Embedding), and/or UMAP (Uniform Manifold Approximation and Projection) to visually identify three different types of data distributions: concentration of variances, statistical density, and spatial density; and 2) clustering through combining gaussian mixture models (GMM) and hierarchical density-based spatial clustering of applications with noise (HDBSCAN) on original data 551 and projected data to combine visual observations and machine learning in a hybrid model for outlier removal. Doing so: (a) enables visualization for user-interactive selection of data 551 to be excluded from model training applying domain knowledge at runtime, (b) enables visualization of clusters from high-dimensional algorithms in dimension-reduced 2D planes for automated inspection by black-box machine learning algorithms, and (c) provides visual insights on a timeline and spatial location for each of different types of conditions of the subject chemical process, including in some embodiments each pulp type.

[0069] Data selection module 552 projects high dimension data 551 onto a 2D plane for visualizing spatial and statistical distribution in an augmented (dimension reduced) way. In particular, data processing 550 projects source data 551 into a 2D plane through linear projection, statistical transformation, and spatial mapping using PCA, tSNE, and UMAP, respectively. Data selection module 552 continues with user-interactive steps and processor automated steps as follows.

[0070] In non-limiting embodiments, linear models or algorithms that may be used by the present invention include: principal component analysis (PCA), PLS and LASSO, and combinations thereof. In other non-limiting embodiments, quasi-linear models or algorithms that may be used by the present invention include: GMM, HDBSCAN, SVM, and combinations thereof. In other non-limiting embodiments, non-linear models or algorithms that may be used by the present invention include: GMM, HDBSCAN, tSNE, UMAP, XGBoost, SVM, and combinations thereof. In other non-limiting embodiments, other models or algorithms that may be used to improve the predictions made by the present invention include: Auto Encoder, ridge regressions, NNs (e.g., long short-term memory (LSTM), temporal convolutional network (TCN), Transformer, etc.), and combinations thereof.

Data Selection Module 552: User-Interactive

[0071] Data Selection module 552 uses density mapping techniques to present the 2D projections (visualizations thereof) to a user for interactive visual inspection. The density maps visually illustrate statistical distribution and spatial distribution of sensor data 551. In FIG. 7 density maps of 2D projections 770 are shown for a PCA projection 771a, a tSNE projection 771b, and a UMAP projection 771c. In each of the 2D projections, there are highlighted regions in a circle 772 that denotes outlier data that would be excluded from model training. For clusters revealed in dimension reduced 2D plane, users may perform interactive masking to select data to be excluded from training models. For example, the data points in circled areas 772 are also visualized along a timeline so that users can match the clusters in a 2D plane and the time intervals to apply domain knowledge in outlier data removal. The interactive process is one type of human-in-the-loop approach for online process monitoring.

[0072] Data selection module 552 also performs a Z-score visualization for sensor data normalization and selection. This involves an application of sigma rules, which are as follows: 1) exclusion of sensor data with little variances (no dynamics except bias); 2) exclusion of data outside of 3-sigma for each sensor; and 3) use of selected data to calculate minimum ranks. An example output is shown in FIG. 6, wherein a time series from 42 variables is z-normalized as a 2D plot 660 over the course of time. From this visualization, further 2D plane scans are evaluated by a user in the interactive visualization and selection of model training data.

[0073] The interactive visualizations also can have a timeline visualization based on operating states from GMM and HDBSCAN 880 as shown in FIG. 8. In our Kraft process 110 example case, operating states along the timeline include GMM states 881a and HDBSCAN states 881b, and the visual alignment with three response variables: total alkaline charge 881c, Kappa number 881d, and residual alkali extraction 881c.

Data Selection Module 552: Processor-Automated

[0074] 2D projections can incorporate states from GMM and HDBSCAN. As shown from the 2D view of high-dimensional multivariate, there are several clusters that represent different process conditions. Manual masking is slow and inconsistent over time. To automatically extract the clusters without manual masking and maintain the consistency, the data selection module 552 applies GMM and HDBSCAN to identify the clusters with domain knowledge guided specification of model parameters. Here are some illustrations of the clusters from clustering methods GMM and HDBSCAN (for non-limiting example) in the 2D plane. In FIG. 9, multiple 2D projected clusters 990 are shown from operating states of GMM and HDBSCAN on 2D projections by PCA, tSNE, and UMAP data dimension reductions. These include GMM states on PCA projection 991a, HDBSCAN states on PCA projection 991b, GMM states on tSNE projection 992a, HDBSCAN states on tSNE projection 992b, GMM states on UMAP projection 993a, and HDBSCAN states on UMAP projection 993b. A corresponding timeline visualization is included to show how to combine 2D and timeline visualization to make better sense of the data distributions.

[0075] To improve automation in selecting desired training data without the outliers, the data selection module 552 applies GMM and HDBSCAN to cluster a z-transformed data into multiple components, then an automated processor (without human interactive intervention) masks isolated components with a small number (defined threshold) of data points as outliers to be excluded in training models. This processing advantageously compliments and improves on outcomes of the user-interactive steps described above. For a given test dataset, data selection module 552 performs a dimension reduction-clustering pair of algorithms from dimension-reduction methods PCA, tSNE, and UMAP and clustering methods GMM and HDBSCAN. In turn, data selection module 552 shows the overlay of clusters on a 2D plane for visual data validation by domain experts. To associate the clusters in the 2D plane and location of these clusters along the timeline, the same data points are visualized in FIG. 8 as GMM states along timeline 881a and HDBSCAN states along timeline 881b. Such visualization provides further visual interpretation of the significant data clusters for training the subject model.

[0076] According to the above, embodiments of the present invention provide novel data analysis techniques and use including: (1) a combination of density distribution by GMM and spatial connectedness by HDBSCAN for data conditions, (2) 2D visualization of projected data conditions for interactive user-guided selection, and (3) a unified score of all data conditions from different methods.

[0077] Returning to FIG. 5, data processing workflow 550 next performs model selection 554. In existing model process controllers or industrial plant process modelers, the process model generally used is PLS (Partial Least Squares) or other types of machine learning models through a manual selection process with trial and error. In embodiments of the present invention, the model selection process (module 554) is automatically performed by a digital processor. The digital processor/model selection module 554 measures nonlinearity of the data 551 and provides an empirical criterion to automatically select a linear or nonlinear prediction model as heretofore unachieved and not contemplated in the art. For nonlinear dependency, an embodiment uses the extreme gradient boosting (XGBoost) method to train the prediction model with high accuracy.

[0078] With reference to FIG. 10, a method or workflow for quantitatively testing linearity and measuring severity of non-linearity is presented. In the method/workflow 1010, controllable sensor data 1011a and measured monitored sensor data 1011b are input into clustering step 1012. The clustering step 1012 applies multiple or a combination of GMM and HDBSCAN clustering algorithms (described above) that recognize operating states and produce distance scores. Other clustering and distance scoring techniques are suitable. Once clustered, the data is evaluated and organized at an operating states step 1013. Results (evaluated data clusters per operating states) are output from operating states step 1013 and fed to linearity testing 1014. In embodiments, the outcome of operating states step 1013 includes multiple groups of data with each group representing certain condition(s) of the subject chemical process dynamics. Each group of data is separately fed to the linearity testing 1014. Responsively, linearity test 1014 begins determining whether a linear or a non-linear prediction model should be used given the sensor data 551. In non-limiting examples, the linearity test 1014 may check whether the relationship between chemical/refinery plant feed materials and resulting products are linear and may check whether relationship between chemical/refinery operation control parameters and system output variables are linear. It is noted, such an approach as linearity testing method 1010 may be applicable to any industry that includes both inputs and outputs for selecting linear or nonlinear models.

[0079] In embodiments, the model selection module 552 evaluates a nonlinearity measurement as follows. For a semi-continuous process (such as a non-limiting example Kraft process 110), the nonlinearity measurement may be expressed by a mathematical function between an independent (predictor) variable X and a dependent (response) variable Y=(X). If function (X.sub.t) is linear, then the rank of covariance of sample sets {X.sub.t} and {Z.sub.t}, where Z=(X.sub.t, Y.sub.t), would be the same due to the linear correlation. Thus, the rank difference between the covariances can be used to estimate the degree of nonlinearity. Accordingly, embodiments employ the following steps. The data processing 550 (i.e., modules 552, 554): [0080] (1) perform z-score normalization for both X and Y; [0081] (2) calculate minimum dimensions of selected samples of X and Z=(X, Y) for a given percentage of total mean squared error in a projected subspace; [0082] (3) average dimensions and differences for evenly sampled percents in [0.90, 0.99] (with a default number of 5 samples); [0083] One embodiment utilizes a regression linearity ranking function. The regression function measures the partial dimension of combined data Z to keep the same percent of covariance from predictor variables X and the combined variables Z. Briefly the regression function in pseudo code is: [0084] For a percent , calculate the difference of minimum ranks of {X.sub.t} and {X.sub.t, Y.sub.t} with keeping percent of covariances; [0085] Take evenly sampled percents in [0.90, 0.99], and then regress the integer ranks to estimate the average of extra dimensions introduced by nonlinear (); [0086] The extra dimension is zero if the function (X.sub.t) is linear but could become a fraction depending on the nonlinearity of (X.sub.t); [0087] (4) use an average dimension to define the model size when initializing a prediction model for both linear and nonlinear cases; [0088] (5) use the difference (from step 3 extra dimension) to measure the nonlinearity with the following criteria: (a) very linear for a range 0.0 to about 0.2 not inclusive, i.e., [0.0, 0.20) which will recommend a linear model; (b) quasilinear for a range of about 0.2 to about 0.5 not inclusive, i.e., [0.20, 0.50) which will prefer to use a nonlinear model; and (c) nonlinear for a range about 0.5 to about dimension of Y not inclusive, i.e., [0.50, dimension of Y) which will recommend to use a nonlinear model. Use of other criteria ranges for linear, quasilinear, and/or nonlinear are suitable. In this way, embodiments quantitatively test linearity and measure severity of non-linearity (including gain in non-linearity over time).

[0089] In some embodiments, the trained prediction model 560 predicts at least three response values. In embodiments pertaining to the Kraft process 110, the three response values are: Kappa number, total alkaline charge, and residual alkali. In that case, the data processing 550 includes: (1) the data selection module 552 recognizing an operating state (i.e. a combination of certain types of wood chips and pulps) and calculating ranks of concentrated independent and dependent variables (i.e. measured sensor data 551); (2) the data selection module 552 using increased ranks and determining whether a linear relationship exists, where a determined nonlinear relationship exists, then use a nonlinear algorithm (e.g. XGBoost) to train the prediction model 560; (3) the model selection module 554 using estimated ranks to initialize a number of estimators for the nonlinear algorithm (e.g. XGBoost), or a number of components for a linear algorithm (e.g. PLS) such as: the number of components for the linear algorithm is the estimated rank among the independent and dependent variables, and a multiplication of number of estimators and depth of the nonlinear algorithm is equal to twice the estimated ranks; (4) the model selection module 554 balancing at least three prediction variables and configuring a single model with multiple inputs and multiple outputs for all used algorithms (e.g. global XGBoost and local LASSO) through packing/unpacking high-dimensional data; and (5) prediction accuracy being measured by a normalized mean square error which is consistent with model requirements.

[0090] In other embodiments, a nonlinearity measurement of predictor and response variables were evaluated as shown in FIG. 11. Comparative data 1110 were assessed as nonlinear data 1111a and 1111b and linear data 1112a and 1112b using the above-described nonlinearity measurement and a rainbow statistical method. Based on the results, embodiments were significantly better than the rainbow method, as the rainbow method test did not work with data except some simulation data.

[0091] In other embodiments, data was evaluated from the Kraft process 110 using a linear PLS algorithm from the largest component by HDBSCAN as shown in FIG. 12. Each of the three response variable values 1210: Kappa number 1211b and 1212b, total alkaline charge 1211a and 1212a, and residual alkali 1211c and 1212c, were evaluated based on actual responses and predictions from training data. The result was poor correlation between the actual responses and the predictions from the algorithm.

[0092] The XGBoost algorithm was then applied and evaluated similarly to the PLS algorithm. XGBoost is a state of art algorithm for small to mid-sized tabular data prediction tasks. Data from the cooking process of the Kraft process 110 is tabular and nonlinear with size from hundreds to thousands of samples. Thus in one embodiment, XGBoost is employed to solve the white liquor prediction and consumption optimization tasks and in turn overcome limitations of existing linear model approaches. To maintain consistency and compatibility in using XGBoost with existing linear methods, Applicants adopted the following initialization and training techniques: (1) number of estimators, maximum depth, and maximum leaves are proportional to the average dimension obtained from a previous step (detailed above). Empirical criterion is that multiplication of the three parameters is three times the average dimension; and (2) selection of the three parameters is further optimized from maximizing the score of XGBoost.

[0093] In other embodiments, data was evaluated from the Kraft process 110 using a nonlinear XGBoost algorithm from largest component by HDBSCAN as shown in FIG. 13. Each of the three response variable values 1310: Kappa number 1311b and 1312b, total alkaline charge 1311a and 1312a, and residual alkali 1311c and 1312c, were evaluated based on actual responses and predictions from training data. The result revealed a much higher correlation between the actual responses and the predictions from the algorithm, indicating significant improvement from the linear method of FIG. 12.

[0094] In some embodiments, PLS-based prediction model coefficients can be used for contribution analysis and optimization control. For the XGBoost model described in FIG. 13, no reliable methods were available to estimate contribution coefficients prior to Applicant's present invention. Some common approaches, such as permutation feature and Shapely value, cannot be explained well by first principles theory and do not provide consistent results for sensor data acquired from real world applications (e.g. a refinery process). Another drawback from PLS is that the coefficients are fixed after a prediction model is trained.

[0095] That is to say, after selecting a linear model configuration or a non-linear model configuration for the target prediction model, model selection module 554 next configures model coefficients. Specifically, model selection module 554 bases model coefficients on relative importance (importance factors) and optimizes predictor variables such as white liquor consumption in our example case Kraft process 110. In embodiments, the model selection module 554 uses data 551 in an interval (time interval) to estimate the importance factors of contributing variables which can be denoted as {X.sub.i} and {Y.sub.i}. In a semi-continuous or continuous process (such as Kraft process 110), there is a data shift () that is accounted for or otherwise considered for the interval. The model selection module 554 selects the length of according to the data acquisition frequency and chemical reaction response time. In one embodiment, the larger of the two values is chosen to define the length of . The model selection module 554 accordingly shifts the data an interval based on to estimate contributing variables for the same importance factors estimated from the previous step. The contributing variable values are thus moved to {X.sub.i+} and {Y.sub.i+}. In this way, embodiments employ duo-interval values to analyze importance factors and to optimize Kappa number (a response variable Y). Such duo-interval data technique can be used in modeling other semi-continuous or continuous processes and configuring model coefficients for manipulating control parameters to reach desired behaviors. For non-limiting example, the model coefficient configuration can be applied to: reducing toxic waste, wastewater, and certain chemicals in a drying process for pulp and paper; calculating control set input in model predictive control to reach desired output; and/or optimizing distribution system in a short-term supply chain (to increase profit).

[0096] Embodiments use a local PLS model and LASSO (least absolute shrinkage and selection operator) model on a small set of samples in a moving window to calculate coefficients when the prediction model 560 is approximated in a local interval. A subject chemical process, industrial plant process, or otherwise operation of interest is described in Equation 2. Embodiments determine, for predictor-corresponding response (X.sub.t, Y.sub.t), the amount of contribution of each X.sub.t to Y.sub.t. For a new target Y.sub.t+4, embodiments also determine how to change X.sub.t+ to meet Y.sub.t+=(X.sub.t+) as follows:

[00002] $\begin{matrix} Y_{t} = f (X_{t}), a process or operation & Equation 2 \end{matrix}$

[0097] For samples of {X.sub.i} predictor and {Y.sub.i} response, LASSO approximation of the function is expressed as Equation 3. In Equation 3, N is the number of samples for {X.sub.i} and {Y.sub.i}, and {.sub.j} is the solution of Lasso under constraints. The purpose of this regression is for variable selection in terms of which and how much each variable contributes to the target. If function (X.sub.t) is linear, {.sub.j, j=1, . . . , N} may be the exact solution if all the constraints are met. Further, for linear cases, Equation 3 may be an exact solution but may be a local approximation for nonlinear cases.

[00003] $\begin{matrix} \min_{} {{.Math.}_{i = 1}^{N} {({.Math.}_{j} (_{j} X_{i}^{j} +_{0}) - Y_{i})}^{2} + {.Math.}_{j} .Math._{j} .Math.} & Equation 3 \end{matrix}$

[0098] In other non-limiting embodiments, if optimizing a change to response Y.sub.t+ is desired, then the corresponding changes to prediction X.sub.t+ need to be solved. These changes can be solved according to Equation 4. Let custom-character and stand for the desired predictors and response, respectively. Correspondingly, locally assume that and follows the same model as obtained from the previous LASSO regression.

[00004] $\begin{matrix} \min {{.Math.}_{j = 1}^{N} {(B_{j} +_{0} -)}^{2} + .Math. - \overline{\overline{X_{t}}} .Math.} & Equation 4 \end{matrix}$

[0099] In Equation 4, X.sub.t may be the local average for the solution custom-character to be close to the previous states, and may be subject to constraints such as lower/upper bounds.

[0100] In other non-limiting embodiments, alternating the regression of Equation 4 may provide a more accurate estimation as described in Equation 5. Equation 5 moves Y.sub.t to Y.sub.t+ in multiple steps.

[00005] $\begin{matrix} \min {.Math.}_{j = 1}^{N} {({\tilde{}}_{J} + - {\tilde{Y}}_{t +})}^{2} & Equation 5 \end{matrix}$

[0101] Based on validation experiments, the coefficients from PLS and LASSO are similar in term of ranks for the important variables. For simplicity without loss of generality, the coefficients from a nonlinear mathematical function x+y.sup.2z.sup.4 in three different regions of [0.0, 0.1][0.0, 0.1][0.0, 0.1], [0.5, 0.6][0.5, 0.6][0.5, 0.6], and [1.0, 1.1][1.0, 1.1][1.0, 1.1] are illustrated in FIG. 15. In FIG. 14, contribution coefficients from PLS and LASSO with different data conditions 1410 that model agnostic and dynamic data are shown as PLS coefficient (fixed) 1411a, LASSO coefficient (trained) 1411b, LASSO coefficient (tested) 1411c, and LASSO coefficient (local) 1411d. By comparison in FIG. 15, coefficients from LASSO/PLS/Shapley with illustration from a nonlinear math function 1510 are shown as a LASSO coefficient 1511a, a XGBoost coefficient 1511b, and a PLS coefficient 1511c. It is noted that the contribution from Shapley values for XGBoost exposed inconsistency on the first variable x.sub.1. Shapley values do not provide the correct directions of the contribution compared with the PLS and LASSO models. Thus, in some embodiments, coefficients from PLS and LASSO are used to calculate dynamic contribution along a timeline.

[0102] To find the optimized predictor variable X for a modified response Y, an embodiment (model selection module 554) applies or otherwise uses a local LASSO in a predefined window with corresponding constraints (Equation 6):

[00006] $\begin{matrix} \min_{\tilde{X}} {{.Math.}_{j = 1}^{N} {(_{j} {\tilde{X}}^{j} +_{0} - \tilde{Y})}^{2} + .Math. \tilde{X} - \overline{X} .Math., subject to constraints . & Equation 6 \end{matrix}$

here {tilde over (X)} is the average of samples (data 551) of predictor variable {tilde over (X)} in a time window of [t.sub.0, t.sub.1], Y is the desired response at next timestamp of t.sub.1+t, and {tilde over (X)} is the optimal solution for given . Note that such local LASSO may not be applicable if the change from Y to Y is not continuously dependent on the changes from X to {tilde over (X)}. The recommended use case would be within the same process control state. For the contribution analysis during the transition period, a nonlinear approach may be needed to reflect the complicated relationship between the predictors and responses (i.e., variables X, Y).

[0103] In other embodiments, a combination of PLS and LASSO models may be employed in an iterative fashion in calculating the contribution coefficients and performing optimization in a local time window. This novel approach and technique can also be interpretated as a federated gradient-based approach because the iteration is performed on a group of points instead of individual data points in the traditional gradient-based optimization solver. In sum, embodiments of the present invention apply local linear models (e.g., PLS, LASSO, etc.) to address global nonlinear data 551. As shown in the previously mentioned math function of FIG. 15 and evaluations of product data, the PLS and LASSO models provide more consistent results than Shapley values and permutation features.

Computer Support

[0104] FIG. 16 illustrates a computer network or similar digital processing environment in which the present invention may be implemented. Client computer(s)/devices 50 and server computer(s) 60 provide processing, storage, and input/output devices executing application programs and the like. Client computer(s)/devices 50 can also be linked through communications network 70 to other computing devices, including other client devices/processes 50 and server computer(s) 60. Communications network 70 can be part of a remote access network, a global network (e.g., the Internet), cloud computing servers or service, a worldwide collection of computers, Local area or Wide area networks, and gateways that currently use respective protocols (TCP/IP, Bluetooth, etc.) to communicate with one another. Other electronic device/computer network architectures are suitable.

[0105] Client computers/devices 50 and server computer(s) 60 may execute any of the modules, computation steps, or data processes embodying the functionalities and workflows of the present invention as detailed in FIGS. 4A-15. Client computers/devices 50 and server computer(s) 60 may store any of the data utilized in the steps or processes of embodiments of the invention including data 551, 553a,b, visualizations 660, 770, 880, 990, contribution coefficient values 1410, 1510, and prediction models 560, 2110 (FIG. 19). Furthermore, client computers/devices 50 and server computer(s) 60 may be part of system 2140, 3100 (described below in FIGS. 18, 19) for process control modeling and/or simulation, and function as a controller 2122 and/or process modeling system 2130 that generate and apply working models 2110, configure and maintain settings 2132, and control subject chemical process 2124, 110.

[0106] FIG. 17 is a diagram of the internal structure of a computer node (e.g., client processor/device 50 or server computers 60) in the computer system of FIG. 16. Each computer 50, 60 contains system bus 79, where a bus is a set of hardware lines used for data transfer among the components of a computer or processing system. Bus 79 is essentially a shared conduit that connects different elements of a computer system (e.g., processor, disk storage, memory, input/output ports, network ports, etc.) that enables the transfer of information between the elements. Attached to system bus 79 is I/O device interface 82 for connecting various input and output devices (e.g., keyboard, mouse, displays, printers, speakers, etc.) to the computer 50, 60. Network interface 86 allows the computer 50, 60 to connect to various other devices attached to a network (e.g., network 70 of FIG. 16). Memory 90 provides volatile storage for computer software instructions 92 and data 94 used to implement an embodiment of the present invention (e.g., modules, methods 440, 550, 1010, user-interface, and code thereof detailed above). Disk storage 95 provides non-volatile storage for computer software instructions 92 and data 94 used to implement an embodiment of the present invention. Central processor unit 84 is also attached to system bus 79 and provides for the execution of computer instructions.

[0107] In one embodiment, the processor routines 92 and data 94 are a computer program product (generally referenced 92), including a computer readable medium (e.g., a removable storage medium such as one or more DVD-ROM's, CD-ROM's, diskettes, tapes, cloud storage, SD cards, etc.) that provides at least a portion of the software instructions for the invention system. Computer program product 92 can be installed by any suitable software installation procedure, as is well known in the art. In another embodiment, at least a portion of the software instructions may also be downloaded over a cable, communication and/or wireless connection. In other embodiments, the invention programs are a computer program propagated signal product 107 embodied on a propagated signal on a propagation medium (e.g., a radio wave, an infrared wave, a laser wave, a sound wave, or an electrical wave propagated over a global network such as the Internet, or other network(s)). Such carrier medium or signals provide at least a portion of the software instructions for the present invention routines/program 92.

[0108] In alternate embodiments, the propagated signal is an analog carrier wave or digital signal carried on the propagated medium. For example, the propagated signal may be a digitized signal propagated over a global network (e.g., the Internet), a telecommunications network, or other network. In one embodiment, the propagated signal is a signal that is transmitted over the propagation medium over a period of time, such as the instructions for a software application sent in packets over a network over a period of milliseconds, seconds, minutes, or longer. In another embodiment, the computer readable medium of computer program product 92 is a propagation medium that the computer system 50 may receive and read, such as by receiving the propagation medium and identifying a propagated signal embodied in the propagation medium, as described above for computer program propagated signal product.

[0109] Generally speaking, the term carrier medium or transient carrier may encompass the foregoing transient signals, propagated signals, propagated medium, storage medium and the like. In other embodiments, the program product 92 may be implemented as a so-called Software as a Service (SaaS), or other installation or communication supporting end-users.

[0110] It should be understood that the example embodiments described herein may be implemented in many different ways. In some instances, the various methods and machines described herein may each be implemented by a physical, virtual, or hybrid general purpose computer, such as the computer system 50, 60, 2140, 3100, or a computer network environment such as those described below in relation to FIGS. 16-19. The computer system 50, 60, 2140, 3100 may be transformed into the machines that execute the methods described herein, for example, by loading software instructions into either memory 90 or non-volatile storage 95 for execution by the CPU 84. One of ordinary skill in the art should further understand that the system 50, 60, 2140, 3100 and its various components may be configured to carry out any embodiments or combination of embodiments, features, and functions described herein. Further, the system 50, 60, 2140, 3100 may implement the various embodiments described herein utilizing any combination of hardware, software, and firmware modules operatively coupled, internally, or externally, to the system 50, 60, 2140, 3100.

[0111] Turning to FIG. 19, illustrated is a process control (or more generally a modeling and simulation) method and system 2140 embodying the present invention. The process control method and system 2140 employ the mentioned new methods and techniques (functionality) for generating an improved model of a chemical/industrial process 2124 (such as Kraft process 110 of FIG. 1, for non-limiting example). Briefly, an industrial plant (chemical processing plant, refinery, pulp and paper plant, or the like) 2120 performs chemical processes of interest 2124. Non-limiting examples include paper pulp manufacturing, pharmaceuticals production, petroleum refining, polymer processing, and so on. Plant equipment (at 2120) for implementing processes 2124 includes distillation columns, various kinds of reactors and reactor tanks, digesters, boilers, evaporators, pipe systems, valves, heaters, etc. by way of illustration and not limitation. Plant data 2105 represents inputs (monitored sensor data, etc.) and outputs (products, residuals, physical operating characteristics/conditions, etc.) of the chemical process 2124. A controller 2122 employs model process control to configure and maintain settings 2132 (i.e., parameter values, temperature selection, pressure settings, flow rate, other values of variables representing physical characteristics) operating the plant equipment in carrying out the subject chemical process 2124. Controller 2122 may also perform longer term planning and scheduling operations for settings 2132.

[0112] The process control, planning, scheduling, and real-time optimization applications are based on models (of the subject physical, chemical, or engineering process 2124) generated by process modeling system 2130. Plant 2120 may have any number of chemical processes 2124, any number of controllers 2122, and any number of process modeling systems 2130 used to configure and maintain their respective settings 2132.

[0113] In prior art methods, process modeling system 2130 may have utilized controlling an industrial based chemical process. In embodiments of the present invention, the process modeling system 2130 generates and deploys improved models 2110 of the subject chemical process 2124 generated from received and working data 2101a, 2101b, and 2101c (generally 2101) detailing the physical characteristics and operating conditions of chemical process 2124 and from initial models 2102a, 2102b, and 2102c (generally 2102) representing linearity-based prediction models (non-linear, quasi-linear, or linear) of the chemical process 2124. Working data 2101a, 2101b, and 2101c may be received as part of plant data 2105, part of a preexisting dataset, include simulated or otherwise derived data, preprocessed (dimension reduced, outliers removed) data, or any combination of the aforementioned. Models 2110 may be generated from any amount of working data 2101 and initial models 2102 (including global non-linear, and local linear models described above).

[0114] The models 2110 (also referred to herein as resulting models 2110) predict, with improved accuracy, the progress and physical characteristics/conditions of the subject chemical process 2124 (such as white liquor consumption of a Kraft process 110 in FIG. 1 for non-limiting example). The predictions generated by resulting models 2110 also reflect empirically fit or supported aspects of the initial models 2102a, 2102b, and 2102c. The predictions enable improved performance and operability of the subject chemical process 2124 by any of: enabling a process engineer to more effectively troubleshoot the chemical process, enabling debottlenecking of the chemical process, planning and scheduling actions, optimizing consumption of a certain resource of the chemical process at the industrial plant, and optimizing performance of the chemical process at the industrial plant 2120. The predictions of resulting models 2110 may further include indications of any need to update the settings 2132, specific values to quantitatively update the settings 2132, and the desired time to update the settings 2132. FIG. 5 and workflow 550 further details the method by which process control system 2140 (and particularly its process modeling system 2130): (a) constructs Applicant's inventive and advantageous resulting models 2110 utilized to mathematically model a chemical process with increased prediction accuracy accounting for process non-linearities, (b) automatically trains said resulting model 2110, and (c) deploys said model 2110 in a manner controlling the subject chemical process 2124 to optimize consumption of the certain resource.

[0115] In a generalized sense, controller 2122 is an interface between process modeling system 2130 and industrial plant 2120. Other interfaces between process modeling system 2130 and plant 2120 in addition to and/or instead of controller 2122 are suitable and in the purview of one skilled in the art given the disclosure herein. For example, there may be an interface between process modeling system 2130 and plant 2120 systems. There may be a user interface for process modeling system 2130. Process modeling system 2130 may effectively be part of a simulator or optimizer for non-limiting examples. Various such interfaces enable an end user, e.g., process engineer, to utilize model predictions in (a) determining different mathematical relationships (of the chemical process physics-based or chemistry-based characteristics) that are substantially linear, then training the initial model as a linear predictive model, in (b) where the different mathematical relationships are determined to be quasi-linear or non-linear, then (i) training the initial model as a global non-linear predictive model, (ii) combining the trained global non-linear predictive model with one or more local linear models dynamically adjusting control variables of the certain resource consumption in respective windows of time, and (iii) producing a resulting trained prediction model based on the global non-linear predictive model combined with the one or more local linear models. In embodiments, an interface enables a process engineer to utilize the model predictions in optimizing (online or offline) the chemical process 2124 at the plant 2120. In these and other similar ways, embodiments enable various improvements in performance of the chemical process 2124 at the subject plant 2120.

[0116] FIG. 18 illustrates a block diagram depicting an example network environment 3100 for monitoring plant processes in many embodiments. System computers 3101, 3102 may operate as controllers. In some embodiments, each one of the system computers 3101, 3102 may operate in real-time as a controller alone, or the computers 3101, 3102 may operate together as distributed processors contributing to real-time operations as a single controller. In other embodiments, additional system computers 3112 may also operate as distributed processors contributing to the real-time operation as a controller.

[0117] The system computers 3101 and 3102 may communicate with the data server 3103 to access collected data for measurable process variables from a historian database 3111. The data server 3103 may be further communicatively coupled to a distributed control system (DCS) 3104, or any other plant control system, which may be configured with instruments 3109A-3109I, 3106, 3107 that collect data at a regular sampling period (e.g., one sample per minute) for the measurable process variables. Instruments 3106, 3107 are online analyzers (e.g., gas chromatographs) that collect data at a longer sampling period. The instruments 3109A-3109I, 3106, 3107 may communicate the collected data to an instrumentation computer 3105, also configured in the DCS 3104, and the instrumentation computer 3105 may in turn communicate the collected data to the data server 3103 over communications network 3108. The data server 3103 may then archive the collected data in the historian database 3111 for model calibration, inferential model training purposes, and the like. The data collected varies according to the type of target process.

[0118] The collected data may include measurements for various measurable process variables. These measurements may include, for example, a feed stream flow rate as measured by a flow meter 3109B, a feed stream temperature as measured by a temperature sensor 3109C, component feed concentrations as determined by an analyzer 3109A, and reflux stream temperature in a pipe as measured by a temperature sensor 3109D. The collected data may also include measurements for process output stream variables, such as, for example, the concentration of produced materials, as measured by analyzers 3106 and 3107. The collected data may further include measurements for manipulated input variables, such as, for example, reflux flow rate as set by valve 3109F and determined by flow meter 3109H, a re-boiler steam flow rate as set by valve 3109E and measured by flow meter 3109I, and pressure in a column as controlled by a valve 3109G. The collected data reflect the operation conditions of the representative plant during a particular sampling period. The collected data is archived in the historian database 3111 for model calibration and inferential model training purposes. The data collected varies according to the type of target process.

[0119] The system computers 3101 or 3102 may execute various types of process controllers for online deployment purposes. The output values generated by the controller(s) on the system computers 3101 or 3102 may be provided to the instrumentation computer 3105 over the network 108 for an operator to view, or may be provided to automatically program any other component of the DCS 3104, or any other plant control system or processing system coupled to the DCS system 3104. Alternatively, the instrumentation computer 3105 can store the historical data through the data server 3103 in the historian database 3111 and execute the process controller(s) in a stand-alone mode. Collectively, the instrumentation computer 3105, the data server 3103, and various sensors and output drivers (e.g., 3109A-3109I, 3106, 3107) form the DCS 3104 and can work together to implement and run the presented application.

[0120] The example architecture 3100 of the computer system supports the process operation of a representative plant. In this embodiment, the representative plant may be, for non-limiting example, a pulp and paper refinery or a chemical processing plant having a number of measurable process variables, such as, for example, temperature, pressure, and flow rate variables. It should be understood that in other embodiments a wide variety of other types of technological processes or equipment in the useful arts may be used.

[0121] Embodiments or aspects thereof may be implemented in the form of hardware, firmware, or software. If implemented in software, the software may be stored on any non-transient computer readable medium that is configured to enable a processor to load the software or subsets of instructions thereof. The processor then executes the instructions and is configured to operate or cause an apparatus to operate in a manner as described herein.

[0122] Further, firmware, software, routines, or instructions may be described herein as performing certain actions and/or functions of the data processors. However, it should be appreciated that such descriptions contained herein are merely for convenience and that such actions in fact result from computing devices, processors, controllers, or other devices executing the firmware, software, routines, instructions, etc. Likewise, where a digital processor is described as performing certain actions, it is understood that one or more digital processors may be performing the actions.

[0123] It should be understood that the flow diagrams, block diagrams, and network diagrams may include more or fewer elements, be arranged differently, or be represented differently. But it further should be understood that certain implementations may dictate the block and network diagrams and the number of block and network diagrams illustrating the execution of the embodiments be implemented in a particular way.

[0124] Accordingly, further embodiments may also be implemented in a variety of computer architectures, physical, virtual, cloud computers, and/or some combination thereof, and thus, the data processors described herein are intended for purposes of illustration only and not as a limitation of the embodiments.

[0125] While example embodiments have been particularly shown and described, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the embodiments encompassed by the appended claims.

[0126] For example, the foregoing description and details of embodiments in the figures reference Applicant-Assignee AspenTech, tools and platforms, for purposes of illustration and not limitation. Other similar tools and platforms are suitable. In one embodiment, the forgoing methods, techniques, and functions may be integrated into existing software programs or products, such as Applicant-Assignee Aspen Tech ProMV (trademark). Functions are implemented through python-based services (for non-limiting example) to ensure the reusability for on-prem, cloud, and MS Edge (trademark) applications. REST (Represented State Transfer) API may be utilized to unify data input and output to neutralize a specific domain characteristic of different applications, and/or facilitate economic and quick integration across products, solutions, and platforms. Other integration techniques are suitable and in the purview of one skilled in the art given this disclosure.

[0127] Given the above, the present invention provides advantages for process model/algorithm selection based on working data from chemical processes of interest. The determined process model/algorithm optimizes independent and dependent variables and then automatically continues to monitor and adjust the model/algorithm to best fit the working data. In this way, Applicant's approach is (and the methods and systems embodying the present invention are) a data-driven generation of process models of nonlinear, continuous-like industrial (e.g., chemical) processes. The subject chemical process may be continuous, semi-continuous, or continuous in one or more parts or portions. In such industrial (e.g., chemical) processes, the computer-automated data-driven selection of operating conditions and models is unachieved in the prior art. Applicant's computer-automated data-driven selection of operating conditions and models is advantageously more consistent than manual domain knowledge selection in the art. The dynamic calculations of dynamic coefficients and optimization through moving windows of time are advantageous over prior global objective function and solver methods for determining optimized independent variables.

[0128] The non-linear correlations of the present invention help fill the gap of modeling processes that are somewhere between fully continuous and fully batch operations. As described above, the pulp and paper industry is a good example because of the discrete nature of the sequential subprocesses from digesting the wood fibers to extruding the sheets of finished paper. Part of the process is batch; other parts are continuous. Other industries that share similar process traits are reactors that are batch operated as individual process units but also have characteristics of an overall semi-continuous process when one batch can affect the next one in the same production line. Establishing a lag of process conditions from the previous batch to predict the current one may not always linearly model the process with sufficient accuracy. Some polymer reactors and reactor furnaces such as smelting furnaces have this complexity due to less than ideal operating practices where reaction products are not completely recovered between batches. Since the art will continue to be challenged with complex processes, Applicant with the present invention data driven approach provides an alternative way to empirically model such industrial (e.g., chemical) processes.

[0129] Applicant's methods and systems advantageously automatically detect linearity for purposes of digital processor selection of prediction (e.g., process) model. As detailed above, the methods and systems quantify severity of linearity, i.e., quantitatively measure non-linearity, and track over time (across the moving windows of time periods) any gradual changes in linearity/non-linearity of data and relationships therein. Such quantitative measuring of gain in non-linearity and quantitative metrics of linearity/non-linearity severity in embodiments is advantageous in process modeling increasing model accuracy and model performance, and is heretofore not contemplated in the prior art.

Process Modelling Method and System for Non-Linear Continuous-Like Process and Application in Pulp and Paper Industry

Inventors

Cpc classification

Classification Explorer

D21H23/78

TEXTILES; PAPER

Classification Explorer

D21H11/04

TEXTILES; PAPER

International classification

Classification Explorer

D21H23/78

TEXTILES; PAPER

Classification Explorer

D21H11/04

TEXTILES; PAPER

Abstract

Claims

Description