CONTROLLING AND MONITORING A PROCESS TO PRODUCE A CHEMICAL, PHARMACEUTICAL OR BIOTECHNOLOGICAL PRODUCT
20190219992 · 2019-07-18
Assignee
Inventors
- Christian Grimm (Göttingen, DE)
- Mario Becker (Göttingen, DE)
- Lars Böttcher (Göttingen, DE)
- Thorsten Adams (Göttingen, DE)
Cpc classification
C12M41/36
CHEMISTRY; METALLURGY
Y02P90/02
GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
G05B2219/32201
PHYSICS
G05B2219/31265
PHYSICS
International classification
G05B19/418
PHYSICS
Abstract
A computer system and computer-implemented method are described for controlling and monitoring a process to produce a chemical, pharmaceutical or biotechnological product. The method includes providing a database that stores sets of process parameters to control and monitor a plurality of processes performed in order to produce products, receiving a set of characterizing process parameters that characterize the process, identifying a first set process parameters from the stored sets of process parameters, and controlling and monitoring the process using a successful trajectory that includes a time-based profile of measurements.
Claims
1. A computer-implemented method of controlling and monitoring a process to produce a chemical, pharmaceutical or biotechnological product, comprising: providing a database, the database storing sets of process parameters to control and monitor respective ones of a plurality of processes performed in order to produce products, wherein each of the stored sets of process parameters is associated with a successful trajectory of a respective one of the processes performed according to the respective set process parameters, wherein each successful trajectory is a time-based profile of measurements recorded during performance of the respective process; receiving a set of characterizing process parameters that characterize the process; identifying a first set process parameters from the stored sets of process parameters, the first set of process parameters having a specified degree of similarity to the set of characterizing process parameters, wherein the first set of process parameters is associated with a first successful trajectory, controlling and monitoring the process using the first successful trajectory, comprising: recording measurements of the process; and estimating a trajectory of the process based on the recorded measurements.
2. The method of claim 1, wherein controlling and monitoring the process using the first successful trajectory further comprises: comparing the estimated trajectory with the first successful trajectory; when a difference between the estimated trajectory and the first successful trajectory fails a confidence criterion, comparing the recorded measurements and the set of characterizing process parameters to a plurality of the stored sets of process parameters, and determining, based on the comparison, whether a second set of process parameters from the plurality of stored sets of process parameters has a greater degree of similarity to the set of characterizing process parameters and the recorded measurements than the first set of process parameters; when the second set of process parameters is determined, controlling and monitoring the process using a second successful trajectory associated with the second set of process parameters; and when a finishing condition is met, determining that the process is complete.
3. The method of claim 1, wherein the characterizing process parameters and the stored sets of process parameters include numerical values and text based values; wherein identifying the first set of stored process parameters comprises: comparing, via multivariate data analysis, numerical values of the set of characterizing process parameters with numerical values of the sets of stored process parameters.
4. The method of claim 1, wherein identifying the first set of process parameters comprises determining, via multivariate analysis, stored process parameters that have an effect on quality attributes included in the characterizing process parameters, wherein the determining may comprise ranking stored process parameters from those having the most significant effect on the included quality attributes to those having the least significant effect on the included quality attributes, wherein the determined process parameters are not included in the characterizing process parameters, wherein the first set of stored process parameters includes the determined process parameters, and wherein the first set of stored process parameters may include determined process parameters that have a relatively significant effect on the included quality attributes according to the ranking.
5. The method of claim 4, wherein the quality attributes include one or more of the following: misincorporation, Glycosylation, recovery, misfolds, aggregation, product concentration, protein concentration, moisture, foreign particles, total mass, drug release rate, and total drug content.
6. The method of claim 4, wherein the quality attributes include one or more of the following: titer, power consumption, a target duration of the process, cell density, and volumetric productivity, wherein volumetric productivity is measured in volume yield of product per unit of time.
7. The method of claim 2, wherein comparing the recorded measurements and the set of characterizing process parameters to the plurality of stored sets of process parameters comprises: determining a plurality of principal components from the characterizing process parameters and the recorded measurements; and calculating a characterizing numerical description of the process as a function of the plurality of principal components; wherein the first set of process parameters includes a first numerical description of the respective process calculated as a function of a plurality of principal components derived from the first set of process parameters; wherein the second set of process parameters includes a second numerical description of the respective process calculated as a function of a plurality of principal components derived from the second set of process parameters; wherein determining the second set of process parameters comprises determining that the characterizing numerical description is closer to the second numerical description than the first numerical description; and wherein each principal component may be a linear combination of process parameters.
8. The method of claim 2, wherein the first successful trajectory is associated with a standard deviation, wherein the confidence criterion is a function of the standard deviation.
9. The method of claim 1, wherein the first set of process parameters includes a further process parameter and a corresponding value, wherein the further process parameter is included in the characterizing process parameters, wherein the corresponding value is not included with the characterizing process parameters, wherein controlling and monitoring the process further comprises performing the process using the further process parameter and the corresponding value; and wherein the further process parameter may be a control set point.
10. The method of claim 1, wherein the characterizing process parameters and/or the stored process parameters include at least one of the following: a description of equipment for performing the process; a scale of the process; a type of the process; a name of the product; a biological system of the process; quality attributes to measure the quality of the product; and a configuration of the equipment.
11. The method of claim 10, wherein the configuration of the equipment includes one or more of the following: a target duration of the process; a target temperature; a stirrer/agitator speed; a target pH level; a feed rate; a target substrate level; and a target dissolved oxygen level.
12. The method of claim 1, wherein the recorded measurements comprise one or more of the following: a current duration of the process; a partial pressure of carbon dioxide; a cell density; a cell viability; a substrate concentration; a metabolite concentration; a time of infection; and a current temperature.
13. The method of claim 1, wherein the product is a biopharmaceutical product, wherein the product is one of the following: a recombinant protein, a non-recombinant protein, a vaccine, a gene vector, DNA, RNA, an antibiotic, a secondary metabolite, cells for cell therapy or regenerative medicine, an artificial organ.
14. A computer program product comprising computer-readable instructions, which, when loaded and executed on a computer system, cause the computer system to perform operations according to the method of claim 1.
15. A computer system operable to control and monitor a process to produce a chemical, pharmaceutical, or biotechnological product, the computer system comprising: a database, and a control system comprising at least one control device; the database being configured to: store sets of process parameters to control and monitor respective ones of a plurality of processes performed in order to produce products, wherein each of the stored sets of process parameters is associated with a successful trajectory of a respective one of the processes performed according to the respective set of process parameters, wherein each successful trajectory is a time based profile of measurements recorded during performance of the respective process; receive a set of characterizing process parameters that characterize the process, a processor associated with the database configured to: identify a first set of process parameters from the stored sets of process parameters, wherein the first set of process parameters has a specified degree of similarity to the set of characterizing process parameters; the control system being configured to: control and monitor the process using the first successful trajectory, and estimate a trajectory of the process based on the recorded measurements.
16. The computer system of claim 15, wherein when a difference between the estimated trajectory and the first successful trajectory fails a confidence criterion, the control system is further configured to cause the database to compare the recorded measurements and the set of characterizing process parameters to a plurality of the stored sets of process parameters; the database is further configured to determine, based on the comparison, whether a second set of process parameters from the plurality of stored sets of process parameters has a greater degree of similarity to the set of characterizing process parameters and the recorded measurements than the first set of process parameters; the control system is further configured to: when the second set of process parameters is determined, control and monitor the process using a second successful trajectory associated with the second set of process parameters; and determine that the process is complete when a finishing condition is met.
Description
BRIEF DESCRIPTION OF THE FIGURES
[0113]
[0114]
[0115]
[0116]
[0117]
[0118]
[0119]
[0120]
[0121]
[0122]
[0123]
[0124]
[0125]
[0126]
[0127]
[0128]
[0129]
[0130]
DETAILED DESCRIPTION
[0131] In the following text, a detailed description of examples will be given with reference to the drawings. It should be understood that various modifications to the examples may be made. In particular, one or more elements of one example may be combined and used in other examples to form new examples.
[0132]
[0133]
[0134] In addition, techniques described in the present application may be particularly useful for processes in which multivariate data analysis (e.g. of recorded measurements) is carried out. Further, measurement of the quality of the product produced by the process may be laborious and time consuming (e.g. requiring extensive analysis and testing). Accordingly, it may be desirable to estimate the quality of the product produced by the process while the process is still being carried out. Based on the estimation, the process can be controlled and monitored so that the resulting product is more likely to meet specified quality attributes.
[0135] The method begins at step S101.
[0136] At step S103, a plurality of processes may be planned and documented. The processes may be documented as recipes for use in process control. For example, recipes conforming to the ISA-88 standard (discussed above) may be used.
[0137] At step S105, a description and process parameters may be established for each process. Process parameters may also be referred to as variables or process variables. Process parameters may describe the process and specify how the process is executed. Examples of process parameters are as follows: [0138] a. Name of the process [0139] b. Description of the process [0140] c. Platform technology or biological system (e.g. Cellca cell line, Cellca medium, supplements for the medium) [0141] d. Description of process equipment (e.g. reactor, sensors or other measuring devices, software) [0142] e. Scale of the process (e.g. capacity of the reactor in liters) [0143] f. Batch number [0144] i. The process [0145] ii. Components used for the batch [0146] g. Date [0147] h. Type of process [0148] i. Type of product (e.g. monoclonal antibody, vaccine, microbe) [0149] ii. Process operation (e.g. batch, fed batch, continuous) [0150] i. Cell line or stem [0151] j. Medium (e.g. powdered, liquid concentrate) [0152] k. Critical Quality Attributes (CQA) and quality target product profile (QTTP), e.g. product concentration, glycosylation, aggregation [0153] l. critical process parameters (CPP) [0154] i. process duration [0155] ii. temperature [0156] iii. stirring speed [0157] iv. pH [0158] v. dissolved oxygen [0159] vi. partial pressure of carbon dioxide [0160] vii. cell density and cell viability [0161] viii. substrate concentration (e.g. Glucose, Glutamine) [0162] ix. Metabolite concentration (e.g. ethanol, glycerol) [0163] x. Feeding strategy (when, how) [0164] xi. infection time (vaccine)
[0165] The parameters specified above are merely examples. Various combinations of process parameters may be used.
[0166] At step S107, process parameter visibility may be determined. In particular, it may be determined that some of the parameters established in step S105 are public process parameters and others of the parameters established in step S105 are private process parameters. Once the process parameters are stored, public process parameters may be accessible by all users of a database (as shown in
[0167] The critical process parameters described in point l. of step S105 may each be associated with measurements recorded during the course of the process. A numerical description of the process may be derived from the recorded measurements.
[0168] Critical process parameters may be a subset of process parameters. Critical process parameters may have or be associated with numerical values. Critical process parameters may be monitored process parameters that has an effect on one or more quality attributes.
[0169] For example, the critical process parameters may be described using a plurality of principal components. The numerical description of the process may be calculated as a function of the plurality of principal components. The numerical description of the process may also be referred to as a score of the process. Use of principal component analysis to determine the score of a process is described in Statistical Process Control of Multivariate Processes, J. F. MacGregor and T. Kurti, 1995. Other multivariate techniques (e.g. partial least squares analysis) may also be used to determine the numerical description of the process.
[0170] Accordingly, in some cases only the numerical description of the process may be made public. All other process parameters may be kept private.
[0171] In other cases, the following process parameters may be public process parameters: c. platform technology or biological system, d. description of the process equipment, e. scale of the process, h. type of the process, and i. cell line or stem. In addition, the following critical process parameters (under point L) may be made public: i. process duration, ii. temperature, iii. Stirring/agitating speed, iv. pH, v. dissolved oxygen.
[0172] Other process parameters may be kept private, e.g. to improve data security and to prevent other users of the database from determining an owner of the process from the process parameters. In some cases, further process parameters may be made public at the option of the user determining visibility of the process parameters.
[0173] At step S109, the process parameters may be stored in the database according to the parameter visibility determined in step S107. The database may be configured such that the private process parameters are not accessible by users of the database other than the owner of the respective process parameters. The process parameters may be stored in the database in sets. Each set of process parameters stored in the database may include public process parameters and private process parameters. The terms public and private describe the visibility or accessibility of the process parameters to users of the database other than the owner of the process parameters. By storing the process parameters in the database according to visibility, the process parameters may be anonymized. Accordingly, if a set of process parameters is retrieved from the database by a user other than the owner of the set of process parameters, then it may be difficult or impossible for the user to determine the owner of the process parameters or to derive private process parameters from the public process parameters. However, the public process parameters may be accessible to all users of the database and may be useful for controlling and monitoring a process to produce a chemical, pharmaceutical, or biotechnological product. In addition, a successful trajectory may be associated with each set of process parameters. The successful trajectory associated with a set of process parameters may be a time base profile of measurements recorded during performance of a process performed according to the set of process parameters.
[0174] At step S111, a user of the database may decide to perform a process. The process (or a statistically similar process) may have already been performed by at least one of the other users of the database.
[0175] For example, the user may decide to optimize a biopharmaceutical process with the goal of producing a monoclonal antibody in a Chinese Hamster Ovary (CHO) cell line in a bioreactor (e.g., as shown in
[0176] A goal of carrying out the process may be to produce a biopharmaceutical product in the optimal product concentration and quality using the CHO cell line, a particular medium and other specified characterizing process parameters. Accordingly, the user has already determined to produce a particular product conforming to specified quality attributes and has access to process equipment (e.g. the bioreactor), the cell line, and the medium in order to reach this goal. However, the user may not know the optimal way to produce the product (i.e. the monoclonal antibody) meeting the quality attributes, or what kind of measurements to expect during a successful run of the process. For example, the user may be uncertain regarding the duration of the process, temperature settings at different stages of the process, or the substrate concentration at different stages of the process. Accordingly, the user may connect to the database and may send a set of characterizing process parameters that characterize the process to the database. These characterizing process parameters may be received at the database at step S111.
[0177] A multivariate comparison may be carried out in order to identify a first set of process parameters from the set of stored process parameters. The comparison may involve comparing the characterizing process parameters with sets of process parameters stored in the database at step S109.
[0178] As an example, the characterizing process parameters may be compared with all the sets of process parameters stored in the database. Alternatively, the set of characterizing process parameters may be compared with sets of stored process parameters until a set of process parameters is identified from the stored sets of process parameters. In both cases, the identified set of process parameters may have a specified degree of similarity to the set of characterizing process parameters.
[0179] The characterizing process parameters received at step S111 may correspond to one of the sets of stored process parameters stored in the database at step S109. However, it may be that values of the critical process parameters that are included in the sets of stored process parameters are not included in the set of characterizing process parameters. In other words, the user may be unaware of appropriate values for at least one of the critical process parameters. In particular, the user may be unaware of values for the process duration, the process temperature, the stirring/agitating speed, the pH, the dissolved oxygen level, the cell density, the cell viability, and other critical process parameters for the process. The process may have more or different critical process parameters than those listed.
[0180] At step S113, the first set of process parameters may be identified from the stored sets of process parameters based on the comparison carried out at step S111. The first set of process parameters has the specified degree of similarity to the set of characterizing process parameters. The first set of process parameters is associated with a first successful trajectory.
[0181] At step S115, the set of stored process parameters identified in step S113 along with the first successful trajectory may be downloaded from the database. In particular, the public parameters of the stored process parameters may be downloaded from the database. The private parameters of the stored process parameters might not be accessible. In some cases, the downloaded parameters may be limited to text based values (e.g. description of the process and type of product) and a numerical description of the process derived from recorded measurements associated with the critical process parameters. In other cases, further process parameters may have been made public and these parameters along with their associated values may be downloaded at step S115. Various combinations of process parameters may be made public, for example, as discussed in conjunction with step S107. As another example, set points for a plurality of the critical process parameters (e.g. the critical process parameters identified in step S105) may be downloaded from the database.
[0182] In addition to the downloaded parameters, the first successful trajectory, a recipe conforming to the ISA-88 standard, and values for the critical quality attributes discussed in step S111 may also be downloaded.
[0183] Each set point (also referred to as a control set point) may be understood as a target value for control of the process. For example a temperature set point may be target temperature for the process. A control device may be used to regulate the process according to the set points.
[0184] At step S117, performance of the process may begin. Performance of the process may include controlling and monitoring the process using the first successful trajectory. During performance of the process, process data may be continually generated. The process data may be associated with process parameters.
[0185] At step S119, measurements of the process may be recorded. In particular, the process data generated by the process may be measured using measurement devices (e.g. sensors) and recorded or stored.
[0186] A trajectory of the process may be estimated based on the recorded measurements. Estimating the trajectory of the process may include determining numerical descriptions of process measurements at predetermined intervals and plotting a curve connecting the numerical descriptions of the measurements. Estimating the trajectory of the process may also include estimating how the process will behave in the future. For example, if the process has been performed for 15 seconds estimating the trajectory of the process may comprise approximating behavior or output of the process for a specified duration or until the process is complete. Estimating future behavior of the process (i.e. forecasting) may be carried out using principal component analysis, as described above. Other multivariate statistical techniques for estimating the trajectory of the process or forecasting the trajectory of the process may also be used.
[0187] At step S121, the estimated trajectory may be compared with the first successful trajectory. Comparing the estimated trajectory with the first successful trajectory may involve determining whether a difference between the estimated trajectory and the first successful trajectory passes or fails a confidence criterion.
[0188] Whether the difference between the estimated trajectory and the first successful trajectory fails the confidence criterion may be determined based on a standard deviation associated with the first successful trajectory. For example, the first successful trajectory may be the mean of multiple trajectories. Each of the multiple trajectories may be associated with a batch of the process corresponding to the first successful trajectory. Accordingly, the confidence criterion may be understood as an upper and lower interval around the first successful trajectory. The intervals may be a function of the standard deviation of the first successful trajectory. For example, the higher interval may be +3 standard deviations from the first successful trajectory and the lower interval may be 3 standard deviations from the first successful trajectory. Other functions of the standard deviation of the first successful trajectory are also possible. If a point on the estimated trajectory falls outside the higher or lower interval, the difference between the estimated trajectory and the first successful trajectory may fail the confidence criterion.
[0189] Alternatively, the first successful trajectory may be implemented as the trajectory for a single successful batch or a single successful process. Accordingly, the difference between the estimated trajectory and the first successful trajectory may fail the confidence criterion when a point on the estimated trajectory is a specified distance from the first successful trajectory.
[0190] It may be that the difference between the estimated trajectory and the first successful trajectory only fails the confidence criterion when multiple points on the estimated trajectory are the specified distance from the first successful trajectory. For example, each trajectory may be a plot of numerical descriptions of measurements recorded for the respective process over time. If the numerical description of measurements on the estimated trajectory at a particular time differs from the numerical description of measurements on the first successful trajectory at the same time (i.e. measured in seconds from the start of the respective process) by a certain percentage, (e.g. 5% or 10%) the difference between the estimated trajectory and the first successful trajectory may fail the confidence criterion.
[0191] Step S123 may be carried out if the confidence criterion fails. At step S123, a comparison of the recorded measurements and the set of characterizing process parameters to a plurality of the stored sets of process parameters may be carried out. The comparison may be a multivariate comparison (i.e. a comparison of parameters, parameter values and measurements using multivariate statistical techniques) to determine statistical similarity between the process being performed and the processes corresponding to the stored sets of process parameters in the database.
[0192] Accordingly, based on the comparison, it may be determined whether a second set of process parameters from the plurality of stored sets of process parameters has a greater degree of similarity to the set of characterizing process parameters and the recorded measurements than the first set of process parameters. In some cases, the plurality of stored sets of process parameters may include every stored set of process parameters other than the first set of process parameters. In other cases, the plurality of stored sets of process parameters may be limited to sets of candidate process parameters identified from the stored sets of process parameters via text analysis of text values of the characterizing process parameters or based on user input. The sets of candidate process parameters may be a proper subset of the sets of stored process parameters. If a second set of process parameters having a greater degree of similarity to the characterizing process parameters and the recorded measurements than the first set of process parameters cannot be determined, an alarm may be sounded so that the user can decide how to proceed further. Alternatively, a different database storing sets of process parameters may be automatically selected and queried.
[0193] The comparing of the recorded measurements and the set of characterizing process parameters to the plurality of the stored sets of process parameters may be carried out by determining a plurality of principal components from the characterizing process parameters and the recorded measurements. The determination of principal components may be carried out according to principal component analysis, as discussed above.
[0194] A characterizing numerical description of the process may be calculated as a function of the plurality of principal components. The characterizing numerical description may be referred to as a score. The characterizing numerical description may also be calculated according to principal component analysis. In addition, instead of principal component analysis, other multivariate statistical techniques may be used, e.g. partial leased squares with discriminant analysis. Further, the SIMCA software from Umetrics may also be used.
[0195] The process may be continually performed (as well as controlled and monitored) in the steps subsequent to step S117 until it is determined that a finishing condition is met. A determination as to whether the finishing condition is met is carried out at step S125. When the finishing condition is met, a determination is made that the process is complete and the process ends at step S127. If the finishing condition is not met, the method returns to step S117 and the process continues to be performed. During performance of the process, recorded measurements of the process may be continuously uploaded to the database. Visibility of the recorded measurements may be determined according to the procedure discussed in step S107. For example, each recorded measurement may be associated with a process parameter. Accordingly, visibility of the recorded measurement corresponding to a process parameter may be determined according to the visibility of the process parameter. In some cases, only a numerical description of the process or numerical descriptions of the measurements at particular points in time during the process may be visible.
[0196] When the process ends at step S127, the set of characterizing parameters and recorded measurements corresponding to the process may be marked as complete in the database. Once the process is marked as complete, a successful trajectory may be derived from the recorded measurements and the process parameters may be made available to other users of the database as one of the sets of stored process parameters, along with the associated successful trajectory. The availability of the process parameters may be determined according to visibility settings, as discussed above. In some cases, only parameters and trajectories associated with processes marked as complete may be available or visible to other users of the database. This may have the advantage of making it more likely that controlling and monitoring of new processes is only carried out using successful trajectories associated with completed processes. Alternatively, it may be possible for users to access data associated with incomplete processes. This may have the advantage of making more useful data available to users of the database faster.
[0197] The method of controlling and monitoring a process to produce a chemical, pharmaceutical, or biotechnological product as described in the context of
[0198]
[0199] Process parameters 201 specify initial conditions of the process 203. It should be noted that although the description of
[0200] The process parameters 201 may include process metadata and further control set points. In particular, the process metadata may be text values describing the process 203, e.g. cell strain, product name, batch type. The control set points may be used to control or regulate the bioreactor in which the process 203 is performed. The characterizing process parameters and the stored sets of process parameters may also include control set points and process metadata corresponding to the control set points and the process metadata included in the process parameters 201.
[0201] In addition, measurements recorded during performance of the process 203 may also correspond to the process parameters 201. Further, the process parameters 201 may also include quality attributes 205, also referred to as offline process parameters. The quality attributes 205 may include critical quality attributes or all the quality attributes 205 may be critical quality attributes. The quality attributes 205 may be used to determine whether the product produced via the process can be used or should be rejected. The quality attributes may include a batch product titer, a viability (i.e. harvest), and a duration. Other quality attributes (as discussed above) may also be included in the quality attributes 205. The product titer may be measured in grams per liter, e.g. 8 grams per liter. The viability may be provided as a percentage, e.g. 90%. The duration may be measured in seconds, minutes, hours, or days, e.g. 17 days.
[0202]
[0203]
[0204]
[0205] In particular, a database 501 may be provided. The database 501 stores sets of process parameters to control and monitor respective ones of a plurality of processes performed in order to produce products, as described above. The database 501 may be hosted by a service provider, possibly on a virtual machine, and may be accessible by various users from multiple organizations, possibly located in a variety of different geographic locations around the world.
[0206] A process control system may also be provided. The process control system may include a process control device 503, a local control device 505 and possibly further control devices. The process control device 503 may be operable, possibly in conjunction with other control devices in the process control system, to control and monitor the process 203 using the first successful trajectory.
[0207] In particular, the process control device 503 may be operable to receive recorded measurements of the process 203. The process control device 503 may also be operable to estimate a trajectory of the process 203 based on the recorded measurements and compare the estimated trajectory with the first successful trajectory. The process control device 503 may be further operable to perform various further comparisons (e.g. similarity comparisons) and determinations. More specifically, when the difference between the estimated trajectory and the first successful trajectory fails the confidence criterion, the process control device 503 may be operable to compare the recorded measurements and the set of characterizing process parameters to a plurality of stored sets of process parameters. The process control device 503 may be further operable to determine, based on the comparison, whether a second set of process parameters from the plurality of stored sets of process parameters has a greater degree of similarity to the set of characterizing process parameters and the recorded measurements than the first set of process parameters. The process control device 503 may also be operable to control and monitor the process 203 using a second successful trajectory associated with the second set of process parameters when the second set of process parameters is determined. The process control device 503 may be also be operable to determine when a finishing condition is met and to determine that the process 203 is complete when the finishing condition is met.
[0208] The process control device 503 may be located in a control room. The process control device 503 may be connected to the database 501, possibly via a secure connection. Data passed from the process control device 503 to the database 501 may be cryptographically protected, e.g. encrypted. The database 501 may be located in a location that is geographically distant (e.g. on another continent) from the process control device 503.
[0209] The process control device 503 may be connected to the local control device 505. The local control device 505 and the additional components depicted in
[0210] The bioreactor 507 may be implemented as a 10 liter Biostat C bioreactor, manufactured by Sartorius AG, including control and inline measurement capability. In particular, the bioreactor 507 may be capable of controlling and measuring the following: temperature, pH, dissolved oxygen concentration, cell density of the cells 511, near infrared spectroscopy. The bioreactor 507 may be capable of recording a variety of measurements using the measurement devices mentioned above and further measurement devices. The bioreactor 507 may be capable not only of fermentation but also of developing mammalian cell cultures. In conjunction with the local control device 505, the bioreactor 507 may be capable of performing various forms of inline measurement and analysis, in which the medium 513 is measured while remaining in the bioreactor.
[0211] In addition, the analyzer 509 may be used to perform atline or online analysis. In particular, a probe may remove a sample of the medium 513 from the bioreactor in order to perform and record measurements. The results of the measurements may be sent from the analyzer 509 to the local control device 505 or the process control device 503. For example, the analyzer 509 may measure the concentration of the substrate glucose or the metabolite lactate. The measurement of the concentration of the substrate glucose and/or the metabolite lactate may be carried out every 12 hours. The analyzer 509 may be capable of performing and recording various other measurements of the process 203.
[0212] The local control device 505 may connect to the bioreactor 507 and the analyzer 509 in order to control and monitor the process 203 using the first successful trajectory. Via the control loop feedback mechanism, the local control device 505 may use the recorded measurements of the process 203 and the control set points (possibly provided in the characterizing process parameters, in the first set of process parameters or in the second set of process parameters) in order to control the process 203. In particular, the measurements recorded from the bioreactor 507 may be used to maintain the medium 513 according to the control set points. The process control device 503 may be understood as a supervisory system that receives the recorded measurements of the process 203 and performs multivariate data analysis in order to estimate whether the process 203 will result in the production of a product meeting predetermined quality attributes, e.g. the quality attributes 205.
[0213] Further devices may also be used to analyze the medium 513 and determine whether the quality attributes 205 have been met. The quality attributes 205 may include a glycosylation profile and a potency (i.e. biological activity of antibodies). It may be that critical quality parameters can only be measured offline. In other words, measurements of critical quality parameters may not be available until the process 203 is complete.
[0214] The database 501 may be a relational database, and object relational database, or an object oriented database. Various database management systems may be used such as Oracle, MySQL, and Microsoft SQL.
[0215] Transparent data encryption (TDE) may be used to encrypt database files. Other cryptographic database protection mechanisms may also be used. In order to determine process parameter visibility as discussed in step S107, a discretionary access control policy may be used. Other types of access control policies may also be used in order to ensure that users of the database only have access to public process parameters and that private process parameters are only accessible by a limited number of users or the owner of the corresponding set of stored process parameters.
[0216] The database 501 may be accessible from the process control device 503 via the Internet. Communications between the database 501 and the process control device 503 may be secured, e.g. via Internet protocol security (IPSEC) or other security protocols. A virtual private network (VPN) may also be used.
[0217] The database 501 may be implemented using clustering and/or load balancing in order to ensure a high level of availability and fault tolerance. A registration process may be provided for users to register to use the database 501. The registration process may allow individual persons, work groups, companies, or various types of institutions (e.g. universities) to register to use the database 501. An authentication process for accessing the database 501 may also be established. In particular, two factor authentication may be required to access the database, such that a password plus a code provided to a mobile phone via SMS or a password plus a token are required to access the database. The database 501 may be used to control and monitor the process 203. In particular, the database 501 may be used to optimize pharmaceutical production processes.
[0218] According to an example, a user may desire to control and monitor the process 203 using the sufficiently characterized Cellca cell line as a biological system. The Cellca cell line carries the gene for the expression of a monoclonal antibody. A medium, nutrient solution, and supplements corresponding to the Cellca cell line may be used as part of the biological system. Continuing the example, the user may enter the control room in order to use the process control device 503. Accordingly, the user may log in to the process control device 503 in order to prepare a production run and a first batch. The user may enter the following process parameters in the process control device 503: [0219] a. Name of the process [0220] b. Description of process equipment (e.g. a description of the bioreactor 507) [0221] c. Scale of the process: 10 liter reactor [0222] d. Batch number [0223] i. the process [0224] ii. the components used [0225] e. Date [0226] f. Type of process [0227] i. type of product: monoclonal antibody [0228] ii. process strategy: fed batch [0229] g. Biological system (as described above) [0230] i. Cellca cell line [0231] ii. Cellca medium and supplements [0232] h. Critical quality attributes [0233] i. product concentration [0234] ii. glycosylation profile [0235] i. Critical process parameters [0236] i. process duration [0237] ii. temperature [0238] iii. stirring speed [0239] iv. pH [0240] v. dissolved oxygen [0241] vi. cell density [0242] vii. substrate concentration (glucose) [0243] viii. metabolite concentration (lactate)
[0244] According to the example, the critical process parameters may be determined but their values may not be known. The values of the critical cross process parameters may be retrieved from the database. In particular, the values of the critical process parameters may be included in one of the stored sets of process parameters stored in the database 501. After entry of the process parameters specified above, the process control device 503 may connect to the database 501. The connection between the process control device 503 and the database 501 may be a secure connection, e.g. via a VPN or IPSEC. A subset of the parameters entered into the process control device 503 maybe transferred to the database 501. The subset of parameters transferred to the database 501 may be referred to as the set of characterizing process parameters that characterize the process. In the context of the present example, the characterizing process parameters may be as follows: [0245] a. Scale of the process: 10 liter reactor [0246] b. Batch number [0247] i. components used [0248] c. Type of process [0249] i. type of product: monoclonal antibody [0250] ii. process strategy: fed batch [0251] d. Biological system [0252] i. Cellca cell line [0253] ii. Cellca medium and supplements [0254] e. Critical quality attributes [0255] i. product concentration [0256] ii. glycosylation profile [0257] f. Critical process parameters [0258] i. process duration [0259] ii. temperature [0260] iii. stir speed [0261] iv. pH [0262] v. dissolved oxygen [0263] vi. cell density [0264] vii. substrate concentration (e.g. glucose) [0265] viii. metabolite concentration (e.g. lactate)
[0266] The characterizing process parameters are compared with the stored sets of process parameters in order to identify the first set of process parameters, as described above. The first set of process parameters may be identified using multivariate statistical techniques, as discussed above. In addition to the literature discussed above, multivariate statistical techniques relevant to the present application are further described in A User Friendly Guide to Multivariate Calibration and Classification T. Nehers, et al., 2002. Also relevant is Chemometrics, Matthias Otto, 2007.
[0267] A goal of identifying the first set of process parameters having the specified degree of similarity to the set of characterizing process parameters is to provide the user with optimal process parameters that are not yet known to the user. Advantageously, the user receives guidance for determining whether recorded measurements of the process are consistent with what they should be and for determining how to control the process so that it results in the production of a viable product, i.e. a product meeting predetermined quality attributes. The guidance is provided in the form of the first successful trajectory associated with the first set of process parameters. In some cases, the first set of process parameters also includes values for control set points for use in controlling the process. Further, the first successful trajectory may be used to ensure that recorded measurements of the process meet a confidence criterion (e.g. they are within a desired interval) such that the process will result in the production of a viable product meeting quality attributes.
[0268] Continuing the example, once the first set of process parameters has been identified, the first successful trajectory associated with the first set of process parameters may be transferred from the database 501 to the process control device 503. The first set of process parameters may include public process parameters. In particular, the first set of process parameters may include values for control set points specified in the characterizing process parameters. For example, the first set of process parameters may include values for the following control set points: [0269] i. process duration [0270] ii. temperature [0271] iii. stirring speed [0272] iv. pH [0273] v. dissolved oxygen [0274] vi. cell density [0275] vii. substrate (e.g. glucose) concentration [0276] viii. metabolite (e.g. lactate) concentration
[0277] The database 501 may also transmit an ISA-88 recipe (as discussed above) to the process control device 503. The ISA-88 recipe may be used to control the process. In particular, the ISA-88 recipe may define a process model consisting of an ordered set of stages, where each stage comprises operations and each operation comprises actions. In addition, the database 501 may transmit values for quality attributes to the process control device 503. The quality attributes themselves may be specified without values as part of the set of characterizing process parameters that characterize the process.
[0278] In addition, user comments may be associated with the first set of process parameters stored in the database. The user comments may also be transferred from the database 501 to the process control device 503. The user comments may include suggestions as to how to better measure and control the process. In particular, the user comments may specify further process parameters for use in measuring and controlling the process. For example, the user comments may suggest measuring dissolved carbon dioxide concentration in the bioreactor 507 for the process because control of this particular process parameter has been helpful when performing similar processes. Once this information has been transferred to the process control device 503, performance of the process in the bioreactor 507 may begin. Performance of the process may include controlling and monitoring the process using the first successful trajectory. Before performing the process, the user may prepare the bioreactor 507 by sterilizing the bioreactor 507 and filling the bioreactor 507 with the medium 513. In addition, the user may make necessary connections to the bioreactor 507 to prepare the bioreactor 507 for operation. This may include connecting the analyzer 509 and the local control device 505 to the bioreactor 507.
[0279] As the process is being performed, measurements of the process are recorded. In particular, measurements corresponding to process parameters may be recorded inline via the bioreactor and online or atline via the analyzer 509. Based on the recorded measurements, the process control device 503 may estimate a trajectory of the process. The estimated trajectory of the process may be compared with the first successful trajectory. Comparisons of the estimated trajectory with the first successful trajectory may be made throughout performance of the process, as described in the context of
[0280] When the difference between the estimated trajectory and the first successful trajectory fails the confidence criterion, the database 501 may be queried by the process control device 503 in order to determine whether one of the sets of process parameters stored in the database has a greater degree of similarity to the process being performed in comparison to the first set of process parameters. The greater degree of similarity may be determined based on all information regarding the process that is available. In particular, the greater degree of similarity may be determined via a similarity comparison, as discussed above, by comparing the recorded measurements and the set of characterizing process parameters to a plurality of the stored sets of process parameters. The comparison may be a multivariate comparison as discussed in connection with step S123 above. Such multivariate comparisons may generally be carried out with all recorded measurements for the process along with the characterizing process parameters. Accordingly, as more information about the process becomes available during further execution of the process, different sets of stored process parameters may be determined to be the most similar set of stored process parameters to the characterizing process parameters of the process being performed.
[0281] When a second set of process parameters from the plurality of stored sets of process parameters is determined to have a greater degree of similarity to the set of characterizing process parameters and the recorded measurements then the first set of process parameters, then a second successful trajectory associated with the second set of process parameters is used to control and monitor the process.
[0282] During performance of the process, the process may continually be monitored to determine whether the difference between the estimated trajectory of the process and a current (e.g. first or second) successful trajectory corresponding to a set of stored process parameters passes or fails the confidence criterion. If the confidence criterion fails, an attempt will be made to determine a more suitable set of stored process parameters from the process parameters stored in the database 501. In this context, more suitable means a set of stored process parameters having a greater degree of similarity to the parameters and measurements currently available for the process then the set of stored process parameters and corresponding trajectory currently being used to control and monitor the process.
[0283] The multiple database queries carried out during performance of the process may have the advantage of making it possible to take advantage of the latest information in the database 501 and to gradually tailor performance of the process. For example, the first set of process parameters may be selected from among many equally similar sets of stored process parameters. The first set of process parameters may be suitable for initial execution of the process, but after some time, more similar process parameters may be needed in order to ensure that the quality attributes are met. Accordingly, after the process has been performed for some time, there may be more data available for comparison and the second (or third) set of process parameters (each determined according to the similarity comparison) may be much more similar to the process being performed in comparison to the first set of process parameters.
[0284] The finishing condition may be determined to be met according to the ISA-88 recipe. In particular, the ISA-88 recipe may specify when the process is complete. Once the process is complete or a batch of the process is complete, the user may harvest the contents of the bioreactor 507 and the monoclonal antibody that is the product being produced. The user may then analyze the product, i.e. the monoclonal antibody in his example, in order to determine whether the specified quality attributes have been met. In particular, the user may analyze the monoclonal antibody in order to determine whether the glycosylation profile and potency of the monoclonal antibody meet criteria in the quality parameters. If the criteria are met, the batch may be marked as successful in the process control device 503. The process control device 503 may then forward the recorded measurements for the process to the database 501. Recorded measurements and parameters of the process may be stored in the database 501 according to an access policy specified by the user, as discussed above in conjunction with step S107.
[0285] According to the access policy, some of the process parameters and recorded measurements may be public and accessible by other users, whereas some of the process parameters and recorded measurements may be private and inaccessible. Accordingly, resulting data for the finished process including recorded measurements and process parameters may become a further stored set of process parameters available to other users for controlling and monitoring further processes to produce chemical, pharmaceutical, or biotechnological products.
[0286]
[0287]
[0288]
[0289]
[0290]
[0291]
[0292]
[0293]
[0294]
[0295]
[0296]
[0297]
[0298]
[0299] Accordingly, as the process is performed its numerical description may evolve such that it is more similar to the numerical description of a different process stored in the database. Then, a second set of process parameters may be determined from the plurality of stored sets of process parameters that has a greater degree of similarity to the set of characterizing process parameters and the recorded measurements than the first set of process parameters.
[0300]