Rational drug design with computational free energy difference calculation using a modified bond stretch potential
11562808 · 2023-01-24
Assignee
Inventors
Cpc classification
G16B15/00
PHYSICS
G16C20/30
PHYSICS
G16C20/10
PHYSICS
International classification
G01N33/50
PHYSICS
G16C20/30
PHYSICS
G16B15/00
PHYSICS
Abstract
A method and system for calculating the free energy difference between a target state and a reference state. The method includes determining one or more intermediate states using a coupling parameter, performing molecular simulations to obtain ensembles of micro-states for each of the system states, and calculating the free energy difference by an analysis of the ensembles of micro-states of the system states. The method can be particularly suited for calculating physical or non-physical transformation of molecular systems such as ring-opening, ring-closing, and other transformations involving bond breaking and/or formation. A soft bond potential dependent on a bond stretching component of the coupling parameter and different from the conventional harmonic potential is used in the molecular simulations of the system states for the bond being broken or formed during the transformation.
Claims
1. A method of rational drug design comprising: identifying a biomolecular target molecule associated with a pathology; identifying a small molecule candidate for binding to the biomolecular target; calculating, using a computer, a relative binding affinity between the small molecule candidate and the biomolecular target molecule based on a free energy difference between a reference state and a target state, wherein the reference state and target state each correspond to a respective arrangement of the biomolecular target molecule and small molecule candidate that include a common set of atoms P.sub.AB, and wherein the reference state further includes a set of atoms P.sub.A, the target state further includes a set of atoms P.sub.B, the set P.sub.A being present only in the reference state and not in the target state, and the set P.sub.B being present only in the target state and not the reference state, where there exist at least two atoms A.sub.a and A.sub.b, A.sub.a and A.sub.b being either: (1) not valence-bonded to each other in the reference state and valence-bonded in the target state, or (2) valence-bonded to each other in reference state and not valence-bonded to each other in the target state, the method comprising: (a) providing a topology, including the bonded connections between the atoms and the relative spatial arrangements of the atoms, for all the atoms in P.sub.A, P.sub.B, and P.sub.AB; (b) determining one or more intermediate system states along a transformation path between the reference state and the target state, the transformation path defined by a coupling parameter λ that modulates the energies arising from inter-atom interactions for each system state, the coupling parameter λ including a plurality of components each having a value belonging to [0,1] and modulating a different type of interaction energy; (c) performing, using at least one computer processor, molecular simulations to obtain ensembles of micro-states for the reference state, the target state, and the intermediate states, wherein performing molecular simulations for each of the system states includes calculating a bonded stretch interaction energy between the atoms A.sub.a and A.sub.b, the bonded stretch interaction energy being defined by a soft bond potential, wherein the soft bond potential is a function of a bonded stretch component, λ.sub.sbs, of the coupling parameter λ, and does not include any singular regions for all values of λ.sub.sbs within [0,1] and for all values of the distance r between A.sub.a and A.sub.b, the soft bond potential further satisfies the following conditions: when λ.sub.sbs is within (0,1), the soft bond potential is flat when the distance between A.sub.a and A.sub.b approaches infinity; when A.sub.a and A.sub.b are not valence bonded in either the reference state or the target state, the soft bond potential is flat and zero for all distances between A.sub.a and A.sub.b; and when A.sub.a and A.sub.b are valence bonded in either the target state or the reference state, the soft bond potential reverts to a harmonic potential; and wherein the soft bond potential is a function of (r—r.sub.0).sup.2, where r.sub.0 is the equilibrium distance between A.sub.a and A.sub.b, and is expressed by:
ƒ(λ.sub.sbs=0)=0,
ƒ(λ.sub.sbs=1)=1,
g(λ.sub.sbs=0)=1,
g(λ.sub.sbs=1)=0,
α(k,λ.sub.sbs<1)>0; (d) calculating, using at least one computer processor, the free energy difference between the reference state and the target state, by way of an analysis of the ensembles of micro-states obtained at the target state, the reference state, and the intermediate states; and (e) determining a relative binding affinity for the biomolecular target molecule and small molecule candidate based on the calculated free energy difference; synthesizing the small molecule candidate based on the calculation; and further assessing the small molecule candidate for suitability for treating the pathology based on assays using the synthesized small molecule candidate.
2. The method of claim 1, wherein ƒ(λ.sub.sbs)=λ.sub.sbs, g(λ.sub.sbs)=1−λ.sub.sbs, and α(k, λ.sub.sbs)=const.
3. The method of claim 1, wherein performing molecular simulations for each of the system states comprises: if A.sub.a and A.sub.b are valence-bonded to each other in the reference state and not valence-bonded in the target state, using a schedule of λ.sub.sbsA and a corresponding soft bond potential for calculating the bonded stretch interaction energy between A.sub.a and A.sub.b for each of the intermediate states, wherein λ.sub.sbsA is 1 at the reference state, 0 at the target state, and varied from 1 to 0 at each intermediate state along the transformation from the reference state to the target state; if A.sub.a and A.sub.b are not valence-bonded to each other in the reference state and valence-bonded in the target state, using a schedule of λ.sub.sbsB and a corresponding soft bond potential for each of the intermediate states and a soft bond potential corresponding to the λ.sub.sbsB for calculating the bonded stretch interaction between A.sub.a and A.sub.b, wherein λ.sub.sbsB is 0 at the reference state, 1 at the target state, and varied from 0 to 1 at each intermediate state along the transformation from the reference state to the target state.
4. The method of claim 3, wherein performing molecular simulations for each of the system states further comprises: (a) computing a bonded angle interaction, using applicable parameters for bonded angle interactions of a force field, between (i) a bond formed by A.sub.a and another atom A.sub.c, and (ii) the bond between A.sub.a and A.sub.b that is being broken or formed by the transformation from the reference state to the target state; (b) if A.sub.a and A.sub.b are valence-bonded to each other in the reference state and not valence-bonded to each other in the target state, multiplying the computed bonded angle interaction obtained in (a) by a bonded angle coupling parameter λ.sub.baA, wherein λ.sub.baA is 1 at the reference state, 0 at the target state, and varied from 1 to 0 at each intermediate state along the transformation from the reference state to the target state; and if A.sub.a and A.sub.b are not valence-bonded to each other in the reference state and valence-bonded to each other in the target state, multiplying the computed bonded angle interaction obtained in (a) by a bonded angle coupling parameter λ.sub.baB, wherein λ.sub.baB is 0 at the reference state, 1 at the target state, and varied from 0 to 1 at each intermediate state along the transformation from the reference state to the target state; and (c) including the bonded angle interaction obtained in (b) into the total energy of a simulation step of the corresponding system state.
5. The method of claim 4, wherein performing molecular simulations for each of the system states further includes: (a) computing a dihedral angle interaction, using applicable parameters for dihedral interactions of a force field, of a group of four connected atoms {A.sub.i, A.sub.j, A.sub.k, A.sub.l}, the group including both A.sub.a and A.sub.b; (b) if A.sub.a and A.sub.b are valence-bonded to each other in the reference state and not valence-bonded to each other in the target state, multiplying the computed dihedral interaction obtained in (a) by a dihedral angle coupling parameter λ.sub.bdA, wherein λ.sub.bdA is 1 at the reference state, 0 at the target state, and varied from 1 to 0 at each intermediate state along the transformation from the reference state to the target state; and if A.sub.a and A.sub.b are not valence-bonded to each other in the reference state and valence-bonded to each other in the target state, multiplying the computed dihedral interaction obtained in (a) by a dihedral angle coupling parameter λ.sub.bdB, wherein λ.sub.bdB is 0 at the reference state, 1 at the target state, and varied from 0 to 1 at each intermediate state along the transformation from the reference state to the target state; and (c) including the dihedral interaction obtained in (b) into the total energy of the simulation step of the corresponding system state.
6. The method of claim 5, wherein: if A.sub.a and A.sub.b are valence-bonded to each other in the reference state and not valence-bonded to each other in the target state, the bonded angle interaction and the bonded dihedral interaction coupling parameters λ.sub.baA and λ.sub.bdA, are each selected to be 0 when λ.sub.sbsA is smaller than a predefined threshold, and if A.sub.a and A.sub.b are not valence-bonded to each other in the reference state and valence-bonded to each other in the target state, the bonded angle interaction and the bonded dihedral interaction coupling parameters λ.sub.baB and λ.sub.bdB are each selected to be 0 when λ.sub.sbsB is smaller than a predefined threshold.
7. The method of claim 5, wherein performing molecular simulations for all of the states further includes: (a) computing nonbonded electrostatic interactions and van der Waals interactions, using applicable parameters for electrostatic interactions and van der Waals of a force field, between two atoms A.sub.i and A.sub.j and the non-bonded exclusion status of the pair (A.sub.i, A.sub.j) is affected by the transformation from the reference state to the target state; (b) if A.sub.a and A.sub.b are valence-bonded to each other in the reference state and not valence-bonded in the target state, and the nonbonded interactions between A.sub.i and A.sub.j are excluded in the reference state but not excluded in the target state, multiplying the nonbonded electrostatic interactions and van der Waals interactions between A.sub.i and A.sub.j obtained in (a) by coupling parameters λ.sub.elecAex and λ.sub.vdwAex, respectively, wherein both of λ.sub.elecAex and λ.sub.vdwAex are 0 at the reference state, 1 at the target state, and varied from 0 to 1 at each intermediate state along the transformation from the reference state to the target state; if A.sub.a and A.sub.b are not valence-bonded to each other in the reference state and valence-bonded in the target state, and the nonbonded interactions between A.sub.i and A.sub.j are not excluded in the reference state but excluded in the target state, multiplying the nonbonded electrostatic interactions and van der Waals interactions between A.sub.i and A.sub.j obtained in (a) by coupling parameters λ.sub.elecBex and λ.sub.vdwBex, respectively, wherein both of λ.sub.elecBex and λ.sub.vdwBex are 1 at the reference state, 0 at the target state, and varied from 1 to 0 at each intermediate state along the transformation from the reference state to the target state; and (c) including the calculated nonbonded electrostatic interactions and van der Waals interactions obtained in (b) into the total energy of the simulation step of the corresponding system state.
8. The method of claim 7, wherein performing molecular simulations for all of the states further includes: if A.sub.a and A.sub.b are valence-bonded to each other in the reference state and not valence-bonded in the target state, and the nonbonded interactions between A.sub.i and A.sub.j are excluded in the reference state but not excluded in the target state, varying each of λ.sub.elecAex and λ.sub.vdwAex according to a schedule for each of the intermediate states along the transformation from the reference state to the target state such that when λ.sub.vdwAex is smaller than 1 for an intermediate state, λ.sub.elecAex is 0 for that intermediate state; and if A.sub.a and A.sub.b are not valence-bonded to each other in the reference state and valence-bonded in the target state, and the nonbonded interactions between A.sub.i and A.sub.j are not excluded in the reference state but excluded in the target state, varying each of λ.sub.elecBex and λ.sub.vdwBex according to a schedule for each of the intermediate states along the transformation from the reference state to the target state such that when λ.sub.vdwAex is smaller than 1 for an intermediate state, λ.sub.elecAex is 0 for that intermediate state.
9. The method of claim 7, wherein performing molecular simulations for all of the states further includes: (a) computing nonbonded electrostatic 1-4 pair interactions and van der Waals 1-4 pair interactions, using applicable parameters for electrostatic 1-4 pair interactions and van der Waals 1-4 pair interactions of a force field, between two atoms A.sub.i and A.sub.j which together with another two intervening atoms forms a bonded dihedral angle interaction in either the reference state or the target state; (b) if A.sub.a and A.sub.b are valence-bonded to each other in the reference state and not valence-bonded in the target state: if the nonbonded 1-4 pair interactions between A.sub.i and A.sub.j are included in the reference state but not included in the target state, multiplying the nonbonded electrostatic 1-4 pair interactions and van der Waals 1-4 pair interactions between A.sub.i and A.sub.j obtained in (a) by coupling parameters λ.sub.elecA14 and λ.sub.vdwA14, respectively, wherein both of λ.sub.elecA14 and λ.sub.vdwA14 are 1 at the reference state, 0 at the target state, and varied from 1 to 0 at each intermediate state along the transformation from the reference state to the target state, and if the nonbonded 1-4 pair interactions between A.sub.i and A.sub.j are not included in the reference state but included in the target state, multiplying the nonbonded electrostatic 1-4 pair interactions and van der Waals 1-4 pair interactions between A.sub.i and A.sub.j obtained in (a) by coupling parameters λ.sub.elecB14 and λ.sub.vdwB14, respectively, wherein both of λ.sub.elecB14 and λ.sub.vdwB14 are 0 at the reference state, 1 at the target state, and varied from 1 to 0 at each intermediate state along the transformation from the reference state to the target state, and if A.sub.a and A.sub.b are not valence-bonded to each other in the reference state and valence-bonded in the target state: if the nonbonded 1-4 pair interactions between A.sub.i and A.sub.j are not included in the reference state but included in the target state, multiplying the nonbonded electrostatic 1-4 pair interactions and van der Waals 1-4 pair interactions between A.sub.i and A.sub.j obtained in (a) by coupling parameters λ.sub.elecB14 and λ.sub.vdwB14, respectively, wherein both of λ.sub.elecB14 and λ.sub.vdwB14 are 0 at the reference state, 1 at the target state, and varied from 0 to 1 at each intermediate state along the transformation from the reference state to the target state, and if the nonbonded 1-4 pair interactions between A.sub.i and A.sub.j are included in the reference state but not included in the target state, multiplying the nonbonded electrostatic 1-4 pair interactions and van der Waals 1-4 pair interactions between A.sub.i and A.sub.j obtained in (a) by the coupling parameters λ.sub.elecA14 and λ.sub.vdwA14, respectively, wherein both of λ.sub.elecA14 and λ.sub.vdwA14 are 1 at the reference state, 0 at the target state, and varied from 1 to 0 at each intermediate state along the transformation from the reference state to the target state; and (c) including the calculated electrostatic 1-4 pair interactions and van der Waals 1-4 pair interactions obtained in (b) into the total energy of the simulation step of the corresponding system state.
10. The method of claim 9, wherein performing molecular simulations for all of the states further includes: if A.sub.a and A.sub.b are valence-bonded to each other in the reference state and not valence-bonded in the target state, if the nonbonded 1-4 pair interactions between A.sub.i and A.sub.j are included in the reference state but not included in the target state, varying each of λ.sub.elecA14 and λ.sub.vdwA14 according to a schedule for each of the intermediate states along the transformation from the reference state to the target state such that when λ.sub.vdwA14 is smaller than 1 for an intermediate state, λ.sub.elecA14 is 0 for that intermediate state; if the nonbonded 1-4 pair interactions between A.sub.i and A.sub.j are not included in the reference state but included in the target state, varying each of λ.sub.elecB14 and λ.sub.vdwB14 according to a schedule for each of the intermediate states along the transformation from the reference state to the target state such that when λ.sub.vdwB14 is smaller than 1 for an intermediate state, λ.sub.elecB14 is 0 for that intermediate state; and if A.sub.a and A.sub.b are not valence-bonded to each other in the reference state and valence-bonded in the target state, if the nonbonded 1-4 pair interactions between A.sub.i and A.sub.j are not included in the reference state but included in the target state, varying each of λ.sub.elecB14 and λ.sub.vdwB14 according to a schedule for each of the intermediate states along the transformation from the reference state to the target state such that when λ.sub.vdwB14 is smaller than 1 for an intermediate state, λ.sub.elecB14 is 0 for that intermediate state; if the nonbonded 1-4 pair interactions between A.sub.i and A.sub.j are included in the reference state but not included in the target state, varying each of λ.sub.elecA14 and λ.sub.vdwA14 according to a schedule for each of the intermediate states along the transformation from the reference state to the target state such that when λ.sub.vdwA14 is smaller than 1 for an intermediate state, λ.sub.elecA14 is 0 for that intermediate state.
11. The method of claim 7, wherein at least one of the computing of the van der Waals interactions includes using a soft-core LJ interaction potential.
12. The method of claim 1, wherein at least one of the reference state and the target state includes a molecule having a ring structure in which the atoms A.sub.a and A.sub.b are bonded to each other and form a part of the ring structure.
13. The method of claim 1, wherein the molecular simulations include at least one of molecular dynamic simulations and Monte Carlo simulations.
14. The method of claim 1, wherein calculating the free energy difference between the reference state and the target state comprises performing an analysis of the ensembles of micro-states obtained at the target state, the reference state, and the intermediate states by way of a determination and analysis of the work associated with the variation of coupling parameter λ.
15. The method of claim 1, wherein calculating the free energy difference between the reference state and the target state comprises performing an analysis of the ensembles of micro-states obtained at the target state, the reference state, and the intermediate states by way of an analysis of the differences in a thermodynamic property of an ensemble of the micro-states obtained at the target state, the reference state, and the intermediate states as coupling parameter λ is instantaneously varied for the selected ensemble of micro-states.
16. The method of claim 15, wherein the ensemble is selected from an NVT ensemble, a NPT ensemble, a NVE ensemble, and a μVT ensemble.
17. The method of claim 15, wherein performing the analysis of the differences in a thermodynamic property comprises applying an estimator selected from BAR, MBAR, WHAM, Zwanzig average estimators.
18. The method of claim 15, wherein performing the analysis of the differences in a thermodynamic property comprises applying one of an FEP-family estimators.
19. The method of claim 1, wherein calculating the free energy difference between the reference state and the target state comprises performing a thermodynamic integration analysis of the derivative of a thermodynamic property of an ensemble of micro-states obtained for the target state, the reference state, and the intermediate states with respect of the coupling parameter λ.
20. The method of claim 19, wherein the ensemble is selected from an NVT ensemble, a NPT ensemble, a NVE ensemble, and a μVT ensemble.
21. The method of claim 1, wherein at least one of P.sub.A and P.sub.B is null.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The present invention will be better understood by reference to the accompanying drawings, wherein:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
DETAILED DESCRIPTION
(9) The present application discloses computer-implemented methods and systems for computing free energy difference between a reference system state and a target system state. In particular, to address the issues in determining free energy difference arising from bond breaking in a ring structure that transforms a ring structure to a linear structure and bond formation that transforms a linear structure into a ring structure (each of which is further discussed below), the methods and systems disclosed in the present application utilize a functional form for bond stretching that allows a rigorous connection to the harmonic bond functional form to be maintained at any points in the alchemical transformation for calculating the free energy difference. Accordingly, the methods and systems of the present application can advantageously improve numerical stability and accuracy of the free energy calculations. However, it is to be recognized that the general principles of the free energy calculations using such modified bond stretching potentials disclosed herein can be applied generally in any bond formation and breaking situations, and not limited to ring closing or ring opening.
(10) As with traditional free energy difference calculations, the atoms in the system can be categorized into different groups for evaluating the system energy in different system states. The reference state and target state both include a common set of atoms P.sub.AB. The reference state further includes a set of atoms P.sub.A, and the target state further includes a set of atoms P.sub.B. The set of atoms P.sub.A are present only in the reference state and not in the target state, and the set of atoms P.sub.B are present only in the target state and not the reference state. In a ring formation scenario, P.sub.A can be the atoms connected to the two terminal atoms to form a bond. During the course of the transformation, the atoms in P.sub.A and in P.sub.B interact with other atoms within their own set as well as with those in P.sub.AB, but the atoms in P.sub.A do not interact with any atoms in P.sub.B, or vice versa. For example, for a molecule having a structure shown in
(11) As described herein, the coupling Hamiltonian (λ) for alchemical transformation involving ring opening and closing can generally include the following terms,
.sub.bs(λ.sub.bs),
.sub.ba(λ.sub.ba),
.sub.bd(λ.sub.bd) and
.sub.nb(λ.sub.nb), corresponding to the bonded stretch terms, the bonded angle terms, the bonded dihedral angle terms, and the nonbonded exclusion and 1-4 pair interaction terms respectively. To simplify the discussion, in the following the case where a valence bond between two ring atoms is being formed (i.e., the transformation from the reference state to the target state involves ring closing) is described in detail.
(12) As an initial step, the topology of the system is provided, including the bonded connections between the atoms in the system and the relative spatial arrangements of the atoms forming each of P.sub.AB, P.sub.A, and P.sub.B.
(13) One or more, e.g., a plurality of intermediate states between the reference state and the target state can be determined along a path defined by different values of the coupling parameter λ, where the increments of λ in value move the system from the reference state to the target state. While λ can be a scalar variable that varies from 0 to 1, in some embodiments of the present invention, such as those further discussed below, λ can be a vector containing different components for different types of interactions within the system. Computer molecular simulations, such as, but not limited to, molecular dynamics or Monte Carlo simulations, can be performed to obtain ensembles of the micro-states for the reference state, the target state, and each of the intermediate states. The λ values of the intermediate states can be chosen by known techniques such that between each neighboring λ windows on the “reaction pathway” from the reference state to the target state there is substantial overlap between the micro-states in the successive λ windows that are sampled by the molecular simulations.
(14) In performing molecular simulations for all these states, the bonded stretch interaction energy between the two atoms A.sub.a and A.sub.b that are to form a bond (e.g., A.sub.1 and A.sub.3 in
(15) In popular molecular mechanics force fields, such as OPLS, CHARMM, and AMBER, the bonded stretch interactions between two atoms are modeled by a harmonic bond of the following form:
U.sub.bs(λ,r)=½k(r−r.sub.0).sup.2 (5)
where k is the “force constant” or “Hookean constant” which defines the strength (or rigidity) of the bond of the force field used, r is the instantaneous distance between the two atoms and r.sub.0 is the equilibrium distance between the two atoms. In conventional method of linear scaling of the coupling parameter between the Hamiltonians of the reference state and the target state, the bonded stretch term has the following form in the coupling Hamiltonian:
U.sub.bs(λ,r)=½λk(r−r.sub.0).sup.2 (6)
Therefore,
(16)
The integrand in the above equation approaches infinity when r is very large. In the limit when λ approaches 0, there is no bonded stretch interaction between the two atoms, and the distance between the two atoms can be very large, leading to singularity and numerical instability problem in the calculation of the above integral. In practice, the distance between the two atoms is limited by the size of the simulation box (a three-dimensional volume or unit cell in which the simulation is conducted and boundary conditions, such as periodic boundary conditions, can be applied), the singularity problem can be avoided. However, since the integrand in the above equation is unbounded, it can still cause numerical instability and inaccuracy problems in the free energy calculations.
(17) Additionally, using the above conventional coupling Hamiltonian functional form, the potential is undefined for λ=0 when r approaches infinity. The limiting value depends on how λ approaches 0 and how r approaches infinity, i.e.:
(18)
(19) In order to obtain a pathway that allows stable and efficient simulations from which reliable free energies involving the annihilation and/or formation of a bond between two atoms can be determined, the present inventors have discovered the following coupling potential (referred to herein as the soft bond potential) to connect the two physical systems where the harmonic interactions between the two atoms are fully turned on and off when the coupling parameter λ changes between 0 and 1:
(20)
where the functions ƒ, g and α are each continuous functions and simultaneously satisfy the following conditions: f(λ=0)=0; f(λ=1)=1; g(λ=0)=1; g(λ=1)=0; α(k,λ<1)>0. It is noted that for all the discussions herein regarding the soft bond potential for the bonded stretch interactions, λ as used in the equations (from Equation 9 and onwards) refers to the bonded stretch component, λ.sub.sbs, of the coupling parameter λ.
(21) In particular example of the soft bond potential described by Eq. 9, f(λ)=λ, g(λ)=1−λ and α(k,λ)=α=const (a constant number), i.e., the soft bond potential takes the following form:
(22)
(23) It can be seen that the soft bond interaction of Eq. 10 correctly recovers the two physical end states when λ=0 and λ=1:
(24)
(25) The introduction of α(1−λ)(r−r.sub.0).sup.2 in the denominator of Eq. 10 changes the harmonic interaction into a soft bond interaction (the interaction is bounded when r approaches infinity) when λ is smaller than 1, and at λ=1 (the bond is formed) the function has the exact harmonic potential form.
(26) The above functional form removes the singularity and numerical instability problems associated with the conventional harmonic potentials. In the following, some properties of the soft bond interaction functional form as exemplified by Eq. 10 are discussed.
(27) Property 1:
(28) The soft bond interaction functional form does not have any singular regions for all values of λ between 0 and 1 and for all values of r. From the above description, i.e.:
(29)
(30) The soft bond potential for a model system with the force constant k=20 kcal.Math.mol.sup.−1.Math.Å.sup.−2 and the soft bond parameter α=1 at different values of λ are given in
(31) Property 2:
(32)
(33) As discussed in the above section, it is
(34)
which determines the numerical stability and accuracy of the free energy calculations. In the above formulation, when λ∈[0,1), the thermodynamic property to be averaged in the rightmost bracket in Eq. 13 is continuous and bounded for all values of r. When λ=1, the soft bond potential recovers the harmonic potential, so only phase space regions where r is close to r.sub.0 are sampled and taken into the ensemble average. In the phase space regions where r is close to r.sub.0, the integrand in Eq. 13 is also bounded. Therefore, the quantity in the bracket of Eq. 13 is bounded for all values of λ between 0 and 1. Since
(35)
does not have any singular region for all values of λ between 0 and 1, accurate and reliable free energy results can be obtained using the above soft bond interaction functional form.
(36) As an example, the derivative of the soft bond potential of Eq. 10 with respect to the coupling parameter λ at λ=0.5 for a model system with force constant k=20 kcal.Math.mol.sup.−1.Math.Å.sup.−2 and the soft bond parameter α=1 is plotted in
(37)
does not have any singular region for all values of λ and r, allowing reliable and accurate free energy calculations.
Property 3:
(38)
(39) Both the first derivative and the second derivative of the soft bond potential of Eq. 10 with respect to the inter-particle (or inter-atom) distance r are continuous and bounded for all values of λ∈[0,1] and they are short ranged and approaching 0 when r.fwdarw.∞. When λ=1, the soft bond potential reverts to the harmonic potential, and only phase space regions where r is close to r.sub.0 are sampled in the molecular simulations. Thus the first and the second derivatives of the potential with respect to the inter-particle distance r are also continuous and bounded in the physically accessible phase space regions when λ=1. Therefore, the forces and acceleration on the atoms are always continuous and bounded for all values of λ between 0 and 1, allowing stable molecular dynamics simulations to be performed for all values of λ. As an example, the first and second derivative of the soft bond potential with respect to the inter-particle distance r at λ=0.5 for a model system with force constant k=20 kcal.Math.mol.sup.−1.Math.Å.sup.−2 and the soft bond parameter α=1 are plotted in
(40) It is clear from the above description that the soft bond potential removes the singularity and numerical instability problems associated with the traditional methods, and it allows stable molecular dynamics simulations and more convergent Monte Carlo to be performed for all system states, including the reference system state, the intermediate system states, and the target system state. Using the soft bond potential described herein, the free energy difference between the reference system state and the target system state involving breaking and form valance bond can be accurately and reliable calculated. The free energy calculations utilizing such a soft bond potential are particularly advantageous for alchemical transformations involving ring opening, ring closing, or ring rearrangement, where the computational efficiency and convergence of the free energy calculations can be significantly improved over the conventional methods.
(41) As described in above section, in the conventional coupling Hamiltonian, the limit does not exist for the initial state where λ=0 when r approaches infinity, while in the soft bond potential the limit exists for all values of r. The conventional coupling Hamiltonian is a special case of the soft bond potential where α=0. While the conventional coupling Hamiltonian and the soft bond potential reach the same end state when λ=1, the initial states when λ=0 are different depending on the value of the soft bond parameter α. In the following, it is shown that the free energy of the initial state does not depend on the soft bond parameter α.
(42) Consider a Hamiltonian with the following potential energy term (i.e., Eq. 10 where λ=0):
(43)
(44) When α=0, it becomes the conventional harmonic potential, and when α≠0 it becomes the soft bond potential. We need to prove that
F(α=0,λ=0)=F(α=α′>0,λ=0) (16)
Note that
(45)
(46) The ensemble average I(α) is a finite number since
(47)
(48) In view of the above, the initial state for the conventional coupling Hamiltonian and soft bond potential when λ=0 have the same free energy.
(49) In free energy calculations, the convergence of the free energy can be affected by the overlap of phase space regions between neighboring or successive λ windows. Empirically, a suitable path from the initial state to the final state can be achieved when the change of the free energy as a function of λ is continuous and smooth for all values of λ, i.e.:
(50)
(51) Thus, in preferred embodiments, the value of the soft bond parameter α can be obtained when Eq. 21 is satisfied.
(52) As mentioned above, in ring opening and closing free energy calculations, in addition to the bonded stretch interactions, there can be other interaction energy terms which are different in the initial and final states, including the bonded angle terms, the bonded dihedral angle terms, and the nonbonded interactions. These different types of interactions can be treated differently during the transformation process. As described herein, in some embodiments, the coupling parameter λ can take a vector form which includes components for different types of interactions. For example, the coupling vector λ can include the following components for interactions affected by the formation (or the breaking) of the valence bond between the reference state and the target state: a component λ.sub.sbs for bonded stretch energy term; a component λ.sub.ba for the bonded angle energy term; a component λ.sub.bd for bonded dihedral angle energy term; components λ.sub.vdw and λ.sub.elec for the van de Waals and electrostatic energy terms of the nonbonded exclusion and 1-4 pair interaction, respectively. The coupling vector λ can further include components λ.sub.other for other interactions not affected by the formation and/or breaking of the valence bond between the reference state and the target state. Each component of the coupling vector λ belongs to [0, 1]. During the course of the alchemical transformation from the reference state to the target state, the interactions unique in the reference state can be turned off according to a first set of schedules for different λ components, and the interactions unique in the target state can be turned on according to a second set of schedules for different λ components, as will be further described below.
(53) In popular molecular mechanics force fields, the bonded angle and bonded dihedral angle interactions usually have the following potential energy form:
(54)
where θ is the bond angle, θ.sub.0 is the equilibrium bond angle, k.sub.θ is the angle force constant (both θ.sub.0 and k.sub.θ depend on the atoms forming the bond angle); ϕ is the dihedral angle, k.sub.ϕ is the dihedral angle force constant (which depends on the atoms forming the dihedrals). With the opening or closing of a ring, the bonded angle and dihedral angle terms that are affected by the breaking or forming of the bond can be modulated by components λ.sub.ba and λ.sub.bd of the coupling parameter λ, respectively. Although the bonded angle and dihedral interaction terms are bounded for all λ values, the absolute values of these terms can be very large if the molecule is in a very twisted geometry. To improve the accuracy of the free energy calculation, in some embodiments, the bonded stretch interaction can be first turned on to a significant degree (e.g., λ.sub.sbs=0.5) before turning on the bonded angle and bonded dihedral angle interaction (during bond formation). In this way, the bonded stretch interaction will steer the molecule clear from a severely twisted geometry, improving the inaccuracy problem caused by the bonded angle and bonded dihedral angle interactions in the free energy calculations.
(55) The nonbonded electrostatic and van der Waals interactions between two atoms are usually modeled by the following potential energy form in popular molecular mechanics force fields:
(56)
where in Eqs. 24 and 25, r is the inter-atom distance, q.sub.1 and q.sub.2 are the charges of the two atoms, C is a constant, c is the depth of the potential well of U.sub.vdw(r), and σ is the finite distance at which the inter-atom potential U.sub.vdw(r) is zero.
(57) Many force fields, including OPLS, CHARM, AMBER, exclude or modify the nonbonded interactions between atoms separated by one, two, or three bonds. In particular, when two atoms are separated by three bonds (e.g., the two atoms A.sub.1 and A.sub.8 shown in
(58) With the opening or closing of a ring, the nonbonded electrostatic and van der Waals interactions for atoms that span the bond that is broken or forming can be modulated for by components λ.sub.elec and λ.sub.vdw, respectively, which will be further discussed below. As used herein, in some embodiments of the invention, the van der Waals and/or the electrostatic interaction can be described by a soft-core Lennard Jones (LJ) and/or soft-core Coulomb potential that keeps pairwise interaction energies finite for all configurations. By way of example, a soft-core LJ potential may take the following functional form:
(59)
where it α.sub.LJ is the soft core parameter, ε and σ are the standard Lennard Jones interaction parameters, n is the order parameter which usually takes values between 1 and 6. See Beutler et al., “Avoiding singularities and numerical instabilities in free energy calculations based on molecular simulations,” Chem. Phys. Lett., 222 (1994) 529-539, the disclosure of which is incorporated by reference herein. The soft-core LJ interaction recovers the standard Lennard Jones interactions when λ=1 and it becomes 0 when λ=0.
(60) In some embodiments, for ring opening and closing free energy calculations, a λ schedule for all the system states can be used to treat the bonded stretch, bonded angle, bonded dihedral angle, nonbonded exclusion and 1-4 pair interactions, that are affected by the formation and/or annihilation of a bond that results in the opening or the closing of a ring structure. The coupling parameter λ for those interactions affected by the bond formation or breaking may include seven components for bond breaking transformation and seven components for bond formation transformation from the reference state to the target state.
(61) The seven components of the coupling parameter λ applicable for the bond breaking transformation include the following terms: λ.sub.sbsA, which modulates the bonded stretch interactions, respectively, that are present in the initial state (the bond is present and yet to be broken) but not in the final state (the bond is broken); λ.sub.baA and λ.sub.bdA, which modulate the bonded angle and bonded dihedral angle interactions that are present in the initial state but not in the final state; λ.sub.elecA.sub.
(62) Similarly, the seven components for bond formation transformation include the following terms: λ.sub.sbsB, which modulates the bonded stretch interactions that are present in the final state (the bond is formed) but not in the initial state (the bond is yet to be formed), λ.sub.baB and λ.sub.bdB, which modulate the bonded angle and bonded dihedral angle interactions that are present in the final state but not in the initial state, respectively, λ.sub.elecB.sub.
(63) As used herein, the term “modulate” when used in connection with a component of the coupling parameter λ means that the interaction energy for that particular component is calculated using parameters of a conventional force field and the corresponding component coupling parameter. To be specific, for bonded stretch interaction and the LJ interaction, (or the electrostatic interaction) where the soft-core potentials are used, the interaction energies are calculated according to equations 9 and 26 respectively, while for other types of the interactions, the interaction energy is calculated using parameters of a conventional force field multiplied by the corresponding component coupling parameter λ.
(64) One example of λ schedules discussed above is shown in Scheme 1 below, in which the superscript (0, 1, . . . , m, m+1, . . . , n) indicate the indexes of the reference system state, the intermediate states, and the target system state, and λ.sub.comp.sup.i is the respective λ component value for state with index “i” and component “comp”.
(65)
(66) λ.sub.sbsA as shown in Scheme 1 can be varied from 1 to 0 over the bond breaking (e.g., a ring opening) transformation. In some embodiments, the variation of λ.sub.sbsA can be linear and/or monotonic over the transformation. In other embodiments, the variation of λ.sub.sbsA can be non-linear and/or non-monotonic over the transformation. Although it is shown that λ.sub.sbsA=0.5 at the intermediate system state indexed by m, this schedule for λ.sub.sbsA is merely illustrative and non-limiting (e.g., other values smaller or greater than 0.5 can also be used for the intermediate system state indexed by m).
(67) λ.sub.baA as shown in Scheme 1 can be varied from 1 to 0 over the bond breaking (e.g., a ring opening) transformation. In some embodiments, the variation of λ.sub.baA can be linear and/or monotonic over the transformation or a portion of the transformation. In other embodiments, the variation of λ.sub.baA can be non-linear and/or non-monotonic over the transformation or a portion of the transformation.
(68) λ.sub.bdA as shown in Scheme 1 can be varied from 1 to 0 over the bond breaking (e.g., a ring opening) transformation. In some embodiments, the variation of λ.sub.bdA can be linear and/or monotonic over the transformation or a portion of the transformation. In other embodiments, the variation of λ.sub.bdA can be non-linear and/or non-monotonic over the transformation or a portion of the transformation.
(69) Further, in certain embodiments, for improved sampling efficiency in the molecular simulations, the bonded angle and bonded dihedral interactions between two atoms can be turned off more quickly to 0 before the bonded stretch interactions are turned off during the bond breaking transformation (conversely, the bonded angle and bonded dihedral interactions can be turned on only after the bonded stretch interactions are turned on to a predetermined degree). Although it is shown that λ.sub.baA/λ.sub.bdA can be decreased to 0 at the intermediate system state indexed by m (meaning that λ.sub.baA/λ.sub.bdA can be varied from 1 at the initial state to 0 at this intermediate state by a more rapid decrease than that of λ.sub.sbsA), the schedule for λ.sub.baA/λ.sub.bdA are merely illustrative and non-limiting (e.g., λ.sub.baA/λ.sub.bdA can be decreased to 0 more rapidly or slowly from the initial state). Also, the λ.sub.baA and λ.sub.bdA can be varied separately according to their own respective schedules and do not need to be synchronized.
(70) λ.sub.elecA.sub.
(71) λ.sub.vdwA.sub.
(72) λ.sub.elecA.sub.
(73) λ.sub.vdwA.sub.
(74) The schedules of the λ components for the bond formation transformation, λ.sub.sbsB, λ.sub.baB and λ.sub.bdB, λ.sub.elecB.sub.
(75) In the following, energy calculations in a molecular simulation of the transformation shown in
(76) The other bonded and nonbonded interactions that are not affected by the formation and annihilation of bonds for the ring opening or closing transformation can be modulated by a regular λ schedule as in conventional free energy perturbations not involving ring opening and closing (e.g., incrementing λ.sub.other from the initial state to the final state, where the interactions between P.sub.AB and P.sub.A is scaled by 1−λ.sub.other, and the interactions between P.sub.AB and P.sub.B is scaled by λ.sub.other). Further, all the interactions involving atoms that only appear in the initial state but missing from the final state (i.e., those atoms that become “dummy” atoms in the final state) will also be treated by a normal λ schedule. Conversely, for the reverse (bond formation) transformation, all the interactions involving atoms that appear only in the final state and are missing from the initial state (e.g., A.sub.11 and A.sub.12 in
(77) With the energy terms defined by a suitable λ schedule (such as the one depicted in Scheme 1) for all the system states in the transformation from the initial state to the final state, molecular simulations can be run to sample the ensembles of micro-states obtained at the reference state, the target state, and the intermediate states according to the λ schedule. For each λ window, the free energy difference can be calculated between all the neighboring lambda windows ΔF.sub.λi.fwdarw.λi+1 and/or between any pair of lambda windows ΔF.sub.i.fwdarw.j, including between the reference state and the target state. The total free energy difference between the reference state and the target state can be obtained by adding the free energy differences between each two successive state along the transformation path defined by the λ schedule or directly obtained by analyzing the data from all the sampled states.
(78) The free energy difference between neighboring λ windows or generally between any pair of states including initial state and the final state can be calculated by a variety of ways. For example, e.g., by the use of internal energy difference (FEP NVT ensemble), the enthalpy difference (FEP NPT ensemble), or other related thermodynamic property difference (FEP other ensembles, such as the NVE ensemble), of a suitable ensemble of the micro-states obtained at the target state, the reference state, and the intermediate states as coupling parameter λ is instantaneously varied for the selected ensemble of micro-states. The analysis can be further performed, for example, by way of Bannet Acceptance Ratio (BAR), Multistate Bannet Acceptance Ratio (MBAR), Weighted Histogram Analysis Method (WHAM), Zwanzig averaging, or other similar FEP-family estimators. Alternatively, the free energy difference between neighboring λ windows or generally between any pairs of states including the initial state and the final state can be calculated by way of an analysis the derivative of the energy with respect of the coupling vector λ (TI NVT ensemble), the derivative of the enthalpy with respect to the coupling vector λ (TI NPT ensemble), or the derivative of other related thermodynamic properties with respect to the coupling vector λ (TI other ensembles, such as the TI NVE ensemble), for each microscopic state obtained. In other embodiments, the free energy difference of each λ window can be calculated by way of an analysis of the potential of mean force (PMF) associated with sampling of the coupling vector λ as a dynamical variable that can dynamically transition between the reference state, the target state and intermediate states for example and without loss of generality via the λ-dynamics, the principle of which is generally described in Knight et al., λ-dynamics free energy simulation methods, J. Comput. Chem., 2009, 30: 1692-1700, the disclosure of which is incorporated by reference herein. λ-dynamics based sampling methods include, but are not limited to, λ-Monte Carlo, λ-metadynamics, λ-OSRW, and other λ PMF sampling family methods.
(79) The methods for free energy difference calculations described herein can be applied to a number of highly useful applications, which include, for example: Relative protein-ligand binding affinity and/or relative solvation free energy calculations between congeneric ligands with ring opening or closing (see
(80) Embodiments of the method for the free energy calculations of the disclosed subject matter can be implemented in a computer program, which can take the form of a software component of a suitable hardware platform, for example, a standalone computer, a networked computer, a network server computer, a handheld device, or the like. Different aspects of the disclosed methods may be implemented in different software modules and executed by one processor or different processors, sequentially or in parallel, depending on how the software is designed. The apparatus on which the program can be executed can include one or more processors, one or more memory devices (such as ROM, RAM, flash memory, hard drive, optical drive, etc.), input/output devices, network interfaces, and other peripheral devices. A computer readable non-transitory media storing the program is also provided.
(81) The disclosed subject matter is not to be limited in scope by the specific embodiments described herein. Indeed, various modifications of the disclosed subject matter in addition to those described herein will become apparent to those skilled in the art from the foregoing description and the accompanying figures. Such modifications therefore fall within the scope of the appended claims.