Molecular Biology 03: 'Chemical kinetics and enzymes'

These are my notes from lecture 03 of Harvard’s BCMP 200: Molecular Biology course, delivered by Joe Loparo on September 8, 2014.

Review

Thermodynamics tells you whether a reaction will occur spontaneously, kinetics tells you how quickly it will occur. Today’s lecture will cover kinetics; last time we covered thermodynamics. Here is a quick review of some key points from last time, and their implications for today’s lecture.

Recall the tradeoff between enthalpy and entropy in note that solvation of urea is an endothermic reaction driven by increasing entropy which overcomes an increase in enthalpy, making the environment colder. Mixing of epoxy is an exothermic reaction where entropy decreases but is more than compensated by a decrease in enthalpy, making the environment hotter.

Levinthal’s paradox is as follows. Imagine a 100-residue peptide with 99 peptide bonds and a total of 198 Ψ and Φ angles. If each angle has three stable orientations, there are 3¹⁹⁸ possible conformations of the polypeptide. If a protein sampled one conformation per picosecond (a reasonable time scale of molecular movements), it would take 10^71 seconds to sample all of them. So how can proteins ever fold in a reasonable amount of time? Levinthal concluded that sampling is non-random - it occurs on a rugged high-dimensional energy landscape where the protein moves downhill.

This requires that thermal energy be on a scale comparable to other biological energies. Thermal energy is equal to kT. In terms of kJ/mol energy content, thermal energy is less than one order of magnitude less than a noncovalent bond, and only a bit more than an order of magnitude less than one ATP molecule’s hydrolysis. This thermal energy is crucial for not getting stuck at a local minimum as you traverse the rugged energy landscape.

Reaction kinetics

For the generic reaction A + B → C to occur, A and B must diffuse, collide with each other, and then fuse into C. Imagine the reaction occurring in a discretized grid, where a collision occurs if a unit of A and a unit of B are found in adjacent grid cells.

N_A is the number of A molecules
N_B is the number of B molecules
n_total is the number of grid cells
P_A := N_A / n_total
P_B := N_B / n_total

The probability of A and B being adjacent to one another is proportional to both P_A and P_B. This explains why the rate of a reaction should be proportional to the product of the concentrations of all reactants.

reaction type	reaction equation	rate
first order	A → C	d[A]/dt = -K[A]
second order	A + B → C	d[A]/dt = -K[A][B]
third order	2A + B → C	&half;d[A]/dt = d[B]/dt = -K[A]²[B]
zero order	z catalyzes A → C, where z is limiting and A is in abundance	d[A]/dt = -K

In practice, most experiments depend upon reactant concentration as a function of time, rather than on the time derivative of the concentration per se, so it is more useful to integrate all of the above formulae.For instance, for a zero order reaction where d[A]/dt = -K, integration yields [A] - [A]₀ = -Kt. In a first-order reaction, integration of d[A]/dt -K[A] yields [A] = [A]₀ e^-Kt, which means exponential decay of [A] over time. In a second order reaction, integration yields [A] = [A]₀/(1 + [A]₀Kt).

If you plot these equations, you will see that the higher-order a reaction, the more rapidly it slows down over time as reactants are consumed.

The above equations are approximations relevant to cases where ΔG « 0, i.e. irreversible reactions. In practice many reactions in biology are reversible.

Consider the reversible reaction A ↔ B, where the forward rate constant is K_f and the reverse is K_r. Then:

d[A]/dt = -K_f[A] + K_r[B]
d[B]/dt = K_f[A] - K_r[B]

If you integrate these under the constraint that d([A] + [B])/dt = 0 (i.e. conservation of total concentration) you get:

[A] = ([A]₀ - [A]_eq)e^{-(Kf + Kr)t} + [A]_eq

An implication of this is that [A] approaches [A]_eq asymptotically over time. We can obtain a similar equation for B. If we do this for both A and B we find that [B]_eq / [A]_eq = K_f / K_r = K_eq. Therefore the ratio of [A] and [B] at equilibrium tells you the thermodynamic equilibrium constant K_eq.

How does one observe the equilibrium state of a biological system in practice? For some reactions, it is easy to mix two things and watch the reaction proceed to equilibrium. For other systems it is impossible - for instance, for proteins that are always found in a dimeric form, it can be impossible to purify monomers in order to observe the rate of dimerization. The solution to this difficulty is to use relaxation methods. You perturb the system, using a change in pH or temperature so that the system is no longer at equilibrium, and then watch the “relax” into the new equilibrium. This is often done with an electric shock or laser blast to suddenly increase temperature.

Consider the binding reaction between a protein P and a ligand L to form a protein-ligand complex P·L:

P + L ↔ P·L

The forward rate is called K_on and the backwards rate is called K_off. We define the association constant K_A as follows:

K_A := K_eq = [P·L]/([P][L])

It has units of inverse molar (M^-1), which is non-intuitive. Therefore we instead by convention refer to a dissociation constant, K_D, which has units of molar (M):

K_D := [P][L]/[P·L]

Complexs with smaller K_D have higher affinity.

If we assume [L] » [P], then [L] ≈ [L]₀. Therefore, we can define K′_on = K_on[L]₀, and this simplifies the overall formula for derivative of [P] to d[P]/dt = -K′_on[P] + K_off[P·L]. Integrating this we can get:

[P] = ([P]₀ - [P]_eq)e^{-(K′on+Koff)t + [P]_eq}

Thus if we fix [L], then run the reaction and plot [P] as a function of time, we can estimate the quantity K_obs = (K′_on + K_off). Then we do this for a variety of [L] values - L always has to be in vast excess to P, so we could chose values of [L] that are in 100-fold excess, 1,000-fold excess, 10,000-fold excess, and so on. If we plot [L] on the x-axis and K_obs on the y axis, then the slope will be K_on and the intercept will be K_off.

One technique to do these measurements in practice is white light interferometry. White light is emitted down a needle towards its tip which is a functionalized surface. Some light will be reflected by the tip itself and some by the molecules attached to the tip. You can then plot amplitude vs. wavelength of reflected light (i.e. which colors are most intense) - this is called an interferogram. When a protein binds to the tip’s surface, you get a change in the interferogram. This spectral shift can be measured in real time to derive the association constant. The extent of interferogram response is related to protein size, so for example you get a bigger spectral change from an IgG antibody than from a Fab.

In class, we went through an experiment where protein A (a HA-tagged Staphylococcus virulence factor) is conjugated to the tip, and it is dipped into a solution containing an antibody (HA) that will bind it. You can observe association, and then after you reach equilibrium, you can dip the tip into a vast reservoir of buffer without antibody, and observe the exponential decay of dissociation. The instrument measures “binding” in units of nanometers (nm) of spectral shift, and the software can calculate K_on and K_off from the association and dissociation curves.

Other experimental approaches include the filter binding assay and electrophoretic mobility shift assay (EMSA). Both of these measure the bound and unbound fractions of DNA. The fraction f of A bound to B is defined as f := [A·B] / ([A] + [A·B]). It can be shown that [A·B] = [A][B]/K_D. Therefore f = [B] / (K_D + [B]). When [B] == K_D, then f = 1/2, so you can just find the value of [B] that gives you 50% of A bound, and that’s your K_D.

k, the rate constant, depends on both:

The rate of collisions between A and B
The proportion of collisions that produce a product

That’s right, not all collisions lead to product. K_obs even for a “fast” reaction is usually four orders of magnitude smaller than what would be predicted from the collision rate, implying that only 1 in 10,000 collisions is productive. This is largely because only those collisions which provide enough energy to overcome the higher-energy transition state will lead to a product. For instance, hydroloysis of ATP requires that H₂O collide with ATP with sufficient energy such that it forms a pentavalent transition state complex which is highly unfavorable - only if this energy hill is mounted with the molecule proceed downhill to ADP.

We define the activation energy E_A as the difference in energy between the transition state (the highest energy point in the curve) and the starting state.

The fraction of molecules that have energy equal to E_A is given by the Boltzmann distribution, such that we can derive the Arrhenius equation:

k = Ae^-E_A/(RT)

Where A is a factor that captures collision rate and orientation effects. The kinetics, therefore, are dictated by the barrier height and not by ΔG.

Michaelis and Menten proposed that an enzyme E and its substrate S form a complex E·S which has higher energy before yielding products E + P:

E + S ↔ E·S → E + P

Where the forward rate in step 1 is K₁ and the backwards rate is K_-1, while the second step is irreversible and has only a forward rate, K₂.

We can measure the initial velocity of the reaction as:

V₀ = d[P]/dt = K₂[E·S]

At the steady state,

d[E·S]/dt = 0 = K₁[E][S] - K_-1[E·S]_SS - K₂[E·S]_SS

Rearranging this we get:

[E·S]_SS = K₁[E][S] / (K_-1 + K₂)

And plugging this into the equation for V₀ we get:

V₀ = K₁K₂[E][S] / (K_-1 + K₂)

We then do further substitution and rearrangement, and we define the Michaelis constant (K_m) as:

K_m := (K_-1 + K₂)/K₁

And thus we get the Michaelis-Menten equation:

V₀ = V_max / (1 + K_m/[S])

The Michaelis-Menten equation represents the simplest possible explanation of enzyme kinetics that is still a remotely good model of reality for some (but still not all) reactions.

Continuing the derivation from last time, we can get that:

V_max = k₂[E]₀

Substituting this in, we get:

V₀ = V_max / (1 + K_m/[S])

If we plot [S] (x-axis) vs. V₀ we find that at low [S] where [S] « K_m, the curve is nearly linear. In this range, the slope is approximated by K₂ / K_m, because:

V₀ = V_max / (1 + K_m/[S]) ≈ V_max / (K_m/[S]) = (K₂/K_m)[E]₀[S]

The meaning of these constants may be understood as follows:

K_m is a ratio of rate constants, (K_-1 + K₂)/K₁. If K₂ « K_-1, then K_m ≈ K_-1/K₁ which is simply the K_D from the E + S ↔ E·S reaction. Thus K_m describes how much enzyme has substrate bound to it.

The apparent rate constant for an enzyme functioning at its maximum possible rate is called the catalytic rate constant, denoted K_cat. This is the velocity when the enzyme is saturated:

V_max = K_cat[E]₀

The rate constant has E_A in its exponent, thus even small increases in E_A can cause a large increase in reaction rate. Consider for instance, myosin, which stabilizes (reduces the free energy of) the transition state in ATP hydrolysis, lowering E_A for what is otherwise a spontaneous, but very slow, reaction. Enzymes can lower these energy barriers by (1) stabilizing the transition state through additional interactions (hydrogen bonds, salt bridges) with the substrate and by (2) promoting the correct orientation of substrates with respect to each other. Another example is DNA polymerase, which overcomes the electromagnetic repulsion of a negatively charged DNA backbone and a new triphosphate group through interactions with metal cations.

Nomenclature clarification

Rate constants should be written with lowercase k.

Thermodynamic equilibrium constants should be written with capital K.

K_m is technically a rate constant, but is capitalized because in the limit discussed above it becomes K_D.

The above notes have been updated to include the portion of this material which was finished on 9/10