Cooperative binding

This article has been published in PLOS Computational Biology and can now be edited on the English Wikipedia.

Molecular binding is an interaction between molecules that results in a stable association between those molecules. Cooperative binding occurs if the number of binding sites of a macromolecule that are occupied by a specific type of ligand is a non-linear function of this ligand's concentration. This can be due, for instance, to an affinity for the ligand that depends on the amount of ligand bound. Cooperativity can be positive (supra-linear) or negative (infra-linear). Cooperative binding is most often observed in proteins but nucleic acids can also exhibit cooperative binding, for instance of transcription factors. Cooperative binding has been shown to be the mechanism underlying a large range of biochemical and physiological processes.

Christian Bohr and the concept of cooperative binding
In 1904, Christian Bohr studied hemoglobin binding to oxygen under different conditions. When plotting hemoglobin saturation with oxygen as a function of the partial pressure of oxygen, he obtained a sigmoidal (or "S-shaped") curve, see. This indicates that the more oxygen is bound to hemoglobin, the easier it is for more oxygen to bind - until all binding sites are saturated. In addition, Bohr noticed that increasing CO2 pressure shifted this curve to the right - i.e. higher concentrations of CO2 make it more difficult for hemoglobin to bind oxygen. This latter phenomenon, together with the observation that hemoglobin's affinity for oxygen increases with increasing pH, is known as the Bohr effect.



A receptor molecule is said to exhibit cooperative binding if its binding to ligand scales non-linearly with ligand concentration. Cooperativity can be positive (if binding of a ligand molecule increases the receptor's apparent affinity, and hence increases the chance of another ligand molecule binding) or negative (if binding of a ligand molecule decreases affinity and hence makes binding of other ligand molecules less likely). Figure 1 is a chart of the "fractional occupancy" $$\bar{Y}$$ of a receptor with a given ligand, which is defined as the quantity of ligand-bound binding sites divided by the total quantity of ligand binding sites:

$$ \bar{Y}=\frac{[\text{bound sites}]}{[\text{bound sites}]+[\text{unbound sites}]} = \frac{[\text{bound sites}]}{[\text{total sites}]} $$

If $$\bar{Y}=0$$, then the protein is completely unbound, and if $$\bar{Y}=1$$, it is completely saturated. If the plot of $$\bar{Y}$$ at equilibrium as a function of ligand concentration is sigmoidal in shape, as observed by Bohr for hemoglobin, this indicates positive cooperativity. If it is not, no statement can be made about cooperativity from looking at this plot alone.

The concept of cooperative binding only applies to molecules or complexes with more than one ligand binding sites. If several ligand binding sites exist, but ligand binding to any one site does not affect the others, the receptor is said to be non-cooperative. Cooperativity can be homotropic, if a ligand influences the binding of ligands of the same kind, or heterotropic, if it influences binding of other kinds of ligands. In the case of hemoglobin, Bohr observed homotropic positive cooperativity (binding of oxygen facilitates binding of more oxygen) and heterotropic negative cooperativity (binding of CO2 reduces hemoglobin's facility to bind oxygen.)

Throughout the 20th century, various frameworks have been developed to describe the binding of a ligand to a protein with more than one binding site and the cooperative effects observed in this context (reviewed by Wyman, J. and Gill, 1990 ).

The Hill equation
The first description of cooperative binding to a multi-site protein was developed by A.V. Hill. Drawing on observations of oxygen binding to hemoglobin and the idea that cooperativity arose from the aggregation of hemoglobin molecules, each one binding one oxygen molecule, Hill suggested a phenomenological equation that has since been named after him:

$$ \bar{Y} = \frac{K\cdot{}[X]^n}{1+ K\cdot{}[X]^n} = \frac{[X]^n}{K* + [X]^n} = \frac{[X]^n}{K_d^n + [X]^n} $$

here $$n$$ is the "Hill coefficient", $$[X]$$ denotes ligand concentration, $$K$$ denotes an apparent association constant (used in the original form of the equation), $$K*$$ is an apparent dissociation constant (equivalent to an$$EC_{50}$$) and $$K_d$$ a microscopic dissociation constant (used in modern forms of the equation). If $$n<1$$, the system exhibits negative cooperativity, whereas cooperativity is positive if $$n>1$$. The total number of ligand binding sites is an upper bound for $$n$$. The Hill equation can be linearized as:

$$ \log \frac{\bar{Y}}{1-\bar{Y}} = n\cdot{}\log [X] - \log K_d $$

The "Hill plot" is obtained by plotting $$\log \frac{\bar{Y}}{1-\bar{Y}}$$ versus $$\log [X]$$. In the case of the Hill equation, it is a line with slope $$n_H$$ and intercept $$log(K_d)$$ (see ). This means that cooperativity is assumed to be fixed, i.e. it does not change with saturation. It also means that binding sites always exhibit the same affinity, and cooperativity does not arise from an affinity increasing with ligand concentration.

The Adair equation
G.S. Adair found that the Hill plot for hemoglobin was not a straight line, and hypothesized that cooperativity was not a fixed term, but dependent on ligand saturation. Having demonstrated that hemoglobin contained four hemes (and therefore binding sites for oxygen), he worked from the assumption that fully saturated hemoglobin is formed in stages, with intermediate forms with one, two, or three bound oxygen molecules. The formation of each intermediate stage from unbound hemoglobin can be described using an apparent macroscopic association constant $$K_i$$. The resulting fractional occupancy can be expressed as:

$$ \bar{Y} = \frac{1}{4}\cdot{}\frac{K_I[X]+2K_{II}[X]^2+3K_{III}[X]^3+4K_{IV}[X]^4}{1+K_I[X]+K_{II}[X]^2+K_{III}[X]^3+K_{IV}[X]^4} $$

Or, for any protein with n ligand binding sites:

$$ \bar{Y}=\frac{1}{n}\frac{K_I[X] + 2K_{II}[X]^2 + \ldots + nK_{n} [X]^n}{1+K_I[X]+K_{II}[X]^2+ \ldots +K_n[X]^n} $$

where n denotes the number of binding sites and each $$K_i$$ is a combined association constant, describing the binding of i ligand molecules.

The Klotz equation
Working on calcium binding proteins, Irving Klotz deconvoluted Adair's association constants by considering stepwise formation of the intermediate stages, and tried to express the cooperative binding in terms of elementary processes governed by mass action law. In his framework, $$K_1$$ is the association constant governing binding of the first ligand molecule, $$K_2$$ the association constant governing binding of the second ligand molecule (once the first is already bound) etc. For $$\bar{Y}$$, this gives:

$$ \bar{Y}=\frac{1}{n}\frac{K_1[X] + 2K_1K_2[X]^2 + \ldots + n\left(K_1K_2 \ldots K_n\right)[X]^n}{1+K_1[X]+K_1K_2[X]^2+ \ldots +\left(K_1K_2 \ldots K_n\right)[X]^n} $$

It is worth noting that the constants $$K_1$$, $$K_2$$ and so forth do not relate to individual binding sites. They describe how many binding sites are occupied, rather than which ones. This form has the advantage that cooperativity is easily recognised when considering the association constants. If all ligand binding sites are identical with a microscopic association constant $$K$$, one would expect $$K_1=nK, K_2=\frac{n-1}{2}K, \ldots K_n=\frac{1}{n}K$$ (that is $$K_i=\frac{n-i+1}{i}K$$) in the absence of cooperativity. We have positive cooperativity if $$K_i$$ lies above these expected values for $$i>1$$.

The Klotz equation (which is sometimes also called the Adair-Klotz equation) is still often used in the experimental literature to describe measurements of ligand binding in terms of sequential apparent binding constants.

Pauling equation
By the middle of the 20th century, there was an increased interest in models that would not only describe binding curves phenomenologically, but offer an underlying biochemical mechanism. Linus Pauling reinterpreted the equation provided by Adair, assuming that his constants were the combination of the binding constant for the ligand ($$K$$ in the equation below) and energy coming from the interaction between subunits of the cooperative protein ($$\alpha$$ below). Pauling actually derived several equations, depending on the degree of interaction between subunits. Based on wrong assumptions about the localisation of hemes, he opted for the wrong one to describe oxygen binding by hemoglobin, assuming the subunit were arranged in a square. The equation below provides the equation for a tetrahedral structure, which would be more accurate in the case of hemoglobin:

$$ \bar{Y} = \frac{K[X]+3\alpha{}K^2[X]^2+3\alpha{}^3K^3[X]^3+\alpha{}^6K^4[X]^4}{1+4K[X]+6\alpha{}K^2[X]^2+4\alpha{}^3K^3[X]^3+\alpha{}^6K^4[X]^4} $$

The KNF model
Based on results showing that the structure of cooperative proteins changed upon binding to their ligand, Daniel Koshland and colleagues refined the biochemical explanation of the mechanism described by Pauling. The Koshland-Némethy-Filmer (KNF) model assumes that each subunit can exist in one of two conformations: active or inactive. Ligand binding to one subunit would induce an immediate conformational change of that subunit from the inactive to the active conformation, a mechanism described as "induced fit". Cooperativity, according to the KNF model, would arise from interactions between the subunits, the strength of which varies depending on the relative conformations of the subunits involved. For a tetrahedric structure (they also considered linear and square structures), they proposed the following formula:

$$ \bar{Y} = \frac{K_{AB}^3(K_XK_t[X])+3K_{AB}^4K_{BB}(K_XK_t[X])^2+3K_{AB}^3K_{BB}^3(K_XK_t[X])^3+K_{BB}^6(K_XK_t[X])^4}{1+4K_{AB}^3(K_XK_t[X])+6K_{AB}^4K_{BB}(K_XK_t[X])^2+4K_{AB}^3K_{BB}^3(K_XK_t[X])^3+K_{BB}^6(K_XK_t[X])^4} $$

Where $$K_X$$ is the constant of association for X, $$K_t$$ is the ratio of B and A states in the absence of ligand ("transition"), $$K_{AB}$$ and $$K_{BB}$$ are the relative stabilities of pairs of neighbouring subunits relative to a pair where both subunits are in the A state (Note that the KNF paper actually presents $$N_s$$, the number of occupied sites, which is here 4 times $$\bar{Y}$$).

The MWC model
The Monod-Wyman-Changeux (MWC) model for concerted allosteric transitions went a step further by exploring cooperativity based on thermodynamics and three-dimensional conformations. It was originally formulated for oligomeric proteins with symmetrically arranged, identical subunits, each of which has one ligand binding site. According to this framework, two (or more) interconvertible conformational states of an allosteric protein coexist in a thermal equilibrium. The states - often termed tense (T) and relaxed (R) - differ in affinity for the ligand molecule. The ratio between the two states is regulated by the binding of ligand molecules that stabilizes the higher-affinity state. Importantly, all subunits of a molecule change states at the same time, a phenomenon known as "concerted transition". The MWC model is illustrated in.

The allosteric isomerisation constant L describes the equilibrium between both states when no ligand molecule is bound: $$L=\frac{\left[T_0\right]}{\left[R_0\right]}$$. If L is very large, most of the protein exists in the T state in the absence of ligand. If L is small (close to one), the R state is nearly as populated as the T state. The ratio of dissociation constants for for the ligand from the T and R states is described by the constant c: $$c = \frac{K_d^R}{K_d^T}$$. If $$c=1$$, both R and T states have the same affinity for the ligand and the ligand does not affect isomerisation. The value of c also indicates how much the equilibrium between T and R states changes upon ligand binding: the smaller c, the more the equilibrium shifts towards the R state after one binding. With $$\alpha = \frac{[X]}{K_d^R}$$, fractional occupancy is described as:

$$ \bar{Y} = \frac{\alpha(1+\alpha)^{n-1}+Lc\alpha(1+c\alpha)^{n-1}}{(1+\alpha)^n+L(1+c\alpha)^n} $$

The sigmoid Hill plot of allosteric proteins (shown in ) can then be analysed as a progressive transition from the T state (low affinity) to the R state (high affinity) as the saturation increases (see ). The Hill coefficient also depends on saturation, with a maximum value at the inflexion point. The intercepts between the two asymptotes and and the y-axis allow to determine the affinities of both states for the ligand.

In proteins, conformational change is often associated with activity, or activity towards specific targets. Such activity is often what is physiologically relevant or what is experimentally measured. The degree of conformational change is described by the state function $$\bar{R}$$, which denotes the fraction of protein present in the $$R$$ state. As the energy diagram illustrates, $$\bar{R}$$ increases as more ligand molecules bind. The expression for $$\bar{R}$$ is:

$$ \bar{R}=\frac{(1+\alpha)^n}{(1+\alpha)^n+L(1+c\alpha)^n} $$

A crucial aspect of the MWC model is that the curves for $$\bar{Y}$$ and $$\bar{R}$$ do not coincide, i.e. fractional saturation is not a direct indicator of conformational state (and hence, of activity). Moreoever, the extents of the cooperativity of binding and the cooperativity of activation can be very different: an extreme case is provide by the bacteria flagella motor with a Hill coefficient of 1.7 for the binding and 10.3 for the activation. The supra-linearity of the response is sometimes called ultrasensitivity.

If an allosteric protein binds to a target that also has a higher affinity for the R state, then target binding further stabilises the R state, hence increasing ligand affinity. If, on the other hand, a target preferentially binds to the T state, then target binding will have a negative effect on ligand affinity. Such targets are called allosteric modulators.

Since its inception, the MWC framework has been extended and generalised. Variations have been proposed, for example to cater for proteins with more than two states, proteins that bind to several types of ligands or several types of allosteric modulators and proteins with non-identical subunits or ligand-binding sites.

Examples of cooperative binding
The list of molecular assemblies that exhibit cooperative binding of ligands is very large, but some examples are particularly notable for their historical interest, their unusual properties, or their physiological importance.

As described in the historical section, the most famous example of cooperative binding is hemoglobin. Its quaternary structure, solved by Max Perutz using X-ray diffraction, exhibits a pseudo-symmetrical tetrahedron carrying four binding sites (hemes) for oxygen (see ). Many other molecular assemblies exhibiting cooperative binding have been studied in great detail.

Multimeric enzymes
The activity of many enzymes is regulated by allosteric effectors. Some of these enzymes are multimeric and carry several binding sites for the regulators.

Threonine deaminase was one of the first enzymes suggested to behave like hemoglobin and shown to bind ligands cooperatively. It was later shown to be a tetrameric protein.

Another enzyme that has been suggested early to bind ligands cooperatively is aspartate trans-carbamylase. Although initial models were consistent with four binding sites, its structure was later shown to be hexameric by William Lipscomb and colleagues.

Ion Channels
Most ion channels are formed of several identical or pseudo-identical monomers or domains, arranged symmetrically in biological membranes. Several classes of such channels whose opening is regulated by ligands exhibit cooperative binding of these ligands.

It was suggested as early as 1967 (when the exact nature of those channels was still unknown) that the nicotinic acetylcholine receptors bound acetylcholine in a cooperative manner due to the existence of several binding sites. The purification of the receptor and its characterization demonstrated a pentameric structure with binding sites located at the interfaces between subunits, confirmed by the structure of the receptor binding domain.

Inositol triphosphate (IP3) receptors form another class of ligand-gated ion channels exhibiting cooperative binding. The structure of those receptors shows four IP3 binding sites symmetrically arranged.

Multi-site molecules
Although most proteins showing cooperative binding are multimeric complexes of homologous subunits, some proteins carry several binding sites for the same ligand on the same polypeptide. One such example is calmodulin. One molecule of calmodulin binds four calcium ions cooperatively. Its structure presents four EF-hand domains, each one binding one calcium ion. Interestingly, the molecule does not display a square or tetrahedron structure, but is formed of two lobes, each carrying two EF-hand domains.

Transcription factors
Cooperative binding of proteins onto nucleic acids has also been shown. A classical example is the binding of the lambda phage repressor to its operators, which occurs cooperatively. Other examples of transcription factors exhibit positive cooperativity when binding their target, such as the repressor of the TtgABC pumps (n=1.6).

Conversely, examples of negative cooperativity for the binding of transcription factors were also documented, as for the homodimeric repressor of the Pseudomonas putida cytochrome P450cam hydroxylase operon (n=0.56).

Conformational spread and binding cooperativity
Early on, it has been argued that some proteins, especially those consisting of many subunits, could be regulated by a generalized MWC mechanism, in which the transition between R and T state is not necessarily synchronized across the entire protein. In 1969, Wyman proposed such a model with "mixed conformations" (i.e. some protomers in the R state, some in the T state) for respiratory proteins in invertebrates.

Following a similar idea, the conformational spread model by Duke and colleagues subsumes both the KNF and the MWC model as special cases. In this model, a subunit does not automatically change conformation upon ligand binding (as in the KNF model), nor do all subunits in a complex change conformations together (as in the MWC model). Conformational changes are stochastic with the likelihood of a subunit switching states depending on whether or not it is ligand bound and on the conformational state of neighbouring subunits. Thus, conformational states can "spread" around the entire complex.