Two-Locus Selection Models

The human genome is thought to have about 100,000 genes, Drosophila about 10,000 genes, and even bacteria contain thousands of genes per cell!

Clearly, one-locus models of selection are highly simplified depictions of how populations evolve over time.

Exact analyses with multiple loci are, however, extremely difficult if not impossible to obtain.

Even with just two loci, the dynamics are complicated and not completely understood. Results are generally limited in scope, focusing on particular fitness schemes.

Nevertheless, results from two-locus models are very important in determining what properties of the one-locus model are unique and so might not apply to the real-world situation in which a number of loci collectively interact to guide the formation of the individual.

We will consider a model with two loci, A and B, with two alleles each: A₁, A₂ and B₁, B₂ respectively.

There are four possible combinations of these alleles on a chromosome:

Chromosome type: A₁ B₁ A₁ B₂ A₂ B₁ A₂ B₂

Frequency: x₁ x₂ x₃ x₄

[Note: x₁ + x₂ + x₃ + x₄ = 1]

Two-Locus Selection Models

As you know from genetics, recombination can occur during meiosis in sexual organisms to generate gametes that carry new combinations of the alleles not present in the parent:

We specify the rate of recombination between two loci by r.

[Note: Recombination may occur in any individual but it only changes the type of gametes produced if the parent was a double heterozygote.]

Two-Locus Selection Models

We will consider a diploid life cycle as follows:

where the survival of a diploid individual depends on its genotype:

Two-Locus Selection Models

We census the gamete frequencies at the beginning of a generation (x_i).

These gametes unite at random, creating all possible combinations according to their frequencies. Selection then acts on these diploid genotypes producing a new generation of adults. These adults undergo meiosis to produce the next generation of gametes.

These processes are illustrated in the following mating table:

Two-Locus Selection Models

To calculate gamete frequencies in the next generation, multiply the column "Frequency after selection" (= adult frequencies) by the appropriate gamete column and sum down the column.

For instance,

where we define

as the marginal fitness of allele i (the fitness that the allele experiences averaged over all genetics backgrounds) and where

is the mean fitness of all members of the current population.

Two-Locus Selection Models

The most common simplifications made are that the fitness of the two types of double heterozygotes (cis and trans) are equally fit, ie w₁₄=w₂₃, and that fitness does not depend on which parent contributed a chromosome, ie w_ij=w_ji.

The equations describing selection in the two-locus diploid model are then:

Notice that recombination only enters into these equations when multiplied by (x₁ x₄- x₂x₃). This term is known as the linkage disequilibrium, D.

Technically, D measures the difference in frequency between "coupling" (A₁B₁ and A₂B₂) and "repulsion" (A₁B₂ and A₂B₁) gametes.

More intuitively, D measures the difference between the observed and expected gamete frequencies:

x₁ = p_A₁ p_B₁ + D

x₂ = p_A₁ p_B₂ - D

x₃ = p_A₂ p_B₁ - D

x₄ = p_A₂ p_B₂ + D

where p_A₁ is the frequency of allele A₁ in the population (p_A₁ = x₁+x₂).

Two-Locus Selection Models

We will not analyse this model in detail, but will simply illustrate some interesting results and how they are obtained.

No Selection

If all genotypes are equally fit, the recursions become:

Let's first find out what happens to the disequilbrium over time.

What is D' = x'₁x'₄ - x'₂x'₃ ?

Linkage disequilibrium decays at a rate r every generation.

After an amount of time t, the expected amount of disequilibrium is D[t]=(1-r)^t D[0].

Therefore, after enough time has passed, we expect to see little linkage disequilibrium between two neutral (=not selected) loci unless they are very tightly linked.

What is p'_A₁ = x'₁+x'₂ ?

In the absence of selection, allele frequencies remain constant.

Two-Locus Selection Models

If selection acts on both loci, the simplest question to answer is:

If one chromosome is fixed within the population, when will other chromosomes invade the population?

For comparison, remember that in the one-locus model, a new allele would invade if it had a higher fitness.

Let's say that the population is initially fixed on A₁B₁ (x₁=1) and that the system is perturbed such that some A₁B₂ (x₂=epsilon₂), A₂B₁ (x₃=epsilon₃), and A₂B₂ (x₄=epsilon₄) chromosomes exist within the population.

If we were to do a Taylor series of the equation for x'₂ and keep only linear terms in epsilon, we would get:

Repeating this for x'₃ and x'₄, leads to three linear equations in three variables (the perturbations), which we can write in matrix form:

Two-Locus Selection Models

Using the equations for x'₂, x'₃, and x'₄:

we can determine what all the partial derivatives are to get:

This is the local stability matrix for the two-locus selection model near the equilibrium with one chromosome fixed.

Two-Locus Selection Models

Fixation on A₁B₁ is stable if and only if all eigenvalues of this matrix are less than one.

Since the eigenvalues of a triangular matrix are simply the diagonal elements:

we learn that the fixation state with A₁B₁ present is unstable if ANY of the following hold:

w₁₂ > w₁₁ (A₁B₂ A₁B₁ more fit than A₁B₁ A₁B₁)
w₁₃ > w₁₁ (A₂B₁ A₁B₁ more fit than A₁B₁ A₁B₁)
w₁₄ > (1-r) w₁₁ (A₂B₂ A₁B₁ more fit AND recombination low)

The first two conditions are expected on the basis of the one locus model, but the last condition has no analogy.

This last condition tells us that beneficial gene combinations will invade only if there is tight enough linkage between the two loci (unless the new alleles are also beneficial on their own).

Back to biology 301 home page.