Skip to main content
Advertisement
  • Loading metrics

FST and genetic diversity in an island model with background selection

Abstract

Background selection, by which selection on deleterious alleles reduces diversity at linked neutral sites, influences patterns of total neutral diversity, πT, and genetic differentiation, FST, in structured populations. The theory of background selection may be split into two regimes: the background selection regime, where selection pressures are strong and mutation rates are sufficiently low such that deleterious alleles are at a deterministic mutation-selection balance, and the interference selection regime, where selection pressures are weak and mutation rates are sufficiently high that deleterious alleles accumulate and interfere with another, leading to selective interference. Previous work has quantified the effects of background selection on πT and FST only for deleterious alleles in the background selection regime. Furthermore, there is evidence to suggest that migration reduces the effects of background selection on FST, but this has not been fully explained. Here, we derive novel theory to predict the effects of migration on background selection experienced by a subpopulation and extend previous theory from the interference selection regime to make predictions in an island model. Using simulations, we show that this theory best predicts FST and πT. Moreover, we demonstrate that background selection may generate minimal increases in FST under sufficiently high migration rates, because migration reduces correlated effects on fitness over generations within subpopulations. However, we show that background selection may still cause substantial reductions in πT, particularly for metapopulations with a larger effective population size. Our work further extends the theory of background selection into structured populations, and suggests that background selection will minimally confound locus-to-locus FST scans.

Author summary

Most mutations that affect fitness incur deleterious effects and are ultimately removed via natural selection. Consequently, nearby neutral variants may also experience the effects of selection; this is termed background selection. Background selection greatly influences patterns of genetic diversity both between and within populations among virtually all extant species, and is therefore of great interest to geneticists. Previous models of background selection have been primarily restricted to populations with completely random mating. However, it is well known that most natural populations exhibit some form of spatial structure. Here, we explore the effects of background selection in spatially structured populations, and we find that migration between subpopulations may attenuate the effects of background selection acting to increase genetic differentiation among populations. We derive novel theory to account for this effect by considering that individuals with deleterious alleles may migrate out of a local subpopulation prior to being removed by the population via selection. Our work demonstrates that, when migration rates are high, background selection does not substantially influence genetic differentiation among populations. Despite this, we find that background selection may greatly decrease genetic diversity within subpopulations and in the whole metapopulation.

Introduction

Background selection is the process by which selection on deleterious alleles also affects diversity at linked sites [12]. Background selection reduces genetic diversity, as individuals carrying deleterious mutations are selected out of the population, and ultimately do not contribute to neutral diversity [1,34]. Due to the ubiquity of deleterious mutations [56], background selection occurs in all species and its effects have been estimated in several species, including humans [711], fruit flies [1214], and various other eukaryotes [15]. Given that background selection influences patterns of genetic variation and the site frequency spectrum [1618], it may bias demographic inference or inferences of other selective processes [14,19]. Thus, there is a general recognition of the effects of background selection on patterns of diversity, and efforts are underway to incorporate background selection into null models of genomic diversity [9,13,14,20].

The strength of background selection is approximated using the metric B, which can be defined as the ratio of the effective population size under background selection and the effective population size without background selection. In other words , where Ne is the effective population size with background selection, and Ne0 is the effective population size without background selection. With the correct effective population size, B predicts the genetic diversity (π) relative to that expected without background selection (π0≈4Ne0μ in a diploid population under an infinite sites model [21], where μ is the neutral per base-pair mutation rate). It is important to note that because this approximation simply rescales the effective population size, it does not fully capture all the features of background selection. In particular, it does not capture the influence of background selection on the site frequency spectrum due to distortions in genealogies [2,18,2223]. However, this approximation works reasonably well under most conditions [18].

Generally, we can think of purifying selection as introducing positive correlations between generations in deleterious allele frequency change, amplifying the variance in reproductive success (and therefore genetic drift) experienced by linked neutral sites, further reducing their Ne beyond a simple mutation-drift model [4,2426]. Individuals with deleterious mutations are less likely to have offspring, and so are their offspring as they may also carry deleterious mutations. B therefore captures the amplification of variance in reproductive success of neutral alleles due to their multigenerational association with deleterious alleles.

Most natural populations exhibit some degree of population structure [27], and this influences patterns of genetic diversity. In structured populations, FST [28] is a widely used population genetic metric among plant/animal breeders, conservationists, and others [29] to identify sites subject to spatially varying selection (in the case of FST outlier tests), the demographic histories of subpopulations, rates of local inbreeding, and for various other reasons. FST has various definitions. It can be defined as , where πS is the average proportion of pairwise differences between haplotypes within a subpopulation and πB is the average proportion of pairwise differences between haplotypes from different subpopulations [30; the same quantity is estimated by ref. [31] θ; see ref. [32]. FST can also be defined in terms of the standardized variance in allele frequencies among subpopulations, or in terms of mean coalescence times for alleles within subpopulations versus the whole population [33].

Background selection influences neutral diversity patterns in populations, [3437], and this effect varies across the genome. Variation in recombination rates, mutation rates, and the density of functionally conserved regions generate locus-to-locus variation in Ne and FST, which has been observed in human [8] and fruit fly [32] populations. Locally beneficial alleles are expected to exhibit greater FST values relative to neutrality, as they exhibit greater variance in frequency between subpopulations relative to neutral alleles. However, locus-to-locus variation in background selection can increase FST values for neutral loci as well, confounding scans for locally adapted alleles and potentially increasing the false-positive rate. Work by Matthey-Doret and Whitlock has shown this effect to be weak, particularly when mean FST values are low [37]. This work also suggested an effect of migration to increase B (reduce the effect of background selection on Ne and π). In other words, when migration rates are sufficiently high, it is possible some of the effects of background selection are not realized in a local population before the allele is lost to selection, potentially weakening the confounding effect of background selection on FST. However, this effect remains analytically unexplored. The central objective of this paper is therefore to test and improve the approximations used to predict the effects of background selection on FST and total diversity, πT, of a metapopulation.

Classical models of background selection estimated B under the assumption that the effective strength of selection was not weak (|Net|> 1, where t is the heterozygous selection coefficient in diploids) and mutations are sufficiently rare, such that the average frequency of deleterious alleles is well predicted by a deterministic mutation-selection balance [1,34]. This parameter range has become known as the background selection regime. In the background selection regime, the dynamics at deleterious loci can be assumed to be independent from one another, and genetic drift can be ignored. Background selection was first modeled assuming an infinitely large diploid population where deleterious loci are at mutation-selection balance in a non-recombining genome. In this case the fraction of deleterious mutation-free gametes could be modeled using a Poisson distribution with mean , where U is the gametic genomic deleterious mutation rate [1]. Soon after, this model was extended to consider recombination [34]. From the very beginning [1], these papers acknowledged that background selection with weaker selection would behave differently; in particular weak selection was shown to have differing effects on the number of segregating sites [38].

Later work extended our understanding of cases where selection against deleterious alleles is weak (i.e., |Net| < 1), recombination rates approach zero (r → 0), and the per base-pair mutation rate, μ, is sufficiently high such that deleterious alleles are introduced frequently and inefficiently removed by selection [11,23,26,3940]. Here, weakly deleterious alleles accumulate along the genome, and the effects of selection at a single site are influenced by selection at other linked sites, leading to the “Hill-Robertson effect” or selective interference [4041]. In this interference selection regime, assumptions of a deterministic mutation-selection balance fail to predict the quantitative effects of background selection, as deleterious alleles (|Net|< 1) exhibit stochastic fluctuations in frequency due to the weak efficacy of selection and correlations between deleterious sites. Contrary to theory from the background selection regime, models of selective interference cannot assume independence among selected sites and must consider multiple selected sites in tight linkage with one another [39,40] as well as the effects of genetic drift [42]. There has been some work to address this issue, by instead considering the total variance in fitness experienced by a neutral allele under background selection in a region under selection, circumventing the issue of disentangling individual selection pressures. For instance, it has been shown that the effects of many weakly deleterious alleles in this interference selection regime can be approximated by fewer, strongly deleterious alleles in the background selection regime that generate equal variance in fitness [23]. Similarly, other work has shown that B can be approximated for weakly deleterious alleles by considering their fixation at an appreciable rate by genetic drift, as we expect substituted mutations to no longer contribute to fitness variance [11,26] These models effectively circumvent the issue of selective interference to predict B in panmictic populations for both the background selection and interference selection regimes (see refs.[2, 43] for reviews on background selection). However, we still lack theory to make predictions about patterns of genetic diversity in structured populations under background selection using these models, and they have not been extended to consider the potential effects of migration.

In models of structured populations under background selection, the equilibrium effects of background selection on FST have been approximated by rescaling the local population size of a subpopulation by B [36]. This is intuitive, as B represents the effects of background selection on the rate of genetic drift on neutral loci, with smaller values of B corresponding to greater amounts of genetic drift; thus, a greater effect of background selection on neutral loci will lead to a reduced local effective population size, Ne,local. In a haploid two-island model, Zeng and Corcoran [36] predicted that (1) where Nlocal is the local population size, and m is the total migration rate. Note that BNlocal = Ne, local, and that this equation is consistent with the more general diploid finite island model prediction of , where d is the number of demes [32]. This approximation has been shown to work well to predict FST in the deterministic background selection regime, in cases with strong selection and low migration rates [36]. However, no previous study provides a solution to the effects of background selection on FST for weakly deleterious alleles in the interference selection regime. This represents a gap in the literature, as both the theory [44] and empirical estimates of the distribution of fitness effects (reviewed in Ref. [5]) suggest that a non-trivial portion of coding and non-coding mutations incur weakly deleterious effects on fitness (see also ref.[6]). Furthermore, we lack theory predicting the effects of migration on background selection or the effects of background selection on total diversity, πT, in structured populations. Given that spatial structure is a common feature of natural populations, we require theory to better account for its role in shaping πT.

The overall goal of this paper is therefore to better understand the effects of background selection in structured populations. We find that some previous approximations fail under common biological conditions, and we develop new approximate theory. We use simulations to test and compare the accuracy of these approximate theories in predicting πT and FST under background selection in an island model. Our simulations show that the effects of background selection from weakly deleterious alleles (in the interference selection regime) on FST are relatively weak, and that the effects of background selection on FST are attenuated by migration; with high migration, FST values approach those expected under neutrality. We derive novel background selection theory to account for this migration effect. We also combine theory about background selection from the interference selection regime in panmictic populations [11,23,26] with what is known about how background selection affects FST [36] to best estimate FST and πT in an island model for both the interference selection and background selection regimes under various strengths of selection, recombination rates, and migration rates. Lastly, we demonstrate that, while the effects of background selection from weakly deleterious alleles on FST are weak, background selection from weakly deleterious alleles can substantially reduce total πT in a large metapopulation, and that we can predict this effect reasonably well. Our work extends the theory of background selection from panmictic populations to structured populations and lends further support for an insignificant confounding role of background selection in FST scans for populations where FST is relatively low on average. Our results also suggest a potentially under-appreciated role for background selection from weakly deleterious alleles in reducing total diversity in sizable metapopulations.

Theory

Zeng and Corcoran [36] suggested that the effects of FST in an island model would be predictable from modifying the effective size of a subpopulation, Nlocal, by multiplying times B, the Ne/N ratio created by background selection. We show here that migration lessens the effects of background selection on the amount of genetic differentiation caused by local drift relative to predictions of B based on earlier work (e.g. Ref [4]). Then, we derive the expected FST in an island model with background selection and a finite number of demes, when migration follows genetic drift.

Derivation of B with ‘migration effect’

In this section, we extend the quantitative genetic approach to modeling background selection, following the framework of Santiago and Caballero [2426] and Buffalo and Kern [11], to account for how migration influences the effects of background selection, B, in an island model consisting of diploid individuals. In brief, these models consider how the additive genetic fitness variance experienced by a neutral allele due to its associations with deleterious alleles further increase its variance in reproductive success, therefore reducing its effective population size beyond that of a mutation-drift model. Given that this additional increase in the variance of reproductive success of neutral alleles is due to selection at linked sites, this has been termed the fitness-effective population size, Nf [11]. First, we describe the framework under panmixia; then, we extend it to predict the geographically local effects of background selection in an island model, Blocal (where Blocal = ), by deriving its local fitness-effective population size Nf, local. [24,45].

In a panmictic population, Santiago & Caballero [2526] suggest that, in order to understand the effects of background selection on Nf at a neutral allele, we must consider both the magnitude of additive genetic fitness variance created by associated deleterious alleles, VA, as well as the cumulative effects of this variance on the reproductive success of a neutral allele over time, estimated by the linkage inflation factor Q2. The product of these two, VAQ2, therefore represents the total variance in reproductive success over time experienced by a neutral allele due to the effects of purifying selection at linked sites. This is summarized by the following equation:

Assuming a multiplicative polygenic basis to fitness, we can approximate Nf at a neutral site linked to S regions under selection as NfN . Here, we follow Buffalo and Kern [11] and ignore short-lived associations of neutral sites with sites on homologous chromosomes, hence the division by 2.

For our derivation, we now consider an island model with d subpopulations, each with Nlocal diploid individuals. Deleterious mutations appear at site i with per-base pair mutation rate μi and heterozygous selection coefficient ti. Recombination between the focal neutral site and deleterious allele i occurs at rate ri. In order to predict the effects of background selection on FST in an island model, we now consider the effects of background selection on the fitness-effective population size of each geographically local subpopulation, Nf, local.

First, we will solve for Qi2, the linkage inflation factor. Qi2 captures the cross-generational associations between a focal neutral site and deleterious site i within a focal subpopulation. For now, we will consider the case of strong selection; the estimation of Qi2 is later extended to address weak selection. The magnitude of Qi2 is contingent on the heterozygous selection coefficient of the deleterious allele, ti, and the recombination rate between the neutral and deleterious allele, ri. This is intuitive, as we expect the association between a neutral and deleterious allele within a focal subpopulation to be ended if the individual is selected out of the population or if the neutral allele recombines off of the background of the deleterious allele. Here, we account for the fact that the association between a neutral and deleterious allele also ends within a focal subpopulation if the individual carrying the haplotype emigrates out of the subpopulation, which will occur with rate m. Thus, the autocorrelation function for a neutral allele associated with deleterious allele i over time is , where τ is the number of generations. In order to estimate the net effect of this autocorrelation, , we sum its effects over time:

Next, we turn to solving VA,i, the additive genetic variance contributed by deleterious alleles in segment i. Here, we simply follow the results of Buffalo and Kern [11], who show that VA,i is well approximated in a region of Li selected sites each with heterozygous selection coefficient ti and mutation rate μi under mutation-selection balance to be VA,iUit, where Ui = 2Liμi is the total mutation rate in the segment. Thus, for S selected segments, we get the following equation for Nf,local:

As pointed out by previous work, classic background selection theory assuming mutation-selection balance only serves as a reasonable approximation under strong selection and/or weak mutation (see ref. [23]). When alleles are weakly deleterious, we expect genetic drift to substantially affect their frequency dynamics such that they are no longer well predicted by classic mutation-selection balance theory. Santiago and Caballero [2526] demonstrated that Nf can be well approximated under weak selection by considering that some fraction of weakly deleterious alleles fix due to genetic drift, thus no longer contributing to the additive genetic variance, VA, experienced by linked neutral alleles.

Under this weak selection regime in diploids, where deleterious alleles have a non-trivial rate of fixation, VA may be approximated as VA = (U−2R)t [11], where R is the rate of fixation of deleterious alleles (the factor of two is due to our modeling of diploids). R is estimated by considering the probability of fixation of deleterious alleles [46], pfix, as well as their rate of introduction to the population, NU, the product of the population size and diploid genomic deleterious mutation rate. Thus, we get the following system of non-linear equations, which can be numerically solved for Nf,local and R: (2) (3)

Under weak selection, we must also modify Q2 to account for the loss of additive genetic variance [11, 26]. Under the strong selection regime, it was assumed that the rate of decay in association between a focal neutral allele and linked deleterious alleles due to reductions in fitness variance from selection was equal to the heterozygous selection coefficient t. This is derived by considering that the factor estimating the decay in fitness variance due to selection, Z, is estimated as , where VM is the mutational variance. A more in-depth explanation is provided in the supplemental materials of Buffalo and Kern [11]. Briefly, Buffalo and Kern [11] demonstrate that VMUt2 and VA = (U−2R)t. In the case of strong selection, where R ≅ 0, we recover , matching the term described in the original, strong selection derivation of Q2 above. Under weak selection, where R > 0, we instead get . Thus, more generally, . Buffalo & Kern [11] use a continuous approximation to derive Q2 over a segment under selection of map length Mi. We also follow this method with addition of our (1−m) term, finding that

(cf. ref.[11], supplement Eq 33), where Mi is the map length in morgans. This derivation relies on a few assumptions: ri is small such that it is additive over sites, ti is constant over the region, and the neutral site is embedded in the center of the region.

For our predictions of π and FST in an island model, we numerically solve for Nf,local and Rlocal for our parameters of interest to get Blocal (), thus allowing us to make predictions across both weak and strong deleterious selection coefficients. One caveat to this model, however, is the lack of an explicit treatment of the effects of interference. Under weak selection and low recombination, we expect correlations in frequency to arise between segregating deleterious variation, leading to selective interference among deleterious alleles. Here, selection at a deleterious site is no longer independent of selection at nearby sites. Instead, deleterious sites themselves also experience a reduction in fitness-effective population size due to selection at linked sites. To account for this, we follow the methodology of Buffalo & Kern [11] and rescale Nlocal at a region x as Nlocal,x = Blocal,xN. We then use this re-scaled Nlocal,x instead of Nlocal to numerically solve for Nf,local and R.

Santiago and Caballero [26] showed that the effective population size for calculating fixation rates (“fixation effective size”, approximated by Nf) with background selection is different from the effective population size for predicting neutral diversity (“heterozygosity effective size”, NeH). To predict π, we must slightly modify the above theory to instead estimate NeH. We may do so by considering that it is not the asymptotic linkage inflation factor, , that is most relevant to prediction of metrics of genetic diversity [11 supp., 26] Instead, at equilibrium, there is variation in the time in which neutral alleles have arisen in the past and therefore the degree to which they have experienced the effect of background selection. Thus, we expect NeH to be less than Nlocal but greater than Nf, the effective population size estimated from using . More details may be found in the Methods.

Prediction of FST

Previous work on background selection in structured populations has demonstrated that, in some cases, the effects of background selection in an island model can be approximated as a reduction in local effective population sizes proportional to the strength of background selection [34, 36]. In order to estimate and predict FST, we use the definition of Hudson et al. [30], where

We show in Eq A4 in S1 Appendix that, using the neutral expectation of FST in an island model, predicted FST under background selection is: (4)

Here, m is the total proportion of immigrants into a deme per generation, d is the number of demes, and Blocal is a measure of background selection experienced by the subpopulation. Our equation differs from that of Ref. [32] because, in SLiM (the evolutionary simulation framework used; ref. [47]), the order of events is such that offspring production occurs first (genetic drift) followed by migration and then FST estimation [48], and because we consider terms that are m2 or higher.

We note that there are multiple definitions of FST that have been previously used. For instance, the equation used to estimate FST in our simulations uses the ratio of πS and πB, but an alternative definition uses the ratio of πS and πT, the total population diversity. This is equivalent to GST, and is defined as:

[49]. Using the neutral expectation, GST in a finite island model experiencing background selection with migration following drift and then FST estimation is derived in Eq A5 in S1 Appendix to be: (5)

Results

We use a simulation-based approach to investigate the accuracy of our theoretical derivation of background selection in the interference selection and background selection regimes under varying migration rates. In SLiM v3.7.1 [47], we run simulations of a metapopulation using an island model consisting of Nglobal diploid individuals split equally into d demes. Population size is held constant, and generations are non-overlapping. Each generation, a total fraction of m individuals in each deme are replaced by immigrants, equally partitioned from each other deme. We simulate a genomic region composed of Lneutral neutral sites, surrounded on each side by Lselected /2 selected sites for a total of Lselected sites subject to deleterious mutation. Recombination occurs between the neutral region and selected sites and between selected sites at total rate M (denoting the map length). Deleterious mutations are introduced to gametes at fixed rate U, with fixed selection coefficients against heterozygotes t. Fitness is multiplicative across sites and additive within sites. More details are given in the Methods section.

Background selection in structured populations and prediction by previous models

For our high migration cases (Fig 1A and 1B), FST under neutrality was 0.004, representing populations with extremely low genetic differentiation and high gene flow. For our intermediate migration cases (Fig 1C and 1D), FST under neutrality was ~0.05, representing populations with moderate genetic differentiation. For our low migration cases (Fig 1E and 1F), FST under neutrality was ~0.50, thus representing populations with very high genetic differentiation and low gene flow. Hereafter, we consider deleterious alleles with Nlocalt ≤ 1 to be weakly deleterious, 1 < Nlocalt < 7.5 to be moderately deleterious, and Nlocalt ≥ 7.5 to be strongly deleterious. Given previous work on the theory of background selection in structured populations [36], we expected the greatest increase in FST due to background selection at moderate strengths of selection. This is intuitive, as we expect for strongly deleterious mutations to be rapidly purged from the population, thus generating weak total variance in fitness due to their short duration in the population. Weakly deleterious mutations, on the other hand, may persist for many more generations in the population, but simply generate little total variance in fitness due to their individually weak selective effects. Moderately deleterious alleles, however, generate moderate variance in fitness and are not rapidly purged from the population; thus, we expected an intermediate maximum of FST with respect to strength of selection against deleterious alleles–in other words, an inverted U-shaped curve.

thumbnail
Fig 1. Accuracy of various methods in predicting FST with respect to the scaled heterozygous strength of selection, Nlocalt.

Forward-time simulations run with (A) Nlocalm = 45 and M = 0, (B) Nlocalm = 45 and M = 0.01, (C) Nlocalm = 4.5 and M = 0, (D) Nlocalm = 4.5 and M = 0, (E) Nlocalm = 0.225 and M = 0, and (F) Nlocalm = 0.225 and M = 0.01. The gray dots connected by a line represent FST values from forward-time simulations, with 95% CIs. (These were quite small, and not always visible.) FST was calculated using Eq 4, which requires a value for Blocal. The pink dots represent theoretical predictions of Blocal from our novel derivation accounting for the effects of migration (Eq 2) with re-scaling of Nlocal to approximate the effects of selective interference, and the purple dots represent theoretical predictions of Blocal from classic background selection theory (Eq 7 in our Method section; refs. [4, 36]). The insets show the performance of classic background selection theory to structured populations following ref. [36] for all simulated Net. All parameter combinations were simulated 750 times in a 10-deme island model with local population size Nlocal = 500, locally scaled selection coefficients Nlocalt = {0, 0.25, 0.5, 1, 2.5, 5, 7.5, 15}, Nlocalm = {45, 4.5, 0.225}, U = 7x10-3, Lselected = 700, and for 250,000 generations. In many cases (especially in part D), the plot points for the simulations (gray dots) are hidden behind the points shown for theoretical expectations. Note the different scales on the y-axis between plots.

https://doi.org/10.1371/journal.pgen.1011225.g001

Indeed, we observed an inverted U-shape curve of FST under background selection, but only for our low migration cases (Fig 1E and 1F), similar to the effect on overall diversity from background selection [50]. We later discuss the intermediate and high migration cases. Considering only our low migration cases, we observed that the effects of background selection on FST from weakly deleterious alleles are relatively weak (Fig 1E and 1F, gray dots when Nlocalt ≤ 1). As expected, we also observed that the effect of background selection on FST is attenuated by recombination (cf. Fig 1E to 1F); recombination introduces the opportunity for a neutral allele to recombine off of the background of a deleterious allele, thus rescuing it from eventual loss via linked selection.

As expected, classic theory from the background selection regime (refs. [1,34]; Fig 1, purple dots) substantially overestimates the effects of background selection (underestimates B) for weakly deleterious alleles (see ref. [23]) and therefore overestimates the effects on FST [36], with the discrepancy increasing for weaker strengths of selection (Fig 1 insets, cf. purple to gray dots). As t approaches 0, the classical formula for B inaccurately predicts deleterious allele frequencies, and indeed those papers [1,34] deriving the predictions are clear that the derivation applies to stronger effect mutations. The discrepancy between observation and prediction, however, is somewhat attenuated in the M = 0.01 cases (cf. Fig 1E inset to Fig 1F inset), presumably due to recombination reducing the magnitude of selective interference. Nevertheless, classic theory poorly predicts the effects of background selection on FST for weakly deleterious alleles, as expected.

On the other hand, our novel derivation of the effects of background selection (Fig 1, pink dots) estimates FST well when alleles are weakly deleterious. However, the theory does not accurately predict FST when Nlocalt ≈ 2.5 (Fig 1E and 1F). Our inability to predict B in this regime previously been pointed out by Buffalo and Kern [11], and the accurate estimation of B when 2Net ≈ 1 remains an outstanding problem in the background selection literature. Thus, we expect some inaccuracy from the theory here.

Next, we turn to our case of high migration rates. Here, we observe no effects of background selection on FST for all simulated values of Nlocalt (Fig 1A and 1B). This lack of effect of background selection on FST under higher rates of migration has been observed previously [37]. However, classic background selection theory still predicts an exponential decline in FST with respect to Nlocalt (Fig 1A and 1B, purple dots). Additionally, note that background selection theory with consideration of interference but not migration (following the method of ref. [23]) erroneously predicts an inverted U-shaped curve with respect to Nlocalt (see Fig A in S1 Text green dots) under high migration rates.

In our intermediate migration case (Fig 1C and 1D), we observe some elevation of FST at moderate strengths of selection, although quite weak relative to our low migration case due to the effects of moderate levels of migration (cf. Fig 1C and 1D to Fig 1E and 1F). Our theoretical derivation of Blocal is also able to predict this effect well. In our derivation above, we hypothesized that migration acts to reduce the effect of background selection by presenting a neutral allele the opportunity to end its association with deleterious alleles within a subpopulation, thus reducing its total variance in fitness within the focal subpopulation. We further discuss these predictions below.

‘Migration effect’ theory accurately predicts the effects of background selection on FST

The application of our derivation of Blocal with the ‘migration effect’ (Fig 1, pink dots) accurately predicts FST under various fixed strengths of selection, recombination rates (Fig 1), and migration rates (Fig 2). Moreover, we find that this accuracy extends well into the interference selection regime, predicting FST well under lower strengths of selection. As expected, the ‘migration effect’ is particularly prominent at higher migration rates. Under our higher migration rate parameter set (Fig 1A and 1B), we observed no effect of background selection on FST, despite marked reductions in πT (Fig 3A and 3B). Thus, migration appears to attenuate the effects of background selection by exporting deleterious alleles to different subpopulations prior to their removal via selection, resulting in similar reductions in πT and πS such that FST is not elevated. In contrast, with low migration, background selection occurs primarily within subpopulations to reduce πS, with some effect on πT [35,37].

thumbnail
Fig 2. Inclusion and exclusion of migration effect in estimation of FST under background selection.

Forward-time simulations under varying rates of migration for (A) M = 0 and (B) M = 0.01. The gray dots represent observed FST and are connected by a line. The pink dots represent predictions from using our novel derivation of Blocal (Eq 2) to estimate FST (Eq 4), the turquoise dots represent predictions of Blocal without accounting for the effects of migration (Eq 2; m = 0), and the dark blue dots are FST from simulations run without selection (representing FST under mutation-drift balance). The insets compare the simulations run with and without selection. Note the different scales for the x-axis on the main figures and insets. All simulations were run 750 times in a 10-deme island model with Nlocal = 500, U = 7x10-3, Nlocalt = 7.5, Lselected = 700, Nlocalm = {0.5, 1.5, 5, 7.5, 10, 15, 25}, and for 250,000 generations.

https://doi.org/10.1371/journal.pgen.1011225.g002

thumbnail
Fig 3. Predicted πT versus observed πT in a metapopulation.

Forward-time simulations run with (A) Nlocalm = 45 and M = 0, (B) Nlocalm = 45 and M = 0.01, (C) Nlocalm = 4.5 and M = 0, (D) Nlocalm = 4.5 and M = 0, (E) Nlocalm = 0.225 and M = 0, and (F) Nlocalm = 0.225 and M = 0.01. The gray dots represent πT from forward-time simulations using a 10-deme island model with Nlocal = 500, Nlocalt = {0, 0.25, 0.5, 1, 2.5, 5, 7.5, 15}, Nlocalm = {45, 4.5, 0.225}, and are connected by a line. The pink dots represent theoretical predictions of total diversity (see Methods) using predictions of Blocal with incorporation of the migration effect (Eq 2) to estimate GST, (Eq 5), Ne, global, and therefore πT.

https://doi.org/10.1371/journal.pgen.1011225.g003

We explored the attenuating effect of background selection on FST across multiple migration rates (Fig 2). We find that incorporation of the migration effect to estimate the geographically local effects of background selection, Blocal, predicts FST more accurately than our estimates without incorporation of the migration effect (cf. Fig 2 pink & turquoise dots). For migration rates of m = 0.01 and higher, we also find that FST values strongly resemble FST values expected under no background selection (Fig 2 inset, cf. gray to dark blue dots). Thus, our results suggest that the alignment of observed FST value to neutral prediction at low FST values may not be sufficient to predict the absence of background selection in empirical systems, as it may otherwise be explained by the presence of both background selection and high migration rates. Regardless, our theoretical predictions further support the expectation of background selection weakly confounding locus-to-locus FST scans when gene flow is sufficiently common [37]. Our simulations suggest that this is true in an island model when mt (Fig 2 inset).

Background selection may substantially reduce πT

In this section we highlight some general effects of background selection on genetic diversity, π, in our simulations and discuss the performance of our theoretical predictions. We found that background selection can cause substantial reductions in πT (Fig 3), despite weak effects on FST. Under our parameter sets, this effect is most pronounced for lower strengths of selection against deleterious alleles under both low and high migration parameter sets, and it is weakly influenced by the introduction of low rates of recombination. Previous work has suggested that selective interference is more common in larger populations (reviewed by Ref. [51]); given that Ne, global is on the order of 104 in our simulations, this is likely the case. We investigated this further and found that this agrees with predictions under previous background selection theory derived for the interference selection regime [23]; this theory predicts a greater reduction in B at lower strengths of selection for increasing population sizes (S1 Fig).

We find that the effects of background selection on πS and πT are dependent on the migration rate, which is to be expected given that migration influences FST both directly and through its effects on background selection, and that FST influences the effective population size of the metapopulation [27] (see Methods Calculation of FST). Under our low migration rate parameter set, we found that πS was reduced by a fractionally larger margin than πT (cf. Fig 3 to S2 Fig), resulting in the elevation of FST. However, in our high migration rate parameter set, πS and πT were similarly reduced for all simulated values of Nglobalt. For instance, we observed that, at Nglobalt = 50 with m = 0.00045 and M = 0, πT was reduced by a fraction of ~0.30 and πS by a fraction of ~0.45. At Nglobalt = 50 with m = 0.09 and M = 0, on the other hand, πT and πS were both reduced by a fraction of ~0.47. Overall, the results from our parameter sets suggest that background selection may cause substantial reductions in πT compared to neutrality, even despite having no effect on FST (cf. Fig 1A and 1B to Fig 3A and 3B).

Moreover, we demonstrate that we are able to predict the effects of background selection on πT with reasonable accuracy under strong selection (Fig 3, Nglobalt ≥ 50). Under lower strengths of selection, Nglobalt ≤ 50, however, our theoretical derivation overpredicts both πT and πS (S2 Fig). We considered that this may be either due to inaccuracy of the theory when interference is common, which is expected to be substantial under smaller t in our parameter set, or an unconsidered artifact of population structure. To investigate this, we ran panmictic simulations with N = {103, 104} and M = {0, 0.01} (S3 Fig), and found that the theory also poorly predicts π in a panmictic population under smaller strengths of selection t for when N = 104 and M = 0 (S3C Fig), parameters under which we expect the relatively greatest amounts of interference. On the other hand, the theory more accurately predicts π under weaker strengths of selection t for N = 1000 and M = 0.01 (S3D Fig), for which we expect the relatively lowest amounts of interference in our set of parameters. Thus, we have reasonable evidence to believe that inaccurate predictions of πT under Nglobalt ≤ 50 are due to inaccuracy in the theory in accounting for the effects of interference, which has been previously noted [11].

For all of the simulations presented above, we also test predictions of the interference selection model of Good et al. [23], modified to consider the effects of migration (S1 Text). Overall, we find that the Good et al. [23] model performs similarly to the quantitative genetic model presented in the main text.

Discussion

Prior to this work, models of background selection have been restricted to the case of populations that are panmictic [1,34,2326] or have low migration rates [2,3536]. This represents a substantial gap in the literature, as we expect many, if not most, natural populations to exhibit low to moderate spatial subdivision. Here, we have extended quantitative genetic models of background selection [11,2526] to account for the effects of migration in an island model, showing that migration, in addition to selection and recombination, acts to end the cross-generational association between a neutral allele and linked deleterious alleles. A useful property of these models is that predictions for B may be made under weak selection, by considering that weakly deleterious alleles have an appreciable rate of fixation via genetic drift. We tested the predictions of our novel derivation using simulations and found that it best predicts FST and πT under varying strengths of selection, recombination rates, and migration rates. Our work provides a mechanistic explanation for the previously observed lack of effect of background selection on FST under high migration rates [37], and highlights parameter space under which we expect background selection theory to be inaccurate.

Our work suggests that the relative importance of the ‘migration effect’ in influencing Blocal depends on the magnitude of the heterozygous selection coefficient, t, and the migration rate, m. In cases where t >> m, we expect the effect of migration to be relatively weak, as selection will likely act to remove deleterious variants prior to their emigration from a population. Under this relatively low migration regime, we expect predictions of background selection to be accurate even without consideration of the migration effect, given that migration is sufficiently infrequent. This is indeed what has been previously observed. For instance, Zeng & Corcoran [36] show that they are able to predict FST well using classic background selection theory in simulations with Nlocalm = {0.05, 0.25} and Nlocalt = {50, 15}. This is also aptly described by Charlesworth et al. [35], where it is stated that the effects of background selection on FST are primarily through reductions in diversity, πS, within populations; this conclusion was evidenced by simulations with Nlocalm = 1. In cases where migration rates are comparable to or stronger than the strength of selection against deleterious alleles, that is, mt, we expect deleterious alleles to emigrate from local populations at an appreciable rate prior to their removal via selection. Under this regime of non-trivial migration rates, we expect the effects of background selection on FST to be significantly attenuated, as we expect greater similarity in reductions in πS and πT.

We now highlight some limitations of our study and avenues for future work. Our study only looked at an island model. However, the effects of background selection on FST and πT remain unexplored for other population subdivision models, such as ‘isolation-by-distance’ models with varying dimensionality. We know that FST interacts with global Ne differently for non-island model demographics [27]. Additionally, the modulating effect of migration on the effect of background selection and FST remains unexplored in these other population subdivision models. Thus, we caution against the application of our model to more complex demographic scenarios. We also use various other simplifying assumptions, including fixed selection coefficients and a simple genomic architecture. We did not explore the effects of varying deleterious selection coefficients on B, but it is possible that the combination of both weakly and moderately deleterious alleles may result in more complex dynamics of background selection. Furthermore, we do not modify our theory to be applied to a continuous distribution of selection coefficients. However, this problem is addressed by the work of Santiago & Caballero [26], which may be extended to our present model. Additionally, we note that our model is restricted to the case of semi-dominance of deleterious mutations. Under low recombination regions with substantial selective interference and recessivity of deleterious mutations, there is evidence that associative overdominance may act oppositely to background selection, instead leading to increased genetic diversity [5253].

In summary, we have provided a novel theoretical derivation of the effects of background selection in an island model; this model accounts for the effects of migration providing an opportunity to lessen the effects of background selection by truncating cross-generational associations between neutral and deleterious alleles in their reproductive success, and presents a first step into the application of background selection theory in systems with more complex demography. We test the accuracy of the theory under various parameter sets, demonstrating that it predicts FST accurately across a wide range of parameters and predicts πT well when we expect selective interference to be weak.

Methods

Simulations

Our primary objective was to study how well the approximations used in the theory of background selection allow the theory to make accurate predictions of B, and to identify parameter regimes where the theory is less accurate. Thus, although the parameter sets used for this study were biologically relevant, in each simulation run the selection coefficient was held fixed and our parameters were not representative of any one study system nor a particular gene model.

We ran simulations in SLiM v3.7.1 [47] using an island model consisting of 10 demes with a local diploid population size, Nlocal = 500 individuals, for a total of Nglobal = 5,000 individuals. To delineate the effects of background selection on FST and πT under varying strengths of selection, we ran simulations under the following parameter set: Nlocalt = {0, 0.25, 0.5, 1, 2.5, 5, 7.5, 15}, μ = 10−5, U = 7x10-3, r = {0, 1.414x10-5}, M = {0, 0.01}, and m = {0.00045, 0.009, 0.09}, t is the selection coefficient against heterozygotes with a deleterious allele, μ is the deleterious mutation rate per base-pair, U is the total gametic deleterious mutation rate, r is the per base-pair recombination rate, M is the total recombination rate (in Morgans), and m is the total migration rate (the fraction of individuals in each deme that are replaced by immigrants from all other demes). To delineate the effects of varying migration rates on the effect of background selection on FST, we also ran simulations under the following parameter set: Nlocalt = 7.5, U = 7x10-3, M = {0, 0.01}, and m = {0.001, 0.003, 0.01, 0.015, 0.02, 0.03, 0.05}.

Fitness effects were multiplicative, such that wi = (1—t)k, where wi is the survival probability of individual i, t is the selection coefficient against heterozygotes, and k is the number of deleterious alleles carried by the individual. The genomic region consisted of Lneutral = 104 neutral sites embedded in a region surrounded on each side by 350 sites subject to deleterious mutation, adding up to a total of Lselected = 700 selected sites. This is not intended to reflect the neutral mutation rate of a particular gene model. Rather, we chose to simulate a large neutral region to minimize measurement error in our estimation of π, allowing us to draw more accurate comparison to theoretical predictions.

Our simulated region is equivalent to a region consisting of a number of sites under selection upon which recombination occurred at total rate M, with a non-recombining neutral site embedded in the center, similar to genomic regions used in previous theoretic work (e.g. refs. [3, 23, 36]. Each simulation was run 750 times to derive 95% CIs and for 250,000 (50Nglobal) generations to achieve equilibrium. Data from the simulations were saved as tree sequence files, and neutral mutations were overlaid using msprime. Tree sequence data was analyzed using the pyslim and tskit packages in Python 3.9.12 to output FST, πS, and πT.

Calculation of FST

FST in the simulations was estimated as:

[30,54], where πS is the average proportion of pairwise differences between haplotypes sampled within a deme at a neutral locus and πB is the average proportion of pairwise differences between haplotypes sampled between demes at a neutral locus. We filtered neutral alleles for which the minor allele frequency was less than 0.05 prior to calculating FST.

Calculation of B

For the case where deleterious mutations behave deterministically (i.e. the background selection regime), Nordborg et al. [4] predict B: (6)

For a neutral mutation embedded in a large region of Lselected sites subject to deleterious mutation with fixed t and ri and being sufficiently small that it is additive over contiguous sites, we may use a continuous approximation for a region under selection, yielding the following formula: (7)

[3], where U is the gametic deleterious mutation rate for the region, t is the heterozygous strength of selection against deleterious alleles, and M = riLselected is the total recombination rate. The above models assume that genetic drift is insignificant; thus, B = Blocal for Eq 6 and Eq 7.

To predict πlocal instead of Nf,local, (i.e. , where πlocal is the geographically local estimate of diversity in a subpopulation experiencing background selection and π0,local is the neutral expectation) under our quantitative genetic model of background selection, we must consider that neutral alleles that contribute to diversity patterns arose at different times in the past, and have therefore experienced different degrees of the effects of background selection. An in-depth explanation may be found in ref. [26] but we describe this phenomenon in brief below.

Neutral mutations that have occurred further back in time are expected to have more strongly experienced reductions in Nf,local due to greater accumulation of autocorrelation in variance in reproductive success with deleterious alleles. On the other hand, we expect newer neutral alleles to experience weaker reductions in Nf, local due to the relatively short time they are likely to have been associated with deleterious alleles. Thus, it is incorrect to use the asymptotic linkage inflation factor, , to estimate πlocal at equilibrium. Instead, we must consider that Q is a function of time, and that not all neutral mutations in association with deleterious alleles have arrived at their asymptotic limit, , with regards to the effects of background selection. Once we have accounted for variation in the time of linkage between existing neutral and deleterious variants, we are able to estimate the relevant effective population size for estimates of πlocal, termed the local heterozygosity-effective population size NeH,local [26].

The methodology is as follows. First, we find that . Then, we estimate the local fitness-effective population size of neutral sites that arose τ generations in the past: Nf,local,i(τ) = N ], for neutral sites arising from τ = 0 to τ = 10Nlocal generations in the past; this is somewhat arbitrary, but we follow the convention of [11]. Then, we are able to estimate Blocal to predict πlocal as [11,26]. Given that we are considering variation in the number of generations of autocorrelation between neutral and deleterious alleles instead of using the asymptotic limit, this quantity is greater than the asymptotic fitness-effective population size, Nf,local.

It has been noted that the use of the asymptotic linkage factor, (and therefore Nf), to predict diversity is reasonable under most human parameters [11], but we expect the deviation in prediction of Blocal using to be greatest under weak selection and tight linkage [26], a common feature of our simulations. These are parameters under which we expect the longest cross-generation associations between neutral and deleterious alleles, as there is a lower rate of association-ending events. Thus, we opt for the more precise estimate of NeH for the purposes of our predictions of π for our simulations.

Calculation and prediction of πT

In order to calculate πT in a metapopulation, we must consider its global effective population size, Ne, global. Ne, global is directly influenced by the degree of population structure, and, in the case of an island model, may be estimated as [27,55]. GST is the appropriate measure of population structure here, because it is standardized by total diversity due to its definitional use of πT as required by the theory [27]. For our predictions of πT under background selection, we calculated GST, predicted using background selection theory (see S1 Appendix) and then computed Ne, global. Using Ne, global, we were then able to calculate the global effect of background selection, Bglobal, on πT in the metapopulation. Then, we calculated predicted total diversity πT, pred using the tskit package in Python 3.9.12 as:

[56].

Supporting information

S1 Appendix. FST with a finite number of demes and migration immediately preceding measurement.

https://doi.org/10.1371/journal.pgen.1011225.s001

(PDF)

S1 Text. Derivation of B with migration effect and application to the Good et al. [23] method.

https://doi.org/10.1371/journal.pgen.1011225.s002

(PDF)

S1 Fig. Predicted B in a panmictic population according to the Good et al. [23] method.

B was computed for haploid deleterious selection coefficients of s = {0.0005, 0.001, 0.002, 0.005, 0.01, 0.015, 0.03}, haploid population sizes of N = {100, 1000, 10000, 100000}, U = 7x10-3, and M = 0. Here we see that, as N increases, the theory predicts greater reductions in B for weaker selection coefficients (s ≤ 0.005).

https://doi.org/10.1371/journal.pgen.1011225.s003

(TIFF)

S2 Fig. Predicted πS versus observed πS in a metapopulation.

Forward-time simulations run with (A) Nlocalm = 45 and M = 0, (B) Nlocalm = 45 and M = 0.01, (C) Nlocalm = 4.5 and M = 0, (D) Nlocalm = 4.5 and M = 0, (E) Nlocalm = 0.225 and M = 0, and (F) Nlocalm = 0.225 and M = 0.01. The gray dots represent πS from forward-time simulations using a 10-deme island model with Nlocal = 500, Nlocalt = {0, 0.25, 0.5, 1, 2.5, 5, 7.5, 15}, m = {5x10-5, 1x10-2}, and are connected by a line. The pink dots represent theoretical predictions of total diversity (see Methods) using predictions of Blocal with incorporation of the migration effect (Eq 2) to estimate GST, (Eq 5), Ne, global, and therefore πS.

https://doi.org/10.1371/journal.pgen.1011225.s004

(TIFF)

S3 Fig. Predictions for π in panmictic populations from Buffalo & Kern [11] and Hudson and Kaplan [3].

Forward-time simulations of a panmictic population run with (A) N = 1000 and M = 0, (B) N = 1000 and M = 0.01, (C) N = 10 000 and M = 0, (D) N = 10 000 and M = 0.01, The gray dots represent observed genetic diversity, π, from our panmictic simulations. The pink dots represent predicted π using the model of Buffalo & Kern (2024) (Eq 2), equivalent to our model when m = 0, and the purple dots represent predicted π using classic B (Eq 7).

https://doi.org/10.1371/journal.pgen.1011225.s005

(TIFF)

Acknowledgments

We thank Tom Booker, Tyler Kent, Brian Charlesworth, and Tianlin Duan for helpful discussions and comments. Simulations for this study were conducted using the Zoology Computing Cluster supported by the Zoology Computing Unit at the University of British Columbia.

References

  1. 1. Charlesworth B, Morgan MT, Charlesworth D. The effect of deleterious mutations on neutral molecular variation. Genetics. 1993;134(4):1289–303. pmid:8375663
  2. 2. Charlesworth B, Jensen JD. Effects of selection at linked sites on patterns of genetic variability. Annu Rev Ecol Evol Syst. 2021 Nov;52(1):177–97. pmid:37089401
  3. 3. Hudson RR, Kaplan NL. Deleterious background selection with recombination. Genetics. 1995 Dec;141(4):1605–17. pmid:8601498
  4. 4. Nordborg M, Charlesworth B, Charlesworth D. The effect of recombination on background selection. Genet Res. 1996 Apr;67(2):159–74. pmid:8801188
  5. 5. Eyre-Walker A, Keightley PD. The distribution of fitness effects of new mutations. Nat Rev Genet. 2007 Aug;8(8):610–8. pmid:17637733
  6. 6. Kim BY, Huber CD, and Lohmueller KE. Inference of the distribution of selection coefficients for new nonsynonymous mutations using large samples. Genetics 2017 206(1):345–361. pmid:28249985
  7. 7. McVicker G, Gordon D, Davis C, Green P. Widespread genomic signatures of natural selection in hominid evolution. PLoS Genet. 2009 May;5(5):e1000471. pmid:19424416
  8. 8. Torres R, Szpiech ZA, Hernandez RD. Human demographic history has amplified the effects of background selection across the genome. PLoS Genet. 2018 Jun;14(6):e1007387. pmid:29912945
  9. 9. Murphy DA, Elyashiv E, Amster G, Sella G. Broad-scale variation in human genetic diversity levels is predicted by purifying selection on coding and non-coding elements. Elife [Internet]. 2023 Jun 23;12. Available from: pmid:36196994
  10. 10. Pouyet F, Aeschbacher S, Thiéry A, Excoffier L. Background selection and biased gene conversion affect more than 95% of the human genome and bias demographic inferences. Elife [Internet]. 2018 Aug 23;7. Available from: pmid:30125248
  11. 11. Buffalo V, Kern AD. A quantitative genetic model of background selection in 719 humans. PLoS Genet. 2024 Mar;20(3):e1011144.
  12. 12. Charlesworth B. Background selection and patterns of genetic diversity in Drosophila melanogaster. Genet Res. 1996 Oct;68(2):131–49. pmid:8940902
  13. 13. Comeron JM. Background selection as baseline for nucleotide variation across the Drosophila genome. PLoS Genet. 2014 Jun;10(6):e1004434. pmid:24968283
  14. 14. Johri P, Charlesworth B, Jensen JD. Toward an Evolutionarily Appropriate Null Model: Jointly Inferring Demography and Purifying Selection. Genetics. 2020 May;215(1):173–92. pmid:32152045
  15. 15. Corbett-Detig RB, Hartl DL, Sackton TB. Natural selection constrains neutral diversity across a wide range of species. PLoS Biol. 2015 Apr;13(4):e1002112. pmid:25859758
  16. 16. Nicolaisen LE, Desai MM. Distortions in genealogies due to purifying selection. Mol Biol Evol. 2012 Nov;29(11):3589–600. pmid:22729750
  17. 17. Nicolaisen LE, Desai MM. Distortions in genealogies due to purifying selection and recombination. Genetics. 2013 Sep;195(1):221–30. pmid:23821597
  18. 18. Cvijović I, Good BH, Desai MM. The Effect of Strong Purifying Selection on Genetic Diversity. Genetics. 2018 Aug;209(4):1235–78. pmid:29844134
  19. 19. Ewing GB, Jensen JD. The consequences of not accounting for background selection in demographic inference. Mol Ecol. 2016 Jan;25(1):135–41. pmid:26394805
  20. 20. Cutter AD, Payseur BA. Genomic signatures of selection at linked sites: unifying the disparity among species. Nat Rev Genet. 2013 Apr;14(4):262–74. pmid:23478346
  21. 21. Kimura M. Theoretical foundation of population genetics at the molecular level. Theor Popul Biol. 1971 Jun;2(2):174–208. pmid:5162686
  22. 22. Good BH, Desai MM. Fluctuations in fitness distributions and the effects of weak linked selection on sequence evolution. Theor Popul Biol. 2013 May;85:86–102. pmid:23337315
  23. 23. Good BH, Walczak AM, Neher RA, Desai MM. Genetic diversity in the interference selection limit. PLoS Genet. 2014 Mar;10(3):e1004222. pmid:24675740
  24. 24. Santiago E, Caballero A. Effective size of populations under selection. Genetics. 1995 Feb;139(2):1013–30. pmid:7713405
  25. 25. Santiago E, Caballero A. Effective size and polymorphism of linked neutral loci in populations under directional selection. Genetics. 1998 Aug;149(4):2105–17. pmid:9691062
  26. 26. Santiago E, Caballero A. Joint Prediction of the Effective Population Size and the Rate of Fixation of Deleterious Mutations. Genetics. 2016 Nov;204(3):1267–79. pmid:27672094
  27. 27. Whitlock MC, Barton NH. The effective size of a subdivided population. Genetics. 1997 May;146(1):427–41. pmid:9136031
  28. 28. Wright S. The genetical structure of populations. Ann Eugen. 1951 Mar;15(4):323–54. pmid:24540312
  29. 29. Meirmans PG, Hedrick PW. Assessing population structure: F(ST) and related measures. Mol Ecol Resour. 2011 Jan;11(1):5–18. pmid:21429096
  30. 30. Hudson RR, Boos DD, Kaplan NL. A statistical test for detecting geographic subdivision. Mol Biol Evol. 1992 Jan;9(1):138–51. pmid:1552836
  31. 31. Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population structure. Evolution. 1984 Nov;38(6):1358–70. pmid:28563791
  32. 32. Charlesworth B. Measures of divergence between populations and the effect of forces that reduce variability. Mol Biol Evol. 1998 May;15(5):538–43. pmid:9580982
  33. 33. Slatkin M. Inbreeding coefficients and coalescence times. Genet Res. 1991 Dec;89(5–6):479–87. pmid:1765264
  34. 34. Nordborg M. Structured coalescent processes on different time scales. Genetics. 1997 Aug;146(4):1501–14. pmid:9258691
  35. 35. Charlesworth B, Nordborg M, Charlesworth D. The effects of local selection, balanced polymorphism and background selection on equilibrium patterns of genetic diversity in subdivided populations. Genet Res. 1997 Oct;70(2):155–74. pmid:9449192
  36. 36. Zeng K, Corcoran P. The Effects of Background and Interference Selection on Patterns of Genetic Variation in Subdivided Populations. Genetics. 2015 Dec;201(4):1539–54. pmid:26434720
  37. 37. Matthey-Doret R, Whitlock MC. Background selection and FST: Consequences for detecting local adaptation. Mol Ecol. 2019 Sep;28(17):3902–14. pmid:31349365
  38. 38. Charlesworth D, Charlesworth B, Morgan MT. The pattern of neutral molecular variation under the background selection model. Genetics. 1995 Dec;141(4):1619–32. pmid:8601499
  39. 39. McVean GA, Charlesworth B. The effects of Hill-Robertson interference between weakly selected mutations on patterns of molecular evolution and variation. Genetics. 2000 Jun;155(2):929–44. pmid:10835411
  40. 40. Comeron JM, Kreitman M. Population, evolutionary and genomic consequences of interference selection. Genetics. 2002 May;161(1):389–410. pmid:12019253
  41. 41. Hill WG, Robertson A. The effect of linkage on limits to artificial selection. Genet Res. 1966;89(5–6):311–36. pmid:5980116
  42. 42. Devi A, Speyer G, Lynch M. The divergence of mean phenotypes under persistent directional selection. Genetics. 2023 Jul 6;224(3):iyad091. pmid:37200616
  43. 43. Charlesworth B. The effects of deleterious mutations on evolution at linked sites. Genetics. 2012 Jan;190(1):5–22. pmid:22219506
  44. 44. Ohta T. Slightly deleterious mutant substitutions in evolution. Nature. 1973 Nov 9;246(5428):96–8. pmid:4585855
  45. 45. Robertson A. Inbreeding in artificial selection programmes. Genet Res (Camb). 1961 Jul;2(2):189–94.
  46. 46. Kimura M. On the probability of fixation of mutant genes in a population. Genetics. 1962 Jun;47(6):713–9. pmid:14456043
  47. 47. Haller BC, Messer PW. SLiM 3: Forward Genetic Simulations Beyond the Wright–Fisher Model. Mol Biol Evol. 2019 Jan 18;36(3):632–7. pmid:30517680
  48. 48. Sved JA, Latter BD. Migration and mutation in stochastic models of gene frequency change. I. The island model. J Math Biol. 1977 Dec 27;5(1):61–73. pmid:753908
  49. 49. Nei M. Analysis of gene diversity in subdivided populations. Proc Natl Acad Sci U S A. 1973 Dec;70(12):3321–3. pmid:4519626
  50. 50. Gordo I, Navarro A, Charlesworth B. Muller’s ratchet and the pattern of variation at a neutral locus. Genetics. 2002 Jun;161(2):835–48. pmid:12072478
  51. 51. Otto SP. Selective Interference and the Evolution of Sex. J Hered. 2021 Mar 12;112(1):9–18. pmid:33047117
  52. 52. Gilbert KJ, Pouyet F, Excoffier L, Peischl S. Transition from background selection to associative overdominance promotes diversity in regions of low recombination. Curr Biol. 2020 Jan 6;30(1):101–107.e3. pmid:31866368
  53. 53. Becher H, Jackson BC, Charlesworth B. Patterns of genetic variability in genomic regions with low rates of recombination. Curr Biol. 2020 Jan 6;30(1):94–100.e3. pmid:31866366
  54. 54. Bhatia G, Patterson N, Sankararaman S, Price AL. Estimating and interpreting FST: the impact of rare variants. Genome Res. 2013 Sep;23(9):1514–21. pmid:23861382
  55. 55. Wright S. Statistical Genetics in Relation to Evolution. Hermann; 1939. 63 p.
  56. 56. Lande R, Barrowclough GF. Effective population size, genetic variation, and their use in population management. Viable Populations for Conservation. 1987;87–124.