Learn more: PMC Disclaimer | PMC Copyright Notice
The evolution within us
Abstract
The B-cell immune response is a remarkable evolutionary system found in jawed vertebrates. B-cell receptors, the membrane-bound form of antibodies, are capable of evolving high affinity to almost any foreign protein. High germline diversity and rapid evolution upon encounter with antigen explain the general adaptability of B-cell populations, but the dynamics of repertoires are less well understood. These dynamics are scientifically and clinically important. After highlighting the remarkable characteristics of naive and experienced B-cell repertoires, especially biased usage of genes encoding the B-cell receptors, we contrast methods of sequence analysis and their attempts to explain patterns of B-cell evolution. These phylogenetic approaches are currently unlinked to explicit models of B-cell competition, which analyse repertoire evolution at the level of phenotype, the affinities and specificities to particular antigenic sites. The models, in turn, suggest how chance, infection history and other factors contribute to different patterns of immunodominance and protection between people. Challenges in rational vaccine design, specifically vaccines to induce broadly neutralizing antibodies to HIV, underscore critical gaps in our understanding of B cells' evolutionary and ecological dynamics.
1. Introduction
By the time many of us give serious thought to passing on our genes, the pathogens circulating on the day of our birth have evolved for thousands of generations—and yet most people are fortunate not to greet parenthood at death's door. We owe this good fortune to the adaptive immune system, and especially B cells. B cells evolve on the time scale of pathogen populations and secrete pathogen-specific antibodies that protect us from infection. But despite their importance to our survival and the unique circumstances of their evolution, the dynamics of B-cell populations remain largely unexplored.
Vaccines and antibody-based therapies saved millions of lives, even before their mechanisms of action were known, and continue to be an essential part of medicine. Edward Jenner is often credited with developing the first vaccine in 1796, when he demonstrated that inoculation with cowpox protected against smallpox infection [1]. Centuries earlier, a similar practice known as ‘variolation’—the induction of immunity by a deliberate, attenuated smallpox infection—had spread throughout Central and East Asia [2]. Smallpox vaccination became compulsory in parts of Europe and the USA by the mid-nineteenth century. Serum therapy against diphtheria and tetanus was first demonstrated by Emil von Behring and Shibasaburo Kitasato in 1890 [3]. Several years later, Paul Ehrlich conceived of the side-chain theory, proposing that chemical structures in circulating cells could protect against pathogens [4]. These discoveries led to the rapid adoption of serum therapy against diphtheria in the 1890s and earned Behring, Kitasato and Ehrlich Nobel Prizes [5]. More recently, transfer of specific antibodies to patients infected with respiratory syncytial virus has demonstrated the feasibility of monoclonal antibody therapy during infection [6,7]. Monoclonal antibody therapy has also shown high efficacy against tumour-specific antigens in some cancers [8–12].
The success of antibodies against infections and cancers is offset by a persistent and poorly understood risk of antibody-mediated autoimmunity. Autoreactive antibodies, or ‘horror autotoxicus' as termed by Paul Ehrlich over a hundred years ago [13], are involved in chronic autoimmune disorders that affect 3–4% of the population [14]. The autoimmune pathology is determined by which self-antigen is targeted. Autoantibodies to thyroid-stimulating hormone receptor cause hyperthyroidism in Graves' disease, whereas autoantibodies to thyroid peroxidase or thyroglobulin cause hypothyroidism in Hashimoto's thyroiditis [15]. Ribonucleoprotein and other nuclear antigens are frequent targets in primary Sjögren's syndrome and systemic lupus erythematosus [16,17], conditions associated with widespread inflammation. A frustrating feature of autoimmune disorders involving B cells is that autoantibodies appear without warning, often after infections [18,19]. Although there are predisposing genetic factors [14,18,20], why some people develop autoreactive antibodies while others do not is still mostly a mystery.
Other unexplained differences between individuals' antibody repertoires can have clinical consequences. For instance, in a small fraction of people infected with the human immunodeficiency virus (HIV) [21–23] or vaccinated to influenza H1N1 [24–26], broadly neutralizing antibodies (antibodies effective against a wide diversity of viral strains) dominate the immune response. In HIV, these antibodies can protect animals against challenge [27,28], but their densities are uncorrelated with viral load in humans already infected with the virus [29]. Broadly neutralizing antibodies to influenza may confer protection to potentially pandemic strains [24,30–32]. It is not yet clear whether such antibodies are inducible in everyone. It is also not obvious if broadly neutralizing antibodies will persist over time, or if they will be outcompeted by other responses. Antibody repertoires show influences of infection history [33–35], host age [36–38], host genotype [39,40], and genotypic or phenotypic convergence [41–45], but the relative importance of each is not yet clear.
The diversity of antibody repertoires tracks the ongoing dynamics of B-cell populations in each individual. Clonal selection theory, the idea that cell populations with unique receptors undergo selective expansion upon encounter with antigen, was proposed independently by David Talmage [46] and MacFarlane Burnet [47] in the late 1950s. These hypotheses predate the first observations of affinity maturation [48], or even the observation that B cells are a distinct lineage of lymphocytes [49]. Yet clonal selection is key to the adaptability and survival of vertebrate hosts that frequently encounter new and familiar pathogens.
This extraordinary adaptability arises from B cells' large population sizes, high mutation rates, competition in structured metapopulations, and evolutionary ‘checkpoints' [50,51]. It is also enhanced by many generations of natural selection on naive B-cell repertoires. Naive B cells, which have not encountered antigen, show extensive diversity arising from several sources: polymorphism in the receptor-coding genes, recombination between these genes, insertions and deletions [52,53]. The receptor genes themselves show selection for mutational hotspots and coldspots in structurally beneficial areas [54,55]. Precursors of naive B cells are further selected for non-autoreactivity [56]. In sum, B cells are an extraordinary case of evolved evolvability, and thus they pose challenging questions for evolutionary biology.
Recent technical and conceptual advances have enabled detailed investigation of B-cell dynamics. The wide variety of high-throughput sequencing platforms provide different options trading off depth, read length, and accuracy; new methods exist to pair the heavy and light chain sequences that comprise receptors and assess antigen binding properties [57–59]. New methods of B-cell sorting and cloning allow multiple functional assays to be performed on individual cells [60]. Neutralization assays measure the effectiveness of antibodies at prohibiting infections in cell culture, and cross-competition assays reveal the targets of and steric interference with other antibodies. The movements of individual B cells are now directly observable through fluorescent microscopy [61]. As reviewed in detail below, molecular evolutionary models of the evolution of B-cell populations, and ecological models of their competition, are being tested with these new genetic and phenotypic observations.
This review describes what is known about the dynamics of antibody repertoires, and the uncertainties that stand in the way of predictive models and more effective clinical interventions. We first review the antibody response to primary infections and then jump scales, summarizing the patterns reported in natural repertoires. We next survey methods to address the difficult problem of inferring the evolutionary histories of B-cell populations. We also describe studies of competing B-cell populations that take a different approach and offer preliminary explanations for the dynamics of responses over multiple exposures. These concepts converge in the challenge of vaccine design, an area that we predict will benefit from a deeper understanding of B-cell repertoire dynamics.
2. Brief overview of B cells and their evolution
B cells evolve in each individual through their receptors, which are secreted in soluble form as antibodies by some classes of B cells. These receptors, also known as immunoglobulins, are Y-shaped proteins composed of four polypeptides: two identical light chains and two identical heavy chains. The distal parts of these chains (the arms of the Ys) are composed of variable and constant regions, with the variable regions of each heavy and light chain pair directly binding to antibody. The site at which binding occurs on the antibody or B-cell receptor is the paratope; the site bound on the antigen is the epitope. The variable region of the heavy chain is approximately 100 amino acids in length. Paratopes are thought to be much smaller, roughly one to two dozen amino acids, and can be located in a variety of locations in the variable region. The tail end of an antibody is called the Fc region: this portion does not bind to antigens but rather interacts with the cellular and complement portions of the immune system. Important discoveries have recently been made concerning the Fc region, including its potentially important role in HIV antibodies that were previously thought to operate by blocking viral entry [62,63]. In general, antibody efficacy is not determined solely by the identity of the variable region, although the variable region will be the focus of this issue.
Naive B-cell receptors have extraordinary diversity because of a rearrangement process that incorporates recombination, insertions and deletions. Susumu Tonegawa was awarded the Nobel Prize in 1987 for deciphering the molecular mechanism driving B-cell repertoire diversity [64]. Genes from two loci, the immunoglobulin loci kappa and lambda, encode light chains. Only one of the two light chain loci is used in a single B cell, whereas heavy chains are encoded by a single locus. The variable regions of both chains arise from recombination at their respective loci. In humans, each locus (kappa, lambda and heavy) contains several dozen variable (V) genes and four to six joining (J) genes. The heavy chain locus also contains roughly two dozen diversity (D) genes. B-cell receptors are constructed by rearrangement of randomly chosen V and J genes in the light chain and V, D and J genes in the heavy chain, yielding approximately 2 × 106 unique potential VDJ combinations. Variable addition and subtraction of nucleotides at the junctions between gene segments increases the number of unique potential antibodies to 1011, although the number of unique circulating naive B cells at any time is closer to 106–107 [52,65]. Many immature B cells do not develop to naive cells because they do not generate functional antibodies, or they fail checkpoints for self-reactivity. These cells either further rearrange their receptors, or they apoptose. Germline receptor diversity may arise from slightly different processes, including gene conversion, in some animals [66,67].
B cells further evolve upon exposure to antigen [48,68,69]. This process, known as affinity maturation, involves strong competition and selection for B-cell receptor binding to antigen [70]. Thus, in contrast to the diversity of the naive repertoire, which arises primarily by recombination, B cells exposed to antigen evolve primarily by point mutations (although insertion–deletion mutations occur too [71]). These so-called somatic hypermutations are induced by activation-induced cytidine deaminase (AID) [72,73], which preferentially mutates cytosines to uracils [74]. These mutations induce repair activities by DNA polymerases, which may introduce additional mutations while repairing the initial error. The fact that AID recognizes cytosine, particularly in certain motifs, led to the discovery of hotspots and coldspots of mutational activity in variable regions [75]. Hotspots tend to lie in complementarity-determining regions (CDRs), which are involved in binding, and coldspots in framework regions (FWRs), which are thought to be structurally conserved [76]. Antibody variable regions also have biased codon usage, with cytosines favoured in silent sites in CDRs and potential terminal codons in FWRs [76,77]. The B cells descending via somatic hypermutation from a single naive B cell are said to form a set of expanded clones, although different definitions exist [78].
Most B cells undergo affinity maturation in germinal centres, which are aggregations of immune cells that form several days after immunization or the start of an infection. The number of germinal centres that forms after infection is unknown but appears to be highly variable [79]. High-affinity naive B cells enter each germinal centre and compete for antigens presented on the surface of follicular dendritic cells. After binding and removing antigen, B cells present digested antigen peptides to a class of helper T cells known as follicular helper T cells. B cells that are activated by follicular helper T cells undergo replication under the influence of AID. On average, B-cell receptors accumulate one mutation per 103 bases, or roughly one mutation per round of replication [80]. These cells with mutated receptors then compete for antigens, and cells that fail to bind to antigens or to receive T cell help apoptose. Each generation takes approximately 8–12 h, and germinal centres probably persist for several weeks (reviewed in [81]).
Responses to primary infection can be complicated by feedback from antibodies [82], competition between different lineages of B cells in different germinal centres, competition between memory and naive responses [83], and B-cell activation and affinity maturation outside germinal centres [84]. Mature B cells can also differentiate or class switch to classes that are distinguished by the isotype of antibody produced: B cells secreting the IgG and IgA forms of antibodies, for instance, maintain the affinity of the original B-cell receptor but differ in their constant regions and avidity for antigen (i.e. overall binding rate). Before considering the potential effects of these dynamics, it is worth looking at the final result: naturally occurring antibody repertoires.
3. Observing the natural repertoire
The naive B-cell repertoire deviates significantly from what one would find given a uniform distribution on all of the formational probabilistic events, which is called bias in the literature ([39], reviewed in [44]). For example, individual heavy chain variable genes range in usage frequency from 0.1% to 10% of rearrangements in a repertoire [39,85]. The usage distributions of heavy chain D and J genes and light chain V and J genes are likewise skewed; there is also preferential usage of particular alleles within each gene [65]. These patterns are roughly conserved across individuals and may be intrinsic to rearrangement, although Collins et al. [86] show in this issue that VDJ usage may vary within different strains of a species. Some of this bias is explained by differences in recombination signal sequences [87,88] and variation in the number of gene copies between individuals [89,90]. For example, among 35 individuals, copies of particular IGHV1-69 variants ranged from 0 to 4, while the variants' frequencies in repertoires varied correspondingly from 0 to 11% [91]. Another striking and unexplained feature is bias in the DJ pairings on heavy chains [65,92,93]. There is less evidence of intrinsic bias in the pairing of heavy and light chains [44], although negative selection of naive B cells for autoreactivity suggests a mechanism by which such bias might arise [56].
Identifying the sources of these patterns is challenging because of the large number of functional genes, closely related allelic variants and pseudogenes at receptor loci. Although individual heavy chain variable loci contain only a few dozen genes, researchers have identified over 100 putative heavy chain variable genes, pseudogenes, and open reading frames so far, and together they have over 200 alleles [65,89,90]. Bulk high-throughput sequencing provides a useful perspective on natural repertoires. As described in this issue, other techniques, such as hybridomas and B-cell cloning, allow direct experimentation on B cells with a receptor of interest [78].
Antigen-experienced B-cell populations are similarly not a uniform draw from the naive pool. This pattern holds for the long-lived memory B cells produced during affinity maturation and plasmablasts, the precursors to antibody-secreting plasma cells showing the active response to infection. Adult recipients of the influenza vaccine have memory responses dominated by a few clonal lineages [37]. Some HIV-infected patients show very old lineages, over 20% diverged from their naive B-cell ancestors, that appear to have coevolved with the founding virus [94,95]. Memory B-cell clones to influenza also show differences in size and divergence from the germline, suggesting that they re-enter germinal centres and undergo further affinity maturation upon exposure to related viral strains [96]. These observations suggest that chronic and repeat infections might amplify a few clones that were randomly activated early on, leading to uneven distributions of clone sizes.
Three articles in this issue provide further detail on B-cell repertoire development. Although it has been observed that particular genes, like IGHV1–69, frequently appear in broadly neutralizing antibodies to HIV and influenza, Dunand & Wilson [97] argue that responses to many different antigens may be ‘canonical.’ Antibodies specific to HIV, influenza, Streptococcus pneumoniae, rotavirus, self-antigens and even B-cell tumours with unknown specificities are strongly associated with particular receptor genes, implying that particular clonal expansions may be more expected than others. Also in this issue, Martin et al. [98] present evidence that gene frequencies change with age. Intriguingly, although age-related changes in antigen exposure and other selective pressures could explain these differences in experienced cells, naive repertoires also have shifts in bias. Disease and other abnormalities might partially explain these changes [98].
While high-throughput sequencing and other technologies are broadening our view of B-cell populations, the resulting data have simultaneously underscored our ignorance. Why are some lineages more frequently seen than others? How can we recognize the members of a clonal family? What selective pressures operate on different B-cell types? Increasingly sophisticated methods of sequence analysis have begun to answer these kinds of questions.
4. Inference under statistical models of B-cell sequence evolution
Statistical models of B-cell evolution use sequences to reconstruct the unobserved evolutionary trajectories of populations. The complexity of B-cell evolution, however, limits the utility of simple methods for two reasons. First, inferring rearrangements in heavy and light chain genes is difficult. The number of heavy chain variable genes on each chromosome is unknown, and some of these genes may be duplicates or previously unreported alleles. Junctional diversity is high from random insertions and deletions. Second, somatic hypermutation during affinity maturation is biased by the presence of motifs that direct mutations to particular sites in different genes [75]. Moreover, measuring the precise effects of these motifs is difficult because of mutational saturation and epistasis [55]. High-throughput B-cell sequencing further requires methods that tolerate large collections of sequences and unpaired heavy and light chain sequences.
Identifying the VDJ rearrangement that formed a naive B-cell receptor is important for measuring recombinatorial biases and total allelic diversity, but this task is complicated by the potential for mutation, deletion and insertion to give identical results, as well as by the similarity of some germline genes. Although general-purpose tools can be used productively and efficiently to solve this problem [99,100], others, including iHMMune-Align [101], SoDA2 [102], and two in this issue [103,104], evaluate B-cell rearrangement events statistically. Frost et al. [103] present Immunoglobulin Subtype Classification Using Evolutionary Algorithms (IgSCUEAL), which uses maximum likelihood to assign heavy chain V and J regions to germline alleles, taking into account the phylogenies of V and J gene families and uncertainty from undetected alleles. This method tolerates B cells that have undergone additional somatic mutation [105,106]. In a complementary vein, Elhanati et al. [104] use a large dataset of non-productive naive B-cell sequences to infer the probabilities of specific VDJ rearrangements and segment-specific rates of insertions and deletions. These detailed models could in principle be used for the next generation of VDJ inferential procedures.
A related challenge is how to infer clonal families, i.e. the members of an expanded clone. As a starting point for clonal family inference, one could use the best nucleotide-by-nucleotide annotations inferred by methods in the previous paragraph. However, the resulting estimates are uncertain because there are many ways to make a given B-cell receptor sequence. Current practice is to use single-linkage clustering [37,107,108] or a distance-based cut-off to an inferred ancestral sequence [109]. The development of statistical models to identify clonal families, with corresponding uncertainty estimates, would help to quantify the structure of B-cell populations and test hypotheses about their evolution.
The peculiar features of somatic hypermutation and B-cell sampling make inference of evolutionary characteristics of B-cell populations a non-trivial task. McCoy et al. [110] developed a new method for inferring per-residue selection pressure in the presence of context-sensitive mutation and applied it to a dataset of 15 million B-cell receptor sequences. Like Elhanati et al. [104], they used non-productive sequences as a neutral model to infer rates of mutation without the confounding effects of selection for affinity maturation. Elhanati et al. [104] also find support for an evolutionary model with heterogeneous substitution rates across sites that takes into account context, such as the presence of certain sequence motifs. This context appears vital to explain patterns of substitutions across variable regions [55,75,76].
These analyses have unveiled surprising features of B-cell evolution. In particular, the studies of both [104,110] find that evolutionary parameters are highly non-uniform and consistent between individuals: in the former paper for rearrangement (e.g. gene deletion lengths) and in the latter for per-residue natural selection estimates. Remarkably, there is a high correlation between the probability that a sequence is generated by recombination and its probability, once generated, of being selected for the naive repertoire. This result suggests a mechanism by which evolution has increased the chance that useful B-cell receptor sequences will be generated in the first place.
We predict that the development of statistical and molecular evolutionary methods will clarify the dynamics of affinity maturation. These methods could in principle be used to estimate the number of unique clones, which is currently estimated with heuristics rather than statistics [78]; the ages of lineages; and the neutral and selective forces shaping the evolution of different B-cell types. For instance, examining clonal phylogenies, Yaari et al. [111] propose that the phylogenies of B-cell families reflect multiple time scales of positive selection. Following trends in infectious disease research [112,113], models could eventually integrate phenotypic information, such as the affinities and specificities of receptors for different epitopes. Another possible direction is to use structured coalescent approaches to infer rates of idiotype switching [114]. For now, there is a gap between repertoire analyses that are sequence-based and others that model the competitive dynamics and phenotypic evolution of clonal populations.
5. Growth and competition among the clones
Why are some B-cell lineages more abundant than others, and why are some epitopes targeted more often? This is known as the problem of immunodominance—the preferential targeting by the immune system of certain antigenic sites over others (ch. 6 of [115]), and its importance was recognized long before the development of modern methods of repertoire profiling (e.g. [116–118]). Differences in immunodominance may explain variation in susceptibility between people to the same pathogen. But how does it arise?
Repertoire diversity in the first exposure to antigen is regulated by opposing forces of poorly known strength. Affinity maturation acts on only a subset of the naive repertoire, but from this initial pool, somatic hypermutation introduces mutations that increase genetic and phenotypic variation (phenotype here refers to the affinity and specificity of B cells to various epitopes). Strong competition between B cells for both antigen and T cell help inside germinal centres discards many mutations and promotes the growth of high-affinity cells. In this issue, Childs et al. [119] show that parallel evolution in multiple germinal centres promotes adaptation to multiple epitopes. More complex antigens can compromise this response. In addition to providing a null model for immunodominance, Childs et al. [119] show that competition between antibodies and B cells inside germinal centres attenuates the bias in epitope targeting and increases the breadth of the repertoire. This model thus suggests that the immune system may have evolved to optimize a trade-off between mounting an effective response to any one epitope versus many, a trade-off that may be exploitable by pathogens. In related work, Mayer et al. [53] propose that diverse repertoires are especially advantageous when pathogens have high growth rates, and early recognition of any epitope can attenuate the severity of an infection.
Several examples suggest that intrinsic differences in epitopes' attractiveness can influence immunodominance. The antibody response to Staphylococcus aureus is highly constricted: the vast majority of plasmablasts target SpA, a bacterial virulence factor [120]. Because antibodies to SpA are non-neutralizing, this immunogenic and immunodominant epitope, or ‘superantigen,’ may be an effective means for S. aureus to escape immunity. Immunodominant responses to influenza may arises similarly. Epitopes on the globular head of the influenza haemagglutinin are more accessible and thus more immunogenic than epitopes on the recessed stalk, potentially explaining the preponderance of antibodies to the head despite broader protection associated with anti-stalk antibodies [121]. It has been speculated that influenza may evolve immunogenic decoy epitopes, far from neutralizing epitopes, that may sterically interfere with neutralizing antibodies [122]. Pauli et al. [120] observed that most anti-SpA antibodies in their experiments derived from the IGVH3 idiotype, which suggests that epitopes' immunogenicity might account for not only the phenotypic but also the genotypic diversity of the repertoire.
Studies of repeated exposures to similar antigens reveal contrasting observations: repertoires show signatures of both contingent and convergent evolution. In contingent evolution, chance events, like mutations or infections with other pathogens, affect the induced repertoire. In 1960, Thomas Francis Jr., proposed that contingent evolution is at work in adaptive immunity: that a first infection with influenza might indelibly shape responses to later strains, a phenomenon he dubbed ‘Original Antigenic Sin’ [123]. This hypothesis is consistent with recent research: people indeed have especially high antisera to influenza strains circulating in early childhood [36], and researchers have investigated potential mechanisms [124]. These early exposures may entrain responses in ways that are clinically and epidemiologically important. For example, two recent studies have shown that the specific epitopes targeted on H1N1 strains depend on hosts' year of birth [26,35]. People born between 1965 and 1979, for instance, frequently have antibodies targeting a site that mutated in 2013. Linderman et al. [35] showed that this mutation leads to loss of antibody binding, potentially explaining the increased incidence of influenza-like illness in this cohort in the 2013–2014 influenza season. These age-specific patterns of epitope binding were recapitulated by sequentially immunizing ferrets with different H1N1 strains, demonstrating that the order in which strains are encountered can explain some of the differences between cohorts [35]. Interestingly, ferrets with the same sequence of immunizations did not always develop immunodominant responses to the same sites: a minority of ferrets (three of eight) had the particular antibody under investigation. Similarly, only 40% of humans born between 1965 and 1979 had this antibody. Thus, even though similar infection histories may lead to similar patterns of immunodominance between individuals, differences remain. By contrast, seemingly diverse responses may eventually converge via evolution, as seen in dengue [42], HIV [43] and influenza [41,45,96,125]. Zarnitsyna et al. [126] in this issue investigate competition between B cells in a setting that models antigenic drift and shift in influenza, including multiple antigenic sites. The free variables in this model are population sizes of free and bound antigen on one hand and B cells with various affinities on the other. This set-up allows them to quantify the effect of epitope masking and derive a ratio quantifying original antigenic sin.
It is tempting to assume that experienced repertoires to fast-mutating pathogens reflect intimate coevolution, with adaptation by B cells followed by pathogen escape. Although individual B-cell clones show traces of coevolution with HIV [95], Luo & Perelson [127] argue in this issue that there is little evidence linking repertoires to specific selective pressures or to viral adaptation. Also in this issue, Hoehn et al. [128] find no observable impact of antiretroviral therapy on the genetic structure of B-cell populations in HIV-positive patients over time. This is surprising, as one might expect shifts in clonal size distributions to accompany therapy-induced changes in the viral population. In the same vein, Luo & Perelson [127] note that the emergence of detectable broadly neutralizing antibodies to HIV has not been correlated with bottlenecks in the viral population.
Predicting immunodominance, or the outcome of competition between evolving B-cell populations, appears limited by the data available to fit models. Although changes in population size and selective pressures can to some extent be inferred from sequences, and sequences can be related to antigen-specific serum responses [57], it is currently impossible to obtain high-throughput information about receptors' precise affinities to different epitopes [129]. Highly related sequences may have similar affinities [130], but it is difficult to imagine how a thorough understanding of clonal competition and evolution could be acquired without more precise information on B cells' phenotypes. Rates of antibody secretion and clearance, which vary between tissues, also affect the measured concentration of serum antibodies. Fitting models to these kinds of data could shed light on dynamics that are difficult to observe directly, such as the rules for memory B-cell and plasmablast generation, the rate at which memory B cells pre-empt naive B cells during repeated exposures, and the kinds of competition between secreted antibodies and evolving B-cell clones.
6. Vaccines to direct evolution
It is notoriously difficult to effectively vaccinate against antigenically variable pathogens. Such pathogens may evolve escape mutations, or vaccination may select for escape mutants already present at low frequencies. We have also seen in the previous sections that vaccine design is not a simple process of injecting an antigen and getting out a high-affinity antibody. Challenges in vaccine design are fundamental questions about the ecological and evolutionary dynamics of B cells and their coevolving pathogens.
One such challenge is to evolve high-affinity broadly neutralizing antibodies to an antigen that is poorly bound by naive B cells. For example, broadly neutralizing antibodies in the VRC01 group bind to the gp120 protein of HIV and descend from the IGHV1–69 family; although VRC01 antibodies are diverse on the level of amino acids, they form similar binding sites [43]. These broadly neutralizing antibodies tend to be highly diverged from their unmutated ancestors, requiring months to years to appear [95]. Gao et al. [131] showed that mutations in HIV induced by one B-cell lineage enabled binding by another lineage, which ultimately developed broadly neutralizing properties. Given the vast space of possible mutations, it is perhaps unsurprising that diverged antibodies with specific properties appear to arise rarely, and under seemingly contrived circumstances. Identifying the antigens that bind precursors of broadly neutralizing antibodies, and incrementally evolving lineages through sequential immunizations with these antigens, is an active area of research [132,133]. A more tactical evolutionary approach might be to infer the mutations that limit adaptation, such as those with high fitness cost, and to focus immunogen design on selecting viable solutions. If crucial mutations are rare, it may also be beneficial to increase the size of the relevant B-cell population. Finally, the probability of evolving an antibody with particular properties might depend on host genotype: would people without particular IGVH1–69 alleles not benefit from vaccines to induce VRC01, or can some B-cell phenotypes be evolved from many different gene families?
Another challenge is to ensure that protective B-cell populations induced by vaccination remain at protective levels. McGuire et al. [134] investigated how narrowly neutralizing antibodies outcompete broadly neutralizing antibodies after immunization with a specially engineered HIV antigen. Paradoxically, they found that the narrowly neutralizing antibodies had germline ancestors that bound a wider array of antigens and were thus more likely to be stimulated. Does their result imply that once induced, broadly neutralizing antibodies—which by definition bind diverse antigens—should dominate? Broadly neutralizing antibodies can be thought of as having, on average, more available antigen than narrowly neutralizing antibodies. The idea that antibodies to abundant antigens, such as conserved epitopes, should proliferate more than antibodies to specific epitopes has been used to explain patterns of immunodominance in response to malaria [135] and is consistent with basic principles of population growth. However, the subdominance of anti-stalk antibodies in influenza suggests that structure complicates simple assessments of antigen availability. In addition, the factors determining antigen presentation of viral proteins to B cells [136], and how this presentation shapes immunogenicity [137], are not well understood. Considering likely differences in epitope immunogenicity, a larger concern is thus whether the immediate response to the vaccine might eventually be eroded by competition with pre-existing or other induced responses. It is unclear what strategies may help avoid this, or even anticipate its occurrence.
B-cell repertoires lie at the intersection of immunology and evolution, supplying intriguing questions for researchers in both, and other, fields (box 1). Addressing these questions would go a long way to measuring protection and predicting response. Understanding the dynamics of antibody repertoires is a first step.
Acknowledgements
We thank Greg Dwyer and Duncan Ralph for comments.
Authors' contributions
All the authors wrote the paper and approved the final version.
Competing interests
We declare we have no competing interests.
Funding
S.C. is supported by NIH award DP2AI117921 and the James S. McDonnell Foundation. P.W. acknowledges funding from NIH/NIAID grants no. 2U19AI082724;, 1P01AI097092;, 2U19AI057266;, 1U19AI109946;, 5U19AI090023;, HHSN272201400005C, and the James S. McDonnell Foundation. F.M. is supported by NIH R01 GM113246-01, by the University of Washington eScience Institute through its Seed Grants programme in Translational Health Sciences, and by the University of Washington Center for AIDS Research (CFAR), an NIH funded programme under award number P30AI027757 which is supported by the following NIH Institutes and Centers (NIAID, NCI, NIMH, NIDA, NICHD, NHLBI, NIA, NIGMS, NIDDK).