Quantifying bacterial evolution in the wild: A birthday problem for Campylobacter lineages

Jessica K Calland, Ben Pascoe, Sion C Bayliss, Evangelos Mourkas, Elvire Berthenet, Harry A Thorpe, Matthew D Hitchings, Edward J Feil, Jukka Corander, Martin J Blaser, Daniel Falush, Samuel K Sheppard

Research output: Contribution to journalArticlepeer-review

12 Citations (SciVal)

Abstract

Measuring molecular evolution in bacteria typically requires estimation of the rate at which nucleotide changes accumulate in strains sampled at different times that share a common ancestor. This approach has been useful for dating ecological and evolutionary events that coincide with the emergence of important lineages, such as outbreak strains and obligate human pathogens. However, in multi-host (niche) transmission scenarios, where the pathogen is essentially an opportunistic environmental organism, sampling is often sporadic and rarely reflects the overall population, particularly when concentrated on clinical isolates. This means that approaches that assume recent common ancestry are not applicable. Here we present a new approach to estimate the molecular clock rate in Campylobacter that draws on the popular probability conundrum known as the 'birthday problem'. Using large genomic datasets and comparative genomic approaches, we use isolate pairs that share recent common ancestry to estimate the rate of nucleotide change for the population. Identifying synonymous and non-synonymous nucleotide changes, both within and outside of recombined regions of the genome, we quantify clock-like diversification to estimate synonymous rates of nucleotide change for the common pathogenic bacteria Campylobacter coli (2.4 x 10-6 s/s/y) and Campylobacter jejuni (3.4 x 10-6 s/s/y). Finally, using estimated total rates of nucleotide change, we infer the number of effective lineages within the sample time-frame-analogous to a shared birthday-and assess the rate of turnover of lineages in our sample set over short evolutionary timescales. This provides a generalizable approach to calibrating rates in populations of environmental bacteria and shows that multiple lineages are maintained, implying that large-scale clonal sweeps may take hundreds of years or more in these species.

Original languageEnglish
Article numbere1009829
JournalPlos Genetics
Volume17
Issue number9
DOIs
Publication statusPublished - 28 Sept 2021

Bibliographical note

SKS is funded by the Medical Research Council (MR/L015080/1, MR/S009264/1, MR/T030062/1, MR/V001213/1). JKC is supported by a Biotechnology and Biological Sciences Research Council (BBSRC)-CASE studentship (BB/P504750/1). DF was supported by an MRC senior research fellowship (MR/M501608/1) and currently by Shanghai Municipal Science and Technology Major Project No. 2019SHZDZX02. SKS and BP are supported by funding from the National Institute of Allergy and Infectious Diseases (1R01AI158576-01). The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Funding

SKS is funded by the Medical Research Council (MR/L015080/1, MR/S009264/1, MR/T030062/1, MR/V001213/1). JKC is supported by a Biotechnology and Biological Sciences Research Council (BBSRC)-CASE studentship (BB/P504750/1). DF was supported by an MRC senior research fellowship (MR/M501608/1) and currently by Shanghai Municipal Science and Technology Major Project No. 2019SHZDZX02. SKS and BP are supported by funding from the National Institute of Allergy and Infectious Diseases (1R01AI158576-01). The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Fingerprint

Dive into the research topics of 'Quantifying bacterial evolution in the wild: A birthday problem for Campylobacter lineages'. Together they form a unique fingerprint.

Cite this