Comparative analyses of selection operating on non-translated intergenic regions of diverse bacterial species

Harry A. Thorpe, Sion C. Bayliss, Laurence D. Hurst, Edward J. Feil

Research output: Contribution to journalArticle

13 Citations (Scopus)

Abstract

Non-translated intergenic regions (IGRs) comprise 10-15% of bacterial genomes, and containmany regulatory elements with key functions. Despite this, there are few systematic studies on thestrength and direction of selection operating on IGRs in bacteria using whole genome sequencedatasets. Here we exploit representative whole genome datasets from six diverse bacterialspecies;​ Staphylococcus aureus, ​ Streptococcus pneumoniae , ​ Mycobacteriumtuberculosis,Salmonellaenterica, ​ Klebsiella pneumoniae and ​ Escherichia coli . We comparepatterns of selection operating on IGRs using two independent methods; the proportion ofsingleton mutations, and the dI/dS ratio; where dI is the number of intergenic SNPs per intergenicsite. We find that the strength of purifying selection operating over all intergenic sites is consistentlyintermediate between that operating on synonymous and non-synonymous sites. Ribosomebinding sites and non-coding RNAs tend to be under stronger selective constraint than promotersand rho-independent terminators. Strikingly, a clear signal of purifying selection remains evenwhen all these major categories of regulatory elements are excluded, and this constraint is highestimmediately upstream of genes. Whilst a paucity of variation means that the data for ​ M.tuberculosis are more equivocal than for the other species, we find strong evidence for positiveselection within​ promoters of​ this species. This points to a key adaptive role for regulatory changesin this important pathogen. Our study underlines the feasibility and utility of gauging the selectiveforces operating on bacterial IGRs from whole genome sequence data, and suggests that ourcurrent understanding of the functionality of these sequences is far from complete.
Original languageEnglish
Article number195784
Pages (from-to)363-376
JournalGenetics
Volume206
Issue number1
Early online date9 Mar 2017
DOIs
Publication statusPublished - 5 May 2017

Fingerprint

Intergenic DNA
Genome
Staphylococcal Pneumonia
Bacterial Genomes
Untranslated RNA
Klebsiella pneumoniae
Feasibility Studies
Streptococcus pneumoniae
Single Nucleotide Polymorphism
Tuberculosis
Escherichia coli
Bacteria
Mutation
Genes

Cite this

Comparative analyses of selection operating on non-translated intergenic regions of diverse bacterial species. / Thorpe, Harry A.; Bayliss, Sion C.; Hurst, Laurence D.; Feil, Edward J.

In: Genetics, Vol. 206, No. 1, 195784, 05.05.2017, p. 363-376.

Research output: Contribution to journalArticle

@article{12cab71c92054006a3a6fee5f8c9d4d4,
title = "Comparative analyses of selection operating on non-translated intergenic regions of diverse bacterial species",
abstract = "Non-translated intergenic regions (IGRs) comprise 10-15{\%} of bacterial genomes, and containmany regulatory elements with key functions. Despite this, there are few systematic studies on thestrength and direction of selection operating on IGRs in bacteria using whole genome sequencedatasets. Here we exploit representative whole genome datasets from six diverse bacterialspecies;​ Staphylococcus aureus, ​ Streptococcus pneumoniae , ​ Mycobacteriumtuberculosis,Salmonellaenterica, ​ Klebsiella pneumoniae and ​ Escherichia coli . We comparepatterns of selection operating on IGRs using two independent methods; the proportion ofsingleton mutations, and the dI/dS ratio; where dI is the number of intergenic SNPs per intergenicsite. We find that the strength of purifying selection operating over all intergenic sites is consistentlyintermediate between that operating on synonymous and non-synonymous sites. Ribosomebinding sites and non-coding RNAs tend to be under stronger selective constraint than promotersand rho-independent terminators. Strikingly, a clear signal of purifying selection remains evenwhen all these major categories of regulatory elements are excluded, and this constraint is highestimmediately upstream of genes. Whilst a paucity of variation means that the data for ​ M.tuberculosis are more equivocal than for the other species, we find strong evidence for positiveselection within​ promoters of​ this species. This points to a key adaptive role for regulatory changesin this important pathogen. Our study underlines the feasibility and utility of gauging the selectiveforces operating on bacterial IGRs from whole genome sequence data, and suggests that ourcurrent understanding of the functionality of these sequences is far from complete.",
author = "Thorpe, {Harry A.} and Bayliss, {Sion C.} and Hurst, {Laurence D.} and Feil, {Edward J.}",
year = "2017",
month = "5",
day = "5",
doi = "10.1534/genetics.116.195784",
language = "English",
volume = "206",
pages = "363--376",
journal = "Genetics",
issn = "0016-6731",
publisher = "Genetics Society of America",
number = "1",

}

TY - JOUR

T1 - Comparative analyses of selection operating on non-translated intergenic regions of diverse bacterial species

AU - Thorpe, Harry A.

AU - Bayliss, Sion C.

AU - Hurst, Laurence D.

AU - Feil, Edward J.

PY - 2017/5/5

Y1 - 2017/5/5

N2 - Non-translated intergenic regions (IGRs) comprise 10-15% of bacterial genomes, and containmany regulatory elements with key functions. Despite this, there are few systematic studies on thestrength and direction of selection operating on IGRs in bacteria using whole genome sequencedatasets. Here we exploit representative whole genome datasets from six diverse bacterialspecies;​ Staphylococcus aureus, ​ Streptococcus pneumoniae , ​ Mycobacteriumtuberculosis,Salmonellaenterica, ​ Klebsiella pneumoniae and ​ Escherichia coli . We comparepatterns of selection operating on IGRs using two independent methods; the proportion ofsingleton mutations, and the dI/dS ratio; where dI is the number of intergenic SNPs per intergenicsite. We find that the strength of purifying selection operating over all intergenic sites is consistentlyintermediate between that operating on synonymous and non-synonymous sites. Ribosomebinding sites and non-coding RNAs tend to be under stronger selective constraint than promotersand rho-independent terminators. Strikingly, a clear signal of purifying selection remains evenwhen all these major categories of regulatory elements are excluded, and this constraint is highestimmediately upstream of genes. Whilst a paucity of variation means that the data for ​ M.tuberculosis are more equivocal than for the other species, we find strong evidence for positiveselection within​ promoters of​ this species. This points to a key adaptive role for regulatory changesin this important pathogen. Our study underlines the feasibility and utility of gauging the selectiveforces operating on bacterial IGRs from whole genome sequence data, and suggests that ourcurrent understanding of the functionality of these sequences is far from complete.

AB - Non-translated intergenic regions (IGRs) comprise 10-15% of bacterial genomes, and containmany regulatory elements with key functions. Despite this, there are few systematic studies on thestrength and direction of selection operating on IGRs in bacteria using whole genome sequencedatasets. Here we exploit representative whole genome datasets from six diverse bacterialspecies;​ Staphylococcus aureus, ​ Streptococcus pneumoniae , ​ Mycobacteriumtuberculosis,Salmonellaenterica, ​ Klebsiella pneumoniae and ​ Escherichia coli . We comparepatterns of selection operating on IGRs using two independent methods; the proportion ofsingleton mutations, and the dI/dS ratio; where dI is the number of intergenic SNPs per intergenicsite. We find that the strength of purifying selection operating over all intergenic sites is consistentlyintermediate between that operating on synonymous and non-synonymous sites. Ribosomebinding sites and non-coding RNAs tend to be under stronger selective constraint than promotersand rho-independent terminators. Strikingly, a clear signal of purifying selection remains evenwhen all these major categories of regulatory elements are excluded, and this constraint is highestimmediately upstream of genes. Whilst a paucity of variation means that the data for ​ M.tuberculosis are more equivocal than for the other species, we find strong evidence for positiveselection within​ promoters of​ this species. This points to a key adaptive role for regulatory changesin this important pathogen. Our study underlines the feasibility and utility of gauging the selectiveforces operating on bacterial IGRs from whole genome sequence data, and suggests that ourcurrent understanding of the functionality of these sequences is far from complete.

UR - https://doi.org/10.1534/genetics.116.195784

U2 - 10.1534/genetics.116.195784

DO - 10.1534/genetics.116.195784

M3 - Article

VL - 206

SP - 363

EP - 376

JO - Genetics

JF - Genetics

SN - 0016-6731

IS - 1

M1 - 195784

ER -