Abstract
Nucleotides across a genome do not mutate at equal frequencies. Instead, specific nucleotide positions can exhibit much higher mutation rates than the genomic average due to their immediate nucleotide neighbors. These “mutational hotspots” can play a prominent role in adaptive evolution, yet we lack knowledge of which short nucleotide sequences drive hotspots. In this work, we employ a combination of experimental evolution with Pseudomonas fluorescens and bioinformatic analysis of various Salmonella species to characterize a short nucleotide motif (≥8 bp) that can drive T:A→G:C mutation rates >1000-fold higher than the baseline T→G rate in bacteria. First, we experimentally confirm previous analysis showing that homopolymeric tracts (≥3) of G with a 3′ T frequently mutate so that the T is replaced with a G, resulting in an extension of the guanine tract, i.e. GGGT → GGGG. We then demonstrate that the potency of this T:A→G:C hotspot is dependent on the nucleotides immediately flanking the G nT sequence. We find that the dinucleotide immediately 5′ to a G 4 tract and the dinucleotide immediately 3′ to the T strongly affect the T:A→G:C mutation rate, which ranges from ∼5-fold higher than the typical rate to over 1000-fold higher depending on the flanking elements. G nT motifs are therefore comprised of several modular nucleotide components which each exert a significant, quantifiable effect on the mutation rate. This work advances our ability to accurately identify the position and quantify the mutagenicity of hotspot motifs predicated on short nucleotide sequences.
| Original language | English |
|---|---|
| Article number | msaf183 |
| Journal | Molecular Biology and Evolution |
| Volume | 42 |
| Issue number | 8 |
| Early online date | 4 Aug 2025 |
| DOIs | |
| Publication status | Published - 31 Aug 2025 |
Data Availability Statement
Sequencing data and the custom R script generated for this work are available on the Open Science Framework, accessible via DOI: 10.17605/OSF.IO/HSYFX.Funding
This project was supported by a BBSRC NI grant (BB/T012994/1; awarded to T.B.T.) supporting J.S.H. and G.W., and a Royal Society Dorothy Hodgkin Research Fellowship (DH150169) awarded to and supporting T.B.T. This work was also supported in part by the Division of Intramural Research (DIR) of the National Library of Medicine (NLM), National Institutes of Health. The opinions expressed in this article are those of the author and do not reflect the view of the National Institutes of Health, the Department of Health and Human Services, or the US government. The authors would like to extend their thanks to Dr. Peter Lind, who kindly provided feedback on the manuscript and provided examples of GnT motifs documented in other research. This project was supported by a BBSRC NI grant (BB/ T012994/1; awarded to T.B.T.) supporting J.S.H. and G.W., and a Royal Society Dorothy Hodgkin Research Fellowship (DH150169) awarded to and supporting T.B.T. This work was also supported in part by the Division of Intramural Research (DIR) of the National Library of Medicine (NLM), National Institutes of Health. The opinions expressed in this article are those of the author and do not reflect the view of the National Institutes of Health, the Department of Health and Human Services, or the US government.
| Funders | Funder number |
|---|---|
| Biotechnology and Biological Sciences Research Council | BB/T012994/1 |
Keywords
- homopolymeric tract
- local nucleotide context
- mutagenic nucleotide motif
- mutation hotspot
- transversion mutation
ASJC Scopus subject areas
- Ecology, Evolution, Behavior and Systematics
- Molecular Biology
- Genetics
