High-resolution sweep metagenomics using fast probabilistic inference [version 2; peer review: 2 approved]

Tommi Mäklin, Teemu Kallonen, Sophia David, Christine J. Boinett, Ben Pascoe, Guillaume Méric, David M. Aanensen, Edward J. Feil, Stephen Baker, Julian Parkhill, Samuel K. Sheppard, Jukka Corander, Antti Honkela

Research output: Contribution to journalArticlepeer-review

2 Citations (SciVal)

Abstract

Determining the composition of bacterial communities beyond the level of a genus or species is challenging because of the considerable overlap between genomes representing close relatives. Here, we present the mSWEEP pipeline for identifying and estimating the relative sequence abundances of bacterial lineages from plate sweeps of enrichment cultures. mSWEEP leverages biologically grouped sequence assembly databases, applying probabilistic modelling, and provides controls for false positive results. Using sequencing data from major pathogens, we demonstrate significant improvements in lineage quantification and detection accuracy. Our pipeline facilitates investigating cultures comprising mixtures of bacteria, and opens up a new field of plate sweep metagenomics.

Original languageEnglish
Article number14
Number of pages20
JournalWellcome Open Research
Volume5
Early online date30 Jan 2020
DOIs
Publication statusPublished - 8 Oct 2021

Bibliographical note

Funding Information:
Flagship programme (Finnish Center for Artificial Intelligence FCAI; to JC and AH). TK, JC, DA and EJF are supported by the JPI-AMR consortium SpARK (MR/R00241X/1). JC was funded by the ERC (grant no. 742158). TK was funded by the Norwegian Research Council JPIAMR (grant no. 144501). SB is a Sir Henry Dale Fellow, jointly funded by the Wellcome Trust and the Royal Society [100087]. Sequencing of the Vietnamese E. coli samples was supported by the Wellcome Trust [098051]. Computational resources were provided by the 'Finnish Grid and Cloud Infrastructure' (persistent identifier urn:nbn:fi:research-infras-2016072533).

Funding Information:
Grant information: This work was supported by the Academy of Finland (grants no. 259440 and 310261; to TM and AH) as well as the

Publisher Copyright:
© 2020. Mäklin T et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Keywords

  • bacterial strain identification
  • metagenomics
  • microbial communities
  • plate sweeps
  • probabilistic modeling

ASJC Scopus subject areas

  • Medicine (miscellaneous)
  • General Biochemistry,Genetics and Molecular Biology

Fingerprint

Dive into the research topics of 'High-resolution sweep metagenomics using fast probabilistic inference [version 2; peer review: 2 approved]'. Together they form a unique fingerprint.

Cite this