Resolving the complex Bordetella pertussis genome using barcoded nanopore sequencing

Research output: Contribution to journalArticle

Abstract

The genome of Bordetella pertussis is complex, with high G+C content and many repeats, each longer than 1000 bp. Long-read sequencing offers the opportunity to produce single-contig B. pertussis assemblies using sequencing reads which are longer than the repetitive sections, with the potential to reveal genomic features which were previously unobservable in multi-contig assemblies produced by short-read sequencing alone. We used an R9.4 MinION flow cell and barcoding to sequence five B. pertussis strains in a single sequencing run. We then trialled combinations of the many nanopore user community-built long-read analysis tools to establish the current optimal assembly pipeline for B. pertussis genome sequences. This pipeline produced closed genome sequences for four strains, allowing visualization of inter-strain genomic rearrangement. Read mapping to the Tohama I reference genome suggests that the remaining strain contains an ultra-long duplicated region (almost 200 kbp), which was not resolved by our pipeline; further investigation also revealed that a second strain that was seemingly resolved by our pipeline may contain an even longer duplication, albeit in a small subset of cells. We have therefore demonstrated the ability to resolve the structure of several B. pertussis strains per single barcoded nanopore flow cell, but the genomes with highest complexity (e.g. very large duplicated regions) remain only partially resolved using the standard library preparation and will require an alternative library preparation method. For full strain characterization, we recommend hybrid assembly of long and short reads together; for comparison of genome arrangement, assembly using long reads alone is sufficient.

Original languageEnglish
Pages (from-to)1-13
Number of pages13
JournalMicrobial Genomics
Volume4
Issue number11
DOIs
Publication statusPublished - 28 Nov 2018

Keywords

  • Bordetella pertussis
  • Oxford nanopore
  • benchmarking
  • duplications
  • genome assembly
  • long-read sequencing

ASJC Scopus subject areas

  • Medicine(all)

Fingerprint Dive into the research topics of 'Resolving the complex <i>Bordetella pertussis</i> genome using barcoded nanopore sequencing'. Together they form a unique fingerprint.

Cite this