Skip to main navigation Skip to search Skip to main content

Inferring the Deep Past from Molecular Data

Tom A. Williams, Dominik Schrempf, Gergely J. Szöllosi, Cymon J. Cox, Peter G. Foster, T. Martin Embley

Research output: Contribution to journalArticlepeer-review

25   Link opens in a new tab Citations (SciVal)

Abstract

There is an expectation that analyses of molecular sequences might be able to distinguish between alternative hypotheses for ancient relationships, but the phylogenetic methods used and types of data analyzed are of critical importance in any attempt to recover historical signal. Here, we discuss some common issues that can influence the topology of trees obtained when using overly simple models to analyze molecular data that often display complicated patterns of sequence heterogeneity. To illustrate our discussion, we have used three examples of inferred relationships which have changed radically as models and methods of analysis have improved. In two of these examples, the sister-group relationship between thermophilic Thermus and mesophilic Deinococcus, and the position of long-branch Microsporidia among eukaryotes, we show that recovering what is now generally considered to be the correct tree is critically dependent on the fit between model and data. In the third example, the position of eukaryotes in the tree of life, the hypothesis that is currently supported by the best available methods is fundamentally different from the classical view of relationships between major cellular domains. Since heterogeneity appears to be pervasive and varied among all molecular sequence data, and even the best available models can still struggle to deal with some problems, the issues we discuss are generally relevant to phylogenetic analyses. It remains essential to maintain a critical attitude to all trees as hypotheses of relationship that may change with more data and better methods.

Original languageEnglish
Article numberevab067
JournalGenome biology and evolution
Volume13
Issue number5
Early online date27 Mar 2021
DOIs
Publication statusPublished - 7 May 2021

Bibliographical note

Publisher Copyright:
© 2021 The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Acknowledgements

T.A.W. is supported by a Royal Society University Fellowship and NERC (Grant No. NE/P00251X/1). C.J.C. received Portuguese national funds from Foundation for Science and Technology (FCT) through project UIDB/04326/2020, and from the operational programs CRESC Algarve 2020 and COMPETE 2020 through projects EMBRC.PT ALG-01-0145-FEDER-022121 and BIODATA.PT ALG-01-0145-FEDER-022231. D.S. and G.J.S. received funding from the European Research Council under the European Union’s Horizon 2020 research and innovation program under Grant Agreement No. 714774.

Keywords

  • eukaryote origins
  • microbial evolution
  • phylogenetics
  • substitution models
  • tree of life

ASJC Scopus subject areas

  • General Medicine

Fingerprint

Dive into the research topics of 'Inferring the Deep Past from Molecular Data'. Together they form a unique fingerprint.

Cite this