Addressing the shortcomings of three recent bayesian methods for detecting interspecific recombination in DNA sequence alignments

Dirk Husmeier, Alexander Vassilios Mantzaris

Research output: Contribution to journalArticle

7 Citations (Scopus)
97 Downloads (Pure)

Abstract

We address a potential shortcoming of three probabilistic models for detecting interspecific recombination in DNA sequence alignments: the multiple change-point model (MCP) of Suchard et al. (2003), the dual multiple change-point model (DMCP) of Minin et al. (2005), and the phylogenetic factorial hidden Markov model (PFHMM) of Husmeier (2005). These models are based on the Bayesian paradigm, which requires the solution of an integral over the space of branch lengths. To render this integration analytically tractable, all three models make the same assumption that the vectors of branch lengths of the phylogenetic tree are independent among sites. While this approximation reduces the computational complexity considerably, we show that it leads to the systematic prediction of spurious topology changes in the Felsenstein zone, that is, the area in the branch lengths configuration space where maximum parsimony consistently infers the wrong topology due to long-branch attraction. We apply two Bayesian hypothesis tests, based on an inter- and an intra-model approach to estimating the marginal likelihood. We then propose a revised model that addresses these shortcomings, and compare it with the aforementioned models on a set of synthetic DNA sequence alignments systematically generated around the Felsenstein zone.
Original languageEnglish
JournalStatistical Applications in Genetics and Molecular Biology
Volume7
Issue number1
DOIs
Publication statusPublished - Nov 2008

    Fingerprint

Keywords

  • Markov chain Monte Carlo
  • incidental parameters
  • Phylogenetics
  • recombination
  • Bayesian modelling
  • consistency
  • Felsenstein zone
  • hidden Markov model
  • change-point model

Cite this