Combining phylogenomic and supermatrix approaches, and a time-calibrated phylogeny for squamate reptiles (lizards and snakes) based on 52 genes and 4162 species

Yuchi Zheng, John J Wiens

Research output: Contribution to journalArticle

172 Citations (Scopus)

Abstract

Two common approaches for estimating phylogenies in species-rich groups are to: (i) sample many loci for few species (e.g. phylogenomic approach), or (ii) sample many species for fewer loci (e.g. supermatrix approach). In theory, these approaches can be combined to simultaneously resolve both higher-level relationships (with many genes) and species-level relationships (with many taxa). However, fundamental questions remain unanswered about this combined approach. First, will higher-level relationships more closely resemble those estimated from many genes or those from many taxa? Second, will branch support increase for higher-level relationships (relative to the estimate from many taxa)? Here, we address these questions in squamate reptiles. We combined two recently published datasets, one based on 44 genes for 161 species, and one based on 12 genes for 4161 species. The likelihood-based tree from the combined matrix (52 genes, 4162 species) shared more higher-level clades with the 44-gene tree (90% vs. 77% shared). Branch support for higher level-relationships was marginally higher than in the 12-gene tree, but lower than in the 44-gene tree. Relationships were apparently not obscured by the abundant missing data (92% overall). We provide a time-calibrated phylogeny based on extensive sampling of genes and taxa as a resource for comparative studies.

Original languageEnglish (US)
Pages (from-to)537-547
Number of pages11
JournalMolecular Phylogenetics and Evolution
Volume94
DOIs
StatePublished - Jan 1 2016

Fingerprint

Reptiles
Lizards
Snakes
Squamata
Phylogeny
snake
reptile
lizard
snakes
reptiles
lizards
phylogeny
gene
Genes
genes
loci
sampling
comparative study
matrix

Keywords

  • Missing data
  • Phylogenomic
  • Phylogeny
  • Squamata
  • Supermatrix

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Molecular Biology
  • Genetics

Cite this

@article{650f09579a944f2cab11be1d56f550fa,
title = "Combining phylogenomic and supermatrix approaches, and a time-calibrated phylogeny for squamate reptiles (lizards and snakes) based on 52 genes and 4162 species",
abstract = "Two common approaches for estimating phylogenies in species-rich groups are to: (i) sample many loci for few species (e.g. phylogenomic approach), or (ii) sample many species for fewer loci (e.g. supermatrix approach). In theory, these approaches can be combined to simultaneously resolve both higher-level relationships (with many genes) and species-level relationships (with many taxa). However, fundamental questions remain unanswered about this combined approach. First, will higher-level relationships more closely resemble those estimated from many genes or those from many taxa? Second, will branch support increase for higher-level relationships (relative to the estimate from many taxa)? Here, we address these questions in squamate reptiles. We combined two recently published datasets, one based on 44 genes for 161 species, and one based on 12 genes for 4161 species. The likelihood-based tree from the combined matrix (52 genes, 4162 species) shared more higher-level clades with the 44-gene tree (90{\%} vs. 77{\%} shared). Branch support for higher level-relationships was marginally higher than in the 12-gene tree, but lower than in the 44-gene tree. Relationships were apparently not obscured by the abundant missing data (92{\%} overall). We provide a time-calibrated phylogeny based on extensive sampling of genes and taxa as a resource for comparative studies.",
keywords = "Missing data, Phylogenomic, Phylogeny, Squamata, Supermatrix",
author = "Yuchi Zheng and Wiens, {John J}",
year = "2016",
month = "1",
day = "1",
doi = "10.1016/j.ympev.2015.10.009",
language = "English (US)",
volume = "94",
pages = "537--547",
journal = "Molecular Phylogenetics and Evolution",
issn = "1055-7903",
publisher = "Academic Press Inc.",

}

TY - JOUR

T1 - Combining phylogenomic and supermatrix approaches, and a time-calibrated phylogeny for squamate reptiles (lizards and snakes) based on 52 genes and 4162 species

AU - Zheng, Yuchi

AU - Wiens, John J

PY - 2016/1/1

Y1 - 2016/1/1

N2 - Two common approaches for estimating phylogenies in species-rich groups are to: (i) sample many loci for few species (e.g. phylogenomic approach), or (ii) sample many species for fewer loci (e.g. supermatrix approach). In theory, these approaches can be combined to simultaneously resolve both higher-level relationships (with many genes) and species-level relationships (with many taxa). However, fundamental questions remain unanswered about this combined approach. First, will higher-level relationships more closely resemble those estimated from many genes or those from many taxa? Second, will branch support increase for higher-level relationships (relative to the estimate from many taxa)? Here, we address these questions in squamate reptiles. We combined two recently published datasets, one based on 44 genes for 161 species, and one based on 12 genes for 4161 species. The likelihood-based tree from the combined matrix (52 genes, 4162 species) shared more higher-level clades with the 44-gene tree (90% vs. 77% shared). Branch support for higher level-relationships was marginally higher than in the 12-gene tree, but lower than in the 44-gene tree. Relationships were apparently not obscured by the abundant missing data (92% overall). We provide a time-calibrated phylogeny based on extensive sampling of genes and taxa as a resource for comparative studies.

AB - Two common approaches for estimating phylogenies in species-rich groups are to: (i) sample many loci for few species (e.g. phylogenomic approach), or (ii) sample many species for fewer loci (e.g. supermatrix approach). In theory, these approaches can be combined to simultaneously resolve both higher-level relationships (with many genes) and species-level relationships (with many taxa). However, fundamental questions remain unanswered about this combined approach. First, will higher-level relationships more closely resemble those estimated from many genes or those from many taxa? Second, will branch support increase for higher-level relationships (relative to the estimate from many taxa)? Here, we address these questions in squamate reptiles. We combined two recently published datasets, one based on 44 genes for 161 species, and one based on 12 genes for 4161 species. The likelihood-based tree from the combined matrix (52 genes, 4162 species) shared more higher-level clades with the 44-gene tree (90% vs. 77% shared). Branch support for higher level-relationships was marginally higher than in the 12-gene tree, but lower than in the 44-gene tree. Relationships were apparently not obscured by the abundant missing data (92% overall). We provide a time-calibrated phylogeny based on extensive sampling of genes and taxa as a resource for comparative studies.

KW - Missing data

KW - Phylogenomic

KW - Phylogeny

KW - Squamata

KW - Supermatrix

UR - http://www.scopus.com/inward/record.url?scp=84949033709&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84949033709&partnerID=8YFLogxK

U2 - 10.1016/j.ympev.2015.10.009

DO - 10.1016/j.ympev.2015.10.009

M3 - Article

C2 - 26475614

AN - SCOPUS:84949033709

VL - 94

SP - 537

EP - 547

JO - Molecular Phylogenetics and Evolution

JF - Molecular Phylogenetics and Evolution

SN - 1055-7903

ER -