Abstract
The reliability of phylogenies reconstructed from data on multigene families is investigated via simulation. The evolutionary scenario used is a character-based model of a two-gene family in four species in which clocklike divergence is postulated but neither convergence nor reversal is allowed except as a result of recombination and gene conversion. Thus, any homoplasy emerging from parsimony reconstructions from the simulated data matrices can be attributed to concerted evolution. The probabilities of correctly reconstructing two standard trees are estimated by replicate runs of the simulation. One standard tree (the OP or "orthology/ paralogy" tree) reflects the true gene genealogy in the absence of concerted evolution; the other (the CE or "concerted evolution" tree) depicts gene relationships under complete homogenization of the gene family. The probability of correct reconstruction of the OP tree declines quickly as concerted evolution increases, but above an intermediate level of concerted evolution the probability of correctly inferring the CE tree increases rapidly. Trees similar but not identical to the correct trees can be reconstructed above or below the critical intermediate level of concerted evolution. Levels of homoplasy and numbers of equally parsimonious minimal trees are maximized, and bootstrap confidence levels are minimized, near this intermediate level of concerted evolution. When reconstructing the correct gene tree is the goal, both consistency indices and bootstrap levels will show misleadingly high values when concerted evolution is high. However, because the correct species tree can be inferred from either the OP or CE tree (in the absence of homoplasy from sources other than concerted evolution), these same measures correlate well with fidelity of reconstructing the species tree.
Original language | English (US) |
---|---|
Pages (from-to) | 4-17 |
Number of pages | 14 |
Journal | Systematic Biology |
Volume | 41 |
Issue number | 1 |
State | Published - Mar 1992 |
Fingerprint
Keywords
- Concerted evolution
- Homoplasy
- Multigene family
- Parsimony
- Phylogeny
ASJC Scopus subject areas
- Ecology, Evolution, Behavior and Systematics
Cite this
Reconstruction of organismal and gene phylogenies from data on multigene families : Concerted evolution, homoplasy, and confidence. / Sanderson, Michael; Doyle, Jeff J.
In: Systematic Biology, Vol. 41, No. 1, 03.1992, p. 4-17.Research output: Contribution to journal › Article
}
TY - JOUR
T1 - Reconstruction of organismal and gene phylogenies from data on multigene families
T2 - Concerted evolution, homoplasy, and confidence
AU - Sanderson, Michael
AU - Doyle, Jeff J.
PY - 1992/3
Y1 - 1992/3
N2 - The reliability of phylogenies reconstructed from data on multigene families is investigated via simulation. The evolutionary scenario used is a character-based model of a two-gene family in four species in which clocklike divergence is postulated but neither convergence nor reversal is allowed except as a result of recombination and gene conversion. Thus, any homoplasy emerging from parsimony reconstructions from the simulated data matrices can be attributed to concerted evolution. The probabilities of correctly reconstructing two standard trees are estimated by replicate runs of the simulation. One standard tree (the OP or "orthology/ paralogy" tree) reflects the true gene genealogy in the absence of concerted evolution; the other (the CE or "concerted evolution" tree) depicts gene relationships under complete homogenization of the gene family. The probability of correct reconstruction of the OP tree declines quickly as concerted evolution increases, but above an intermediate level of concerted evolution the probability of correctly inferring the CE tree increases rapidly. Trees similar but not identical to the correct trees can be reconstructed above or below the critical intermediate level of concerted evolution. Levels of homoplasy and numbers of equally parsimonious minimal trees are maximized, and bootstrap confidence levels are minimized, near this intermediate level of concerted evolution. When reconstructing the correct gene tree is the goal, both consistency indices and bootstrap levels will show misleadingly high values when concerted evolution is high. However, because the correct species tree can be inferred from either the OP or CE tree (in the absence of homoplasy from sources other than concerted evolution), these same measures correlate well with fidelity of reconstructing the species tree.
AB - The reliability of phylogenies reconstructed from data on multigene families is investigated via simulation. The evolutionary scenario used is a character-based model of a two-gene family in four species in which clocklike divergence is postulated but neither convergence nor reversal is allowed except as a result of recombination and gene conversion. Thus, any homoplasy emerging from parsimony reconstructions from the simulated data matrices can be attributed to concerted evolution. The probabilities of correctly reconstructing two standard trees are estimated by replicate runs of the simulation. One standard tree (the OP or "orthology/ paralogy" tree) reflects the true gene genealogy in the absence of concerted evolution; the other (the CE or "concerted evolution" tree) depicts gene relationships under complete homogenization of the gene family. The probability of correct reconstruction of the OP tree declines quickly as concerted evolution increases, but above an intermediate level of concerted evolution the probability of correctly inferring the CE tree increases rapidly. Trees similar but not identical to the correct trees can be reconstructed above or below the critical intermediate level of concerted evolution. Levels of homoplasy and numbers of equally parsimonious minimal trees are maximized, and bootstrap confidence levels are minimized, near this intermediate level of concerted evolution. When reconstructing the correct gene tree is the goal, both consistency indices and bootstrap levels will show misleadingly high values when concerted evolution is high. However, because the correct species tree can be inferred from either the OP or CE tree (in the absence of homoplasy from sources other than concerted evolution), these same measures correlate well with fidelity of reconstructing the species tree.
KW - Concerted evolution
KW - Homoplasy
KW - Multigene family
KW - Parsimony
KW - Phylogeny
UR - http://www.scopus.com/inward/record.url?scp=11944255719&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=11944255719&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:11944255719
VL - 41
SP - 4
EP - 17
JO - Systematic Biology
JF - Systematic Biology
SN - 1063-5157
IS - 1
ER -