TY - GEN
T1 - Approximation algorithms for multiple sequence alignment under a fixed evolutionary tree
AU - Ravi, R.
AU - Kececioglu, John D
PY - 1995
Y1 - 1995
N2 - We consider the problem of aligning sequences related by a given evolutionary tree: given a fixed tree with its leaves labeled with sequences, find ancestral sequences to label the internal nodes so as to minimize the total cost of all the edges in the tree. The cost of an edge is the edit distance between the sequences labeling its endpoints. In this paper, we consider the case when the given tree is a regular d-ary tree for some fixed d and provide a d+1/d-1-approximation algorithm for this problem that runs in time O(d(2kn)d+ n2k2d) where k is the number of leaves in the tree and n is the maximum length of any of the sequences labeling the leaves. We also consider a new bottleneck objective in labeling the internal nodes. In this version, we wish to find the labeling of the internal nodes that minimizes the maximum cost of any edge in the tree. For this problem we provide a simple 2δ + 1-approximation algorithm where 8 is the depth of the given undirected tree defined as the maximum over all internal nodes of the number of edges from the internal node to a closest leaf. For phylogenetic trees on n nodes that have no internal nodes of degree two, δ ≤ lg n.
AB - We consider the problem of aligning sequences related by a given evolutionary tree: given a fixed tree with its leaves labeled with sequences, find ancestral sequences to label the internal nodes so as to minimize the total cost of all the edges in the tree. The cost of an edge is the edit distance between the sequences labeling its endpoints. In this paper, we consider the case when the given tree is a regular d-ary tree for some fixed d and provide a d+1/d-1-approximation algorithm for this problem that runs in time O(d(2kn)d+ n2k2d) where k is the number of leaves in the tree and n is the maximum length of any of the sequences labeling the leaves. We also consider a new bottleneck objective in labeling the internal nodes. In this version, we wish to find the labeling of the internal nodes that minimizes the maximum cost of any edge in the tree. For this problem we provide a simple 2δ + 1-approximation algorithm where 8 is the depth of the given undirected tree defined as the maximum over all internal nodes of the number of edges from the internal node to a closest leaf. For phylogenetic trees on n nodes that have no internal nodes of degree two, δ ≤ lg n.
UR - http://www.scopus.com/inward/record.url?scp=84948977657&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84948977657&partnerID=8YFLogxK
U2 - 10.1007/3-540-60044-2_52
DO - 10.1007/3-540-60044-2_52
M3 - Conference contribution
AN - SCOPUS:84948977657
SN - 3540600442
SN - 9783540600442
VL - 937
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 330
EP - 339
BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PB - Springer Verlag
T2 - 6th Annual Symposium on Combinatorial Pattern Matching, CPM 1995
Y2 - 5 July 1995 through 7 July 1995
ER -