Boosting alignment accuracy by adaptive local realignment

Dan DeBlasio, John D Kececioglu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

While mutation rates can vary markedly over the residues of a protein, multiple sequence alignment tools typically use the same values for their scoring-function parameters across a protein’s entire length. We present a new approach, called adaptive local realignment, that in contrast automatically adapts to the diversity of mutation rates along protein sequences. This builds upon a recent technique known as parameter advising that finds global parameter settings for aligners, to adaptively find local settings. Our approach in essence identifies local regions with low estimated accuracy, constructs a set of candidate realignments using a carefully-chosen collection of parameter settings, and replaces the region if a realignment has higher estimated accuracy. This new method of local parameter advising, when combined with prior methods for global advising, boosts alignment accuracy as much as 26% over the best default setting on hard-to-align protein benchmarks, and by 6.4% over global advising alone. Adaptive local realignment, implemented within the Opal aligner using the Facet accuracy estimator, is available at facet.cs.arizona.edu.

Original languageEnglish (US)
Title of host publicationResearch in Computational Molecular Biology - 21st Annual International Conference, RECOMB 2017, Proceedings
PublisherSpringer Verlag
Pages1-17
Number of pages17
Volume10229 LNCS
ISBN (Print)9783319569697
DOIs
StatePublished - 2017
Event21st Annual International Conference on Research in Computational Molecular Biology, RECOMB 2017 - Hong Kong, China
Duration: May 3 2017May 7 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10229 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other21st Annual International Conference on Research in Computational Molecular Biology, RECOMB 2017
CountryChina
CityHong Kong
Period5/3/175/7/17

Fingerprint

Boosting
Alignment
Proteins
Protein
Facet
Mutation
Multiple Sequence Alignment
Protein Sequence
Scoring
Entire
Vary
Benchmark
Estimator

Keywords

  • Alignment accuracy
  • Iterative refinement
  • Local mutation rates
  • Multiple sequence alignment
  • Parameter advising

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

DeBlasio, D., & Kececioglu, J. D. (2017). Boosting alignment accuracy by adaptive local realignment. In Research in Computational Molecular Biology - 21st Annual International Conference, RECOMB 2017, Proceedings (Vol. 10229 LNCS, pp. 1-17). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10229 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-56970-3_1

Boosting alignment accuracy by adaptive local realignment. / DeBlasio, Dan; Kececioglu, John D.

Research in Computational Molecular Biology - 21st Annual International Conference, RECOMB 2017, Proceedings. Vol. 10229 LNCS Springer Verlag, 2017. p. 1-17 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10229 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

DeBlasio, D & Kececioglu, JD 2017, Boosting alignment accuracy by adaptive local realignment. in Research in Computational Molecular Biology - 21st Annual International Conference, RECOMB 2017, Proceedings. vol. 10229 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 10229 LNCS, Springer Verlag, pp. 1-17, 21st Annual International Conference on Research in Computational Molecular Biology, RECOMB 2017, Hong Kong, China, 5/3/17. https://doi.org/10.1007/978-3-319-56970-3_1
DeBlasio D, Kececioglu JD. Boosting alignment accuracy by adaptive local realignment. In Research in Computational Molecular Biology - 21st Annual International Conference, RECOMB 2017, Proceedings. Vol. 10229 LNCS. Springer Verlag. 2017. p. 1-17. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-56970-3_1
DeBlasio, Dan ; Kececioglu, John D. / Boosting alignment accuracy by adaptive local realignment. Research in Computational Molecular Biology - 21st Annual International Conference, RECOMB 2017, Proceedings. Vol. 10229 LNCS Springer Verlag, 2017. pp. 1-17 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{cbfa30b1515649d9acc9c80d90253de2,
title = "Boosting alignment accuracy by adaptive local realignment",
abstract = "While mutation rates can vary markedly over the residues of a protein, multiple sequence alignment tools typically use the same values for their scoring-function parameters across a protein’s entire length. We present a new approach, called adaptive local realignment, that in contrast automatically adapts to the diversity of mutation rates along protein sequences. This builds upon a recent technique known as parameter advising that finds global parameter settings for aligners, to adaptively find local settings. Our approach in essence identifies local regions with low estimated accuracy, constructs a set of candidate realignments using a carefully-chosen collection of parameter settings, and replaces the region if a realignment has higher estimated accuracy. This new method of local parameter advising, when combined with prior methods for global advising, boosts alignment accuracy as much as 26{\%} over the best default setting on hard-to-align protein benchmarks, and by 6.4{\%} over global advising alone. Adaptive local realignment, implemented within the Opal aligner using the Facet accuracy estimator, is available at facet.cs.arizona.edu.",
keywords = "Alignment accuracy, Iterative refinement, Local mutation rates, Multiple sequence alignment, Parameter advising",
author = "Dan DeBlasio and Kececioglu, {John D}",
year = "2017",
doi = "10.1007/978-3-319-56970-3_1",
language = "English (US)",
isbn = "9783319569697",
volume = "10229 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "1--17",
booktitle = "Research in Computational Molecular Biology - 21st Annual International Conference, RECOMB 2017, Proceedings",
address = "Germany",

}

TY - GEN

T1 - Boosting alignment accuracy by adaptive local realignment

AU - DeBlasio, Dan

AU - Kececioglu, John D

PY - 2017

Y1 - 2017

N2 - While mutation rates can vary markedly over the residues of a protein, multiple sequence alignment tools typically use the same values for their scoring-function parameters across a protein’s entire length. We present a new approach, called adaptive local realignment, that in contrast automatically adapts to the diversity of mutation rates along protein sequences. This builds upon a recent technique known as parameter advising that finds global parameter settings for aligners, to adaptively find local settings. Our approach in essence identifies local regions with low estimated accuracy, constructs a set of candidate realignments using a carefully-chosen collection of parameter settings, and replaces the region if a realignment has higher estimated accuracy. This new method of local parameter advising, when combined with prior methods for global advising, boosts alignment accuracy as much as 26% over the best default setting on hard-to-align protein benchmarks, and by 6.4% over global advising alone. Adaptive local realignment, implemented within the Opal aligner using the Facet accuracy estimator, is available at facet.cs.arizona.edu.

AB - While mutation rates can vary markedly over the residues of a protein, multiple sequence alignment tools typically use the same values for their scoring-function parameters across a protein’s entire length. We present a new approach, called adaptive local realignment, that in contrast automatically adapts to the diversity of mutation rates along protein sequences. This builds upon a recent technique known as parameter advising that finds global parameter settings for aligners, to adaptively find local settings. Our approach in essence identifies local regions with low estimated accuracy, constructs a set of candidate realignments using a carefully-chosen collection of parameter settings, and replaces the region if a realignment has higher estimated accuracy. This new method of local parameter advising, when combined with prior methods for global advising, boosts alignment accuracy as much as 26% over the best default setting on hard-to-align protein benchmarks, and by 6.4% over global advising alone. Adaptive local realignment, implemented within the Opal aligner using the Facet accuracy estimator, is available at facet.cs.arizona.edu.

KW - Alignment accuracy

KW - Iterative refinement

KW - Local mutation rates

KW - Multiple sequence alignment

KW - Parameter advising

UR - http://www.scopus.com/inward/record.url?scp=85018377751&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85018377751&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-56970-3_1

DO - 10.1007/978-3-319-56970-3_1

M3 - Conference contribution

AN - SCOPUS:85018377751

SN - 9783319569697

VL - 10229 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 1

EP - 17

BT - Research in Computational Molecular Biology - 21st Annual International Conference, RECOMB 2017, Proceedings

PB - Springer Verlag

ER -