A genome-wide pairwise-identity-based proposal for the classification of viruses in the genus Mastrevirus (family Geminiviridae)

Brejnev Muhire, Darren P. Martin, Judith K Brown, Jesús Navas-Castillo, Enrique Moriones, F. Murilo Zerbini, Rafael Rivera-Bustamante, V. G. Malathi, Rob W. Briddon, Arvind Varsani

Research output: Contribution to journalArticle

137 Citations (Scopus)

Abstract

Recent advances in the ease with which the genomes of small circular single-stranded DNA viruses can be amplified, cloned, and sequenced have greatly accelerated the rate at which full genome sequences of mastreviruses (family Geminiviridae, genus Mastrevirus) are being deposited in public sequence databases. Although guidelines currently exist for species-level classification of newly determined, complete mastrevirus genome sequences, these are difficult to apply to large sequence datasets and are permissive enough that, effectively, a high degree of leeway exists for the proposal of new species and strains. The lack of a standardised and rigorous method for testing whether a new genome sequence deserves such a classification is resulting in increasing numbers of questionable mastrevirus species proposals. Importantly, the recommended sequence alignment and pairwise identity calculation protocols of the current guidelines could easily be modified to make the classification of newly determined mastrevirus genome sequences significantly more objective. Here, we propose modified versions of these protocols that should substantially minimise the degree of classification inconsistency that is permissible under the current system. To facilitate the objective application of these guidelines for mastrevirus species demarcation, we additionally present a user-friendly computer program, SDT (species demarcation tool), for calculating and graphically displaying pairwise genome identity scores. We apply SDT to the 939 full genome sequences of mastreviruses that were publically available in May 2012, and based on the distribution of pairwise identity scores yielded by our protocol, we propose mastrevirus species and strain demarcation thresholds of >78 % and >94 % identity, respectively.

Original languageEnglish (US)
Pages (from-to)1411-1424
Number of pages14
JournalArchives of Virology
Volume158
Issue number6
DOIs
StatePublished - Jun 2013

Fingerprint

Geminiviridae
Genome
Viruses
Guidelines
Circular DNA
DNA Viruses
Sequence Alignment
Single-Stranded DNA
Software
Databases

ASJC Scopus subject areas

  • Virology

Cite this

A genome-wide pairwise-identity-based proposal for the classification of viruses in the genus Mastrevirus (family Geminiviridae). / Muhire, Brejnev; Martin, Darren P.; Brown, Judith K; Navas-Castillo, Jesús; Moriones, Enrique; Zerbini, F. Murilo; Rivera-Bustamante, Rafael; Malathi, V. G.; Briddon, Rob W.; Varsani, Arvind.

In: Archives of Virology, Vol. 158, No. 6, 06.2013, p. 1411-1424.

Research output: Contribution to journalArticle

Muhire, B, Martin, DP, Brown, JK, Navas-Castillo, J, Moriones, E, Zerbini, FM, Rivera-Bustamante, R, Malathi, VG, Briddon, RW & Varsani, A 2013, 'A genome-wide pairwise-identity-based proposal for the classification of viruses in the genus Mastrevirus (family Geminiviridae)', Archives of Virology, vol. 158, no. 6, pp. 1411-1424. https://doi.org/10.1007/s00705-012-1601-7
Muhire, Brejnev ; Martin, Darren P. ; Brown, Judith K ; Navas-Castillo, Jesús ; Moriones, Enrique ; Zerbini, F. Murilo ; Rivera-Bustamante, Rafael ; Malathi, V. G. ; Briddon, Rob W. ; Varsani, Arvind. / A genome-wide pairwise-identity-based proposal for the classification of viruses in the genus Mastrevirus (family Geminiviridae). In: Archives of Virology. 2013 ; Vol. 158, No. 6. pp. 1411-1424.
@article{30e6bb140de14630807a412420032ca8,
title = "A genome-wide pairwise-identity-based proposal for the classification of viruses in the genus Mastrevirus (family Geminiviridae)",
abstract = "Recent advances in the ease with which the genomes of small circular single-stranded DNA viruses can be amplified, cloned, and sequenced have greatly accelerated the rate at which full genome sequences of mastreviruses (family Geminiviridae, genus Mastrevirus) are being deposited in public sequence databases. Although guidelines currently exist for species-level classification of newly determined, complete mastrevirus genome sequences, these are difficult to apply to large sequence datasets and are permissive enough that, effectively, a high degree of leeway exists for the proposal of new species and strains. The lack of a standardised and rigorous method for testing whether a new genome sequence deserves such a classification is resulting in increasing numbers of questionable mastrevirus species proposals. Importantly, the recommended sequence alignment and pairwise identity calculation protocols of the current guidelines could easily be modified to make the classification of newly determined mastrevirus genome sequences significantly more objective. Here, we propose modified versions of these protocols that should substantially minimise the degree of classification inconsistency that is permissible under the current system. To facilitate the objective application of these guidelines for mastrevirus species demarcation, we additionally present a user-friendly computer program, SDT (species demarcation tool), for calculating and graphically displaying pairwise genome identity scores. We apply SDT to the 939 full genome sequences of mastreviruses that were publically available in May 2012, and based on the distribution of pairwise identity scores yielded by our protocol, we propose mastrevirus species and strain demarcation thresholds of >78 {\%} and >94 {\%} identity, respectively.",
author = "Brejnev Muhire and Martin, {Darren P.} and Brown, {Judith K} and Jes{\'u}s Navas-Castillo and Enrique Moriones and Zerbini, {F. Murilo} and Rafael Rivera-Bustamante and Malathi, {V. G.} and Briddon, {Rob W.} and Arvind Varsani",
year = "2013",
month = "6",
doi = "10.1007/s00705-012-1601-7",
language = "English (US)",
volume = "158",
pages = "1411--1424",
journal = "Archives of Virology",
issn = "0304-8608",
publisher = "Springer Wien",
number = "6",

}

TY - JOUR

T1 - A genome-wide pairwise-identity-based proposal for the classification of viruses in the genus Mastrevirus (family Geminiviridae)

AU - Muhire, Brejnev

AU - Martin, Darren P.

AU - Brown, Judith K

AU - Navas-Castillo, Jesús

AU - Moriones, Enrique

AU - Zerbini, F. Murilo

AU - Rivera-Bustamante, Rafael

AU - Malathi, V. G.

AU - Briddon, Rob W.

AU - Varsani, Arvind

PY - 2013/6

Y1 - 2013/6

N2 - Recent advances in the ease with which the genomes of small circular single-stranded DNA viruses can be amplified, cloned, and sequenced have greatly accelerated the rate at which full genome sequences of mastreviruses (family Geminiviridae, genus Mastrevirus) are being deposited in public sequence databases. Although guidelines currently exist for species-level classification of newly determined, complete mastrevirus genome sequences, these are difficult to apply to large sequence datasets and are permissive enough that, effectively, a high degree of leeway exists for the proposal of new species and strains. The lack of a standardised and rigorous method for testing whether a new genome sequence deserves such a classification is resulting in increasing numbers of questionable mastrevirus species proposals. Importantly, the recommended sequence alignment and pairwise identity calculation protocols of the current guidelines could easily be modified to make the classification of newly determined mastrevirus genome sequences significantly more objective. Here, we propose modified versions of these protocols that should substantially minimise the degree of classification inconsistency that is permissible under the current system. To facilitate the objective application of these guidelines for mastrevirus species demarcation, we additionally present a user-friendly computer program, SDT (species demarcation tool), for calculating and graphically displaying pairwise genome identity scores. We apply SDT to the 939 full genome sequences of mastreviruses that were publically available in May 2012, and based on the distribution of pairwise identity scores yielded by our protocol, we propose mastrevirus species and strain demarcation thresholds of >78 % and >94 % identity, respectively.

AB - Recent advances in the ease with which the genomes of small circular single-stranded DNA viruses can be amplified, cloned, and sequenced have greatly accelerated the rate at which full genome sequences of mastreviruses (family Geminiviridae, genus Mastrevirus) are being deposited in public sequence databases. Although guidelines currently exist for species-level classification of newly determined, complete mastrevirus genome sequences, these are difficult to apply to large sequence datasets and are permissive enough that, effectively, a high degree of leeway exists for the proposal of new species and strains. The lack of a standardised and rigorous method for testing whether a new genome sequence deserves such a classification is resulting in increasing numbers of questionable mastrevirus species proposals. Importantly, the recommended sequence alignment and pairwise identity calculation protocols of the current guidelines could easily be modified to make the classification of newly determined mastrevirus genome sequences significantly more objective. Here, we propose modified versions of these protocols that should substantially minimise the degree of classification inconsistency that is permissible under the current system. To facilitate the objective application of these guidelines for mastrevirus species demarcation, we additionally present a user-friendly computer program, SDT (species demarcation tool), for calculating and graphically displaying pairwise genome identity scores. We apply SDT to the 939 full genome sequences of mastreviruses that were publically available in May 2012, and based on the distribution of pairwise identity scores yielded by our protocol, we propose mastrevirus species and strain demarcation thresholds of >78 % and >94 % identity, respectively.

UR - http://www.scopus.com/inward/record.url?scp=84878598727&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84878598727&partnerID=8YFLogxK

U2 - 10.1007/s00705-012-1601-7

DO - 10.1007/s00705-012-1601-7

M3 - Article

C2 - 23340592

AN - SCOPUS:84878598727

VL - 158

SP - 1411

EP - 1424

JO - Archives of Virology

JF - Archives of Virology

SN - 0304-8608

IS - 6

ER -