RiTE database: A resource database for genus-wide rice genomics and evolutionary biology

Dario Copetti, Jianwei - Zhang, Moaine El Baidouri, Dongying Gao, Jun Wang, Elena Barghini, Rosa M. Cossu, Angelina Angelova, Carlos E. Maldonado L., Stefan Roffler, Hajime Ohyanagi, Thomas Wicker, Chuanzhu Fan, Andrea Zuccolo, Mingsheng Chen, Antonio Costa de Oliveira, Bin Han, Robert Henry, Yue ie Hsing, Nori KurataWen Wang, Scott A. Jackson, Olivier Panaud, Rod A Wing

Research output: Contribution to journalArticle

23 Citations (Scopus)

Abstract

Comparative evolutionary analysis of whole genomes requires not only accurate annotation of gene space, but also proper annotation of the repetitive fraction which is often the largest component of most if not all genomes larger than 50 kb in size. Results: Here we present the Rice TE database (RiTE-db) - a genus-wide collection of transposable elements and repeated sequences across 11 diploid species of the genus Oryza and the closely-related out-group Leersia perrieri. The database consists of more than 170,000 entries divided into three main types: (i) a classified and curated set of publicly-available repeated sequences, (ii) a set of consensus assemblies of highly-repetitive sequences obtained from genome sequencing surveys of 12 species; and (iii) a set of full-length TEs, identified and extracted from 12 whole genome assemblies. Conclusions: This is the first report of a repeat dataset that spans the majority of repeat variability within an entire genus, and one that includes complete elements as well as unassembled repeats. The database allows sequence browsing, downloading, and similarity searches. Because of the strategy adopted, the RiTE-db opens a new path to unprecedented direct comparative studies that span the entire nuclear repeat content of 15 million years of Oryza diversity.

Original languageEnglish (US)
Article number538
JournalBMC Genomics
Volume16
Issue number1
DOIs
StatePublished - Jul 22 2015

Fingerprint

Genomics
Genome
Databases
Molecular Sequence Annotation
DNA Transposable Elements
Nucleic Acid Repetitive Sequences
Diploidy
Oryza

Keywords

  • Genome
  • Oryza
  • Repeats
  • Rice
  • RiTE-db
  • Transposable elements

ASJC Scopus subject areas

  • Biotechnology
  • Genetics

Cite this

RiTE database : A resource database for genus-wide rice genomics and evolutionary biology. / Copetti, Dario; Zhang, Jianwei -; El Baidouri, Moaine; Gao, Dongying; Wang, Jun; Barghini, Elena; Cossu, Rosa M.; Angelova, Angelina; Maldonado L., Carlos E.; Roffler, Stefan; Ohyanagi, Hajime; Wicker, Thomas; Fan, Chuanzhu; Zuccolo, Andrea; Chen, Mingsheng; Costa de Oliveira, Antonio; Han, Bin; Henry, Robert; Hsing, Yue ie; Kurata, Nori; Wang, Wen; Jackson, Scott A.; Panaud, Olivier; Wing, Rod A.

In: BMC Genomics, Vol. 16, No. 1, 538, 22.07.2015.

Research output: Contribution to journalArticle

Copetti, D, Zhang, J, El Baidouri, M, Gao, D, Wang, J, Barghini, E, Cossu, RM, Angelova, A, Maldonado L., CE, Roffler, S, Ohyanagi, H, Wicker, T, Fan, C, Zuccolo, A, Chen, M, Costa de Oliveira, A, Han, B, Henry, R, Hsing, YI, Kurata, N, Wang, W, Jackson, SA, Panaud, O & Wing, RA 2015, 'RiTE database: A resource database for genus-wide rice genomics and evolutionary biology', BMC Genomics, vol. 16, no. 1, 538. https://doi.org/10.1186/s12864-015-1762-3
Copetti, Dario ; Zhang, Jianwei - ; El Baidouri, Moaine ; Gao, Dongying ; Wang, Jun ; Barghini, Elena ; Cossu, Rosa M. ; Angelova, Angelina ; Maldonado L., Carlos E. ; Roffler, Stefan ; Ohyanagi, Hajime ; Wicker, Thomas ; Fan, Chuanzhu ; Zuccolo, Andrea ; Chen, Mingsheng ; Costa de Oliveira, Antonio ; Han, Bin ; Henry, Robert ; Hsing, Yue ie ; Kurata, Nori ; Wang, Wen ; Jackson, Scott A. ; Panaud, Olivier ; Wing, Rod A. / RiTE database : A resource database for genus-wide rice genomics and evolutionary biology. In: BMC Genomics. 2015 ; Vol. 16, No. 1.
@article{6984795aa39b4d998acae64d70e196d5,
title = "RiTE database: A resource database for genus-wide rice genomics and evolutionary biology",
abstract = "Comparative evolutionary analysis of whole genomes requires not only accurate annotation of gene space, but also proper annotation of the repetitive fraction which is often the largest component of most if not all genomes larger than 50 kb in size. Results: Here we present the Rice TE database (RiTE-db) - a genus-wide collection of transposable elements and repeated sequences across 11 diploid species of the genus Oryza and the closely-related out-group Leersia perrieri. The database consists of more than 170,000 entries divided into three main types: (i) a classified and curated set of publicly-available repeated sequences, (ii) a set of consensus assemblies of highly-repetitive sequences obtained from genome sequencing surveys of 12 species; and (iii) a set of full-length TEs, identified and extracted from 12 whole genome assemblies. Conclusions: This is the first report of a repeat dataset that spans the majority of repeat variability within an entire genus, and one that includes complete elements as well as unassembled repeats. The database allows sequence browsing, downloading, and similarity searches. Because of the strategy adopted, the RiTE-db opens a new path to unprecedented direct comparative studies that span the entire nuclear repeat content of 15 million years of Oryza diversity.",
keywords = "Genome, Oryza, Repeats, Rice, RiTE-db, Transposable elements",
author = "Dario Copetti and Zhang, {Jianwei -} and {El Baidouri}, Moaine and Dongying Gao and Jun Wang and Elena Barghini and Cossu, {Rosa M.} and Angelina Angelova and {Maldonado L.}, {Carlos E.} and Stefan Roffler and Hajime Ohyanagi and Thomas Wicker and Chuanzhu Fan and Andrea Zuccolo and Mingsheng Chen and {Costa de Oliveira}, Antonio and Bin Han and Robert Henry and Hsing, {Yue ie} and Nori Kurata and Wen Wang and Jackson, {Scott A.} and Olivier Panaud and Wing, {Rod A}",
year = "2015",
month = "7",
day = "22",
doi = "10.1186/s12864-015-1762-3",
language = "English (US)",
volume = "16",
journal = "BMC Genomics",
issn = "1471-2164",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - RiTE database

T2 - A resource database for genus-wide rice genomics and evolutionary biology

AU - Copetti, Dario

AU - Zhang, Jianwei -

AU - El Baidouri, Moaine

AU - Gao, Dongying

AU - Wang, Jun

AU - Barghini, Elena

AU - Cossu, Rosa M.

AU - Angelova, Angelina

AU - Maldonado L., Carlos E.

AU - Roffler, Stefan

AU - Ohyanagi, Hajime

AU - Wicker, Thomas

AU - Fan, Chuanzhu

AU - Zuccolo, Andrea

AU - Chen, Mingsheng

AU - Costa de Oliveira, Antonio

AU - Han, Bin

AU - Henry, Robert

AU - Hsing, Yue ie

AU - Kurata, Nori

AU - Wang, Wen

AU - Jackson, Scott A.

AU - Panaud, Olivier

AU - Wing, Rod A

PY - 2015/7/22

Y1 - 2015/7/22

N2 - Comparative evolutionary analysis of whole genomes requires not only accurate annotation of gene space, but also proper annotation of the repetitive fraction which is often the largest component of most if not all genomes larger than 50 kb in size. Results: Here we present the Rice TE database (RiTE-db) - a genus-wide collection of transposable elements and repeated sequences across 11 diploid species of the genus Oryza and the closely-related out-group Leersia perrieri. The database consists of more than 170,000 entries divided into three main types: (i) a classified and curated set of publicly-available repeated sequences, (ii) a set of consensus assemblies of highly-repetitive sequences obtained from genome sequencing surveys of 12 species; and (iii) a set of full-length TEs, identified and extracted from 12 whole genome assemblies. Conclusions: This is the first report of a repeat dataset that spans the majority of repeat variability within an entire genus, and one that includes complete elements as well as unassembled repeats. The database allows sequence browsing, downloading, and similarity searches. Because of the strategy adopted, the RiTE-db opens a new path to unprecedented direct comparative studies that span the entire nuclear repeat content of 15 million years of Oryza diversity.

AB - Comparative evolutionary analysis of whole genomes requires not only accurate annotation of gene space, but also proper annotation of the repetitive fraction which is often the largest component of most if not all genomes larger than 50 kb in size. Results: Here we present the Rice TE database (RiTE-db) - a genus-wide collection of transposable elements and repeated sequences across 11 diploid species of the genus Oryza and the closely-related out-group Leersia perrieri. The database consists of more than 170,000 entries divided into three main types: (i) a classified and curated set of publicly-available repeated sequences, (ii) a set of consensus assemblies of highly-repetitive sequences obtained from genome sequencing surveys of 12 species; and (iii) a set of full-length TEs, identified and extracted from 12 whole genome assemblies. Conclusions: This is the first report of a repeat dataset that spans the majority of repeat variability within an entire genus, and one that includes complete elements as well as unassembled repeats. The database allows sequence browsing, downloading, and similarity searches. Because of the strategy adopted, the RiTE-db opens a new path to unprecedented direct comparative studies that span the entire nuclear repeat content of 15 million years of Oryza diversity.

KW - Genome

KW - Oryza

KW - Repeats

KW - Rice

KW - RiTE-db

KW - Transposable elements

UR - http://www.scopus.com/inward/record.url?scp=84937232446&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84937232446&partnerID=8YFLogxK

U2 - 10.1186/s12864-015-1762-3

DO - 10.1186/s12864-015-1762-3

M3 - Article

C2 - 26194356

AN - SCOPUS:84937232446

VL - 16

JO - BMC Genomics

JF - BMC Genomics

SN - 1471-2164

IS - 1

M1 - 538

ER -