Big genomes facilitate the comparative identification of regulatory elements

Brant K. Peterson, Emily E. Hare, Venky N. Iyer, Steven Storage, Laura Conner, Daniel R Papaj, Rick Kurashima, Eric Jang, Michael B. Eisen

Research output: Contribution to journalArticle

37 Citations (Scopus)

Abstract

The identification of regulatory sequences in animal genomes remains a significant challenge. Comparative genomic methods that use patterns of evolutionary conservation to identify non-coding sequences with regulatory function have yielded many new vertebrate enhancers. However, these methods have not contributed significantly to the identification of regulatory sequences in sequenced invertebrate taxa. We demonstrate here that this differential success, which is often attributed to fundamental differences in the nature of vertebrate and invertebrate regulatory sequences, is instead primarily a product of the relatively small size of sequenced invertebrate genomes. We sequenced and compared loci involved in early embryonic patterning from four species of true fruit flies (family Tephritidae) that have genomes four to six times larger than those of Drosophila melanogaster. Unlike in Drosophila, where virtually all non-coding DNA is highly conserved, blocks of conserved non-coding sequence in tephritids are flanked by large stretches of poorly conserved sequence, similar to what is observed in vertebrate genomes. We tested the activities of nine conserved non-coding sequences flanking the even-skipped gene of the teprhitid Ceratis capitata in transgenic D. melanogaster embryos, six of which drove patterns that recapitulate those of known D. melanogaster enhancers. In contrast, none of the three non-conserved tephritid non-coding sequences that we tested drove expression in D. melanogaster embryos. Based on the landscape of non-coding conservation in tephritids, and our initial success in using conservation in tephritids to identify D. melanogaster regulatory sequences, we suggest that comparison of tephritid genomes may provide a systematic means to annotate the non-coding portion of the D. melanogaster genome. We also propose that large genomes be given more consideration in the selection of species for comparative genomics projects, to provide increased power to detect functional non-coding DNAs and to provide a less biased view of the evolution and function of animal genomes.

Original languageEnglish (US)
Article numbere4688
JournalPLoS One
Volume4
Issue number3
DOIs
StatePublished - Mar 4 2009

Fingerprint

Genes
Drosophila melanogaster
Genome
regulatory sequences
genome
Tephritidae
Invertebrates
Vertebrates
invertebrates
Conservation
vertebrates
intergenic DNA
embryo (animal)
Embryonic Structures
Animals
genomics
conserved sequences
Conserved Sequence
DNA
fruit flies

ASJC Scopus subject areas

  • Agricultural and Biological Sciences(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Medicine(all)

Cite this

Peterson, B. K., Hare, E. E., Iyer, V. N., Storage, S., Conner, L., Papaj, D. R., ... Eisen, M. B. (2009). Big genomes facilitate the comparative identification of regulatory elements. PLoS One, 4(3), [e4688]. https://doi.org/10.1371/journal.pone.0004688

Big genomes facilitate the comparative identification of regulatory elements. / Peterson, Brant K.; Hare, Emily E.; Iyer, Venky N.; Storage, Steven; Conner, Laura; Papaj, Daniel R; Kurashima, Rick; Jang, Eric; Eisen, Michael B.

In: PLoS One, Vol. 4, No. 3, e4688, 04.03.2009.

Research output: Contribution to journalArticle

Peterson, BK, Hare, EE, Iyer, VN, Storage, S, Conner, L, Papaj, DR, Kurashima, R, Jang, E & Eisen, MB 2009, 'Big genomes facilitate the comparative identification of regulatory elements', PLoS One, vol. 4, no. 3, e4688. https://doi.org/10.1371/journal.pone.0004688
Peterson, Brant K. ; Hare, Emily E. ; Iyer, Venky N. ; Storage, Steven ; Conner, Laura ; Papaj, Daniel R ; Kurashima, Rick ; Jang, Eric ; Eisen, Michael B. / Big genomes facilitate the comparative identification of regulatory elements. In: PLoS One. 2009 ; Vol. 4, No. 3.
@article{d0f9dd9d5dfd4bb6bacae7a2ec3af950,
title = "Big genomes facilitate the comparative identification of regulatory elements",
abstract = "The identification of regulatory sequences in animal genomes remains a significant challenge. Comparative genomic methods that use patterns of evolutionary conservation to identify non-coding sequences with regulatory function have yielded many new vertebrate enhancers. However, these methods have not contributed significantly to the identification of regulatory sequences in sequenced invertebrate taxa. We demonstrate here that this differential success, which is often attributed to fundamental differences in the nature of vertebrate and invertebrate regulatory sequences, is instead primarily a product of the relatively small size of sequenced invertebrate genomes. We sequenced and compared loci involved in early embryonic patterning from four species of true fruit flies (family Tephritidae) that have genomes four to six times larger than those of Drosophila melanogaster. Unlike in Drosophila, where virtually all non-coding DNA is highly conserved, blocks of conserved non-coding sequence in tephritids are flanked by large stretches of poorly conserved sequence, similar to what is observed in vertebrate genomes. We tested the activities of nine conserved non-coding sequences flanking the even-skipped gene of the teprhitid Ceratis capitata in transgenic D. melanogaster embryos, six of which drove patterns that recapitulate those of known D. melanogaster enhancers. In contrast, none of the three non-conserved tephritid non-coding sequences that we tested drove expression in D. melanogaster embryos. Based on the landscape of non-coding conservation in tephritids, and our initial success in using conservation in tephritids to identify D. melanogaster regulatory sequences, we suggest that comparison of tephritid genomes may provide a systematic means to annotate the non-coding portion of the D. melanogaster genome. We also propose that large genomes be given more consideration in the selection of species for comparative genomics projects, to provide increased power to detect functional non-coding DNAs and to provide a less biased view of the evolution and function of animal genomes.",
author = "Peterson, {Brant K.} and Hare, {Emily E.} and Iyer, {Venky N.} and Steven Storage and Laura Conner and Papaj, {Daniel R} and Rick Kurashima and Eric Jang and Eisen, {Michael B.}",
year = "2009",
month = "3",
day = "4",
doi = "10.1371/journal.pone.0004688",
language = "English (US)",
volume = "4",
journal = "PLoS One",
issn = "1932-6203",
publisher = "Public Library of Science",
number = "3",

}

TY - JOUR

T1 - Big genomes facilitate the comparative identification of regulatory elements

AU - Peterson, Brant K.

AU - Hare, Emily E.

AU - Iyer, Venky N.

AU - Storage, Steven

AU - Conner, Laura

AU - Papaj, Daniel R

AU - Kurashima, Rick

AU - Jang, Eric

AU - Eisen, Michael B.

PY - 2009/3/4

Y1 - 2009/3/4

N2 - The identification of regulatory sequences in animal genomes remains a significant challenge. Comparative genomic methods that use patterns of evolutionary conservation to identify non-coding sequences with regulatory function have yielded many new vertebrate enhancers. However, these methods have not contributed significantly to the identification of regulatory sequences in sequenced invertebrate taxa. We demonstrate here that this differential success, which is often attributed to fundamental differences in the nature of vertebrate and invertebrate regulatory sequences, is instead primarily a product of the relatively small size of sequenced invertebrate genomes. We sequenced and compared loci involved in early embryonic patterning from four species of true fruit flies (family Tephritidae) that have genomes four to six times larger than those of Drosophila melanogaster. Unlike in Drosophila, where virtually all non-coding DNA is highly conserved, blocks of conserved non-coding sequence in tephritids are flanked by large stretches of poorly conserved sequence, similar to what is observed in vertebrate genomes. We tested the activities of nine conserved non-coding sequences flanking the even-skipped gene of the teprhitid Ceratis capitata in transgenic D. melanogaster embryos, six of which drove patterns that recapitulate those of known D. melanogaster enhancers. In contrast, none of the three non-conserved tephritid non-coding sequences that we tested drove expression in D. melanogaster embryos. Based on the landscape of non-coding conservation in tephritids, and our initial success in using conservation in tephritids to identify D. melanogaster regulatory sequences, we suggest that comparison of tephritid genomes may provide a systematic means to annotate the non-coding portion of the D. melanogaster genome. We also propose that large genomes be given more consideration in the selection of species for comparative genomics projects, to provide increased power to detect functional non-coding DNAs and to provide a less biased view of the evolution and function of animal genomes.

AB - The identification of regulatory sequences in animal genomes remains a significant challenge. Comparative genomic methods that use patterns of evolutionary conservation to identify non-coding sequences with regulatory function have yielded many new vertebrate enhancers. However, these methods have not contributed significantly to the identification of regulatory sequences in sequenced invertebrate taxa. We demonstrate here that this differential success, which is often attributed to fundamental differences in the nature of vertebrate and invertebrate regulatory sequences, is instead primarily a product of the relatively small size of sequenced invertebrate genomes. We sequenced and compared loci involved in early embryonic patterning from four species of true fruit flies (family Tephritidae) that have genomes four to six times larger than those of Drosophila melanogaster. Unlike in Drosophila, where virtually all non-coding DNA is highly conserved, blocks of conserved non-coding sequence in tephritids are flanked by large stretches of poorly conserved sequence, similar to what is observed in vertebrate genomes. We tested the activities of nine conserved non-coding sequences flanking the even-skipped gene of the teprhitid Ceratis capitata in transgenic D. melanogaster embryos, six of which drove patterns that recapitulate those of known D. melanogaster enhancers. In contrast, none of the three non-conserved tephritid non-coding sequences that we tested drove expression in D. melanogaster embryos. Based on the landscape of non-coding conservation in tephritids, and our initial success in using conservation in tephritids to identify D. melanogaster regulatory sequences, we suggest that comparison of tephritid genomes may provide a systematic means to annotate the non-coding portion of the D. melanogaster genome. We also propose that large genomes be given more consideration in the selection of species for comparative genomics projects, to provide increased power to detect functional non-coding DNAs and to provide a less biased view of the evolution and function of animal genomes.

UR - http://www.scopus.com/inward/record.url?scp=62249204977&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=62249204977&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0004688

DO - 10.1371/journal.pone.0004688

M3 - Article

C2 - 19259274

AN - SCOPUS:62249204977

VL - 4

JO - PLoS One

JF - PLoS One

SN - 1932-6203

IS - 3

M1 - e4688

ER -