Illuminating structural proteins in viral "dark matter" with metaproteomics

Jennifer R. Brum, J. Cesar Ignacio-Espinoza, Eun Hae Kim, Gareth Trubl, Robert M. Jones, Simon Roux, Nathan C. VerBerkmoes, Virginia I Rich, Matthew Sullivan

Research output: Contribution to journalArticle

24 Citations (Scopus)

Abstract

Viruses are ecologically important, yet environmental virology is limited by dominance of unannotated genomic sequences representing taxonomic and functional "viral dark matter." Although recent analytical advances are rapidly improving taxonomic annotations, identifying functional darkmatter remains problematic. Here, we apply paired metaproteomics and dsDNA-targeted metagenomics to identify 1,875 virion-associated proteins from the ocean. Over onehalf of these proteins were newly functionally annotated and represent abundant and widespread viral metagenome-derived protein clusters (PCs). One primarily unannotated PC dominated the dataset, but structural modeling and genomic context identified this PC as a previously unidentified capsid protein from multiple uncultivated tailed virus families. Furthermore, four of the five most abundant PCs in the metaproteome represent capsid proteins containing the HK97-like protein fold previously found in many viruses that infect all three domains of life. The dominance of these proteins within our dataset, as well as their global distribution throughout the world's oceans and seas, supports prior hypotheses that this HK97-like protein fold is the most abundant biological structure on Earth. Together, these culture-independent analyses improve virion-associated protein annotations, facilitate the investigation of proteins within natural viral communities, and offer a high-throughput means of illuminating functional viral dark matter.

Original languageEnglish (US)
Pages (from-to)2436-2441
Number of pages6
JournalProceedings of the National Academy of Sciences of the United States of America
Volume113
Issue number9
DOIs
StatePublished - Mar 1 2016

Fingerprint

Viral Structural Proteins
Proteins
Capsid Proteins
Viruses
Oceans and Seas
Virion
Metagenome
Molecular Sequence Annotation
Metagenomics
Virology

Keywords

  • Marine
  • Proteins
  • Viruses

ASJC Scopus subject areas

  • General

Cite this

Illuminating structural proteins in viral "dark matter" with metaproteomics. / Brum, Jennifer R.; Ignacio-Espinoza, J. Cesar; Kim, Eun Hae; Trubl, Gareth; Jones, Robert M.; Roux, Simon; VerBerkmoes, Nathan C.; Rich, Virginia I; Sullivan, Matthew.

In: Proceedings of the National Academy of Sciences of the United States of America, Vol. 113, No. 9, 01.03.2016, p. 2436-2441.

Research output: Contribution to journalArticle

Brum, Jennifer R. ; Ignacio-Espinoza, J. Cesar ; Kim, Eun Hae ; Trubl, Gareth ; Jones, Robert M. ; Roux, Simon ; VerBerkmoes, Nathan C. ; Rich, Virginia I ; Sullivan, Matthew. / Illuminating structural proteins in viral "dark matter" with metaproteomics. In: Proceedings of the National Academy of Sciences of the United States of America. 2016 ; Vol. 113, No. 9. pp. 2436-2441.
@article{377b89fb3bb94f90ab561a43d0dab5d5,
title = "Illuminating structural proteins in viral {"}dark matter{"} with metaproteomics",
abstract = "Viruses are ecologically important, yet environmental virology is limited by dominance of unannotated genomic sequences representing taxonomic and functional {"}viral dark matter.{"} Although recent analytical advances are rapidly improving taxonomic annotations, identifying functional darkmatter remains problematic. Here, we apply paired metaproteomics and dsDNA-targeted metagenomics to identify 1,875 virion-associated proteins from the ocean. Over onehalf of these proteins were newly functionally annotated and represent abundant and widespread viral metagenome-derived protein clusters (PCs). One primarily unannotated PC dominated the dataset, but structural modeling and genomic context identified this PC as a previously unidentified capsid protein from multiple uncultivated tailed virus families. Furthermore, four of the five most abundant PCs in the metaproteome represent capsid proteins containing the HK97-like protein fold previously found in many viruses that infect all three domains of life. The dominance of these proteins within our dataset, as well as their global distribution throughout the world's oceans and seas, supports prior hypotheses that this HK97-like protein fold is the most abundant biological structure on Earth. Together, these culture-independent analyses improve virion-associated protein annotations, facilitate the investigation of proteins within natural viral communities, and offer a high-throughput means of illuminating functional viral dark matter.",
keywords = "Marine, Proteins, Viruses",
author = "Brum, {Jennifer R.} and Ignacio-Espinoza, {J. Cesar} and Kim, {Eun Hae} and Gareth Trubl and Jones, {Robert M.} and Simon Roux and VerBerkmoes, {Nathan C.} and Rich, {Virginia I} and Matthew Sullivan",
year = "2016",
month = "3",
day = "1",
doi = "10.1073/pnas.1525139113",
language = "English (US)",
volume = "113",
pages = "2436--2441",
journal = "Proceedings of the National Academy of Sciences of the United States of America",
issn = "0027-8424",
number = "9",

}

TY - JOUR

T1 - Illuminating structural proteins in viral "dark matter" with metaproteomics

AU - Brum, Jennifer R.

AU - Ignacio-Espinoza, J. Cesar

AU - Kim, Eun Hae

AU - Trubl, Gareth

AU - Jones, Robert M.

AU - Roux, Simon

AU - VerBerkmoes, Nathan C.

AU - Rich, Virginia I

AU - Sullivan, Matthew

PY - 2016/3/1

Y1 - 2016/3/1

N2 - Viruses are ecologically important, yet environmental virology is limited by dominance of unannotated genomic sequences representing taxonomic and functional "viral dark matter." Although recent analytical advances are rapidly improving taxonomic annotations, identifying functional darkmatter remains problematic. Here, we apply paired metaproteomics and dsDNA-targeted metagenomics to identify 1,875 virion-associated proteins from the ocean. Over onehalf of these proteins were newly functionally annotated and represent abundant and widespread viral metagenome-derived protein clusters (PCs). One primarily unannotated PC dominated the dataset, but structural modeling and genomic context identified this PC as a previously unidentified capsid protein from multiple uncultivated tailed virus families. Furthermore, four of the five most abundant PCs in the metaproteome represent capsid proteins containing the HK97-like protein fold previously found in many viruses that infect all three domains of life. The dominance of these proteins within our dataset, as well as their global distribution throughout the world's oceans and seas, supports prior hypotheses that this HK97-like protein fold is the most abundant biological structure on Earth. Together, these culture-independent analyses improve virion-associated protein annotations, facilitate the investigation of proteins within natural viral communities, and offer a high-throughput means of illuminating functional viral dark matter.

AB - Viruses are ecologically important, yet environmental virology is limited by dominance of unannotated genomic sequences representing taxonomic and functional "viral dark matter." Although recent analytical advances are rapidly improving taxonomic annotations, identifying functional darkmatter remains problematic. Here, we apply paired metaproteomics and dsDNA-targeted metagenomics to identify 1,875 virion-associated proteins from the ocean. Over onehalf of these proteins were newly functionally annotated and represent abundant and widespread viral metagenome-derived protein clusters (PCs). One primarily unannotated PC dominated the dataset, but structural modeling and genomic context identified this PC as a previously unidentified capsid protein from multiple uncultivated tailed virus families. Furthermore, four of the five most abundant PCs in the metaproteome represent capsid proteins containing the HK97-like protein fold previously found in many viruses that infect all three domains of life. The dominance of these proteins within our dataset, as well as their global distribution throughout the world's oceans and seas, supports prior hypotheses that this HK97-like protein fold is the most abundant biological structure on Earth. Together, these culture-independent analyses improve virion-associated protein annotations, facilitate the investigation of proteins within natural viral communities, and offer a high-throughput means of illuminating functional viral dark matter.

KW - Marine

KW - Proteins

KW - Viruses

UR - http://www.scopus.com/inward/record.url?scp=84959474801&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84959474801&partnerID=8YFLogxK

U2 - 10.1073/pnas.1525139113

DO - 10.1073/pnas.1525139113

M3 - Article

C2 - 26884177

AN - SCOPUS:84959474801

VL - 113

SP - 2436

EP - 2441

JO - Proceedings of the National Academy of Sciences of the United States of America

JF - Proceedings of the National Academy of Sciences of the United States of America

SN - 0027-8424

IS - 9

ER -