A tutorial of diverse genome analysis tools found in the CoGe web-platform using Plasmodium spp. As a model

Andreina I. Castillo, Andrew D.L. Nelson, Asher K. Haug-Baltzell, Eric H Lyons

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Integrated platforms for storage, management, analysis and sharing of large quantities of omics data have become fundamental to comparative genomics. CoGe (https://genomevolution.org/coge/) is an online platform designed to manage and study genomic data, enabling both data- and hypothesis-driven comparative genomics. CoGe's tools and resources can be used to organize and analyse both publicly available and private genomic data from any species. Here, we demonstrate the capabilities of CoGe through three example workflows using 17 Plasmodium genomes as a model. Plasmodium genomes present unique challenges for comparative genomics due to their rapidly evolving and highly variable genomic AT/GC content. These example workflows are intended to serve as templates to help guide researchers who would like to use CoGe to examine diverse aspects of genome evolution. In the first workflow, trends in genome composition and amino acid usage are explored. In the second, changes in genome structure and the distribution of synonymous (Ks) and non-synonymous (Kn) substitution values are evaluated across species with different levels of evolutionary relatedness. In the third workflow, microsyntenic analyses of multigene families' genomic organization are conducted using two Plasmodium-specific gene families - serine repeat antigen, and cytoadherence-linked asexual gene - as models. In general, these example workflows show how to achieve quick, reproducible and shareable results using the CoGe platform. We were able to replicate previously published results, as well as leverage CoGe's tools and resources to gain additional insight into various aspects of Plasmodium genome evolution. Our results highlight the usefulness of the CoGe platform, particularly in understanding complex features of genome evolution.

Original languageEnglish (US)
JournalDatabase
Volume2018
Issue number2018
DOIs
StatePublished - Jan 1 2018

Fingerprint

Plasmodium
Workflow
Genes
Genome
genomics
genome
Genomics
Base Composition
Storage management
Multigene Family
multigene family
cell adhesion
serine
Serine
Antigens
genes
researchers
Research Personnel
antigens
Amino acids

ASJC Scopus subject areas

  • Information Systems
  • Biochemistry, Genetics and Molecular Biology(all)
  • Agricultural and Biological Sciences(all)

Cite this

A tutorial of diverse genome analysis tools found in the CoGe web-platform using Plasmodium spp. As a model. / Castillo, Andreina I.; Nelson, Andrew D.L.; Haug-Baltzell, Asher K.; Lyons, Eric H.

In: Database, Vol. 2018, No. 2018, 01.01.2018.

Research output: Contribution to journalArticle

Castillo, Andreina I. ; Nelson, Andrew D.L. ; Haug-Baltzell, Asher K. ; Lyons, Eric H. / A tutorial of diverse genome analysis tools found in the CoGe web-platform using Plasmodium spp. As a model. In: Database. 2018 ; Vol. 2018, No. 2018.
@article{8b73ceff202b4b1a8ae43eecf06c2a2a,
title = "A tutorial of diverse genome analysis tools found in the CoGe web-platform using Plasmodium spp. As a model",
abstract = "Integrated platforms for storage, management, analysis and sharing of large quantities of omics data have become fundamental to comparative genomics. CoGe (https://genomevolution.org/coge/) is an online platform designed to manage and study genomic data, enabling both data- and hypothesis-driven comparative genomics. CoGe's tools and resources can be used to organize and analyse both publicly available and private genomic data from any species. Here, we demonstrate the capabilities of CoGe through three example workflows using 17 Plasmodium genomes as a model. Plasmodium genomes present unique challenges for comparative genomics due to their rapidly evolving and highly variable genomic AT/GC content. These example workflows are intended to serve as templates to help guide researchers who would like to use CoGe to examine diverse aspects of genome evolution. In the first workflow, trends in genome composition and amino acid usage are explored. In the second, changes in genome structure and the distribution of synonymous (Ks) and non-synonymous (Kn) substitution values are evaluated across species with different levels of evolutionary relatedness. In the third workflow, microsyntenic analyses of multigene families' genomic organization are conducted using two Plasmodium-specific gene families - serine repeat antigen, and cytoadherence-linked asexual gene - as models. In general, these example workflows show how to achieve quick, reproducible and shareable results using the CoGe platform. We were able to replicate previously published results, as well as leverage CoGe's tools and resources to gain additional insight into various aspects of Plasmodium genome evolution. Our results highlight the usefulness of the CoGe platform, particularly in understanding complex features of genome evolution.",
author = "Castillo, {Andreina I.} and Nelson, {Andrew D.L.} and Haug-Baltzell, {Asher K.} and Lyons, {Eric H}",
year = "2018",
month = "1",
day = "1",
doi = "10.1093/database/bay030",
language = "English (US)",
volume = "2018",
journal = "Database : the journal of biological databases and curation",
issn = "1758-0463",
publisher = "Oxford University Press",
number = "2018",

}

TY - JOUR

T1 - A tutorial of diverse genome analysis tools found in the CoGe web-platform using Plasmodium spp. As a model

AU - Castillo, Andreina I.

AU - Nelson, Andrew D.L.

AU - Haug-Baltzell, Asher K.

AU - Lyons, Eric H

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Integrated platforms for storage, management, analysis and sharing of large quantities of omics data have become fundamental to comparative genomics. CoGe (https://genomevolution.org/coge/) is an online platform designed to manage and study genomic data, enabling both data- and hypothesis-driven comparative genomics. CoGe's tools and resources can be used to organize and analyse both publicly available and private genomic data from any species. Here, we demonstrate the capabilities of CoGe through three example workflows using 17 Plasmodium genomes as a model. Plasmodium genomes present unique challenges for comparative genomics due to their rapidly evolving and highly variable genomic AT/GC content. These example workflows are intended to serve as templates to help guide researchers who would like to use CoGe to examine diverse aspects of genome evolution. In the first workflow, trends in genome composition and amino acid usage are explored. In the second, changes in genome structure and the distribution of synonymous (Ks) and non-synonymous (Kn) substitution values are evaluated across species with different levels of evolutionary relatedness. In the third workflow, microsyntenic analyses of multigene families' genomic organization are conducted using two Plasmodium-specific gene families - serine repeat antigen, and cytoadherence-linked asexual gene - as models. In general, these example workflows show how to achieve quick, reproducible and shareable results using the CoGe platform. We were able to replicate previously published results, as well as leverage CoGe's tools and resources to gain additional insight into various aspects of Plasmodium genome evolution. Our results highlight the usefulness of the CoGe platform, particularly in understanding complex features of genome evolution.

AB - Integrated platforms for storage, management, analysis and sharing of large quantities of omics data have become fundamental to comparative genomics. CoGe (https://genomevolution.org/coge/) is an online platform designed to manage and study genomic data, enabling both data- and hypothesis-driven comparative genomics. CoGe's tools and resources can be used to organize and analyse both publicly available and private genomic data from any species. Here, we demonstrate the capabilities of CoGe through three example workflows using 17 Plasmodium genomes as a model. Plasmodium genomes present unique challenges for comparative genomics due to their rapidly evolving and highly variable genomic AT/GC content. These example workflows are intended to serve as templates to help guide researchers who would like to use CoGe to examine diverse aspects of genome evolution. In the first workflow, trends in genome composition and amino acid usage are explored. In the second, changes in genome structure and the distribution of synonymous (Ks) and non-synonymous (Kn) substitution values are evaluated across species with different levels of evolutionary relatedness. In the third workflow, microsyntenic analyses of multigene families' genomic organization are conducted using two Plasmodium-specific gene families - serine repeat antigen, and cytoadherence-linked asexual gene - as models. In general, these example workflows show how to achieve quick, reproducible and shareable results using the CoGe platform. We were able to replicate previously published results, as well as leverage CoGe's tools and resources to gain additional insight into various aspects of Plasmodium genome evolution. Our results highlight the usefulness of the CoGe platform, particularly in understanding complex features of genome evolution.

UR - http://www.scopus.com/inward/record.url?scp=85057256247&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85057256247&partnerID=8YFLogxK

U2 - 10.1093/database/bay030

DO - 10.1093/database/bay030

M3 - Article

VL - 2018

JO - Database : the journal of biological databases and curation

JF - Database : the journal of biological databases and curation

SN - 1758-0463

IS - 2018

ER -