Tackling the provenance challenge one layer at a time

Carlos Eduardo Scheidegger, David Koop, Emanuele Santos, Huy Vo, Steven Callahan, Juliana Freire, Cláudio Silva

Research output: Contribution to journalArticle

72 Citations (Scopus)

Abstract

VisTrails is a new workflow and provenance management system that provides support for scientific data exploration and visualization. Whereas workflows have been traditionally used to automate repetitive tasks, for applications that are exploratory in nature, change is the norm. VisTrails uses a new change-based provenance mechanism, which was designed to handle rapidly evolving workflows. It uniformly and automatically captures provenance information for data products and for the evolution of the workflows used to generate these products. In this paper, we describe how the VisTrails provenance data are organized in layers and present a first approach for querying this data that we developed to tackle the Provenance Challenge queries.

Original languageEnglish (US)
Pages (from-to)473-483
Number of pages11
JournalConcurrency Computation Practice and Experience
Volume20
Issue number5
DOIs
StatePublished - Apr 10 2008
Externally publishedYes

Fingerprint

Provenance
Visualization
Work Flow
Query
Norm

Keywords

  • Provenance
  • Visualization
  • Workflow evolution

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Software
  • Theoretical Computer Science
  • Computational Theory and Mathematics

Cite this

Tackling the provenance challenge one layer at a time. / Scheidegger, Carlos Eduardo; Koop, David; Santos, Emanuele; Vo, Huy; Callahan, Steven; Freire, Juliana; Silva, Cláudio.

In: Concurrency Computation Practice and Experience, Vol. 20, No. 5, 10.04.2008, p. 473-483.

Research output: Contribution to journalArticle

Scheidegger, CE, Koop, D, Santos, E, Vo, H, Callahan, S, Freire, J & Silva, C 2008, 'Tackling the provenance challenge one layer at a time', Concurrency Computation Practice and Experience, vol. 20, no. 5, pp. 473-483. https://doi.org/10.1002/cpe.1237
Scheidegger, Carlos Eduardo ; Koop, David ; Santos, Emanuele ; Vo, Huy ; Callahan, Steven ; Freire, Juliana ; Silva, Cláudio. / Tackling the provenance challenge one layer at a time. In: Concurrency Computation Practice and Experience. 2008 ; Vol. 20, No. 5. pp. 473-483.
@article{2e572b2521ca4f1c9c86ba219e55258c,
title = "Tackling the provenance challenge one layer at a time",
abstract = "VisTrails is a new workflow and provenance management system that provides support for scientific data exploration and visualization. Whereas workflows have been traditionally used to automate repetitive tasks, for applications that are exploratory in nature, change is the norm. VisTrails uses a new change-based provenance mechanism, which was designed to handle rapidly evolving workflows. It uniformly and automatically captures provenance information for data products and for the evolution of the workflows used to generate these products. In this paper, we describe how the VisTrails provenance data are organized in layers and present a first approach for querying this data that we developed to tackle the Provenance Challenge queries.",
keywords = "Provenance, Visualization, Workflow evolution",
author = "Scheidegger, {Carlos Eduardo} and David Koop and Emanuele Santos and Huy Vo and Steven Callahan and Juliana Freire and Cl{\'a}udio Silva",
year = "2008",
month = "4",
day = "10",
doi = "10.1002/cpe.1237",
language = "English (US)",
volume = "20",
pages = "473--483",
journal = "Concurrency Computation Practice and Experience",
issn = "1532-0626",
publisher = "John Wiley and Sons Ltd",
number = "5",

}

TY - JOUR

T1 - Tackling the provenance challenge one layer at a time

AU - Scheidegger, Carlos Eduardo

AU - Koop, David

AU - Santos, Emanuele

AU - Vo, Huy

AU - Callahan, Steven

AU - Freire, Juliana

AU - Silva, Cláudio

PY - 2008/4/10

Y1 - 2008/4/10

N2 - VisTrails is a new workflow and provenance management system that provides support for scientific data exploration and visualization. Whereas workflows have been traditionally used to automate repetitive tasks, for applications that are exploratory in nature, change is the norm. VisTrails uses a new change-based provenance mechanism, which was designed to handle rapidly evolving workflows. It uniformly and automatically captures provenance information for data products and for the evolution of the workflows used to generate these products. In this paper, we describe how the VisTrails provenance data are organized in layers and present a first approach for querying this data that we developed to tackle the Provenance Challenge queries.

AB - VisTrails is a new workflow and provenance management system that provides support for scientific data exploration and visualization. Whereas workflows have been traditionally used to automate repetitive tasks, for applications that are exploratory in nature, change is the norm. VisTrails uses a new change-based provenance mechanism, which was designed to handle rapidly evolving workflows. It uniformly and automatically captures provenance information for data products and for the evolution of the workflows used to generate these products. In this paper, we describe how the VisTrails provenance data are organized in layers and present a first approach for querying this data that we developed to tackle the Provenance Challenge queries.

KW - Provenance

KW - Visualization

KW - Workflow evolution

UR - http://www.scopus.com/inward/record.url?scp=41149134435&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=41149134435&partnerID=8YFLogxK

U2 - 10.1002/cpe.1237

DO - 10.1002/cpe.1237

M3 - Article

AN - SCOPUS:41149134435

VL - 20

SP - 473

EP - 483

JO - Concurrency Computation Practice and Experience

JF - Concurrency Computation Practice and Experience

SN - 1532-0626

IS - 5

ER -