Understanding the semantics of data provenance to support active conceptual modeling

Sudha Ram, Jun Liu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

36 Citations (Scopus)

Abstract

Data Provenance refers to the lineage of data including its origin, key events that occur over the course of its lifecycle, and other details associated with data creation, processing, and archiving. We believe that tracking provenance enables users to share, discover, and reuse the data, thus streamlining collaborative activities, reducing the possibility of repeating dead ends, and facilitating learning. It also provides a mechanism to transition from static to active conceptual modeling. The primary goal of our research is to investigate the semantics or meaning of data provenance. We describe the W7 model that represents different components of provenance and their relationships to each other. We conceptualize provenance as a combination of seven interconnected elements including "what", "when", "where", "how", "who", "which" and "why". Each of these components may be used to track events that affect data during its lifetime. A homeland security example illustrates how current conceptual models can be extended to embed provenance.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages17-29
Number of pages13
Volume4512 LNCS
DOIs
StatePublished - 2008
Event1st International Active Conceptual Modeling of Learning Workshop - Tucson, AZ, United States
Duration: Nov 8 2006Nov 8 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4512 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other1st International Active Conceptual Modeling of Learning Workshop
CountryUnited States
CityTucson, AZ
Period11/8/0611/8/06

Fingerprint

Conceptual Modeling
Provenance
Semantics
Learning
National security
Research
Homeland Security
Conceptual Model
Life Cycle
Reuse
W 7
Lifetime

ASJC Scopus subject areas

  • Computer Science(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Theoretical Computer Science

Cite this

Ram, S., & Liu, J. (2008). Understanding the semantics of data provenance to support active conceptual modeling. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4512 LNCS, pp. 17-29). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4512 LNCS). https://doi.org/10.1007/978-3-540-77503-4_3

Understanding the semantics of data provenance to support active conceptual modeling. / Ram, Sudha; Liu, Jun.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4512 LNCS 2008. p. 17-29 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4512 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Ram, S & Liu, J 2008, Understanding the semantics of data provenance to support active conceptual modeling. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 4512 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 4512 LNCS, pp. 17-29, 1st International Active Conceptual Modeling of Learning Workshop, Tucson, AZ, United States, 11/8/06. https://doi.org/10.1007/978-3-540-77503-4_3
Ram S, Liu J. Understanding the semantics of data provenance to support active conceptual modeling. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4512 LNCS. 2008. p. 17-29. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-540-77503-4_3
Ram, Sudha ; Liu, Jun. / Understanding the semantics of data provenance to support active conceptual modeling. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4512 LNCS 2008. pp. 17-29 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{be6f8e499beb4fff97f25c60018cd8e9,
title = "Understanding the semantics of data provenance to support active conceptual modeling",
abstract = "Data Provenance refers to the lineage of data including its origin, key events that occur over the course of its lifecycle, and other details associated with data creation, processing, and archiving. We believe that tracking provenance enables users to share, discover, and reuse the data, thus streamlining collaborative activities, reducing the possibility of repeating dead ends, and facilitating learning. It also provides a mechanism to transition from static to active conceptual modeling. The primary goal of our research is to investigate the semantics or meaning of data provenance. We describe the W7 model that represents different components of provenance and their relationships to each other. We conceptualize provenance as a combination of seven interconnected elements including {"}what{"}, {"}when{"}, {"}where{"}, {"}how{"}, {"}who{"}, {"}which{"} and {"}why{"}. Each of these components may be used to track events that affect data during its lifetime. A homeland security example illustrates how current conceptual models can be extended to embed provenance.",
author = "Sudha Ram and Jun Liu",
year = "2008",
doi = "10.1007/978-3-540-77503-4_3",
language = "English (US)",
isbn = "3540775021",
volume = "4512 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "17--29",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Understanding the semantics of data provenance to support active conceptual modeling

AU - Ram, Sudha

AU - Liu, Jun

PY - 2008

Y1 - 2008

N2 - Data Provenance refers to the lineage of data including its origin, key events that occur over the course of its lifecycle, and other details associated with data creation, processing, and archiving. We believe that tracking provenance enables users to share, discover, and reuse the data, thus streamlining collaborative activities, reducing the possibility of repeating dead ends, and facilitating learning. It also provides a mechanism to transition from static to active conceptual modeling. The primary goal of our research is to investigate the semantics or meaning of data provenance. We describe the W7 model that represents different components of provenance and their relationships to each other. We conceptualize provenance as a combination of seven interconnected elements including "what", "when", "where", "how", "who", "which" and "why". Each of these components may be used to track events that affect data during its lifetime. A homeland security example illustrates how current conceptual models can be extended to embed provenance.

AB - Data Provenance refers to the lineage of data including its origin, key events that occur over the course of its lifecycle, and other details associated with data creation, processing, and archiving. We believe that tracking provenance enables users to share, discover, and reuse the data, thus streamlining collaborative activities, reducing the possibility of repeating dead ends, and facilitating learning. It also provides a mechanism to transition from static to active conceptual modeling. The primary goal of our research is to investigate the semantics or meaning of data provenance. We describe the W7 model that represents different components of provenance and their relationships to each other. We conceptualize provenance as a combination of seven interconnected elements including "what", "when", "where", "how", "who", "which" and "why". Each of these components may be used to track events that affect data during its lifetime. A homeland security example illustrates how current conceptual models can be extended to embed provenance.

UR - http://www.scopus.com/inward/record.url?scp=49949112653&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=49949112653&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-77503-4_3

DO - 10.1007/978-3-540-77503-4_3

M3 - Conference contribution

AN - SCOPUS:49949112653

SN - 3540775021

SN - 9783540775027

VL - 4512 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 17

EP - 29

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -