DBMS metrology: Measuring query time

Sabah Currim, Richard Thomas Snodgrass, Young Kyoon Suh, Rui Zhang, Matthew Wong Johnson, Cheng Yi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

It is surprisingly hard to obtain accurate and precise measurements of the time spent executing a query. We review relevant process and overall measures obtainable from the Linux kernel and introduce a structural causal model relating these measures. A thorough correlational analysis provides strong support for this model. Using this model, we developed a timing protocol, which (1) performs sanity checks to ensure validity of the data, (2) drops some query executions via clearly motivated predicates, (3) drops some entire queries at a cardinality, again via clearly motivated predicates, (4) for those that remain, for each computes a single measured time by a carefully justified formula over the underlying measures of the remaining query executions, and (5) performs post-analysis sanity checks. The resulting query time measurement procedure, termed the Tucson Protocol, applies to proprietary and open-source DBMSes.

Original languageEnglish (US)
Title of host publicationProceedings of the ACM SIGMOD International Conference on Management of Data
Pages421-432
Number of pages12
DOIs
StatePublished - 2013
Event2013 ACM SIGMOD Conference on Management of Data, SIGMOD 2013 - New York, NY, United States
Duration: Jun 22 2013Jun 27 2013

Other

Other2013 ACM SIGMOD Conference on Management of Data, SIGMOD 2013
CountryUnited States
CityNew York, NY
Period6/22/136/27/13

Fingerprint

Time measurement
Linux

Keywords

  • Accuracy
  • Repeatability
  • Tucson Protocol

ASJC Scopus subject areas

  • Software
  • Information Systems

Cite this

Currim, S., Snodgrass, R. T., Suh, Y. K., Zhang, R., Johnson, M. W., & Yi, C. (2013). DBMS metrology: Measuring query time. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 421-432) https://doi.org/10.1145/2463676.2465331

DBMS metrology : Measuring query time. / Currim, Sabah; Snodgrass, Richard Thomas; Suh, Young Kyoon; Zhang, Rui; Johnson, Matthew Wong; Yi, Cheng.

Proceedings of the ACM SIGMOD International Conference on Management of Data. 2013. p. 421-432.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Currim, S, Snodgrass, RT, Suh, YK, Zhang, R, Johnson, MW & Yi, C 2013, DBMS metrology: Measuring query time. in Proceedings of the ACM SIGMOD International Conference on Management of Data. pp. 421-432, 2013 ACM SIGMOD Conference on Management of Data, SIGMOD 2013, New York, NY, United States, 6/22/13. https://doi.org/10.1145/2463676.2465331
Currim S, Snodgrass RT, Suh YK, Zhang R, Johnson MW, Yi C. DBMS metrology: Measuring query time. In Proceedings of the ACM SIGMOD International Conference on Management of Data. 2013. p. 421-432 https://doi.org/10.1145/2463676.2465331
Currim, Sabah ; Snodgrass, Richard Thomas ; Suh, Young Kyoon ; Zhang, Rui ; Johnson, Matthew Wong ; Yi, Cheng. / DBMS metrology : Measuring query time. Proceedings of the ACM SIGMOD International Conference on Management of Data. 2013. pp. 421-432
@inproceedings{7f5f58dc31ab44e79ec48eb4ac89948c,
title = "DBMS metrology: Measuring query time",
abstract = "It is surprisingly hard to obtain accurate and precise measurements of the time spent executing a query. We review relevant process and overall measures obtainable from the Linux kernel and introduce a structural causal model relating these measures. A thorough correlational analysis provides strong support for this model. Using this model, we developed a timing protocol, which (1) performs sanity checks to ensure validity of the data, (2) drops some query executions via clearly motivated predicates, (3) drops some entire queries at a cardinality, again via clearly motivated predicates, (4) for those that remain, for each computes a single measured time by a carefully justified formula over the underlying measures of the remaining query executions, and (5) performs post-analysis sanity checks. The resulting query time measurement procedure, termed the Tucson Protocol, applies to proprietary and open-source DBMSes.",
keywords = "Accuracy, Repeatability, Tucson Protocol",
author = "Sabah Currim and Snodgrass, {Richard Thomas} and Suh, {Young Kyoon} and Rui Zhang and Johnson, {Matthew Wong} and Cheng Yi",
year = "2013",
doi = "10.1145/2463676.2465331",
language = "English (US)",
isbn = "9781450320375",
pages = "421--432",
booktitle = "Proceedings of the ACM SIGMOD International Conference on Management of Data",

}

TY - GEN

T1 - DBMS metrology

T2 - Measuring query time

AU - Currim, Sabah

AU - Snodgrass, Richard Thomas

AU - Suh, Young Kyoon

AU - Zhang, Rui

AU - Johnson, Matthew Wong

AU - Yi, Cheng

PY - 2013

Y1 - 2013

N2 - It is surprisingly hard to obtain accurate and precise measurements of the time spent executing a query. We review relevant process and overall measures obtainable from the Linux kernel and introduce a structural causal model relating these measures. A thorough correlational analysis provides strong support for this model. Using this model, we developed a timing protocol, which (1) performs sanity checks to ensure validity of the data, (2) drops some query executions via clearly motivated predicates, (3) drops some entire queries at a cardinality, again via clearly motivated predicates, (4) for those that remain, for each computes a single measured time by a carefully justified formula over the underlying measures of the remaining query executions, and (5) performs post-analysis sanity checks. The resulting query time measurement procedure, termed the Tucson Protocol, applies to proprietary and open-source DBMSes.

AB - It is surprisingly hard to obtain accurate and precise measurements of the time spent executing a query. We review relevant process and overall measures obtainable from the Linux kernel and introduce a structural causal model relating these measures. A thorough correlational analysis provides strong support for this model. Using this model, we developed a timing protocol, which (1) performs sanity checks to ensure validity of the data, (2) drops some query executions via clearly motivated predicates, (3) drops some entire queries at a cardinality, again via clearly motivated predicates, (4) for those that remain, for each computes a single measured time by a carefully justified formula over the underlying measures of the remaining query executions, and (5) performs post-analysis sanity checks. The resulting query time measurement procedure, termed the Tucson Protocol, applies to proprietary and open-source DBMSes.

KW - Accuracy

KW - Repeatability

KW - Tucson Protocol

UR - http://www.scopus.com/inward/record.url?scp=84880547780&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84880547780&partnerID=8YFLogxK

U2 - 10.1145/2463676.2465331

DO - 10.1145/2463676.2465331

M3 - Conference contribution

AN - SCOPUS:84880547780

SN - 9781450320375

SP - 421

EP - 432

BT - Proceedings of the ACM SIGMOD International Conference on Management of Data

ER -