Collaborative visual analysis with RCloud

Stephen North, Carlos Eduardo Scheidegger, Simon Urbanek, Gordon Woodhull

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Consider the emerging role of data science teams embedded in larger organizations. Individual analysts work on loosely related problems, and must share their findings with each other and the organization at large, moving results from exploratory data analyses (EDA) into automated visualizations, diagnostics and reports deployed for wider consumption. There are two problems with the current practice. First, there are gaps in this workflow: EDA is performed with one set of tools, and automated reports and deployments with another. Second, these environments often assume a single-developer perspective, while data scientist teams could get much benefit from easier sharing of scripts and data feeds, experiments, annotations, and automated recommendations, which are well beyond what traditional version control systems provide. We contribute and justify the following three requirements for systems built to support current data science teams and users: discoverability, technology transfer, and coexistence. In addition, we contribute the design and implementation of RCloud, a system that supports the requirements of collaborative data analysis, visualization and web deployment. About 100 people used RCloud for two years. We report on interviews with some of these users, and discuss design decisions, tradeoffs and limitations in comparison to other approaches.

Original languageEnglish (US)
Title of host publication2015 IEEE Conference on Visual Analytics Science and Technology, VAST 2015 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages25-32
Number of pages8
ISBN (Electronic)9781467397834
DOIs
StatePublished - Dec 4 2015
Event10th IEEE Conference on Visual Analytics Science and Technology, VAST 2015 - Chicago, United States
Duration: Oct 25 2015Oct 30 2015

Other

Other10th IEEE Conference on Visual Analytics Science and Technology, VAST 2015
CountryUnited States
CityChicago
Period10/25/1510/30/15

Fingerprint

Visualization
Technology transfer
Control systems
Experiments

Keywords

  • collaboration
  • computer-supported cooperative work
  • provenance
  • visual analytics process
  • visualization

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Computer Science Applications

Cite this

North, S., Scheidegger, C. E., Urbanek, S., & Woodhull, G. (2015). Collaborative visual analysis with RCloud. In 2015 IEEE Conference on Visual Analytics Science and Technology, VAST 2015 - Proceedings (pp. 25-32). [7347627] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/VAST.2015.7347627

Collaborative visual analysis with RCloud. / North, Stephen; Scheidegger, Carlos Eduardo; Urbanek, Simon; Woodhull, Gordon.

2015 IEEE Conference on Visual Analytics Science and Technology, VAST 2015 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2015. p. 25-32 7347627.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

North, S, Scheidegger, CE, Urbanek, S & Woodhull, G 2015, Collaborative visual analysis with RCloud. in 2015 IEEE Conference on Visual Analytics Science and Technology, VAST 2015 - Proceedings., 7347627, Institute of Electrical and Electronics Engineers Inc., pp. 25-32, 10th IEEE Conference on Visual Analytics Science and Technology, VAST 2015, Chicago, United States, 10/25/15. https://doi.org/10.1109/VAST.2015.7347627
North S, Scheidegger CE, Urbanek S, Woodhull G. Collaborative visual analysis with RCloud. In 2015 IEEE Conference on Visual Analytics Science and Technology, VAST 2015 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2015. p. 25-32. 7347627 https://doi.org/10.1109/VAST.2015.7347627
North, Stephen ; Scheidegger, Carlos Eduardo ; Urbanek, Simon ; Woodhull, Gordon. / Collaborative visual analysis with RCloud. 2015 IEEE Conference on Visual Analytics Science and Technology, VAST 2015 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2015. pp. 25-32
@inproceedings{f9a6e76764564631bb85b3b12450452c,
title = "Collaborative visual analysis with RCloud",
abstract = "Consider the emerging role of data science teams embedded in larger organizations. Individual analysts work on loosely related problems, and must share their findings with each other and the organization at large, moving results from exploratory data analyses (EDA) into automated visualizations, diagnostics and reports deployed for wider consumption. There are two problems with the current practice. First, there are gaps in this workflow: EDA is performed with one set of tools, and automated reports and deployments with another. Second, these environments often assume a single-developer perspective, while data scientist teams could get much benefit from easier sharing of scripts and data feeds, experiments, annotations, and automated recommendations, which are well beyond what traditional version control systems provide. We contribute and justify the following three requirements for systems built to support current data science teams and users: discoverability, technology transfer, and coexistence. In addition, we contribute the design and implementation of RCloud, a system that supports the requirements of collaborative data analysis, visualization and web deployment. About 100 people used RCloud for two years. We report on interviews with some of these users, and discuss design decisions, tradeoffs and limitations in comparison to other approaches.",
keywords = "collaboration, computer-supported cooperative work, provenance, visual analytics process, visualization",
author = "Stephen North and Scheidegger, {Carlos Eduardo} and Simon Urbanek and Gordon Woodhull",
year = "2015",
month = "12",
day = "4",
doi = "10.1109/VAST.2015.7347627",
language = "English (US)",
pages = "25--32",
booktitle = "2015 IEEE Conference on Visual Analytics Science and Technology, VAST 2015 - Proceedings",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

TY - GEN

T1 - Collaborative visual analysis with RCloud

AU - North, Stephen

AU - Scheidegger, Carlos Eduardo

AU - Urbanek, Simon

AU - Woodhull, Gordon

PY - 2015/12/4

Y1 - 2015/12/4

N2 - Consider the emerging role of data science teams embedded in larger organizations. Individual analysts work on loosely related problems, and must share their findings with each other and the organization at large, moving results from exploratory data analyses (EDA) into automated visualizations, diagnostics and reports deployed for wider consumption. There are two problems with the current practice. First, there are gaps in this workflow: EDA is performed with one set of tools, and automated reports and deployments with another. Second, these environments often assume a single-developer perspective, while data scientist teams could get much benefit from easier sharing of scripts and data feeds, experiments, annotations, and automated recommendations, which are well beyond what traditional version control systems provide. We contribute and justify the following three requirements for systems built to support current data science teams and users: discoverability, technology transfer, and coexistence. In addition, we contribute the design and implementation of RCloud, a system that supports the requirements of collaborative data analysis, visualization and web deployment. About 100 people used RCloud for two years. We report on interviews with some of these users, and discuss design decisions, tradeoffs and limitations in comparison to other approaches.

AB - Consider the emerging role of data science teams embedded in larger organizations. Individual analysts work on loosely related problems, and must share their findings with each other and the organization at large, moving results from exploratory data analyses (EDA) into automated visualizations, diagnostics and reports deployed for wider consumption. There are two problems with the current practice. First, there are gaps in this workflow: EDA is performed with one set of tools, and automated reports and deployments with another. Second, these environments often assume a single-developer perspective, while data scientist teams could get much benefit from easier sharing of scripts and data feeds, experiments, annotations, and automated recommendations, which are well beyond what traditional version control systems provide. We contribute and justify the following three requirements for systems built to support current data science teams and users: discoverability, technology transfer, and coexistence. In addition, we contribute the design and implementation of RCloud, a system that supports the requirements of collaborative data analysis, visualization and web deployment. About 100 people used RCloud for two years. We report on interviews with some of these users, and discuss design decisions, tradeoffs and limitations in comparison to other approaches.

KW - collaboration

KW - computer-supported cooperative work

KW - provenance

KW - visual analytics process

KW - visualization

UR - http://www.scopus.com/inward/record.url?scp=84983189224&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84983189224&partnerID=8YFLogxK

U2 - 10.1109/VAST.2015.7347627

DO - 10.1109/VAST.2015.7347627

M3 - Conference contribution

SP - 25

EP - 32

BT - 2015 IEEE Conference on Visual Analytics Science and Technology, VAST 2015 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

ER -