Who does what: Collaboration patterns in the wikipedia and their impact on data quality

Jun Liu, Sudha Ram

Research output: Chapter in Book/Report/Conference proceedingConference contribution

15 Citations (Scopus)

Abstract

Data quality in the Wikipedia is debatable. On the one hand, existing research indicates that not only are people willing to contribute articles but the quality of those articles is close to that found in conventional encyclopedias. On the other hand, the public has never stopped criticizing the quality of Wikipedia articles, and critics never have trouble finding low quality Wikipedia articles. Why do Wikipedia articles vary widely in quality? We investigate the relationship between collaboration and data quality. We show that the quality of Wikipedia articles is not only dependent on the different types of contributors but also on how they collaborate. Based on an empirical study, we classify contributors based on their roles in editing individual Wikipedia articles. We identify various patterns of collaboration based on the provenance or, more specifically, who does what to Wikipedia articles. Our research helps identify collaboration patterns that are preferable or detrimental for data quality, thus providing insights for improving data quality in Wikipedia.

Original languageEnglish (US)
Title of host publication19th Workshop on Information Technologies and Systems, WITS 2009
PublisherSocial Science Research Network
Pages175-180
Number of pages6
StatePublished - 2009
Event19th Workshop on Information Technologies and Systems, WITS 2009 - Phoenix, AZ, United States
Duration: Dec 14 2009Dec 15 2009

Other

Other19th Workshop on Information Technologies and Systems, WITS 2009
CountryUnited States
CityPhoenix, AZ
Period12/14/0912/15/09

Keywords

  • Collaboration pattern
  • Data provenance
  • Data quality
  • Wikipedia

ASJC Scopus subject areas

  • Information Systems
  • Control and Systems Engineering

Cite this

Liu, J., & Ram, S. (2009). Who does what: Collaboration patterns in the wikipedia and their impact on data quality. In 19th Workshop on Information Technologies and Systems, WITS 2009 (pp. 175-180). Social Science Research Network.

Who does what : Collaboration patterns in the wikipedia and their impact on data quality. / Liu, Jun; Ram, Sudha.

19th Workshop on Information Technologies and Systems, WITS 2009. Social Science Research Network, 2009. p. 175-180.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Liu, J & Ram, S 2009, Who does what: Collaboration patterns in the wikipedia and their impact on data quality. in 19th Workshop on Information Technologies and Systems, WITS 2009. Social Science Research Network, pp. 175-180, 19th Workshop on Information Technologies and Systems, WITS 2009, Phoenix, AZ, United States, 12/14/09.
Liu J, Ram S. Who does what: Collaboration patterns in the wikipedia and their impact on data quality. In 19th Workshop on Information Technologies and Systems, WITS 2009. Social Science Research Network. 2009. p. 175-180
Liu, Jun ; Ram, Sudha. / Who does what : Collaboration patterns in the wikipedia and their impact on data quality. 19th Workshop on Information Technologies and Systems, WITS 2009. Social Science Research Network, 2009. pp. 175-180
@inproceedings{c8875fad12274c1683667b27db5ee818,
title = "Who does what: Collaboration patterns in the wikipedia and their impact on data quality",
abstract = "Data quality in the Wikipedia is debatable. On the one hand, existing research indicates that not only are people willing to contribute articles but the quality of those articles is close to that found in conventional encyclopedias. On the other hand, the public has never stopped criticizing the quality of Wikipedia articles, and critics never have trouble finding low quality Wikipedia articles. Why do Wikipedia articles vary widely in quality? We investigate the relationship between collaboration and data quality. We show that the quality of Wikipedia articles is not only dependent on the different types of contributors but also on how they collaborate. Based on an empirical study, we classify contributors based on their roles in editing individual Wikipedia articles. We identify various patterns of collaboration based on the provenance or, more specifically, who does what to Wikipedia articles. Our research helps identify collaboration patterns that are preferable or detrimental for data quality, thus providing insights for improving data quality in Wikipedia.",
keywords = "Collaboration pattern, Data provenance, Data quality, Wikipedia",
author = "Jun Liu and Sudha Ram",
year = "2009",
language = "English (US)",
pages = "175--180",
booktitle = "19th Workshop on Information Technologies and Systems, WITS 2009",
publisher = "Social Science Research Network",

}

TY - GEN

T1 - Who does what

T2 - Collaboration patterns in the wikipedia and their impact on data quality

AU - Liu, Jun

AU - Ram, Sudha

PY - 2009

Y1 - 2009

N2 - Data quality in the Wikipedia is debatable. On the one hand, existing research indicates that not only are people willing to contribute articles but the quality of those articles is close to that found in conventional encyclopedias. On the other hand, the public has never stopped criticizing the quality of Wikipedia articles, and critics never have trouble finding low quality Wikipedia articles. Why do Wikipedia articles vary widely in quality? We investigate the relationship between collaboration and data quality. We show that the quality of Wikipedia articles is not only dependent on the different types of contributors but also on how they collaborate. Based on an empirical study, we classify contributors based on their roles in editing individual Wikipedia articles. We identify various patterns of collaboration based on the provenance or, more specifically, who does what to Wikipedia articles. Our research helps identify collaboration patterns that are preferable or detrimental for data quality, thus providing insights for improving data quality in Wikipedia.

AB - Data quality in the Wikipedia is debatable. On the one hand, existing research indicates that not only are people willing to contribute articles but the quality of those articles is close to that found in conventional encyclopedias. On the other hand, the public has never stopped criticizing the quality of Wikipedia articles, and critics never have trouble finding low quality Wikipedia articles. Why do Wikipedia articles vary widely in quality? We investigate the relationship between collaboration and data quality. We show that the quality of Wikipedia articles is not only dependent on the different types of contributors but also on how they collaborate. Based on an empirical study, we classify contributors based on their roles in editing individual Wikipedia articles. We identify various patterns of collaboration based on the provenance or, more specifically, who does what to Wikipedia articles. Our research helps identify collaboration patterns that are preferable or detrimental for data quality, thus providing insights for improving data quality in Wikipedia.

KW - Collaboration pattern

KW - Data provenance

KW - Data quality

KW - Wikipedia

UR - http://www.scopus.com/inward/record.url?scp=79952983202&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79952983202&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:79952983202

SP - 175

EP - 180

BT - 19th Workshop on Information Technologies and Systems, WITS 2009

PB - Social Science Research Network

ER -