ProcData: An R Package for Process Data Analysis

Xueying Tang, Susu Zhang, Zhi Wang, Jingchen Liu, Zhiliang Ying

Research output: Contribution to journalArticlepeer-review

Abstract

Process data refer to data recorded in log files of computer-based items. These data, represented as timestamped action sequences, keep track of respondents’ response problem-solving behaviors. Process data analysis aims at enhancing educational assessment accuracy and serving other assessment purposes by utilizing the rich information contained in response processes. The R package ProcData presented in this article is designed to provide tools for inspecting, processing, and analyzing process data. We define an S3 class ‘proc’ for organizing process data and extend generic methods summary and print for ‘proc’. Feature extraction methods for process data are implemented in the package for compressing information in the irregular response processes into regular numeric vectors. ProcData also provides functions for making predictions from neural-network-based sequence models. In addition, a real dataset of response processes from the climate control item in the 2012 Programme for International Student Assessment is included in the package.

Original languageEnglish (US)
JournalPsychometrika
DOIs
StateAccepted/In press - 2021
Externally publishedYes

Keywords

  • autoencoder
  • multidimensional scaling
  • process data analysis
  • sequence model

ASJC Scopus subject areas

  • Psychology(all)
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'ProcData: An R Package for Process Data Analysis'. Together they form a unique fingerprint.

Cite this