Mining smart card data for transit riders' travel patterns

Xiaolei Ma, Yao-jan Wu, Yinhai Wang, Feng Chen, Jianfeng Liu

Research output: Contribution to journalArticle

225 Citations (Scopus)

Abstract

To mitigate the congestion caused by the ever increasing number of privately owned automobiles, public transit is highly promoted by transportation agencies worldwide. A better understanding of travel patterns and regularity at the "magnitude" level will enable transit authorities to evaluate the services they offer, adjust marketing strategies, retain loyal customers and improve overall transit performance. However, it is fairly challenging to identify travel patterns for individual transit riders in a large dataset. This paper proposes an efficient and effective data-mining procedure that models the travel patterns of transit riders in Beijing, China. Transit riders' trip chains are identified based on the temporal and spatial characteristics of their smart card transaction data. The Density-based Spatial Clustering of Applications with Noise (DBSCAN) algorithm then analyzes the identified trip chains to detect transit riders' historical travel patterns and the K-Means++ clustering algorithm and the rough-set theory are jointly applied to cluster and classify travel pattern regularities. The performance of the rough-set-based algorithm is compared with those of other prevailing classification algorithms. The results indicate that the proposed rough-set-based algorithm outperforms other commonly used data-mining algorithms in terms of accuracy and efficiency.

Original languageEnglish (US)
Pages (from-to)1-12
Number of pages12
JournalTransportation Research Part C: Emerging Technologies
Volume36
DOIs
StatePublished - Nov 2013

Fingerprint

Smart cards
travel
Data mining
regularity
transit authority
Rough set theory
set theory
Clustering algorithms
Automobiles
Marketing
motor vehicle
performance
transaction
Smart card
marketing
customer
efficiency
China

Keywords

  • Automatic Fare Collection System
  • K-Means algorithm
  • Rough set theory
  • Smart card
  • Transit travel pattern

ASJC Scopus subject areas

  • Computer Science Applications
  • Management Science and Operations Research
  • Automotive Engineering
  • Transportation

Cite this

Mining smart card data for transit riders' travel patterns. / Ma, Xiaolei; Wu, Yao-jan; Wang, Yinhai; Chen, Feng; Liu, Jianfeng.

In: Transportation Research Part C: Emerging Technologies, Vol. 36, 11.2013, p. 1-12.

Research output: Contribution to journalArticle

Ma, Xiaolei ; Wu, Yao-jan ; Wang, Yinhai ; Chen, Feng ; Liu, Jianfeng. / Mining smart card data for transit riders' travel patterns. In: Transportation Research Part C: Emerging Technologies. 2013 ; Vol. 36. pp. 1-12.
@article{54d3cf593d474d16a94cef09cb7c086e,
title = "Mining smart card data for transit riders' travel patterns",
abstract = "To mitigate the congestion caused by the ever increasing number of privately owned automobiles, public transit is highly promoted by transportation agencies worldwide. A better understanding of travel patterns and regularity at the {"}magnitude{"} level will enable transit authorities to evaluate the services they offer, adjust marketing strategies, retain loyal customers and improve overall transit performance. However, it is fairly challenging to identify travel patterns for individual transit riders in a large dataset. This paper proposes an efficient and effective data-mining procedure that models the travel patterns of transit riders in Beijing, China. Transit riders' trip chains are identified based on the temporal and spatial characteristics of their smart card transaction data. The Density-based Spatial Clustering of Applications with Noise (DBSCAN) algorithm then analyzes the identified trip chains to detect transit riders' historical travel patterns and the K-Means++ clustering algorithm and the rough-set theory are jointly applied to cluster and classify travel pattern regularities. The performance of the rough-set-based algorithm is compared with those of other prevailing classification algorithms. The results indicate that the proposed rough-set-based algorithm outperforms other commonly used data-mining algorithms in terms of accuracy and efficiency.",
keywords = "Automatic Fare Collection System, K-Means algorithm, Rough set theory, Smart card, Transit travel pattern",
author = "Xiaolei Ma and Yao-jan Wu and Yinhai Wang and Feng Chen and Jianfeng Liu",
year = "2013",
month = "11",
doi = "10.1016/j.trc.2013.07.010",
language = "English (US)",
volume = "36",
pages = "1--12",
journal = "Transportation Research Part C: Emerging Technologies",
issn = "0968-090X",
publisher = "Elsevier Limited",

}

TY - JOUR

T1 - Mining smart card data for transit riders' travel patterns

AU - Ma, Xiaolei

AU - Wu, Yao-jan

AU - Wang, Yinhai

AU - Chen, Feng

AU - Liu, Jianfeng

PY - 2013/11

Y1 - 2013/11

N2 - To mitigate the congestion caused by the ever increasing number of privately owned automobiles, public transit is highly promoted by transportation agencies worldwide. A better understanding of travel patterns and regularity at the "magnitude" level will enable transit authorities to evaluate the services they offer, adjust marketing strategies, retain loyal customers and improve overall transit performance. However, it is fairly challenging to identify travel patterns for individual transit riders in a large dataset. This paper proposes an efficient and effective data-mining procedure that models the travel patterns of transit riders in Beijing, China. Transit riders' trip chains are identified based on the temporal and spatial characteristics of their smart card transaction data. The Density-based Spatial Clustering of Applications with Noise (DBSCAN) algorithm then analyzes the identified trip chains to detect transit riders' historical travel patterns and the K-Means++ clustering algorithm and the rough-set theory are jointly applied to cluster and classify travel pattern regularities. The performance of the rough-set-based algorithm is compared with those of other prevailing classification algorithms. The results indicate that the proposed rough-set-based algorithm outperforms other commonly used data-mining algorithms in terms of accuracy and efficiency.

AB - To mitigate the congestion caused by the ever increasing number of privately owned automobiles, public transit is highly promoted by transportation agencies worldwide. A better understanding of travel patterns and regularity at the "magnitude" level will enable transit authorities to evaluate the services they offer, adjust marketing strategies, retain loyal customers and improve overall transit performance. However, it is fairly challenging to identify travel patterns for individual transit riders in a large dataset. This paper proposes an efficient and effective data-mining procedure that models the travel patterns of transit riders in Beijing, China. Transit riders' trip chains are identified based on the temporal and spatial characteristics of their smart card transaction data. The Density-based Spatial Clustering of Applications with Noise (DBSCAN) algorithm then analyzes the identified trip chains to detect transit riders' historical travel patterns and the K-Means++ clustering algorithm and the rough-set theory are jointly applied to cluster and classify travel pattern regularities. The performance of the rough-set-based algorithm is compared with those of other prevailing classification algorithms. The results indicate that the proposed rough-set-based algorithm outperforms other commonly used data-mining algorithms in terms of accuracy and efficiency.

KW - Automatic Fare Collection System

KW - K-Means algorithm

KW - Rough set theory

KW - Smart card

KW - Transit travel pattern

UR - http://www.scopus.com/inward/record.url?scp=84884728365&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84884728365&partnerID=8YFLogxK

U2 - 10.1016/j.trc.2013.07.010

DO - 10.1016/j.trc.2013.07.010

M3 - Article

AN - SCOPUS:84884728365

VL - 36

SP - 1

EP - 12

JO - Transportation Research Part C: Emerging Technologies

JF - Transportation Research Part C: Emerging Technologies

SN - 0968-090X

ER -