An empirical study of cost-sensitive learning in cultural modeling

Peng Su, Wenji Mao, Dajun Zeng

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Cultural modeling aims at developing behavioral models of groups and analyzing the impact of culture factors on group behavior using computational methods. Machine learning methods and in particular classification, play a central role in such applications. In modeling cultural data, it is expected that standard classifiers yield good performance under the assumption that different classification errors have uniform costs. However, this assumption is often violated in practice. Therefore, the performance of standard classifiers is severely hindered. To handle this problem, this paper empirically studies cost-sensitive learning in cultural modeling. We consider cost factor when building the classifiers, with the aim of minimizing total misclassification costs. We conduct experiments to investigate four typical cost-sensitive learning methods, combine them with six standard classifiers and evaluate their performance under various conditions. Our empirical study verifies the effectiveness of cost-sensitive learning in cultural modeling. Based on the experimental results, we gain a thorough insight into the problem of non-uniform misclassification costs, as well as the selection of cost-sensitive methods, base classifiers and method-classifier pairs for this domain. Furthermore, we propose an improved algorithm which outperforms the best method-classifier pair using the benchmark cultural datasets.

Original languageEnglish (US)
Pages (from-to)437-455
Number of pages19
JournalInformation Systems and e-Business Management
Volume11
Issue number3
DOIs
StatePublished - 2013

Fingerprint

Classifiers
Costs
Computational methods
Data structures
Learning systems
Experiments

Keywords

  • Behavior modeling and prediction
  • Class imbalance problem
  • Cost-sensitive learning
  • Cultural modeling
  • Misclassification cost

ASJC Scopus subject areas

  • Information Systems

Cite this

An empirical study of cost-sensitive learning in cultural modeling. / Su, Peng; Mao, Wenji; Zeng, Dajun.

In: Information Systems and e-Business Management, Vol. 11, No. 3, 2013, p. 437-455.

Research output: Contribution to journalArticle

@article{9fb411d77f3249269f1e71c21289f0c3,
title = "An empirical study of cost-sensitive learning in cultural modeling",
abstract = "Cultural modeling aims at developing behavioral models of groups and analyzing the impact of culture factors on group behavior using computational methods. Machine learning methods and in particular classification, play a central role in such applications. In modeling cultural data, it is expected that standard classifiers yield good performance under the assumption that different classification errors have uniform costs. However, this assumption is often violated in practice. Therefore, the performance of standard classifiers is severely hindered. To handle this problem, this paper empirically studies cost-sensitive learning in cultural modeling. We consider cost factor when building the classifiers, with the aim of minimizing total misclassification costs. We conduct experiments to investigate four typical cost-sensitive learning methods, combine them with six standard classifiers and evaluate their performance under various conditions. Our empirical study verifies the effectiveness of cost-sensitive learning in cultural modeling. Based on the experimental results, we gain a thorough insight into the problem of non-uniform misclassification costs, as well as the selection of cost-sensitive methods, base classifiers and method-classifier pairs for this domain. Furthermore, we propose an improved algorithm which outperforms the best method-classifier pair using the benchmark cultural datasets.",
keywords = "Behavior modeling and prediction, Class imbalance problem, Cost-sensitive learning, Cultural modeling, Misclassification cost",
author = "Peng Su and Wenji Mao and Dajun Zeng",
year = "2013",
doi = "10.1007/s10257-012-0198-4",
language = "English (US)",
volume = "11",
pages = "437--455",
journal = "Information Systems and e-Business Management",
issn = "1617-9846",
publisher = "Springer Verlag",
number = "3",

}

TY - JOUR

T1 - An empirical study of cost-sensitive learning in cultural modeling

AU - Su, Peng

AU - Mao, Wenji

AU - Zeng, Dajun

PY - 2013

Y1 - 2013

N2 - Cultural modeling aims at developing behavioral models of groups and analyzing the impact of culture factors on group behavior using computational methods. Machine learning methods and in particular classification, play a central role in such applications. In modeling cultural data, it is expected that standard classifiers yield good performance under the assumption that different classification errors have uniform costs. However, this assumption is often violated in practice. Therefore, the performance of standard classifiers is severely hindered. To handle this problem, this paper empirically studies cost-sensitive learning in cultural modeling. We consider cost factor when building the classifiers, with the aim of minimizing total misclassification costs. We conduct experiments to investigate four typical cost-sensitive learning methods, combine them with six standard classifiers and evaluate their performance under various conditions. Our empirical study verifies the effectiveness of cost-sensitive learning in cultural modeling. Based on the experimental results, we gain a thorough insight into the problem of non-uniform misclassification costs, as well as the selection of cost-sensitive methods, base classifiers and method-classifier pairs for this domain. Furthermore, we propose an improved algorithm which outperforms the best method-classifier pair using the benchmark cultural datasets.

AB - Cultural modeling aims at developing behavioral models of groups and analyzing the impact of culture factors on group behavior using computational methods. Machine learning methods and in particular classification, play a central role in such applications. In modeling cultural data, it is expected that standard classifiers yield good performance under the assumption that different classification errors have uniform costs. However, this assumption is often violated in practice. Therefore, the performance of standard classifiers is severely hindered. To handle this problem, this paper empirically studies cost-sensitive learning in cultural modeling. We consider cost factor when building the classifiers, with the aim of minimizing total misclassification costs. We conduct experiments to investigate four typical cost-sensitive learning methods, combine them with six standard classifiers and evaluate their performance under various conditions. Our empirical study verifies the effectiveness of cost-sensitive learning in cultural modeling. Based on the experimental results, we gain a thorough insight into the problem of non-uniform misclassification costs, as well as the selection of cost-sensitive methods, base classifiers and method-classifier pairs for this domain. Furthermore, we propose an improved algorithm which outperforms the best method-classifier pair using the benchmark cultural datasets.

KW - Behavior modeling and prediction

KW - Class imbalance problem

KW - Cost-sensitive learning

KW - Cultural modeling

KW - Misclassification cost

UR - http://www.scopus.com/inward/record.url?scp=84880768121&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84880768121&partnerID=8YFLogxK

U2 - 10.1007/s10257-012-0198-4

DO - 10.1007/s10257-012-0198-4

M3 - Article

VL - 11

SP - 437

EP - 455

JO - Information Systems and e-Business Management

JF - Information Systems and e-Business Management

SN - 1617-9846

IS - 3

ER -