When is the right time to refresh knowledge discovered from data?

Xiao Fang, Olivia R.Liu Sheng, Paulo Goes

Research output: Contribution to journalArticle

11 Scopus citations

Abstract

Knowledge discovery in databases (KDD) techniques have been extensively employed to extract knowledge from massive data stores to support decision making in a wide range of critical applications. Maintaining the currency of discovered knowledge over evolving data sources is a fundamental challenge faced by all KDD applications. This paper addresses the challenge from the perspective of deciding the right times to refresh knowledge. We define the knowledge-refreshing problem and model it as a Markov decision process. Based on the identified properties of the Markov decision process model, we establish that the optimal knowledge-refreshing policy is monotonically increasing in the system state within every appropriate partition of the state space. We further show that the problem of searching for the optimal knowledgerefreshing policy can be reduced to the problem of finding the optimal thresholds and propose a method for computing the optimal knowledge-refreshing policy. The effectiveness and the robustness of the computed optimal knowledge-refreshing policy are examined through extensive empirical studies addressing a real-world knowledge-refreshing problem. Our method can be applied to refresh knowledge for KDD applications that employ major data-mining models.

Original languageEnglish (US)
Pages (from-to)32-44
Number of pages13
JournalOperations Research
Volume61
Issue number1
DOIs
StatePublished - Jan 1 2013

Keywords

  • Data mining
  • Knowledge discovery in databases
  • Knowledge refreshing
  • Markov decision process

ASJC Scopus subject areas

  • Computer Science Applications
  • Management Science and Operations Research

Fingerprint Dive into the research topics of 'When is the right time to refresh knowledge discovered from data?'. Together they form a unique fingerprint.

Cite this