Range query estimation with data skewness for top-k retrieval

Anteneh Ayanso, Paulo B. Goes, Kumar Mehta

Research output: Contribution to journalArticle

2 Scopus citations

Abstract

Top-k querying can significantly improve the performance of web-based business intelligence applications such as price comparison and product recommendation systems. Top-k retrieval involves finding a limited number of records in a relational database that are most similar to user-specified attribute-value pairs. This paper extends the cost-based query-mapping method for top-k retrieval by incorporating data skewness in range estimation. Experiments on real world and synthetic multi-attribute data sets show that incorporating data skewness provides a robust performance across different types of data sets, query sets, distance functions, and histograms.

Original languageEnglish (US)
Pages (from-to)258-273
Number of pages16
JournalDecision Support Systems
Volume57
Issue number1
DOIs
StatePublished - Jan 1 2014

Keywords

  • Cost model
  • Query processing
  • Query-mapping
  • RDBMSs
  • Top-k query

ASJC Scopus subject areas

  • Management Information Systems
  • Information Systems
  • Developmental and Educational Psychology
  • Arts and Humanities (miscellaneous)
  • Information Systems and Management

Fingerprint Dive into the research topics of 'Range query estimation with data skewness for top-k retrieval'. Together they form a unique fingerprint.

  • Cite this