Voting experts: An unsupervised algorithm for segmenting sequences

Paul Cohen, Niall Adams, Brent Heeringa

Research output: Contribution to journalArticlepeer-review

41 Scopus citations

Abstract

We describe a statistical signature of chunks and an algorithm for finding chunks. While there is no formal definition of chunks, they may be reliably identified as configurations with low internal entropy or unpredictability and high entropy at their boundaries. We show that the log frequency of a chunk is a measure of its internal entropy. The Voting-Experts exploits the signature of chunks to find word boundaries in text from four languages and episode boundaries in the activities of a mobile robot.

Original languageEnglish (US)
Pages (from-to)607-625
Number of pages19
JournalIntelligent Data Analysis
Volume11
Issue number6
StatePublished - Dec 1 2007

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint Dive into the research topics of 'Voting experts: An unsupervised algorithm for segmenting sequences'. Together they form a unique fingerprint.

Cite this