Clustering main concepts from e-mails

Jesús S. Aguilar-Ruiz, Domingo S. Rodriguez-Baena, Paul R. Cohen, Jose Cristóbal Riquelme

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

E–mail is one of the most common ways to communicate, assuming, in some cases, up to 75% of a company’s communication, in which every employee spends about 90 minutes a day in e–mail tasks such as filing and deleting. This paper deals with the generation of clusters of relevant words from E–mail texts. Our approach consists of the application of text mining techniques and, later, data mining techniques, to obtain related concepts extracted from sent and received messages. We have developed a new clustering algorithm based on neighborhood, which takes into account similarity values among words obtained in the text mining phase. The potential of these applications is enormous and only a few companies, mainly large organizations, have invested in this project so far, taking advantage of employees’s knowledge in future decisions.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsRicardo Conejo, Jose-Luis Perez-de-la-Cruz, Maite Urretavizcaya
PublisherSpringer-Verlag
Pages231-240
Number of pages10
ISBN (Print)3540222189, 9783540222187
StatePublished - Jan 1 2004
Event10th Conference of the Spanish Association for Artificial Intelligence, CAEPIA 2003 and 5th Conference on Technology Transfer, TTIA 2003 - San Sebastian, Spain
Duration: Nov 12 2003Nov 14 2003

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3040
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th Conference of the Spanish Association for Artificial Intelligence, CAEPIA 2003 and 5th Conference on Technology Transfer, TTIA 2003
CountrySpain
CitySan Sebastian
Period11/12/0311/14/03

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Clustering main concepts from e-mails'. Together they form a unique fingerprint.

  • Cite this

    Aguilar-Ruiz, J. S., Rodriguez-Baena, D. S., Cohen, P. R., & Riquelme, J. C. (2004). Clustering main concepts from e-mails. In R. Conejo, J-L. Perez-de-la-Cruz, & M. Urretavizcaya (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 231-240). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 3040). Springer-Verlag.