IEDs in the dark web: Lexicon expansion and genre classification

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

Improvised explosive device web pages represent a significant source of knowledge for security organizations. In this paper, we present significant improvements to our approach to the discovery and classification of IED related web pages in the Dark Web. We present a statistical feature ranking approach to the expansion of the keyword lexicon used to discover IED related web pages, which identified new relevant terms for inclusion. Additionally, we present an improved web page feature representation designed to better capture the structural and stylistic cues revealing of genres of communication, and a series of experiments comparing the classification performance of the new representation with our existing approach.

Original languageEnglish (US)
Title of host publication2009 IEEE International Conference on Intelligence and Security Informatics, ISI 2009
Pages173-175
Number of pages3
DOIs
StatePublished - 2009
Event2009 IEEE International Conference on Intelligence and Security Informatics, ISI 2009 - Dallas, TX, United States
Duration: Jun 8 2009Jun 11 2009

Other

Other2009 IEEE International Conference on Intelligence and Security Informatics, ISI 2009
CountryUnited States
CityDallas, TX
Period6/8/096/11/09

Fingerprint

Websites
Communication
Experiments

Keywords

  • Dark web
  • Genre classification
  • Improvised explosive device

ASJC Scopus subject areas

  • Artificial Intelligence
  • Information Systems
  • Software

Cite this

Chen, H. (2009). IEDs in the dark web: Lexicon expansion and genre classification. In 2009 IEEE International Conference on Intelligence and Security Informatics, ISI 2009 (pp. 173-175). [5137293] https://doi.org/10.1109/ISI.2009.5137293

IEDs in the dark web : Lexicon expansion and genre classification. / Chen, Hsinchun.

2009 IEEE International Conference on Intelligence and Security Informatics, ISI 2009. 2009. p. 173-175 5137293.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chen, H 2009, IEDs in the dark web: Lexicon expansion and genre classification. in 2009 IEEE International Conference on Intelligence and Security Informatics, ISI 2009., 5137293, pp. 173-175, 2009 IEEE International Conference on Intelligence and Security Informatics, ISI 2009, Dallas, TX, United States, 6/8/09. https://doi.org/10.1109/ISI.2009.5137293
Chen H. IEDs in the dark web: Lexicon expansion and genre classification. In 2009 IEEE International Conference on Intelligence and Security Informatics, ISI 2009. 2009. p. 173-175. 5137293 https://doi.org/10.1109/ISI.2009.5137293
Chen, Hsinchun. / IEDs in the dark web : Lexicon expansion and genre classification. 2009 IEEE International Conference on Intelligence and Security Informatics, ISI 2009. 2009. pp. 173-175
@inproceedings{61e773620b9e4985a9922c16e58273a8,
title = "IEDs in the dark web: Lexicon expansion and genre classification",
abstract = "Improvised explosive device web pages represent a significant source of knowledge for security organizations. In this paper, we present significant improvements to our approach to the discovery and classification of IED related web pages in the Dark Web. We present a statistical feature ranking approach to the expansion of the keyword lexicon used to discover IED related web pages, which identified new relevant terms for inclusion. Additionally, we present an improved web page feature representation designed to better capture the structural and stylistic cues revealing of genres of communication, and a series of experiments comparing the classification performance of the new representation with our existing approach.",
keywords = "Dark web, Genre classification, Improvised explosive device",
author = "Hsinchun Chen",
year = "2009",
doi = "10.1109/ISI.2009.5137293",
language = "English (US)",
isbn = "9781424441730",
pages = "173--175",
booktitle = "2009 IEEE International Conference on Intelligence and Security Informatics, ISI 2009",

}

TY - GEN

T1 - IEDs in the dark web

T2 - Lexicon expansion and genre classification

AU - Chen, Hsinchun

PY - 2009

Y1 - 2009

N2 - Improvised explosive device web pages represent a significant source of knowledge for security organizations. In this paper, we present significant improvements to our approach to the discovery and classification of IED related web pages in the Dark Web. We present a statistical feature ranking approach to the expansion of the keyword lexicon used to discover IED related web pages, which identified new relevant terms for inclusion. Additionally, we present an improved web page feature representation designed to better capture the structural and stylistic cues revealing of genres of communication, and a series of experiments comparing the classification performance of the new representation with our existing approach.

AB - Improvised explosive device web pages represent a significant source of knowledge for security organizations. In this paper, we present significant improvements to our approach to the discovery and classification of IED related web pages in the Dark Web. We present a statistical feature ranking approach to the expansion of the keyword lexicon used to discover IED related web pages, which identified new relevant terms for inclusion. Additionally, we present an improved web page feature representation designed to better capture the structural and stylistic cues revealing of genres of communication, and a series of experiments comparing the classification performance of the new representation with our existing approach.

KW - Dark web

KW - Genre classification

KW - Improvised explosive device

UR - http://www.scopus.com/inward/record.url?scp=70350064283&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70350064283&partnerID=8YFLogxK

U2 - 10.1109/ISI.2009.5137293

DO - 10.1109/ISI.2009.5137293

M3 - Conference contribution

AN - SCOPUS:70350064283

SN - 9781424441730

SP - 173

EP - 175

BT - 2009 IEEE International Conference on Intelligence and Security Informatics, ISI 2009

ER -