Detecting cyber threats in non-english hacker forums: An adversarial cross-lingual knowledge transfer approach

Mohammadreza Ebrahimi, Sagar Samtani, Yidong Chai, Hsinchun Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The regularity of devastating cyber-attacks has made cybersecurity a grand societal challenge. Many cybersecurity professionals are closely examining the international Dark Web to proactively pinpoint potential cyber threats. Despite its potential, the Dark Web contains hundreds of thousands of non-English posts. While machine translation is the prevailing approach to process non-English text, applying MT on hacker forum text results in mistranslations. In this study, we draw upon Long-Short Term Memory (LSTM), Cross-Lingual Knowledge Transfer (CLKT), and Generative Adversarial Networks (GANs) principles to design a novel Adversarial CLKT (A-CLKT) approach. A-CLKT operates on untranslated text to retain the original semantics of the language and leverages the collective knowledge about cyber threats across languages to create a language invariant representation without any manual feature engineering or external resources. Three experiments demonstrate how A-CLKT outperforms state-of-the-art machine learning, deep learning, and CLKT algorithms in identifying cyber-threats in French and Russian forums.

Original languageEnglish (US)
Title of host publicationProceedings - 2020 IEEE Symposium on Security and Privacy Workshops, SPW 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages20-26
Number of pages7
ISBN (Electronic)9781728193465
DOIs
StatePublished - May 2020
Event2020 IEEE Symposium on Security and Privacy Workshops, SPW 2020 - Virtual, San Francisco, United States
Duration: May 21 2020 → …

Publication series

NameProceedings - 2020 IEEE Symposium on Security and Privacy Workshops, SPW 2020

Conference

Conference2020 IEEE Symposium on Security and Privacy Workshops, SPW 2020
Country/TerritoryUnited States
CityVirtual, San Francisco
Period5/21/20 → …

Keywords

  • Adversarial learning
  • Cross-lingual knowledge transfer
  • Generative adversarial networks
  • Hacker forums
  • Long short-term memory

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Civil and Structural Engineering
  • Safety, Risk, Reliability and Quality
  • Analysis

Fingerprint

Dive into the research topics of 'Detecting cyber threats in non-english hacker forums: An adversarial cross-lingual knowledge transfer approach'. Together they form a unique fingerprint.

Cite this