Developing understanding of hacker language through the use of lexical semantics

Victor Benjamin, Hsinchun Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

The need for more research scrutinizing online hacker communities is a common suggestion in recent years. However, researchers and practitioners face many challenges when attempting to do so. In particular, they may encounter hacking-specific terms, concepts, tools, and other items that are unfamiliar and may be challenging to understand. For these reasons, we are motivated to develop an automated method for developing understanding of hacker language. We utilize the latest advancements in recurrent neural network language models (RNNLMs) to develop an unsupervised machine learning technique for learning hacker language. The selected RNNLM produces state-of-the-art word embeddings that are useful for understanding the relations between different hacker terms and concepts. We evaluate our work by testing the RNNLMs ability to learn relevant relations between known hacker terms. Results suggest that the latest work in RNNLMs can aid in modeling hacker language, providing promising direction for future research.

Original languageEnglish (US)
Title of host publication2015 IEEE International Conference on Intelligence and Security Informatics: Securing the World through an Alignment of Technology, Intelligence, Humans and Organizations, ISI 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages79-84
Number of pages6
ISBN (Print)9781479998883
DOIs
StatePublished - Jul 23 2015
Event13th IEEE International Conference on Intelligence and Security Informatics, ISI 2015 - Baltimore, United States
Duration: May 27 2015May 29 2015

Other

Other13th IEEE International Conference on Intelligence and Security Informatics, ISI 2015
CountryUnited States
CityBaltimore
Period5/27/155/29/15

Fingerprint

hacker
Recurrent neural networks
Semantics
semantics
neural network
language
Learning systems
Testing
learning
ability

Keywords

  • Cybersecurity
  • Hacker community
  • Language model
  • Recurrent neural network

ASJC Scopus subject areas

  • Artificial Intelligence
  • Law
  • Computer Networks and Communications
  • Safety, Risk, Reliability and Quality

Cite this

Benjamin, V., & Chen, H. (2015). Developing understanding of hacker language through the use of lexical semantics. In 2015 IEEE International Conference on Intelligence and Security Informatics: Securing the World through an Alignment of Technology, Intelligence, Humans and Organizations, ISI 2015 (pp. 79-84). [7165943] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ISI.2015.7165943

Developing understanding of hacker language through the use of lexical semantics. / Benjamin, Victor; Chen, Hsinchun.

2015 IEEE International Conference on Intelligence and Security Informatics: Securing the World through an Alignment of Technology, Intelligence, Humans and Organizations, ISI 2015. Institute of Electrical and Electronics Engineers Inc., 2015. p. 79-84 7165943.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Benjamin, V & Chen, H 2015, Developing understanding of hacker language through the use of lexical semantics. in 2015 IEEE International Conference on Intelligence and Security Informatics: Securing the World through an Alignment of Technology, Intelligence, Humans and Organizations, ISI 2015., 7165943, Institute of Electrical and Electronics Engineers Inc., pp. 79-84, 13th IEEE International Conference on Intelligence and Security Informatics, ISI 2015, Baltimore, United States, 5/27/15. https://doi.org/10.1109/ISI.2015.7165943
Benjamin V, Chen H. Developing understanding of hacker language through the use of lexical semantics. In 2015 IEEE International Conference on Intelligence and Security Informatics: Securing the World through an Alignment of Technology, Intelligence, Humans and Organizations, ISI 2015. Institute of Electrical and Electronics Engineers Inc. 2015. p. 79-84. 7165943 https://doi.org/10.1109/ISI.2015.7165943
Benjamin, Victor ; Chen, Hsinchun. / Developing understanding of hacker language through the use of lexical semantics. 2015 IEEE International Conference on Intelligence and Security Informatics: Securing the World through an Alignment of Technology, Intelligence, Humans and Organizations, ISI 2015. Institute of Electrical and Electronics Engineers Inc., 2015. pp. 79-84
@inproceedings{86fe289f8d334fb69ffcfa38fa5aed8a,
title = "Developing understanding of hacker language through the use of lexical semantics",
abstract = "The need for more research scrutinizing online hacker communities is a common suggestion in recent years. However, researchers and practitioners face many challenges when attempting to do so. In particular, they may encounter hacking-specific terms, concepts, tools, and other items that are unfamiliar and may be challenging to understand. For these reasons, we are motivated to develop an automated method for developing understanding of hacker language. We utilize the latest advancements in recurrent neural network language models (RNNLMs) to develop an unsupervised machine learning technique for learning hacker language. The selected RNNLM produces state-of-the-art word embeddings that are useful for understanding the relations between different hacker terms and concepts. We evaluate our work by testing the RNNLMs ability to learn relevant relations between known hacker terms. Results suggest that the latest work in RNNLMs can aid in modeling hacker language, providing promising direction for future research.",
keywords = "Cybersecurity, Hacker community, Language model, Recurrent neural network",
author = "Victor Benjamin and Hsinchun Chen",
year = "2015",
month = "7",
day = "23",
doi = "10.1109/ISI.2015.7165943",
language = "English (US)",
isbn = "9781479998883",
pages = "79--84",
booktitle = "2015 IEEE International Conference on Intelligence and Security Informatics: Securing the World through an Alignment of Technology, Intelligence, Humans and Organizations, ISI 2015",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Developing understanding of hacker language through the use of lexical semantics

AU - Benjamin, Victor

AU - Chen, Hsinchun

PY - 2015/7/23

Y1 - 2015/7/23

N2 - The need for more research scrutinizing online hacker communities is a common suggestion in recent years. However, researchers and practitioners face many challenges when attempting to do so. In particular, they may encounter hacking-specific terms, concepts, tools, and other items that are unfamiliar and may be challenging to understand. For these reasons, we are motivated to develop an automated method for developing understanding of hacker language. We utilize the latest advancements in recurrent neural network language models (RNNLMs) to develop an unsupervised machine learning technique for learning hacker language. The selected RNNLM produces state-of-the-art word embeddings that are useful for understanding the relations between different hacker terms and concepts. We evaluate our work by testing the RNNLMs ability to learn relevant relations between known hacker terms. Results suggest that the latest work in RNNLMs can aid in modeling hacker language, providing promising direction for future research.

AB - The need for more research scrutinizing online hacker communities is a common suggestion in recent years. However, researchers and practitioners face many challenges when attempting to do so. In particular, they may encounter hacking-specific terms, concepts, tools, and other items that are unfamiliar and may be challenging to understand. For these reasons, we are motivated to develop an automated method for developing understanding of hacker language. We utilize the latest advancements in recurrent neural network language models (RNNLMs) to develop an unsupervised machine learning technique for learning hacker language. The selected RNNLM produces state-of-the-art word embeddings that are useful for understanding the relations between different hacker terms and concepts. We evaluate our work by testing the RNNLMs ability to learn relevant relations between known hacker terms. Results suggest that the latest work in RNNLMs can aid in modeling hacker language, providing promising direction for future research.

KW - Cybersecurity

KW - Hacker community

KW - Language model

KW - Recurrent neural network

UR - http://www.scopus.com/inward/record.url?scp=84963758658&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84963758658&partnerID=8YFLogxK

U2 - 10.1109/ISI.2015.7165943

DO - 10.1109/ISI.2015.7165943

M3 - Conference contribution

SN - 9781479998883

SP - 79

EP - 84

BT - 2015 IEEE International Conference on Intelligence and Security Informatics: Securing the World through an Alignment of Technology, Intelligence, Humans and Organizations, ISI 2015

PB - Institute of Electrical and Electronics Engineers Inc.

ER -