A method to evaluate CFG comparison algorithms

Patrick P F Chan, Christian S Collberg

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

Control-Flow Graph (CFG) similarity is a core technique in many areas, including malware detection and software plagiarism detection. While many algorithms have been proposed in the literature, their relative strengths and weaknesses have not been previously studied. Moreover, it is not even clear how to perform such an evaluation. In this paper we therefore propose the first methodology for evaluating CFG similarity algorithms with respect to accuracy and efficiency. At the heart of our methodology is a technique to automatically generate benchmark graphs, CFGs of known edit distances. We show the result of applying our methodology to four popular algorithms. Our results show that an algorithm proposed by Hu et al. is most efficient both in terms of running time and accuracy.

Original languageEnglish (US)
Title of host publicationProceedings - International Conference on Quality Software
PublisherIEEE Computer Society
Pages95-104
Number of pages10
ISBN (Print)9781479971978
DOIs
StatePublished - Nov 14 2014
Event14th International Conference on Quality Software, QSIC 2014 - Dallas, United States
Duration: Oct 2 2014Oct 3 2014

Other

Other14th International Conference on Quality Software, QSIC 2014
CountryUnited States
CityDallas
Period10/2/1410/3/14

Fingerprint

Flow graphs

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Chan, P. P. F., & Collberg, C. S. (2014). A method to evaluate CFG comparison algorithms. In Proceedings - International Conference on Quality Software (pp. 95-104). [06958392] IEEE Computer Society. https://doi.org/10.1109/QSIC.2014.28

A method to evaluate CFG comparison algorithms. / Chan, Patrick P F; Collberg, Christian S.

Proceedings - International Conference on Quality Software. IEEE Computer Society, 2014. p. 95-104 06958392.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chan, PPF & Collberg, CS 2014, A method to evaluate CFG comparison algorithms. in Proceedings - International Conference on Quality Software., 06958392, IEEE Computer Society, pp. 95-104, 14th International Conference on Quality Software, QSIC 2014, Dallas, United States, 10/2/14. https://doi.org/10.1109/QSIC.2014.28
Chan PPF, Collberg CS. A method to evaluate CFG comparison algorithms. In Proceedings - International Conference on Quality Software. IEEE Computer Society. 2014. p. 95-104. 06958392 https://doi.org/10.1109/QSIC.2014.28
Chan, Patrick P F ; Collberg, Christian S. / A method to evaluate CFG comparison algorithms. Proceedings - International Conference on Quality Software. IEEE Computer Society, 2014. pp. 95-104
@inproceedings{27d8d3a0d8034ea1aa1ba8061d620a90,
title = "A method to evaluate CFG comparison algorithms",
abstract = "Control-Flow Graph (CFG) similarity is a core technique in many areas, including malware detection and software plagiarism detection. While many algorithms have been proposed in the literature, their relative strengths and weaknesses have not been previously studied. Moreover, it is not even clear how to perform such an evaluation. In this paper we therefore propose the first methodology for evaluating CFG similarity algorithms with respect to accuracy and efficiency. At the heart of our methodology is a technique to automatically generate benchmark graphs, CFGs of known edit distances. We show the result of applying our methodology to four popular algorithms. Our results show that an algorithm proposed by Hu et al. is most efficient both in terms of running time and accuracy.",
author = "Chan, {Patrick P F} and Collberg, {Christian S}",
year = "2014",
month = "11",
day = "14",
doi = "10.1109/QSIC.2014.28",
language = "English (US)",
isbn = "9781479971978",
pages = "95--104",
booktitle = "Proceedings - International Conference on Quality Software",
publisher = "IEEE Computer Society",

}

TY - GEN

T1 - A method to evaluate CFG comparison algorithms

AU - Chan, Patrick P F

AU - Collberg, Christian S

PY - 2014/11/14

Y1 - 2014/11/14

N2 - Control-Flow Graph (CFG) similarity is a core technique in many areas, including malware detection and software plagiarism detection. While many algorithms have been proposed in the literature, their relative strengths and weaknesses have not been previously studied. Moreover, it is not even clear how to perform such an evaluation. In this paper we therefore propose the first methodology for evaluating CFG similarity algorithms with respect to accuracy and efficiency. At the heart of our methodology is a technique to automatically generate benchmark graphs, CFGs of known edit distances. We show the result of applying our methodology to four popular algorithms. Our results show that an algorithm proposed by Hu et al. is most efficient both in terms of running time and accuracy.

AB - Control-Flow Graph (CFG) similarity is a core technique in many areas, including malware detection and software plagiarism detection. While many algorithms have been proposed in the literature, their relative strengths and weaknesses have not been previously studied. Moreover, it is not even clear how to perform such an evaluation. In this paper we therefore propose the first methodology for evaluating CFG similarity algorithms with respect to accuracy and efficiency. At the heart of our methodology is a technique to automatically generate benchmark graphs, CFGs of known edit distances. We show the result of applying our methodology to four popular algorithms. Our results show that an algorithm proposed by Hu et al. is most efficient both in terms of running time and accuracy.

UR - http://www.scopus.com/inward/record.url?scp=84912095604&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84912095604&partnerID=8YFLogxK

U2 - 10.1109/QSIC.2014.28

DO - 10.1109/QSIC.2014.28

M3 - Conference contribution

AN - SCOPUS:84912095604

SN - 9781479971978

SP - 95

EP - 104

BT - Proceedings - International Conference on Quality Software

PB - IEEE Computer Society

ER -