Binary Code Similarity Detection through LSTM and Siamese Neural Network

Zhengping  Luo; Tao  Hou; Xiangrong  Zhou; Hui  Zeng; Zhuo  Lu

doi:10.4108/eai.14-9-2021.170956

Binary Code Similarity Detection through LSTM and Siamese Neural Network

Authors

Zhengping Luo Rider University
Tao Hou University of South Florida
Xiangrong Zhou Intelligent Automation (United States)
Hui Zeng Intelligent Automation (United States)
Zhuo Lu University of South Florida

DOI:

https://doi.org/10.4108/eai.14-9-2021.170956

Keywords:

Malware detection, binary analysis, LSTM, Siamese Neural Network, similarity detection

Abstract

Given the fact that many software projects are closed-source, analyzing security-related vulnerabilities at the binary level is quintessential to protect computer systems from attacks of malware. Binary code similarity detection is a potential solution for detecting malware from the binaries generated by the processor. In this paper, we proposed a malware detection mechanism based on the binaries using machine learning techniques. Through utilizing the Recurrent Neural Network (RNN), more specifically Long Short-Term Memory (LSTM) network, we generate the uniformed feature embedding of each binary file and further take advantage of the Siamese Neural Network to compute the similarity measure of the extracted features. Therefore, the security risks of the software projects can be evaluated through the similarity measure of the corresponding binaries with existing trained malware. Our real-world experimental results demonstrate a convincing performance in distinguishing out the outliers, and achieved slightly better performance compared with existing state-of-the-art methods.

References

Downloads

Published

14-09-2021

Issue

Vol. 8 No. 29 (2021): EAI Endorsed Transactions on Security and Safety

Section

Research article

License

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

This is an open-access article distributed under the terms of the Creative Commons Attribution CC BY 4.0 license, which permits unlimited use, distribution, and reproduction in any medium so long as the original work is properly cited.

How to Cite

Luo Z, Hou T, Zhou X, Zeng H, Lu Z. Binary Code Similarity Detection through LSTM and Siamese Neural Network. EAI Endorsed Trans Sec Saf [Internet]. 2021 Sep. 14 [cited 2026 Jul. 26];8(29):e1. Available from: https://publications.eai.eu/index.php/sesa/article/view/29

Download Citation

Binary Code Similarity Detection through LSTM and Siamese Neural Network

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

Most read articles by the same author(s)

Latest publications

Make a Submission