A Multi-Channel Spam Detection System Utilizing Natural Language Processing and Machine Learning
DOI:
https://doi.org/10.4108/airo.8309Keywords:
Bidirectional Encoder Representations from Transformers, BERT, Machine Learning, ML, Natural Language Processing, NLP, Spam/Ham, Support Vector Machine, SVMAbstract
As digital communication rapidly expands, the issue of unsolicited and unwanted messages, commonly known as spam, has become a major concern. This paper introduces an advanced spam detection system that integrates Natural Language Processing (NLP) and Machine Learning (ML) techniques. The system differentiates between spam and legitimate messages by employing a hybrid model that combines Naive Bayes, Support Vector Machines (SVM), and deep learning models like Bidirectional Encoder Representations from
Transformers (BERT). The model demonstrates high effectiveness across various communication platforms, including emails, SMS, and social media, achieving an accuracy exceeding 98.5%.
Downloads
References
[1] Jancy S, Daisy S, Begum AR. Email Spam Behavioral Sieving Technique using Hybrid Algorithm. In: IEEE ISMAC. 2023:687–693.
[2] Ugwueze WO, Anigbogu SO, Asogwa EC, Asogwa DC, Anigbogu KS. Enhancing Email Security: A Hybrid Machine Learning Approach for Spam and Malware Detection. World J Adv Eng Technol Sci. 2024;12(1):187–200.
[3] Nakarmi A, Parajuli R, Sharma G. Hybrid Classifier for Enhancing Accuracy and Performance of Spam and Ham Email Detection. Int J Res Publ. 2024.
[4] Dupade R. Spam Email Identification. Indian Sci J Res Eng Manag. 2024.
[5] Mythili J, Deebeshkumar B, Eshwaramoorthy T, Ajay J. Enhancing Email Spam Detection with Temporal Naive Bayes Classifier. 2024.
[6] Salıcı M, Ölçer Ü. Impact of Transformer-Based Models in NLP: An In-Depth Study on BERT and GPT. 2024:1–6.
[7] Beydemir AB, Sezgin U, Dogan UA, Aşıklar BE, Yerlikaya FA, Bahtıyar Ş. A Dynamically Selected GPT Model for Phishing Detection. 2024:481–484.
[8] Agarwal K, Kumar T. Email spam detection using integrated approach of naïve bayes and particle swarm optimization. In: IEEE ICICCS. 2018:685–690.
[9] Arya V, Almomani AAD, Mishra A, Peraković D, Rafsanjani MK. Email spam detection using naive bayes and random forest classifiers. In: ICSPN. 2022:341–348.
[10] GuangJun L, Nazir S, Khan HU, Haq AU. Spam detection approach for secure mobile message communication using machine learning algorithms. Sec Commun Netw. 2020:1–6.
[11] Guo Y, Mustafaoglu Z, Koundal D. Spam detection using bidirectional transformers and machine learning classifier algorithms. J Comput Cogn Eng. 2023;2(1):5–9.
[12] Kontsewaya Y, Antonov E, Artamonov A. Evaluating the effectiveness of machine learning methods for spam detection. Procedia Comput Sci. 2021;190:479–486.
[13] Lee H, Jeong S, Cho S, Choi E. Visualization technology and deep-learning for multilingual spam message detection. Electronics. 2023;12(3):582.
[14] Nandhini S, Marseline KSJ. Performance evaluation of machine learning algorithms for email spam detection. In: IEEE icETITE. 2020:1–4.
[15] Navaney P, Dubey G, Rana A. SMS spam filtering using supervised machine learning algorithms. In: IEEE Confluence. 2018:43–48.
[16] Ponmalar A, Rajkumar K, Hariharan U, Kalaiselvi VKG, Deeba S. Analysis of spam detection using integration of logistic regression and PSO algorithm. In: IEEE ICCCT. 2021:396–402.
[17] Rathod SB, Pattewar TM. Content based spam detection in email using bayesian classifier. In: IEEE ICCSP. 2015:1257–1261.
[18] Sreedhar L, Kavya B, Kiran HS, Bhaskar CV. Email spam detection using machine learning algorithms. Network. 2023;52(4).
[19] Srinivasarao U, Sharaff A. Machine intelligence based hybrid classifier for spam detection and sentiment analysis of SMS messages. Multimed Tools Appl. 2023:1–31.
[20] Sun N, Lin G, Qiu J, Rimba P. Near real-time twitter spam detection with machine learning techniques. Int J Comput Appl. 2022;44(4):338–348.
[21] Sultana T, Sapnaz KA, Sana F,Najath J. Email based spam detection.
[22] Wang WB, Yin F, Sun H, Li P. Random forest algorithm for spam filtering based on machine learning. Electron Eng Inf Sci. 2015:225–228.
[23] Qazi A, Hasan N, Mao R, Mohamed Elhag MA, Dey SK, Hardaker G. Machine learning-based opinion spamdetection: A systematic literature review. IEEE Access. 2024.
[24] Iqbal A, Younas M, Iftikhar S, Fatima F, Saleem R. Spam detection using hybrid model on fusion of spammer behavior and linguistics features. Egypt Inform J. 2025;29:100605.
[25] Si S, Wu Y, Tang L, Zhang Y, Wosik J. Evaluating the performance of ChatGPT for spam email detection. arXiv preprint arXiv:2402.15537. 2024.
[26] Popova S, Nenov H, Stoyanova D. Spam detection system based on hybrid scoring. In: IEEE ICAI. 2024:120–125.
[27] Tusher EH, Ismail MA, Raffei AFM. Email spam classification based on deep learning methods: A review. Iraqi J Comput Sci Math. 2025;6(1):2.
[28] Wang Y, Gong C, Ji X, Yuan Q. Text classification for evaluating digital technology adoption maturity based on BERT: An evidence of industrial AI from China. Technol Forecast Soc Change. 2025;211:123903.
[29] Siagian AR, Sugiarto A. Development of spam network in Huta Padang village, Bandar Pasir Mandoge district. J Inf Technol Comput Sci Electr Eng. 2025;2(1):85–93.
[30] Ahmadi M, Khajavi M, Varmaghani A, Ala A, Danesh K, Javaheri D. Leveraging large language models for cybersecurity: Enhancing SMS spam detection with robust and context-aware text classification. arXiv preprint arXiv:2502.11014. 2025.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Mohini Tyagi, Pradeep Kumar Singh, Shivam Kumar Yadav, Sanjay Kumar Soni

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.