Unsupervised Approach for Email Spam Filtering using Data Mining

Mehdi  Ebady Manaa; Ahmed  J. Obaid; Mohammed  Hussein Dosh

doi:10.4108/eai.9-3-2021.168962

Unsupervised Approach for Email Spam Filtering using Data Mining

Authors

Mehdi Ebady Manaa University of Babylon
Ahmed J. Obaid University of Kufa
Mohammed Hussein Dosh University of Kufa

DOI:

https://doi.org/10.4108/eai.9-3-2021.168962

Keywords:

Spam Emails, Vector Space Model, Data Security, Machine Learning, M-DBSCAN

Abstract

The computer networks overwhelm with unwanted emails, which are called spam emails. This email brings financial damage to companies and losses of user reputation. In this paper, the increasing volume of these emails has created the intense need to design and implement robust anti-spam filtering using the vector space model and Machine Learning (ML). ML algorithms have successfully used to detect and filter spam emails that jeopardize the network resources and consume the bandwidth. The main objective is to apply unsupervised learning M-DBSCAN to classify spam and ham emails. A robust method using the Modified Density-Based Spatial Clustering of Applications with Noise (M-DBSCAN) is implemented. The extracted N-representative points from each cluster are applied in the online test. These points represent the cluster objects to detect spherical and non-spherical clusters. These N-representative points are formed from the training step to detect spam email using distance measures. The data set used from the Kaggle website included many objects of ham and spam emails. The results show good performance accuracy with 97.848% in M-DBSCAN compared with 95.918% for standard DBSCAN accuracy and efficient values in false-negative rate, false-positive rate, f-score and online time detection.

Downloads

Download data is not yet available.

Downloads

Published

09-03-2021

Issue

Vol. 8 No. 36 (2021): EAI Endorsed Transactions on Energy Web

Section

Research articles

License

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

This is an open-access article distributed under the terms of the Creative Commons Attribution CC BY 4.0 license, which permits unlimited use, distribution, and reproduction in any medium so long as the original work is properly cited.

How to Cite

Unsupervised Approach for Email Spam Filtering using Data Mining. EAI Endorsed Trans Energy Web [Internet]. 2021 Mar. 9 [cited 2025 Nov. 1];8(36):e3. Available from: https://publications.eai.eu/index.php/ew/article/view/756

Download Citation

Unsupervised Approach for Email Spam Filtering using Data Mining

Authors

DOI:

Keywords:

Abstract

Downloads

Downloads

Published

Issue

Section

License

How to Cite

Most read articles by the same author(s)

Make a Submission

Scopus CiteScore

Latest publications