Detecting Fake Social Media Profiles Using the Majority Voting Approach

Authors

  • Dharmaraj R Patil R.C. Patel Institute of Technology
  • Tareek M Pattewar Vishwakarma University image/svg+xml
  • Vipul D Punjabi R.C. Patel Institute of Technology
  • Shailendra M Pardeshi R.C. Patel Institute of Technology

DOI:

https://doi.org/10.4108/eetsis.4264

Keywords:

Fake social media profiles, social media security, majority voting approach,, machine learning, user behaviour analysis

Abstract

INTRODUCTION: The rise of social media platforms has brought about a concerning surge in the creation of fraudulent user profiles, with intentions ranging from spreading false information and perpetrating fraud to engaging in cyberbullying. The detection of these deceptive profiles has emerged as a critical imperative to safeguard the trustworthiness and security of online communities.
OBJECTIVES: This paper focused on the detection and identification of fake social media profiles.
METHODS: This paper introduces an innovative approach for discerning and categorizing counterfeit social media profiles by leveraging the majority voting approach. The proposed methodology integrates a range of machine learning algorithms, including Decision Trees, XGBoost, Random Forest, Extra Trees, Logistic Regression, AdaBoost and K-Nearest Neighbors each tailored to capture distinct facets of user behavior and profile attributes. This amalgamation of diverse methods results in an ensemble of classifiers, which are subsequently subjected to a majority voting mechanism to render a conclusive judgment regarding the legitimacy of a given social media profile.
RESULTS: We conducted thorough experiments using a dataset containing both legitimate and fake social media profiles to determine the efficiency of our methodology. Our findings substantiate that the Majority Voting Technique surpasses individual classifiers, attaining an accuracy rate of 99.12%, a precision rate of 99.12%, a recall rate of 99.12%, and an F1-score of 99.12%.
CONCLUSION: The results show that the majority vote method is reliable for detecting and recognising fake social media profiles.

Author Biographies

Dharmaraj R Patil, R.C. Patel Institute of Technology

Department of Computer Engineering, R.C. Patel Institute of Technology, Shirpur, India

Tareek M Pattewar, Vishwakarma University

Department of Computer Engineering, Vishwakarma University, Pune, India

Vipul D Punjabi, R.C. Patel Institute of Technology

Department of Computer Engineering, R.C.Patel Institute of Technology, Shirpur, India

Shailendra M Pardeshi, R.C. Patel Institute of Technology

Department of Computer Science & Engineering (Data Science), R.C.Patel Institute of Technology, Shirpur, India

References

Facebook: fake account removal as of Q4 2022, Published by Stacy Jo Dixon, March 9, 2023. Available online, https://www.statista.com/statistics/1013474/facebookfake-account-removal-quarter.

Twitter Inc. (2013-2020). Twitter annual reports. Twitter Inc. Available online, https://www.annualreports.com/Company/twitterinc.

Snap Inc. Snap annual report 2020. Available online, https://s25.q4cdn.com/442043304/files/doc_presentations/presentation/2021/Snap-Inc.-2020-Annual-Report.pdf.

LinkedIn. Annual report 2015. Available online, https://news.linkedin.com/2016/linkedin-announcesfourth-quarter-and-full-year-2015-results.

Ramalingam D, Chinnaiah V. Fake profile detection techniques in large-scale online social networks: A comprehensive review. Computers & Electrical Engineering. 2018 Jan 1;65:165-77.

Ezarfelix J, Jeffrey N, Sari N. A Systematic Literature Review: Instagram Fake Account Detection Based on Machine Learning. Engineering, MAthematics and Computer Science (EMACS) Journal. 2022 Feb 5;4(1):25-31.

Kaubiyal J, Jain AK. A feature based approach to detect fake profiles in Twitter. InProceedings of the 3rd international conference on big data and internet of things 2019 Aug 22 (pp. 135-139).

Mohammadrezaei M, Shiri ME, Rahmani AM. Identifying fake accounts on social networks based on graph analysis and classification algorithms. Security and Communication Networks. 2018 Aug 5;2018.

Joshi UD, Vanshika, Singh AP, Pahuja TR, Naval S, Singal G. Fake social media profile detection. Machine Learning Algorithms and Applications. 2021 Aug 19:193-209.

Ajesh F, Aswathy SU, Philip FM, Jeyakrishnan V. A hybrid method for fake profile detection in social networkusing artificial intelligence. Security Issues and Privacy Concerns in Industry 4.0 Applications. 2021 Jun 15:89-112.

Khaled S, El-Tazi N, Mokhtar HM. Detecting fake accounts on social media. In2018 IEEE international conference on big data (big data) 2018 Dec 10 (pp. 3672-3681). IEEE.

Xiao C, Freeman DM, Hwa T. Detecting clusters of fake accounts in online social networks. In Proceedings of the 8th ACM Workshop on Artificial Intelligence and Security 2015 Oct 16 (pp. 91-101).

Kodati S, Reddy KP, Mekala S, Murthy PS, Reddy PC. Detection of Fake Profiles on Twitter Using Hybrid SVM Algorithm. In E3SWeb of Conferences 2021 (Vol. 309, p.01046). EDP Sciences.

Sahoo SR, Gupta BB. Fake profile detection in multimedia big data on online social networks. International Journal of Information and Computer Security. 2020;12(2-3):303-31.

Akyon FC, Kalfaoglu ME. Instagram fake and automated account detection. In2019 Innovations in intelligent systems and applications conference (ASYU) 2019 Oct 31 (pp. 1-7). IEEE.

Bordbar J, Mohammadrezaie M, Ardalan S, Shiri ME. Detecting fake accounts through Generative Adversarial Network in online social media. arXiv preprint arXiv:2210.15657. 2022 Oct 25.

Samreen S, Joshna A, Neelima B, Varsha C, Nikitha G. ARTIFICIAL NEURAL NETWORKS FOR FAKE ACCOUNT DETECTION FROM SOCIAL MEDIA. Turkish Journal of Computer and Mathematics Education (TURCOMAT). 2023 Jul 6;14(03):53-64.

Ali AK, Abdullah AM. Fake accounts detection on social media using stack ensemble system. International Journal of Electrical and Computer Engineering (IJECE). 2022 Jun 1;12(3):3013-22.

Muñoz SD, Pinto EP. A dataset for the detection of fake profiles on social networking services. In 2020 International Conference on Computational Science and Computational Intelligence (CSCI) 2020 Dec 16 (pp. 230-237). IEEE.

Rostami RR, Karbasi S. Detecting Fake Accounts on Twitter Social Network Using Multi-Objective Hybrid Feature Selection Approach.Webology. 2020 Jun 1;17(1).

Kadam N, Sharma SK. Social media fake profile detection using data mining technique. Journal of Advances in Information Technology Vol. 2022 Oct;13(5).

Al-Qurishi M, Rahman SM, Hossain MS, Almogren A, Alrubaian M, Alamri A, Al-Rakhami M, Gupta BB. An efficient key agreement protocol for Sybil-precaution in online social networks. Future Generation Computer Systems. 2018 Jul 1;84:139-48.

Conti M, Poovendran R, Secchiero M. Fakebook: Detecting fake profiles in on-line social networks. In2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2012 Aug 26 (pp. 1071-1078). IEEE.

Gurajala S, White JS, Hudson B, Matthews JN. Fake Twitter accounts: profile characteristics obtained using an activity-based pattern detection approach. InProceedings of the 2015 international conference on social media & society 2015 Jul 27 (pp. 1-7).

Gurajala S, White JS, Hudson B, Voter BR, Matthews JN. Profile characteristics of fake Twitter accounts. Big Data & Society. 2016 Oct;3(2):2053951716674236.

Meligy AM, Ibrahim HM, Torky MF. Identity verification mechanism for detecting fake profiles in online social networks. Int. J. Comput. Netw. Inf. Secur.(IJCNIS). 2017 Jan 1;9(1):31-9.

Jin L, Takabi H, Joshi JB. Towards active detection of identity clone attacks on online social networks. In Proceedings of the first ACM conference on Data and application security and privacy 2011 Feb 21 (pp. 27-38).

Nagariya HG, Dhanotiya N, Joshi S, Jain S. Identifying Fake Profile in Online Social Network. In ACM Workshop on Advances in Computational Intelligence at ISIC 2021.

Uppada SK, Manasa K, Vidhathri B, Harini R, Sivaselvan B. Novel approaches to fake news and fake account detection in OSNs: user social engagement and visual content centric model. Social Network Analysis and Mining. 2022 Dec;12(1):52.

Du, Jiahua, Jia Rong, Hua Wang, and Yanchun Zhang. Neighbor-aware review helpfulness prediction. Decision Support Systems 148 (2021): 113581.

Du, Jiahua, Liping Zheng, Jiantao He, Jia Rong, Hua Wang, and Yanchun Zhang. An interactive network for end-to-end review helpfulness modeling. Data Science and Engineering 5 (2020): 261-279.

Zhou, Yuzhong, Zhengping Lin, Liang Tu, Junkai Huang, and Zifeng Zhang. Analysis and design of standard knowledge service system based on deep learning. EAI Endorsed Transactions on Scalable Information Systems 10, no. 2 (2022).

Reddy, Shiva Shankar, Mahesh Gadiraju, N. Meghana Preethi, and VVR Maheswara Rao. A Novel Approach for Prediction of Gestational Diabetes based on Clinical Signs and Risk Factors. EAI Endorsed Transactions on Scalable Information Systems 10, no. 3 (2023).

Cresci S, Di Pietro R, Petrocchi M, Spognardi A, Tesconi M. The paradigm-shift of social spambots: Evidence, theories, and tools for the arms race. In Proceedings of the 26th international conference on world wide web companion 2017 Apr 3 (pp. 963-972).

Elyusufi Y, Elyusufi Z, Kbir MH. Social networks fake profiles detection using machine learning algorithms. In Innovations in Smart Cities Applications Edition 3: The Proceedings of the 4th International Conference on Smart City Applications 4 2020 (pp. 30-40). Springer International Publishing.

Maimon O, Rokach L, editors. Data mining and knowledge discovery handbook. New York: Springer; 2005 Sep 1.

Breiman L. Random forests. Machine learning. 2001 Oct;45:5-32.

Stoltzfus JC. Logistic regression: a brief primer. Academic emergency medicine. 2011 Oct;18(10):1099-104.

Chen T, Guestrin C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining 2016 Aug 13 (pp. 785-794).

Geurts P, Ernst D, Wehenkel L. Extremely randomized trees. Machine learning. 2006 Apr;63:3-42.

Peterson LE. K-nearest neighbor. Scholarpedia. 2009 Feb 21;4(2):1883.

Schapire RE. Explaining adaboost. In Empirical Inference: Festschrift in Honor of Vladimir N. Vapnik 2013 Oct 9 (pp. 37-52). Berlin, Heidelberg: Springer Berlin Heidelberg.

Ruta D, Gabrys B. Classifier selection for majority voting. Information fusion. 2005 Mar 1;6(1):63-81.

Sokolova M, Lapalme G. A systematic analysis of performance measures for classification tasks. Information processing & management. 2009 Jul 1;45(4):427-37.

Downloads

Published

14-02-2024

How to Cite

1.
Patil DR, Pattewar TM, Punjabi VD, Pardeshi SM. Detecting Fake Social Media Profiles Using the Majority Voting Approach. EAI Endorsed Scal Inf Syst [Internet]. 2024 Feb. 14 [cited 2024 Dec. 4];11(3). Available from: https://publications.eai.eu/index.php/sis/article/view/4264