Stacking of machine learning classifiers for bot detection using account level data

Jwala Sharma, Samarjeet Borah

Abstract


Social media is a platform for individuals to connect, share, and create information. Social bots produce automated content and interact with humans; in the process, they learn and mimic humans’ behaviour. This research study addresses the challenge of identifying social media bots (SMB) that can rapidly disseminate information or misinformation on platforms like Twitter. It contributes to the field by reviewing literature to define bot behaviours and exploring advanced machine learning classifiers for effective bot detection using account-level data. The study employed Spearman's rank correlation coefficient to select relevant features for SMB classification, then trained six different machine learning models: decision tree (DT), random forest (RF), logistic regression (LR), support vector machine (SVM), and k-nearest neighbour (KNN). To further improve accuracy, a classifier stacking technique was applied. Key findings revealed that while individual classifiers performed variably, with RF leading at 89% accuracy, the stacked classifier approach outperformed all single-classifier methods with an impressive 90% accuracy rate. The results underscore the potential of combining multiple classifiers to enhance the precision of social media bot detection efforts.

Keywords


Bot detection; Feature selection; Machine learning; Social media; Stacking classifier

Full Text:

PDF


DOI: http://doi.org/10.11591/ijict.v15i2.pp477-487

Refbacks

  • There are currently no refbacks.


Copyright (c) 2026 Jwala Sharma, Samarjeet Borah

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

The International Journal of Informatics and Communication Technology (IJ-ICT)
p-ISSN 2252-8776, e-ISSN 2722-2616
This journal is published by the Intelektual Pustaka Media Utama (IPMU).

Web Analytics View IJICT Stats