Jurnal / Konferensi2025

A Semi-Supervised Machine Learning Framework for Detecting Political Hoaxes Based on Keyword-Guided Clustering

Penulis

Sayfudin, Deris Stiawan, Ferdiansyah, Andri Wijaya, Anto Satriyo Nugroho, Siti Hajar Othman

Dipublikasikan di

International Electronics Symposium (IES)

Abstrak

The spread of political hoaxes poses a significant threat to democratic integrity and social stability. This study proposes an efficient and structured machine learning approach to detect political hoax content. The text data undergoes preprocessing, including normalization, removal of punctuation and numbers, tokenization, and stopword elimination. Numerical representation is generated using Term FrequencyInverse Document Frequency (TF-IDF), followed by clustering using the K-Means algorithm to group the text based on feature similarity. The core contribution of this research is the development of an automatic labeling strategy based on political keywords, applied after the clustering stage. Unlike previous studies that rely on pre-labeled datasets or stop at unsupervised clustering, this approach introduces a contextual labeling process to distinguish between political and non-political hoaxes by identifying keyword occurrences within the clustered data. The automatically labeled data is then classified using three classical machine learning algorithms: Logistic Regression, Support Vector Machine (SVM), and Naive Bayes. The models are evaluated to assess their ability to accurately detect political hoaxes. Among them, SVM demonstrates the highest and most consistent classification performance, indicating its suitability for this task. The proposed integrated pipeline proves effective in addressing the challenge of unlabeled data. This study contributes methodologically to the development of adaptive and efficient hoax detection systems that are applicable across various socio-political contexts.

Tim Penulis

1

Sayfudin

Universitas Sriwijaya

2

Deris Stiawan

Universitas Sriwijaya

3

Ferdiansyah

Universitas Sriwijaya

4

Andri Wijaya

Universitas Sriwijaya

5

Anto Satriyo Nugroho

Universitas Sriwijaya

6

Siti Hajar Othman

Universitas Sriwijaya

Kutip

Sayfudin, Deris Stiawan, Ferdiansyah, Andri Wijaya, Anto Satriyo Nugroho, Siti Hajar Othman (2025). A Semi-Supervised Machine Learning Framework for Detecting Political Hoaxes Based on Keyword-Guided Clustering. International Electronics Symposium (IES).
Logo Unsri

Grup Riset Jaringan Komputer, Keamanan, dan Sistem Terdistribusi. Fakultas Ilmu Komputer, Universitas Sriwijaya.

Kontak

Alamat

Gedung Diploma Komputer, Fakultas Ilmu Komputer, Universitas Sriwijaya, Jl. Srijaya Negara, Bukit Besar, Ilir Barat I, Palembang, Sumatera Selatan, 30128

Afiliasi

Diktisaintek Berdampak
Kemdikbud
Unsri
IEEE
ACM

Pengunjung

Flag Counter

© 2026 COMNETS Research Group. Hak Cipta Dilindungi Undang-Undang.