Detection of Negative Content (Hoax) On Microblog Data That Contains Covid-19 Information

Putra Tresna Linge; Alfan Farizki Wicaksono

doi:10.36418/syntax-literate.v7i6.8279

Putra Tresna Linge Magister Teknologi Informasi Universitas Indonesia Jakarta, Indonesia
Alfan Farizki Wicaksono Magister Teknologi Informasi Universitas Indonesia Jakarta, Indonesia

DOI: https://doi.org/10.36418/syntax-literate.v7i6.8279

Keywords: Hoax Detection, Twitter, Sentiment Orientation Classification, Machine Learning, Teks Analysis

Abstract

Over the past few years, the amount of information dissemination has increased, especially since the advent of social media. Among the information circulating, there is information that includes negative content or hoax that have a bad impact such as the emergence of divisions due to incorrect information. Based on the 2018 Kominfo performance report, Twitter social media is the largest contributor to the spread of hoax. To reduce the impact of the spread of hoax, a method is needed to detect hoaxes on Twitter so that prevention can be done such as taking down tweets that are hoax. The purpose of this research is to develop a model that can detect negative content (hoax) automatically and also see the correlation between hoax content and sentiment orientation. The results of this study are a machine learning-based model using a decision tree algorithm with an accuracy of 97.2% with a precision value of 85.4, recall of 81.4, and f1-score 93 and the model. In addition, the results of the analysis show that tweets that are hoax as a result of model identification are dominated by positive sentiment orientation, which is 52.64% of the total data identified as hoax

Downloads

Download data is not yet available.

Journal title	Syntax Literate : Jurnal Ilmiah Indonesia
Initials	JSL
Abbreviation	JSL
Frequency	12 issues per year (monthly)
DOI	prefix 10.36418 by
Online ISSN	2548-1398
Print ISSN	2541-0849
Editor-in-chief	Aen Fariah
Publisher	CV. Syntax Corporation
Citation Analysis	Google Scholar