Data Augmentation for Hoax Detection through the Method of Convolutional Neural Network in Indonesian News

Authors

Atik Zilziana Muflihati Noor , Rahmat Gernowo , Oky Dwi Nurhayati

DOI:

10.29303/jppipa.v9i7.4214

Published:

2023-07-25

Issue:

Vol. 9 No. 7 (2023): July

Keywords:

CNN, Data augmentation, Detection, EDA technique, Hoax

Research Articles

Downloads

How to Cite

Noor , A. Z. M., Gernowo, R. ., & Nurhayati, O. D. . (2023). Data Augmentation for Hoax Detection through the Method of Convolutional Neural Network in Indonesian News. Jurnal Penelitian Pendidikan IPA, 9(7), 5078–5084. https://doi.org/10.29303/jppipa.v9i7.4214

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Abstract

The concept of hoax or fake news refers to the intentional spread of false information on social media that aims to confuse and mislead readers to achieve an economic or political agenda. In addition, the increasingly diverse and numerous actors in the field of news writing and dissemination have led to the creation of news articles that need to be recognized whether they are credible or not. Furthermore, hoax can harm the social and political aspects of Indonesian society. Central Connecticut University released a study entitled The World's Most Literate Nations in 2016, where Indonesia ranked 60th out of 61 countries, indicating that Indonesian media literacy still needs to improve in critically evaluating information and distinguishing between fake news and valid news. Based on this description, the research will create the Synonym-Based Data Augmentation for Hoax Detection using the Convolutional Neural Network (CNN ) method and Easy Data Augmentation (EDA). This research resulted in an accuracy of 8,.81, indicating that it can be stated to be accurate in detecting hoax news

References

Albashish, D. (2022). Ensemble of adapted convolutional neural networks (CNN) methods for classifying colon histopathological images. PeerJ Computer Science, 8. https://doi.org/10.7717/peerj-cs.1031

Anton, A., Nissa, N. F., Janiati, A., Cahya, N., & Astuti, P. (2021). Application of Deep Learning Using Convolutional Neural Network (CNN) Method For Women’s Skin Classification. Scientific Journal of Informatics, 8(1). https://doi.org/10.15294/sji.v8i1.26888

Apriliyanto, A., & Kusumaningrum, R. (2020). Hoax Detection in Indonesia Language Using Long Short-Term Memory Model. Sinergi, 24(3), 189. https://doi.org/10.22441/sinergi.2020.3.003

Borisov, V., Leemann, T., Sessler, K., Haug, J., Pawelczyk, M., & Kasneci, G. (2022). Deep Neural Networks and Tabular Data: A Survey. IEEE Transactions on Neural Networks and Learning Systems. https://doi.org/10.1109/TNNLS.2022.3229161

Buhrmester, V., Münch, D., & Arens, M. (2021). Analysis of Explainers of Black Box Deep Neural Networks for Computer Vision: A Survey. Machine Learning and Knowledge Extraction, 3(4). https://doi.org/10.3390/make3040048

Christina Gregory. (2017). Internet Addiction Disorder - Signs, Symptoms, and Treatments. Vertical Health LLC.

Defriani, M., & Jaelani, I. J. (2022). Recognition of Regional Traditional House in Indonesia Using Convolutional Neural Network (CNN) Method. Journal of Computer Networks, Architecture and High Performance Computing, 4(2). https://doi.org/10.47709/cnahpc.v4i2.1562

Ghinadya, & Suyanto, S. (2020). Synonyms-Based Augmentation to Improve Fake News Detection using Bidirectional LSTM. 2020 8th International Conference on Information and Communication Technology, ICoICT 2020. https://doi.org/10.1109/ICoICT49345.2020.9166230

Gusfa, H., & Kadjuand, F. E. D. (2020). Political Agonism for Indonesian Cyberpolitic: Critical Cyberculture to Political Campaign of 2019 Indonesian Presidential Election in Twitter. Nyimak: Journal of Communication, 4(2). https://doi.org/10.31000/nyimak.v4i2.2685

Hao, Z., Li, M., Yang, W., & Li, X. (2022). Evaluation of UAV spraying quality based on 1D-CNN model and wireless multi-sensors system. Information Processing in Agriculture. https://doi.org/10.1016/j.inpa.2022.07.004

Herrero-Diz, P., & Pérez-Escolar, M. (2022). Analysis of Hoaxes about COVID-19 Debunked by Maldita and Colombiacheck: Effects of the Infodemic on the Behavior of Society. Palabra Clave, 25(1). https://doi.org/10.5294/pacla.2022.25.1.7

Jormand, H., Bashirian, S., Barati, M., Khazaei, S., Jenabi, E., & Zareian, S. (2021). A Qualitative Study On People’s Experiences Of Covid-19 Media Literacy. Media Literacy And Academic Research, 4(1). Retrieved from https://www.ceeol.com/search/article-detail?id=946040

Khidhir, S. (2019). Indonesia is too slow! Theaseanpost.

Kurniawan, A. A., & Mustikasari, M. (2021). Implementasi Deep Learning Menggunakan Metode CNN dan LSTM untuk Menentukan Berita Palsu dalam Bahasa Indonesia. Jurnal Informatika Universitas Pamulang, 5(4), 544. https://doi.org/10.32493/informatika.v5i4.6760

Montavon, G., Samek, W., & Müller, K. R. (2018). Methods for interpreting and understanding deep neural networks. In Digital Signal Processing: A Review Journal (Vol. 73). https://doi.org/10.1016/j.dsp.2017.10.011

Mufid, F. L., & Hariandja, T. R. (2019). Efektivitas Pasal 28 Ayat (1) UU ITE tentang Penyebaran Berita Bohong (Hoax). Jurnal Rechtens, 8(2). https://doi.org/10.36835/rechtens.v8i2.533

Nayoga, B. P., Adipradana, R., Suryadi, R., & Suhartono, D. (2021). Hoax Analyzer for Indonesian News Using Deep Learning Models. Procedia Computer Science, 179, 704–712. https://doi.org/10.1016/j.procs.2021.01.059

Putra, O. V., Musthafa, A., Nur, M., & Rido, M. (2021). Classification of Calligraphy Writing Types Using Convolutional Neural Network Method (CNN). Procedia of Engineering and Life Science, 2. https://doi.org/10.21070/pels.v2i0.1136

Salahuddin, Z., Woodruff, H. C., Chatterjee, A., & Lambin, P. (2022). Transparency of deep neural networks for medical image analysis: A review of interpretability methods. In Computers in Biology and Medicine (Vol. 140). https://doi.org/10.1016/j.compbiomed.2021.105111

Santoso, H. A., Rachmawanto, E. H., Nugraha, A., Nugroho, A. A., Setiadi, D. R. I. M., & Basuki, R. S. (2020). Hoax classification and sentiment analysis of Indonesian news using Naive Bayes optimization. Telkomnika (Telecommunication Computing Electronics and Control), 18(2). https://doi.org/10.12928/TELKOMNIKA.V18I2.14744

Semiawan, C. R. (2017). Metode Penelitian Kuantitatif. Grasindo

Shao, X., & Kim, C. S. (2022). Unsupervised Domain Adaptive 1D-CNN for Fault Diagnosis of Bearing. Sensors, 22(11). https://doi.org/10.3390/s22114156

Singh, K., Scholar, R., Mahajan, A., & Mansotra, V. (2021). 1D-CNN based Model for Classification and Analysis of Network Attacks. International Journal of Advanced Computer Science and Applications, 12(11). https://doi.org/10.14569/IJACSA.2021.0121169

Spera, R., & Peña-Guzmán, D. M. (2019). The Anatomy of a Philosophical Hoax: The Politics of Delegitimation in Contemporary Philosophy. Metaphilosophy, 50(1–2). https://doi.org/10.1111/meta.12343

Sugiyono. (2019). Metode Penelitian Kuantitatif, Kualitatif, dan R&D (1st ed.). Penerbit Alfabeta.

Tang, W., Long, G., Liu, L., Zhou, T., Jiang, J., & Blumenstein, M. (2020). Rethinking 1D-CNN for Time Series Classification: A Stronger Baseline. ArXiv. Retrieved from https://www.arxiv-vanity.com/papers/2002.10061/

Utami, P. (2019). Hoax in modern politics: The meaning of hoax in Indonesian politics and democracy. Jurnal Ilmu Sosial Dan Ilmu Politik, 22(2). https://doi.org/10.22146/jsp.34614

Utami, P. (2019b). Hoax in Modern Politics. Jurnal Ilmu Sosial Dan Ilmu Politik, 22(2). https://doi.org/10.22146/jsp.34614

Wei, J. & Zou, K. (2019). EDA:Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 6382–6388. https://doi.org/10.48550/arXiv.1901.11196

Wu, Y. Y., Hu, Y. S., Wang, J., Zang, Y. F., & Zhang, Y. (2022). Toward Precise Localization of Abnormal Brain Activity: 1D CNN on Single Voxel fMRI Time-Series. Frontiers in Computational Neuroscience, 16. https://doi.org/10.3389/fncom.2022.822237

Youssef, A. M., Pradhan, B., Dikshit, A., Al-Katheri, M. M., Matar, S. S., & Mahdi, A. M. (2022). Landslide susceptibility mapping using CNN-1D and 2D deep learning algorithms: comparison of their performance at Asir Region, KSA. Bulletin of Engineering Geology and the Environment, 81(4). https://doi.org/10.1007/s10064-022-02657-4

Author Biographies

Atik Zilziana Muflihati Noor , Universitas Diponegoro

Rahmat Gernowo, Universitas Diponegoro

Oky Dwi Nurhayati, Universitas Diponegoro

License

Copyright (c) 2023 Atik Zilziana Muflihati Noor , Rahmat Gernowo, Oky Dwi Nurhayati

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors who publish with Jurnal Penelitian Pendidikan IPA, agree to the following terms:

  1. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution 4.0 International License (CC-BY License). This license allows authors to use all articles, data sets, graphics, and appendices in data mining applications, search engines, web sites, blogs, and other platforms by providing an appropriate reference. The journal allows the author(s) to hold the copyright without restrictions and will retain publishing rights without restrictions.
  2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in Jurnal Penelitian Pendidikan IPA.
  3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).