Enhancing Natural Disaster Monitoring: A Deep Learning Approach to Social Media Analysis Using Indonesian BERT Variants
Downloads
Social media has become a primary source of real-time information that can be leveraged by artificial intelligence to identify relevant messages, thereby enhancing disaster management. The rapid dissemination of disaster-related information through social media allows authorities to respond to emergencies more effectively. However, filtering and accurately categorizing these messages remains a challenge due to the vast amount of unstructured data that must be processed efficiently. This study compares the performance of IndoRoBERTa, IndoRoBERTa MLM, IndoDistilBERT, and IndoDistilBERT MLM in classifying social media messages about natural disasters into three categories: eyewitness, non-eyewitness, and don’t know. Additionally, this study analyzes the impact of batch size on model performance to determine the optimal batch size for each type of disaster dataset. The dataset used in this study consists of 1000 messages per category related to natural disasters in the Indonesian language, ensuring sufficient data diversity. The results show that IndoDistilBERT achieved the highest accuracy of 81.22%, followed by IndoDistilBERT MLM at 80.83%, IndoRoBERTa at 79.17%, and IndoRoBERTa MLM at 78.72%. Compared to previous studies, this study demonstrates a significant improvement in classification accuracy and model efficiency, making it more reliable for real-world disaster monitoring. Pre-training with MLM enhances IndoRoBERTa’s sensitivity and IndoDistilBERT’s specificity, allowing both models to better understand context and optimize classification results. Additionally, this study identifies the optimal batch sizes for each disaster dataset: 32 for floods, 128 for earthquakes, and 256 for forest fires, contributing to improved model performance. These findings confirm that this approach significantly improves classification accuracy, supporting the development of machine learning-based early warning systems for disaster management. This study highlights the potential for further model optimization to enhance real-time disaster response and improve public safety measures more effectively and efficiently.
Copyright (c) 2025 Karlina Elreine Fitriani, Mohammad Reza Faisal, Muhammad Itqan Mazdadi, Fatma Indriani, Dodon Turianto Nugrahadi, Septyan Eka Prastya (Author)

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-ShareAlikel 4.0 International (CC BY-SA 4.0) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).