A Hybrid Feature-Enriched IndoBERT Framework for Sentiment Analysis of Ride-Hailing Service Reviews in Indonesia

Authors

  • Puas Triawan Universitas Amikom Purwokerto, Indonesia
  • Imam Tahyudin Universitas Amikom Purwokerto, Indonesia
  • Purwadi Universitas Amikom Purwokerto, Indonesia
Pages Icon

DOI:

https://doi.org/10.63158/journalisi.v8i2.1587

Keywords:

Sentiment Classification, Hybrid Feature Representation, IndoBERT, Indonesian NLP, Ride-Hailing Reviews

Abstract

This study examines sentiment classification for Indonesian ride-hailing user reviews, which often contain informal expressions, ambiguity, and strong contextual dependency. Existing studies commonly rely on either traditional machine learning or transformer-based models, while limited attention has been given to integrating heterogeneous feature representations. To address this gap, this study proposes a feature-level hybrid integration strategy combining TF-IDF and IndoBERT embeddings. This approach enables the model to capture statistical term importance and contextual semantic meaning within a unified representation. A quantitative experimental design was applied to approximately 20,000 reviews collected from Gojek, Grab, and Maxim. Sentiment labels were generated through rating-based mapping and manually validated for consistency. The dataset, which was relatively balanced across positive, neutral, and negative classes, was divided into training and testing sets using an 80:20 split. Model performance was evaluated on the test set using accuracy, precision, recall, and F1-score. The proposed hybrid model achieved the highest accuracy of 93.5%, outperforming IndoBERT (91.8%) and traditional machine learning models (78.4%–87.6%). The results show that feature-level integration improves sentiment classification performance, although neutral sentiment remains challenging due to contextual ambiguity.

Downloads

Download data is not yet available.

References

[1] M. A. Akbar and A. Solichin, “Sentiment Comparison of User Reviews of Ride-Hailing Apps Gojek and Grab Using Multinomial Naïve Bayes Algorithm,” KRESNA: Jurnal Riset dan Pengabdian Masyarakat, vol. 4, pp. 1–11, 2024.

[2] P. Triawan, I. Tahyudin, and P. Purwadi, “Impact of NLP Algorithms on Sentiment Analysis Efficiency and Accuracy,” Journal of Information Systems and Informatics, vol. 7, no. 3, pp. 2684–2709, Sep. 2025, doi: 10.51519/journalisi.v7i3.1222.

[3] S. P. Yuliani, A. A. P. Muharani, R. Q. Fatmawati, and F. Fahmi, “Sentiment Analysis in User Reviews of Gojek Application using Natural Language Processing,” Journal of System and Computer Engineering (JSCE), vol. 6, no. 4, pp. 296–305, Oct. 2025, doi: 10.61628/jsce.v6i4.2062.

[4] H. Jayadianti, W. Kaswidjanti, A. T. Utomo, S. Saifullah, F. A. Dwiyanto, and R. Drezewski, “Sentiment analysis of Indonesian reviews using fine-tuning IndoBERT and R-CNN,” ILKOM Jurnal Ilmiah, vol. 14, no. 3, pp. 348–354, Dec. 2022, doi: 10.33096/ilkom.v14i3.1505.348-354.

[5] V. H. Pranatawijaya, N. N. K. Sari, R. A. Rahman, E. Christian, and S. Geges, “Unveiling User Sentiment: Aspect-Based Analysis and Topic Modeling of Ride-Hailing and Google Play App Reviews,” Journal of Information Systems Engineering and Business Intelligence, vol. 10, no. 3, pp. 328–339, 2024, doi: 10.20473/jisebi.10.3.328-339.

[6] C. H. Lin and U. Nuha, “Sentiment analysis of Indonesian datasets based on a hybrid deep-learning strategy,” J. Big Data, vol. 10, no. 1, Dec. 2023, doi: 10.1186/s40537-023-00782-9.

[7] R. F. Ananda, A. Syahri, and F. N. Hasan, “Sentiment Analysis of Customer Satisfaction In Gojek And Grab Application Reviews Using The Naive Bayes Algorithm,” Jurnal Teknik Informatika (Jutif), vol. 5, no. 1, pp. 233–241, Feb. 2024, doi: 10.52436/1.jutif.2024.5.1.1680.

[8] V. Arifin, Y. A. Putri, and R. Wiputra, “A dataset of subjectivity classification in Indonesian ride-hailing app reviews,” Data Brief, vol. 64, Feb. 2026, doi: 10.1016/j.dib.2025.112348.

[9] V. Atina and P. Srisuk, “Sentiment Analysis of Grab App Reviews with Machine Learning Approach,” International Conference of Health, Science and Technology , Sep. 2024.

[10] N. N. I. Prova, V. Ravi, M. P. Singh, V. K. Srivastava, S. Chippagiri, and A. P. Singh, “Multilingual sentiment analysis in e-commerce customer reviews using GPT and deep learning-based weighted-ensemble model,” International Journal of Cognitive Computing in Engineering, vol. 7, no. 1, pp. 268–286, Dec. 2026, doi: 10.1016/j.ijcce.2025.10.003.

[11] E. C. M. Torres and L. G. de Picado-Santos, “Sentiment Analysis and Topic Modeling in Transportation: A Literature Review,” Jun. 01, 2025, Multidisciplinary Digital Publishing Institute (MDPI). doi: 10.3390/app15126576.

[12] S. Ali, G. Wang, and S. Riaz, “Aspect based sentiment analysis of ridesharing platform reviews for kansei engineering,” IEEE Access, vol. 8, pp. 173186–173196, 2020, doi: 10.1109/ACCESS.2020.3025823.

[13] O. B. J. Putro, A. Jacobus, and F. D. Kambey, “Aspect-Based Sentiment Analysis Product Review Using CNN and Bidirectional LSTM,” Jurnal Teknik Informatika, vol. 20, no. 2, pp. 117–124, May 2025.

[14] N. K. K. Navanigha and R. K, “A Deep Learning Approach to Comparative Sentiment Analysis for Ride-hailing Apps,” International Scientific Journal of Engineering and Management, vol. 4, no. 4, 2025, doi: 10.55041/ISJEM02634.

[15] C. H. P. Panjaitan, “Systematic literature review of sentiment analysis on various review platforms in the tourism sector,” Journal of Advanced Computer Knowledge and Algorithms, vol. 2, no. 1, pp. 12–18, 2025.

[16] A. A. P. Simarmata and T. B. Sasongko, “Sentiment analysis on BRImo application reviews using IndoBERT,” Journal of Applied Informatics and Computing, vol. 9, no. 3, pp. 851–862, 2025.

[17] D. Indra, Ramdaniah, and W. Sukur, “Analysis of Hybrid Learning Sentiment Among Information Systems Students Using the Naïve Bayes Classifier,” Jurnal ELTIKOM, vol. 8, no. 2, pp. 91–99, Dec. 2024, doi: 10.31961/eltikom.v8i2.1144.

[18] P. Kurniawati, R. Y. Fa’rifah, and D. Witarsyah, “Sentiment Analysis of Maxim Online Transportation App Reviews Using Support Vector Machine (SVM) Algorithm,” Building of Informatics, Technology and Science, vol. 5, no. 2, pp. 466–475, Sep. 2023, doi: 10.47065/bits.v5i2.4265.

[19] P. A. Amri, D. M. Suri, and Syuhada, “The Analysis of Ride-Hailing User Characteristics from App Reviews,” Jurnal Siasat Bisnis, vol. 28, no. 2, pp. 241–262, Nov. 2024, doi: 10.20885/jsb.vol28.iss2.art7.

[20] V. H. Pranatawijaya, N. N. K. Sari, R. A. Rahman, E. Christian, and S. Geges, “Unveiling User Sentiment: Aspect-Based Analysis and Topic Modeling of Ride-Hailing and Google Play App Reviews,” Journal of Information Systems Engineering and Business Intelligence, vol. 10, no. 3, pp. 328–339, Oct. 2024, doi: 10.20473/jisebi.10.3.328-339.

[21] D. D. Purwanto, “Empirical Evaluation of IndoBERT and LSTM for Sentiment Analysis of Tourism Reviews: A Data-Driven Study on Kenjeran Park,” Jurnal Teknik Informatika (JUTIF), vol. 7, no. 1, pp. 463–474, Feb. 2026, doi: 10.52436/1.jutif.2026.7.1.4901.

[22] S. Mujilahwati, M. R. Zamroni, and M. Sholihin, “Hybrid Deep Learning Approach for Indonesian Hoax Detection: A Comparative Evaluation with IndoBERT,” International Journal of Advances in Applied Sciences, vol. 15, no. 1, pp. 322–332, 2026.

[23] N. F. Adhim and N. Cahyono, “Optimization of IndoBERT for Sentiment Analysis of FOMO on Social Media Through Fine-Tuning and Hybrid Labeling,” Journal of Applied Informatics and Computing, vol. 9, no. 6, pp. 3786–3797, 2025.

Downloads

Published

2026-05-02

Issue

Section

Articles

How to Cite

[1]
P. Triawan, I. Tahyudin, and Purwadi, “A Hybrid Feature-Enriched IndoBERT Framework for Sentiment Analysis of Ride-Hailing Service Reviews in Indonesia”, journalisi, vol. 8, no. 2, pp. 2550–2569, May 2026, doi: 10.63158/journalisi.v8i2.1587.

Most read articles by the same author(s)