Comparison of random forest and SVM methods in sentiment analysis about electric cars in Indonesia

Penulis

  • Indra Pratistha Institut Bisnis dan Teknologi Indonesia, Indonesia
  • Adi Panca Saputra Iskandar Institut Bisnis dan Teknologi Indonesia, Indonesia
  • Eugenius Gene Rangga Lanang Institut Bisnis dan Teknologi Indonesia, Indonesia
  • Ni Wayan Jeri Kusuma Dewi Institut Bisnis dan Teknologi Indonesia, Indonesia

DOI:

https://doi.org/10.35335/mandiri.v14i1.424

Kata Kunci:

Electric Vehicles, Random Forest Method, Sentiment Analysis, SVM Method

Abstrak

This study examined public sentiment toward electric vehicles (EVs) in Indonesia, where the adoption of EVs reached 28,188 registered units in 2023. The research analyzed user-generated content from the social media platform X (formerly known as Twitter), collecting 1,507 tweets that underwent preprocessing, including text normalization and sentiment labeling. Two machine learning models, Random Forest and Support Vector Machine (SVM), were implemented to classify the tweets into positive and negative sentiments. Each model was evaluated under three experimental scenarios with varying training dataset sizes. The results indicated that the SVM model achieved the best performance in the third scenario, with an accuracy of 81.3%, precision of 88%, and recall of 91%. In comparison, Random Forest achieved its highest results in the same scenario, with an accuracy of 77%, precision of 91%, and recall of 81%. These findings demonstrated that SVM outperformed Random Forest in terms of overall balance between accuracy and recall, making it the more effective model for sentiment classification in this context.

Referensi

Ahsan, M. M. (2021). Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance. Technologies, 9(3). https://doi.org/10.3390/technologies9030052

Alabi, R. O. (2021). Machine learning in oral squamous cell carcinoma: Current status, clinical concerns and prospects for future—A systematic review. In Artificial Intelligence in Medicine (Vol. 115). https://doi.org/10.1016/j.artmed.2021.102060

Alzamzami, F. (2020). Light Gradient Boosting Machine for General Sentiment Classification on Short Texts: A Comparative Evaluation. IEEE Access, 8, 101840–101858. https://doi.org/10.1109/ACCESS.2020.2997330

Amjad, M. (2022). Prediction of Pile Bearing Capacity Using XGBoost Algorithm: Modeling and Performance Evaluation. Applied Sciences Switzerland, 12(4). https://doi.org/10.3390/app12042126

Bressan, T. S. (2020). Evaluation of machine learning methods for lithology classification using geophysical data. Computers and Geosciences, 139. https://doi.org/10.1016/j.cageo.2020.104475

Dutta, A. (2020). An efficient convolutional neural network for coronary heart disease prediction. Expert Systems with Applications, 159. https://doi.org/10.1016/j.eswa.2020.113408

Fang, Z. (2020). Integration of convolutional neural network and conventional machine learning classifiers for landslide susceptibility mapping. Computers and Geosciences, 139. https://doi.org/10.1016/j.cageo.2020.104470

Fitriyani, N. L. (2020). HDPM: An Effective Heart Disease Prediction Model for a Clinical Decision Support System. IEEE Access, 8, 133034–133050. https://doi.org/10.1109/ACCESS.2020.3010511

Ghorbani, R. (2020). Comparing Different Resampling Methods in Predicting Students’ Performance Using Machine Learning Techniques. IEEE Access, 8, 67899–67911. https://doi.org/10.1109/ACCESS.2020.2986809

Guo, Y. (2021). Integrated phenology and climate in rice yields prediction using machine learning methods. Ecological Indicators, 120. https://doi.org/10.1016/j.ecolind.2020.106935

Ishaq, A. (2021). Improving the Prediction of Heart Failure Patients’ Survival Using SMOTE and Effective Data Mining Techniques. IEEE Access, 9, 39707–39716. https://doi.org/10.1109/ACCESS.2021.3064084

Kim, Y. (2020). Gap-filling approaches for eddy covariance methane fluxes: A comparison of three machine learning algorithms and a traditional method with principal component analysis. Global Change Biology, 26(3), 1499–1518. https://doi.org/10.1111/gcb.14845

Li, Y. (2020). Do we really need deep CNN for plant diseases identification? Computers and Electronics in Agriculture, 178. https://doi.org/10.1016/j.compag.2020.105803

Liang, Y. C. (2020). Machine learning-based prediction of air quality. Applied Sciences Switzerland, 10(24), 1–17. https://doi.org/10.3390/app10249151

Liu, L. (2021). Intrusion Detection of Imbalanced Network Traffic Based on Machine Learning and Deep Learning. IEEE Access, 9, 7550–7563. https://doi.org/10.1109/ACCESS.2020.3048198

Mohajane, M. (2021). Application of remote sensing and machine learning algorithms for forest fire mapping in a Mediterranean area. Ecological Indicators, 129. https://doi.org/10.1016/j.ecolind.2021.107869

Nayarisseri, A. (2021). Artificial intelligence, big data and machine learning approaches in precision medicine & drug discovery. Current Drug Targets, 22(6), 631–655. https://doi.org/10.2174/1389450122999210104205732

Olisah, C. C. (2022). Diabetes mellitus prediction and diagnosis from a data preprocessing and machine learning perspective. Computer Methods and Programs in Biomedicine, 220. https://doi.org/10.1016/j.cmpb.2022.106773

Perez-Diaz, J. A. (2020). A Flexible SDN-Based Architecture for Identifying and Mitigating Low-Rate DDoS Attacks Using Machine Learning. IEEE Access, 8, 155859–155872. https://doi.org/10.1109/ACCESS.2020.3019330

Prasad, P. (2020). Application of machine learning techniques in groundwater potential mapping along the west coast of India. Giscience and Remote Sensing, 735–752. https://doi.org/10.1080/15481603.2020.1794104

Rani, P. (2023). Intelligent transportation system for internet of vehicles based vehicular networks for smart cities. Computers and Electrical Engineering, 105. https://doi.org/10.1016/j.compeleceng.2022.108543

Rasyidi, M. A. (2021). Classification of handwritten javanese script using random forest algorithm. Bulletin of Electrical Engineering and Informatics, 10(3), 1308–1315. https://doi.org/10.11591/eei.v10i3.3036

Saleh, H. (2021). OPCNN-FAKE: Optimized Convolutional Neural Network for Fake News Detection. IEEE Access, 9, 129471–129489. https://doi.org/10.1109/ACCESS.2021.3112806

Sarkar, C. (2023). Leaf disease detection using machine learning and deep learning: Review and challenges. In Applied Soft Computing (Vol. 145). https://doi.org/10.1016/j.asoc.2023.110534

Seraphina, Y. A., & Gunawan, P. H. (2025). Sentiment Analysis of Public Responses Regarding The Use of Electric Cars in Indonesia with Support Vector Machine and Random Forest Methods. The Indonesian Journal of Computer Science, 14(1), 510–519. https://doi.org/10.33022/ijcs.v14i1.4649

Shahriar, S. (2021). Prediction of EV charging behavior using machine learning. IEEE Access, 9, 111576–111586. https://doi.org/10.1109/ACCESS.2021.3103119

Singh, V. (2022). A Deep Neural Network for Early Detection and Prediction of Chronic Kidney Disease. Diagnostics, 12(1). https://doi.org/10.3390/diagnostics12010116

Sivaranjani, S. (2021). Diabetes Prediction using Machine Learning Algorithms with Feature Selection and Dimensionality Reduction. In 2021 7th International Conference on Advanced Computing and Communication Systems Icaccs 2021 (pp. 141–146). https://doi.org/10.1109/ICACCS51430.2021.9441935

Sri, G. A., Prameswari, A., Rakib, M., & Aris, V. (2024). Sentiment Analysis Of Electric Vehicles On Twitter Using Machine Learning. INNOVATIVE: Journal Of Social Science Research, 4(4), 837–845.

Tan, L. (2021). Tomato Leaf Diseases Classification Based on Leaf Images: A Comparison between Classical Machine Learning and Deep Learning Methods. Agriengineering, 3(3), 542–558. https://doi.org/10.3390/agriengineering3030035

Tang, J. (2020). Statistical and machine-learning methods for clearance time prediction of road incidents: A methodology review. Analytic Methods in Accident Research, 27. https://doi.org/10.1016/j.amar.2020.100123

Tarigan, V. T., & Yusupa, A. (2024). Perbandingan Algoritma Maching Learning dalam Analisis Sentimen Mobil Listrik di Indonesia pada Media Sosial Twitter/X. Jurnal Informatika Polinema, 10(4), 479–490. https://doi.org/10.33795/jip.v10i4.5130

Tetila, E. C. (2020). Detection and classification of soybean pests using deep learning with UAV images. Computers and Electronics in Agriculture, 179. https://doi.org/10.1016/j.compag.2020.105836

Yazdinejad, A. (2023). An ensemble deep learning model for cyber threat hunting in industrial internet of things. Digital Communications and Networks, 9(1), 101–110. https://doi.org/10.1016/j.dcan.2022.09.008

Diterbitkan

2025-07-15

Cara Mengutip

Pratistha, I., Iskandar, A. P. S., Lanang, E. G. R., & Dewi, N. W. J. K. (2025). Comparison of random forest and SVM methods in sentiment analysis about electric cars in Indonesia. Jurnal Mandiri IT, 14(1), 29–36. https://doi.org/10.35335/mandiri.v14i1.424