Image-Based User Feature Classification on Social Media Using Machine Learning
Main Article Content
Abstract
This study presents a robust machine learning framework for image-based user feature classification on social media, with a particular emphasis on applications in e-commerce environments. To address the critical challenge of limited labeled data within large-scale and unstructured image repositories, the proposed approach integrates semi-supervised learning techniques to enhance model generalization and data utilization efficiency. Three widely adopted machine learning algorithms, Decision Tree, Random Forest, and Support Vector Machine (SVM), were systematically evaluated using K-fold cross-validation and standard performance metrics, including accuracy, precision, recall, and F1-score. Experimental results demonstrate that the Random Forest model outperforms other approaches, achieving an accuracy of 0.9468 and an F1-score of 0.9456, while maintaining strong computational efficiency and robustness. These findings underscore the effectiveness of ensemble learning methods in handling high-dimensional, imbalanced datasets. The proposed framework offers significant practical implications for user profiling, personalized recommendation systems, and targeted marketing strategies in modern digital ecosystems
Downloads
Article Details
Data Availability Statement
The data used in this study are not publicly available due to privacy and platform restrictions but may be available from the corresponding author upon reasonable request
Section

This work is licensed under a Creative Commons Attribution 4.0 International License.
All articles published in Interdisciplinary Journal of AI, Machine Learning & Data Science (IJAIMLDS) are licensed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0).
This license allows others to share, copy, distribute, and adapt the work, provided that proper credit is given to the original author(s) and the source.
Authors retain copyright and grant Interdisciplinary Journal of AI, Machine Learning & Data Science (IJAIMLDS) the right of first publication.
How to Cite
References
1. Bakhshi, S., Shamma, D. A., & Gilbert, E. (2014). Faces engage us: Photos with faces attract more likes and comments on Instagram. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (pp. 965–974). https://doi.org/10.1145/2556288.2557403 DOI: https://doi.org/10.1145/2556288.2557403
2. Baltrušaitis, T., Ahuja, C., & Morency, L.-P. (2019). Multimodal machine learning: A survey and taxonomy. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(2), 423–443. DOI: https://doi.org/10.1109/TPAMI.2018.2798607
3. Chapelle, O., Schölkopf, B., & Zien, A. (2006). Semi-supervised learning. MIT Press. DOI: https://doi.org/10.7551/mitpress/9780262033589.001.0001
4. Chen, T., Kornblith, S., Norouzi, M., & Hinton, G. (2020). A simple framework for contrastive learning of visual representations. Proceedings of the 37th International Conference on Machine Learning (ICML).
5. DataReportal. (2024). Digital 2024: Vietnam. https://datareportal.com
6. Dosovitskiy, A., Beyer, L., Kolesnikov, A., et al. (2021). An image is worth 16×16 words: Transformers for image recognition at scale. International Conference on Learning Representations (ICLR).
7. Gelli, F., Uricchio, T., Bertini, M., Del Bimbo, A., & Chang, S.-F. (2015). Image popularity prediction in social media using sentiment and context features. Proceedings of the ACM International Conference on Multimedia. DOI: https://doi.org/10.1145/2733373.2806361
8. Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT Press.
9. He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778. DOI: https://doi.org/10.1109/CVPR.2016.90
10. Highfield, T., & Leaver, T. (2016). Instagrammatics and digital methods: Studying visual social media. Communication Research and Practice, 2(1), 47–62. https://doi.org/10.1080/22041451.2016.1155332 DOI: https://doi.org/10.1080/22041451.2016.1155332
11. Hu, Y., Manikonda, L., & Kambhampati, S. (2014). What we Instagram: A first analysis of Instagram photo content and user types. In Proceedings of the International AAAI Conference on Web and Social Media (ICWSM). DOI: https://doi.org/10.1609/icwsm.v8i1.14578
12. Jannach, D., Zanker, M., Felfernig, A., & Friedrich, G. (2016). Recommender systems: An introduction. Cambridge University Press.
13. Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems, 25.
14. Oliver, A., Odena, A., Raffel, C., Cubuk, E. D., & Goodfellow, I. (2018). Realistic evaluation of deep semi-supervised learning algorithms. Advances in Neural Information Processing Systems, 31.
15. Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., ... & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. In Proceedings of the International Conference on Machine Learning (ICML).
16. Ricci, F., Rokach, L., & Shapira, B. (2015). Recommender systems handbook (2nd ed.). Springer. DOI: https://doi.org/10.1007/978-1-4899-7637-6
17. Simonyan, K., & Zisserman, A. (2015). Very deep convolutional networks for large-scale image recognition. ICLR.
18. Sohn, K., Berthelot, D., Li, C.-L., Zhang, Z., Carlini, N., Cubuk, E. D., Kurakin, A., & Raffel, C. (2020). FixMatch: Simplifying semi-supervised learning with consistency and confidence. NeurIPS.
19. Tarvainen, A., & Valpola, H. (2017). Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. In Advances in Neural Information Processing Systems (NeurIPS).
20. Wu, L., Sun, P., Hong, R., Wang, Y., & Wang, M. (2022). Multimodal recommendation systems: A survey. ACM Transactions on Information Systems, 40(2), 1–38. DOI: https://doi.org/10.1145/3483611
21. Xie, Q., Luong, M. T., Hovy, E., & Le, Q. V. (2020). Self-training with noisy student improves ImageNet classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). DOI: https://doi.org/10.1109/CVPR42600.2020.01070
22. Zhang, S., Yao, L., Sun, A., & Tay, Y. (2020). Deep learning based recommender system: A survey and new perspectives. ACM Computing Surveys, 52(1), 1–38. DOI: https://doi.org/10.1145/3285029
23. Zhu, X., & Goldberg, A. B. (2009). Introduction to semi-supervised learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, 3(1), 1–130. DOI: https://doi.org/10.1007/978-3-031-01548-9_1