PCA - Enhanced regression approach for predicting internet use based on formal education

Authors

  • Ivana Petkovski Computer sciences, Mathematical Institute of the Serbian Academy of Sciences and Arts, Belgrade, Serbia
  • Petar Vranić Computer sciences, Mathematical Institute of the Serbian Academy of Sciences and Arts, Belgrade, Serbia

DOI:

https://doi.org/10.31181/jdaic10014062025p

Keywords:

principal component analysis (PCA), correlation, linear regression, formal education, internet users

Abstract

The regular use of digital devices and Internet applications in modern society is significantly influenced by the user's educational attainment. In educational institutions, particularly at the primary and secondary levels, digital technologies (DT) are recognized as essential to the teaching process. Current educational programs involve active use of DT in class and thus improve digital skills necessary for active engagement in the digital world. A hybrid machine learning model was built to analyze the impact of varying educational degrees on Internet usage and innovation investments across the European Union (EU) population. Principal component analysis (PCA) was employed for detecting the primary indicators of education and innovation, resulting in the reduction of the initial variables to two factors, one including four indicators and the other containing two indicators. Linear regression (LR) was applied on the PCA factor loadings derived from the primary factor, which exhibited a statistically significant relationship with the percentage of Internet usage (r = 0.767). The results demonstrate that formal education, supported by investments in innovations within the education system, are essential preconditions for the continued development of the digital society.

Downloads

Download data is not yet available.

References

Abdi, H., & Williams, L. J. (2010). Principal component analysis. Wiley interdisciplinary reviews: computational statistics, 2(4), 433-459.

Afzal, A., Khan, S., Daud, S., Ahmad, Z., & Butt, A. (2023). Addressing the Digital Divide: Access and Use of Technology in Education. Journal of Social Sciences Review, 3(2), 883-895.

Aghion, P., Boustan, L. P., Hoxby, C. M., & Vandenbussche, J. (2019). The contribution of higher education to innovation and productivity: Evidence from U.S. states. Journal of Economic Growth, 24(1), 35-82.

Barro, R. J., & Lee, J. W. (2013). A new data set of educational attainment in the world, 1950–2010. Journal of Development Economics, 104, 184-198.

Bingham, N. H., & Fry, J. M. (2010). Regression: Linear models in statistics. Springer Undergraduate Mathematics Series. London: Springer-Verlag.

Bonaccorsi, A., & Daraio, C. (2007). Universities and strategic knowledge creation: Specialization and performance in Europe. Cheltenham: Edward Elgar Publishing Limited.

Cedefop (2020). Empowering adults through upskilling and reskilling pathways: Vol. 1: adult population with potential for upskilling and reskilling. Luxembourg: Publications Office. Cedefop reference series, No 112.

Duong, C. D., Vu, T. N., & Ngo, T. V. N. (2023). Applying a modified technology acceptance model to explain higher education students’ usage of ChatGPT: A serial multiple mediation model with knowledge sharing as a moderator. The International Journal of Management Education, 21(3), 100883.

European Commission. (2022). Indicator database. https://ec.europa.eu/eurostat/data/database, Accessed 15 July 2024.

Everyone On. (2022). The digital skill and trust. https://static1.squarespace.com/static/5aa8af1fc3c16a54bcbb0415/t/61fc71248a56247e899c2a20/1643933997111/EveryoneOn_Report_2_DigitalSkills_and_Trust.pdf, Accessed 10 May 2025.

Greenacre, M., Groenen, P. J., Hastie, T., d’Enza, A. I., Markos, A. & Tuzhilina, E. (2022). Principal component analysis. Nature Reviews Methods Primers, 2(1), 100.

International Telecommunication Union. (2010). World Telecommunication/ICT Development Report 10: Monitoring the WSIS Targets. https://uis.unesco.org/sites/default/files/documents/monitoring-the-wsis-targets-a-mid-term-review-world-telecommunication-ict-development-report-2010-en.pdf, Accessed 10 May 2025.

International Telecommunication Union. (2021). Digital skills insights. https://academy.itu.int/sites/default/files/media2/file/21-00668_Digital-Skill-Insight-210831_CSD%20Edits%206_Accessible-HD.pdf, Accessed 10 May 2025.

Jamil S. 2021. From digital divide to digital inclusion: Challenges for wide-ranging digitalization in Pakistan. Telecommunications Policy, 45(8), 102206.

Komarudin, K., Suherman, S., & Vidákovich, T. (2024). The RMS teaching model with brainstorming technique and student digital literacy as predictors of mathematical literacy. Heliyon, 10(13), e33877.

Kumar, V. (2024). Digitalization Higher Education in India. Journal of dvanced esearch in ducation, 3(1), 14-17.

Lakhotia, R., Nagesh, C. K., & Madgula, K. (2019). Identifying Missing Component in the Bechdel Test Using Principal Component Analysis Method. arXiv preprint arXiv: 1907.03702.

Lee, J. (2017). Internet Usage Effect on Educational Attainment: Evidence of Benefits (pp. 1-17). California: Milken Institute.

Li, F., Cheng, L., Wang, X., Shen, L., Ma, Y., & Islam, A. Y. M. (2025). The causal relationship between digital literacy and students’ academic achievement: a meta-analysis. Humanities and Social Sciences Communications, 12(1), 1-12.

Li, L., Ma, Y., Friesen, D., Zhang, Z., Jin, S., & Rozelle, S. (2021). The impact of Internet use on adolescent learning outcomes: evidence from rural China. China Agricultural Economic Review, 13(3), 569-592.

Manyika, J., Lund, S., Chui, M., Bughin, J., Woetzel, J., Batra, P., Ko, R., & Sanghvi, S. (2017). Jobs lost, jobs gained: Workforce transitions in a time of automation. McKinsey Global Institute. https://www.mckinsey.com/~/media/McKinsey/Industries/Public%20and%20Social%20Sector/Our%20Insights/What%20the%20future%20of%20work%20will%20mean%20for%20jobs%20skills%20and%20wages/MGI-Jobs-Lost-Jobs-Gained-Executive-summary-December-6-2017.pdf , Accessed 01 May 2025.

OECD. (2019). OECD Skills Outlook 2019 Thriving in a Digital World. Paris: OECD Publishing.

Peng, F., Guo, M., Zheng, C., Wang, S., Wang, X., & Xu, M. (2023). An Assessment Model of Digital Literacy for the Students in Vocational Education Based on Principal Component Analysis in Machine Learning. Proceedings of the 6th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC) (pp. 1382-1386). Chongqing: IEEE.

Petkovski, I., & Vranić, P. (2024). Modeling Education and Internet Usage: PCA and Linear Regression Approaches. Proceedings of the 6h Virtual International Conference Path to a Knowledge Society-Managing Risks and Innovation (PaKSoM) (pp. 71-77). Belgrade: Mathematical Institute SANU.

Romano, M. F., Sardella, M. V., Alboni, F., Russo, L., Mariotti, R., Nicastro, I., Barletta, V., & Di Bello, V. (2015). Is the digital divide an obstacle to e-health? An analysis of the situation in Europe and in Italy. Telemedicine and e-Health, 21(1), 24-35.

Rückert, D., Veugelers, R., & Weiss, C. (2020). The growing digital divide in Europe and the United States (No. 2020/07). EIB Working Papers.

Sarker, I. H. (2021). Machine learning: Algorithms, real-world applications and research directions. SN computer science, 2(3), 160.

Segbenya, M., Bervell, B., Frimpong-Manso, E., Otoo, I. C., Andzie, T. A., & Achina, S. (2023). Artificial intelligence in higher education: Modelling the antecedents of artificial intelligence usage and effects on 21st century employability skills among postgraduate students in Ghana. Computers and Education: Artificial Intelligence, 5, 100188.

Shah, A., & Jimenez-Duran, R. (2023). Broadband Access and Standardized Test Scores: A Causal Analysis. Journal of Student Research, 12(4), 1-10.

Tislenko, M. I. (2024). Digital divide in the European Union: assessing spatial disparities and neighborhood effects. Journal of the Geographical Institute “Jovan Cvijić” SASA, 74(2), 181–194.

Tseng, T. H., Wu, T. Y., Lian, Y. H., & Zhuang, B. K. (2023). Developing a value-based online learning model to predict learners’ reactions to internet entrepreneurship education: The moderating role of platform type. The International Journal of Management Education, 21(3), 100867.

UNESCO Institute for Statistics. (2018). A Global Framework of Reference on Digital Literacy Skills for Indicator 4.4.2. https://uis.unesco.org/sites/default/files/documents/ip51-global-framework-reference-digital-literacy-skills-2018-en.pdf, Accessed 10 May 2025.

Valero, A., & Van Reenen, J. (2019). The economic impact of universities: Evidence from across the globe. Economics of Education Review, 68, 53-67.

van Deursen, A. J. A. M., & van Dijk, J. A. G. M. (2016). Modeling traditional literacy, Internet skills and Internet usage: An empirical study. Interacting with computers, 28(1), 13-26.

van Dijk J. A. G. M. (2013). A theory of the digital divide. In M. Ragnedda, & G.W. Muschert (Eds.), The digital divide: the internet and social inequality in international perspective (pp. 29-51). (Routledge advances in sociology; Vol. 73, No. 73). Routledge.

Wang, Y., Xu, G., Cao, J., Chen, Y., & Wu, J. (2025). Does digital literacy affect farmers’ adoption of agricultural social services? An empirical study based on China Land Economic Survey data. PLoS One, 20(4), e0320318.

Weisberg, S. (2005). Applied linear regression (Vol. 528). New Jersey: John Wiley & Sons.

Wold, S., Esbensen, K., & Geladi, P. (1987). Principal component analysis. Chemometrics and intelligent laboratory systems, 2(1-3), 37-52.

World Data Bank. (2024). Indicator database. https://data.worldbank.org/, Accessed 15 July 2024.

World Economic Forum. (2020). The Future of Jobs Report 2020. https://www3.weforum.org/docs/WEF_Future_of_Jobs_2020.pdf, Accessed 10 May 2025.

Published

14.06.2025

How to Cite

Petkovski, I., & Vranić, P. (2025). PCA - Enhanced regression approach for predicting internet use based on formal education. Journal of Decision Analytics and Intelligent Computing, 5(1), 87–98. https://doi.org/10.31181/jdaic10014062025p