Multimodal analysis and prediction of risk-related disclosures in financial reports

Authors

  • Moumita Chatterjee Department of Computer Science & Engineering, Aliah University, Kolkata, India
  • Dhrubasish Sarkar Department of Computer Science & Information Technology, Supreme Institute of Management and Technology, Hooghly, India

DOI:

https://doi.org/10.31181/jdaic10001082025c

Keywords:

risk analysis, financial data mining, machine learning, topic modelling, multimodal approach, human–machine intelligence

Abstract

The rapid changes in the business environment have made it increasingly challenging for experts to accurately analyze and classify risk-related statements in security reports, which often contain large volumes of unstructured information. Over time, several methods have been developed; however, these approaches still encounter difficulties when processing diverse risk expressions. This study utilizes a large dataset of economic and financial statements to explore the relationship between financial risks and the sentiment associated with them. A multimodal approach is proposed, integrating both supervised and unsupervised techniques such as Bag of Words, Term Frequency-Inverse Document Frequency (TF-IDF), Word Embeddings, and topic modeling methods like Latent Dirichlet Allocation (LDA), to develop a model capable of efficiently and accurately predicting a company's risk structure from its security reports. Sentiment analysis is performed on the texts, where negative sentiment is indicative of risk. Various feature sets are then combined, and the resulting model is tested using four classifiers, achieving a highest accuracy of 80.9%. The findings suggest that the model can be effectively developed for risk analysis and identification within financial data and other relevant sectors.

Downloads

Download data is not yet available.

References

Arjun, R., Kuanr, A., & Suprabha, K. R. (2021). Developing banking intelligence in emerging markets: Systematic review and agenda. International Journal of Information Management Data Insights, 1(2), 100026.

Bellstam, G., Bhagat, S., & Cookson, J. A. (2021). A text-based analysis of corporate innovation. Management Science, 67(7), 4004-4031.

Carracedo, P., Puertas, R., & Marti, L. (2021). Research lines on the impact of the COVID-19 pandemic on business. A text mining analysis. Journal of Business Research, 132, 586-593.

Chatterjee, M., & Sarkar, D. (2024). Analyzing and Predicting Risk-Related Statements from Financial Reports. Book of abstracts of International Conference on Recent Advances in Operations Research & Business Analytics (RAORBA-2024) (p. 23). Kolkata: Bharatiya Vidya Bhavan Institute of Management Science.

Das, A. S., Gupta, A., Singh, G., & Subramaniam, L. V. (2016). Mining Qualitative Attributes to Assess Corporate Performance. INFORMS Tutorials in Operations Research, 269-281. https://doi.org/10.1287/educ.2016.0155

De Herve, M. D. G. (2024). Near or distant time horizons? The determinants of the integration of long-term perspectives in disaster risk management evaluation. Progress in Disaster Science, 24, 100365.

Feldman, R. (2013). Techniques and applications for sentiment analysis. Communications of the ACM, 56(4), 82–89.

Fujii, M., Sakaji, H., Masuyama, S., & Sasaki, H. (2022). Extraction and classification of risk-related sentences from securities reports. International Journal of Information Management Data Insights, 2(2), 100096.

Hegde, J., & Rokseth, B. (2020). Applications of machine learning methods for engineering risk assessment – A review. Safety Science, 122, 104492.

International Organization for Standardization. (2018). Occupational health and safety management systems – Requirements with guidance for use (ISO 45001:2018). Geneva: ISO.

Koch, J., Plattfaut, R., & Kregel, I. (2021). Looking for Talent in Times of Crisis – The Impact of the Covid-19 Pandemic on Public Sector Job Openings. International Journal of Information Management Data Insights, 1(2), 100014.

Komazec, N., Janković, K., Mladenović, M., Mijatović, M., & Lapčević, Z. (2024). Ranking of risk using the application of the AHP method in the risk assessment process on the Piraeus-Belgrade-Budapest railway corridor. Journal of Decision Analytics and Intelligent Computing, 4(1), 176–186.

Loughran, T., & McDonald, B. (2011). When is a liability not a liability? textual analysis, dictionaries, and 10-ks. The Journal of Finance, 66(1), 35–65.

Macek, D., & Vitásek, S. (2024). ESG risk analysis and preparedness of companies in the Czech Republic. International Journal of Economic Sciences, 13(2), 38-54.

Malo, P., Sinha, A., Korhonen, P., Wallenius, J., & Takala, P. (2014). Good debt or bad debt: Detecting semantic orientations in economic texts. Journal of the Association for Information Science and Technology, 65(4), 782-796.

Rawat, S., Rawat, A., Kumar, D., & Sabitha, A. S. (2021). Application of machine learning and data visualization techniques for decision support in the insurance sector. International Journal of Information Management Data Insights, 1 (2) (2021), 100012.

Safaeian, M., Moses, R., Ozguven, E. E., & Dulebenets, M. A. (2024). An optimization-based risk management framework with risk interdependence for effective disaster risk reduction. Progress in Disaster Science, 21, 100313

Turgay, S., & Aydin, A. (2025). Improving decision making under uncertainty with data analytics: Bayesian networks, reinforcement learning, and risk perception feedback for disaster management. Journal of Decision Analytics and Intelligent Computing, 5(1), 25–51.

Wang, B., Su, Q., Qiao, H., & Wang, C. (2024). Prevention and adaptation of intermodal interactive seaports and dry ports under asymmetric risk behavior. Computers & Industrial Engineering, 195, 110447

Zhao, L.-T., Guo, S.-Q., & Wang, Y. (2019). Oil market risk factor identification based on text mining technology. Energy Procedia, 158, 3589-3595.

Published

01.08.2025

How to Cite

Chatterjee, M., & Sarkar, D. (2025). Multimodal analysis and prediction of risk-related disclosures in financial reports. Journal of Decision Analytics and Intelligent Computing, 5(1), 159–166. https://doi.org/10.31181/jdaic10001082025c