From the unknown to transparency: a review of information technologies in explainable AI

Cover Page

Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription or Fee Access

Abstract

With the rapid advancement of artificial intelligence, and deep learning in particular, models have emerged that are capable of delivering highly accurate predictions. However, the internal logic of such models remains difficult to interpret—an issue of critical importance, especially in domains where the correctness of an algorithm directly affects high-stakes decision-making. One promising avenue for addressing this challenge is Explainable Artificial Intelligence (XAI), which focuses on developing approaches that clarify model behavior and provide transparent reasoning behind the results obtained. This work examines theoretical foundations of XAI, with particular attention to the classification of methods and the challenges posed by the "black box" nature of machine learning models. The review highlights the necessity of advancing new XAI techniques, outlines potential ways to reconcile high predictive accuracy with sufficient interpretability, and lays the groundwork for further research in this field.

Full Text

Restricted Access

About the authors

S. M. Avdoshin

HSE University — National Research University Higher School of Economics

Author for correspondence.
Email: savdoshin@hse.ru

Cand. Tech. Sc., Professor

Russian Federation, Moscow

E. Y. Pesotskaya

HSE University — National Research University Higher School of Economics

Email: epesotskaya@hse.ru

Cand. Econ. Sc., Ass. Professor

Russian Federation, Moscow

References

  1. Samek W., Wiegand T., Müller K.-R. Explainable artificial intelligence: Understanding, visualizing and interpreting deep learning models, ITU Journal: ICT Discoveries, 2018, vol. 1, no. 1, pp. 39—48.
  2. Barredo Arrieta A., Díaz-Rodríguez N., Del Ser J. et al. Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI, Information Fusion, 2020, vol. 58, pp. 82—115.
  3. Guidotti R., Monreale A., Ruggieri S. et al. A survey of methods for explaining black box models, ACM Computing Surveys, 2018, vol. 51, no. 5, art. 93.
  4. Gunning D., Stefik M., Choi J. et al. DARPA’s explainable AI (XAI) program: A retrospective, Applied AI Letters, 2021, vol. 2, no. 4. e61.
  5. Adadi A., Berrada M. Peeking inside the black-box: A survey on explainable artificial intelligence (XAI), IEEE Access, 2018, vol. 6, pp. 52138—52160.
  6. Murdoch W. J., Singh C., Kumbier K. et al. Definitions, methods, and applications in interpretable machine learning, Proceedings of the National Academy of Sciences, 2019, vol. 116, no. 44, pp. 22071—22080.
  7. Doshi-Velez F., Kim B. Towards a rigorous science of interpretable machine learning, arXiv:1702.08608, 2017.
  8. Tjoa E., Guan C. A survey on explainable artificial intelligence (XAI): Toward medical XAI, IEEE Transactions on Neural Networks and Learning Systems. 2021, vol. 32, no. 11, pp. 4793—4813.
  9. Lipton Z. C. The mythos of model interpretability, arXiv: 1606.03490, 2016.
  10. Tabassi E. (ed.). Artificial intelligence risk management framework (AI RMF 1.0), National Institute of Standards and Technology (NIST), 2023.
  11. Mylrea M., Robinson N. Artificial intelligence (AI) trust framework and maturity model, Entropy, 2023, vol. 25, no. 10, art. 1429.
  12. Molnar C. Interpretable machine learning: A guide for making black box models explainable, 2022, 317 p., Self-published, free PDF version, available at: https://originalstatic.aminer.cn/misc/pdf/Molnar-interpretable-machine-learning_compressed.pdf (accessed: 5 Feb 2026).
  13. Ghosh D., Dhingra D., Hassija V. et al. A review on trustworthy and explainable artificial intelligence (XAI), IEEE Access, 2023, vol. 11, pp. 78994—79015.
  14. Phillips P. J., Hahn C. A., Fontana P. C. et al. Four principles of explainable artificial intelligence, National Institute of Standards and Technology, Internal Report 8312, 2021, 37 p., doi: 10.6028/NIST.IR.8312.
  15. Gregor K., LeCun Y. Learning fast approximations of sparse coding, Proceedings of ICML, 2010, pp. 1—8.
  16. Springenberg J. T., Dosovitskiy A., Brox T., Riedmiller M. Striving for simplicity: The all convolutional net, arXiv:1412.6806, 2015.
  17. Smilkov D., Thorat N., Kim B. et al. SmoothGrad: Removing noise by adding noise, arXiv:1706.03825, 2017.
  18. Zhou B., Khosla A., Lapedriza A. et al. Learning deep features for discriminative localization, Proceedings of CVPR, 2016, pp. 2921—2929.
  19. Selvaraju R. R., Cogswell M., Das A. et al. Grad-CAM: Visual explanations from deep networks via gradient-based localization, Proceedings of ICCV, 2017, pp. 618—626.
  20. Ribeiro M. T., Singh S., Guestrin C. ‘Why should I trust you?’: Explaining the predictions of any classifier, Proceedings of KDD, 2016, pp. 1135—1144.
  21. Guo W., Mu D., Zhang X. et al. LEMNA: Explaining deep learning based security applications, Proceedings of CCS, 2018, pp. 364—379.
  22. Lundberg S. M., Lee S.-I. A unified approach to interpreting model predictions, Advances in Neural Information Processing Systems, 2017, vol. 30, pp. 4765—4774.
  23. Ozturkkal B., Wahlström R. R. Explaining mortgage defaults using SHAP and LASSO, Computational Economics, 2024 (online first).
  24. Kapranov V. V., Tugaenko V. Yu. A machine learning approach for transmitted power estimation in power beaming systems, Informacionnye Tehnologii, 2023, vol. 29, no. 3, pp. 126—135.
  25. Mishra P. Practical Explainable AI Using Python, Moscow, DMK Press, 2022, 298 p.
  26. Abdul A., Vermeulen J., Wang D. et al. Trends and trajectories for explainable, accountable and intelligible systems, Proceedings of CHI, 2018, Paper 582, 18 p.
  27. European Data Protection Supervisor. EDPS TechDispatch 2/2023 — Explainable artificial intelligence, Luxembourg, Publications Office of the European Union, 2023.
  28. Hamon R., Junklewitz H., Sanchez I. Robustness and explainability of artificial intelligence: From technical to policy solutions, Luxembourg,: Publications Office of the European Union, 2020.
  29. Avdoshin S. M., Pesotskaya E. Yu., Patrushev K. A. Trusted artificial intelligence technologies, Informacionnye Tehnologii, 2024, vol. 30, no. 8, pp. 400—410.

Supplementary files

Supplementary Files
Action
1. JATS XML
2. Fig. 1. XAI: generalized classification

Download (96KB)
3. Fig. 2. High-level scheme of XAI trade-offs

Download (48KB)
4. Fig. 3. Detailed diagram of the trade-offs of explainable AI models

Download (53KB)
5. Fig. 4. Main directions of explainable AI development

Download (64KB)

Copyright (c) 2026 Informacionnye Tehnologii



СМИ зарегистрировано Федеральной службой по надзору в сфере связи, информационных технологий и массовых коммуникаций (Роскомнадзор).
Регистрационный номер и дата принятия решения о регистрации СМИ: серия ПИ № 77 - 15565 от 02 июня 2003 г.