Development of intellectual information system for solving typical tasks of text data analysis, processing and synthesis
- Authors: Zakharova О.I.1
-
Affiliations:
- Volga Region State University of Telecommunications and Informatics
- Issue: Vol 31, No 9 (2025)
- Pages: 485-495
- Section: Intelligent systems and technologies
- Published: 15.09.2025
- URL: https://journals.eco-vector.com/1684-6400/article/view/702207
- DOI: https://doi.org/10.17587/it.31.485-495
- ID: 702207
Cite item
Abstract
We consider the architecture of an intelligent information system (hereinafter — IIS), which contains the necessary components for solving typical tasks of analysis, processing and synthesis of text data, as well as means for configuring such components and monitoring their operation. In this case, the main task was to create a modular structure of the system, allowing horizontal scaling by flexible customization of the components and the sequence of their work as part of a single automated data processing pipeline. To describe the system structure we used the "C4" model of three levels: Context, Containers, Components. The 4th level "Code" is not used in this article, as it represents, designing a solution for module level abstractions. As an example for the design we have chosen an information system for collecting data on literary works, processing unstructured texts of such documents, loading the information into the analytical storage and analytical constructions based on the available data. To test the design solutions, a service catalog for data extraction, transformation and loading pipelines, a catalog of interaction contracts between services and a project structure for a containerization and multi-container management tool — Docker Compose — were developed. Apache Airflow and its REST API for integration with the IIS configuration component was used as an orchestration solution. To configure and optimize the system settings, a diagram of components of the module of own development in Python using the Django framework is given, as well as a description of the logic of its components. In conclusion, the advantage of the used approach over the implementation of ready-made software products with a set of necessary functions is described.
About the authors
О. I. Zakharova
Volga Region State University of Telecommunications and Informatics
Author for correspondence.
Email: o.zaharova@psuti.ru
Ph.D., Associate Professor, Deputy Head of Research Laboratory, Research Laboratory of Artificial Intelligence
Russian Federation, SamaraReferences
- Ivanov V. K., Palyukh B. V., Sotnikov A. N. Architecture of intellectual system of information support of innovations in science and education, Programmnye produkty i sistemy, 2013, no. 4, pp. 197—202 (in Russian).
- Martin R. Pure Architecture. The Art of Software Development, SPb, Piter, 2020, pp. 26—33 (in Russian).
- Barseghyan A. A. Data and Process Analysis: textbook, SPb., BHV-Peterburg, 2018, pp. 36—41 (in Russian).
- Berson A., Dubov L. Master data management, M., Publishing House "Noosphere", 2017, pp. 103—105 (in Russian).
- Harenslak B., de Ruyter J. Apache Airflow and data processing pipelines, Moscow, DMK Press, 2021, pp. 346—347 (in Russian).
- Schoenig G.-J. PostgreSQL 11. Mastery of development, Moscow, DMK Press, 2019, pp. 25—26 (in Russian).
- Osipov D. L. Technologies of database design, Moscow, DMK Press, 2019, pp. 467—468 (in Russian).
- Anderson K. Analytical Culture. From data collection to business results, Moscow, Mann, Ivanov & Ferber, 2017, pp. 81 (in Russian).
- Forcier J., Bissex P., Chan W. Django. Development of Web Applications in Python, SPb., Symbol Plus, 2009, pp. 170—172 (in Russian).
- Arsky Y. M., Finn W. K. Principles of the intelligent systems design, ITiVS, 2008, no. 4, pp. 4—37 (in Russian).
- Evteeva E. V. Features and signs of information systems intelligence, Vestnik VUiT, 2010, no. 15, available at: https://cyberleninka.ru/article/n/osobennosti-i-priznaki-intellektualnostiinformatsionnyh-sistem (date of reference: 01.11.2024) (in Russian).
- Vorobyeva I. A., Lazareva M. V. System and activity approaches in the design of intellectual learning system, Gumanitarnye issledovaniya Central’noj Rossii, 2024, no. 2 (31), available at: https://cyberleninka.ru/article/n/sistemnyy-i-deyatelnostnyy-podhody-pri-proektirovanii-intellektualnoy-obuchayuschey-sistemy (date of reference: 01.11.2024) (in Russian).
- Densmore J. Data Conveyors. Pocket guide, Astana, ALIST, 2024, pp. 27—30 (in Russian).
- Grant S. Ingersoll, Thomas S. Morton, Andrew L. Farris Unstructured Text Processing. Retrieval, organization, and manipulation, Moscow, DMK Press, 2015, pp. 178—179 (in Russian).
- Bolshakova E. I., Vorontsov K. V., Efremova N. E., Klyshinsky E. S., Lukashevich N. V., Sapin A. S. Automatic natural language text processing and data analysis, Moscow, Publishing house of NIU HSE, 2017, pp. 147—148 (in Russian).
- Pivotto J., Brasil B. Launch Prometheus, Moscow, Books.kz, 2023, pp. 56—57 (in Russian).
Supplementary files


