On the Problem of Applicability of Synthetic Data in Testing Intelligent Transport Systems

Cover Page

Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription or Fee Access

Abstract

Intelligent Transport Systems (ITS) are now being implemented to ensure optimal and safe road traffic. Increasingly, these systems use artificial intelligence to obtain characteristics about traffic flows. The number of sensors and transducers is increasing dramatically, resulting in higher loads on ITSs. Therefore, it is necessary to develop distributed monitoring systems with scalability and fault tolerance in mind. However, extensive testing is required before implementation. It is not possible to fully conduct such testing on real data due to various factors. Therefore, this paper proposes a tool for generating synthetic traffic flow data with subject matter specificity. The generation system is designed to be integrated into different systems, which will allow different ITS vendors to use it. This service fulfils the scalability requirements and is close to real data. The study proposes a scalable architecture of intelligent transport subsystem that fulfils the requirements of scalability and fault tolerance. As part of this work, a testbed is assembled and the proposed architecture is tested through the developed service of synthetic traffic flow state data generation.

Full Text

Restricted Access

About the authors

Mikhail G. Gorodnichev

Moscow Technical University of Communications and Informatics

Author for correspondence.
Email: m.g.gorodnichev@mtuci.ru
ORCID iD: 0000-0003-1739-9831
Scopus Author ID: 55836031600
ResearcherId: D-3256-2019

Cand. Sci. (Eng.), Associate Professor, Dean of the Faculty of Information Technology

Russian Federation, Moscow

References

  1. Rabchevsky A.N. Review of methods and systems for generating synthetic training data. Applied Mathematics and Control Sciences. 2023. No. 4. Pp. 6–45. (In Rus.). doi: 10.15593/2499-9873/2023.4.01.
  2. Lundin E., Kvarnström H., Jonsson E.A. Synthetic fraud data generation methodology. In: Information and Communications Security. Deng Robertand Bao, Fengand Zhou Jianyingand, Qing Sihan (eds.). Berlin; Heidelberg: Springer Berlin Heidelberg, 2002. Рp. 265–277. doi: 10.1007/3-540-36159-6_23.
  3. McKenna R., Miklau G., Sheldon D. Winning the NIST Contest: A scalable and general approach to differentially private synthetic data. CoRR. 2021. Art. abs/2108.04978.
  4. Awan J., Cai Z. One step to efficient synthetic data. arXiv. 2020. Art. bs/2006.02397. doi: 10.48550/ARXIV.2006.02397.
  5. Mukherjee M., Khushi M. SMOTE-ENC: A novel SMOTE-based method to generate synthetic data for nominal and continuous features. Applied System Innovation. 2021. Vol. 4. Issue 1. Art. 18. doi: 10.3390/asi4010018.
  6. Bernstein D. Containers and cloud: From LXC to docker to Kubernetes. IEEE Cloud Computing. 2014. No. 3. Pp. 81– 84.
  7. Barletta M., Cinque M., Simone L.D., Corte R.D. Introducing k4.0s: A model for mixed-criticality container orchestration in Industry 4.0. In: IEEE Intl. Conf. on Dependable, Autonomic and Secure Computing, 2022.
  8. Polyantseva K., Gorodnichev M., Moseva M. Ensuring the reliability of a highly loaded vehicle monitoring and traffic control platform. In: Systems of signals generating and processing in the field of on board communications. Moscow, 2023. Pp. 1–8. doi: 10.1109/IEEECONF56737.2023.10092031
  9. Slamnik-Krijestorac N., Yilma G.M., Zarrar Y.F. et al. Multi-domain mech orchestration platform for enhanced back situation awareness. In: IEEE Conference on Computer Communications Workshops, 2021.
  10. Mason K., Vejdan S., Grijalva S. An “On the Fly” framework for efficiently generating synthetic big data sets. CoRR. 2019. Art. abs/1903.06798.
  11. Gesmundo A., Tomeh N. HadoopPerceptron: A toolkit for distributed perceptron training and prediction with MapReduce. In: Conference of the European Chapter of the Association for Computational Linguistics. 2012.

Supplementary files

Supplementary Files
Action
1. JATS XML
2. Fig. 1. Scenarios of interaction with the system

Download (148KB)
3. Fig. 2. Organization of system components

Download (173KB)
4. Fig. 3. Database Datalogical Scheme

Download (107KB)
5. Fig. 4. Class diagram of domain models

Download (171KB)
6. Fig. 5. UML diagram of classes

Download (125KB)
7. Fig. 6. Diagram of the sequence of adding a service by a user

Download (117KB)
8. Fig. 7. Sequence of creating a table and sending a new metric

Download (195KB)
9. Fig. 8. Load distribution

Download (60KB)