Infokommunikacionnye tehnologii

Инфокоммуникационные технологии

2073-3909

Povolzhskiy State University of Telecommunications and Informatics

689815

10.18469/ikt.2024.22.1.01

Theoretical technological basis of information transmission and signals

Теоретические основы технологий передачи и обработки информации и сигналов

Research Article

Suppression ofn the video stream frames processed by unmanned systems using FPV control

Нивелирование артефактов кадров видеопотока при FPV-управлении беспилотными системам

Berezkin

А. А.

Березкин

А. А.

Russian Federation

Associate Professor of Program Engineering and Computer Science Department, PhD in Technical Science

к.т.н., доцент кафедры программной инженерии и вычислительной техники (ПИиВТ)

berezkin.aa@sut.ru

Chenskiy

A. A.

Ченский

А. А.

Russian Federation

Master’s Degree Student of Program Engineering and Computer Science Department

магистрант кафедры ПИиВТ

chenskii.aa@sut.ru

Kirichek

R. V.

Киричек

Р. В.

Russian Federation

Rector, Professor of Program Engineering and Computer Science Department, Doctor of Technical Science

д.т.н., ректор, профессор кафедры ПИиВТ

kirichek@sut.ru

Bonch-Bruevich Saint Petersburg State University of TelecommunicationsСанкт-Петербургский государственный университет телекоммуникаций им. проф. М.А. Бонч-Бруевича

09032025

221

7172308202523082025

2025

Berezkin А.А., Chenskiy A.A., Kirichek R.V.

Березкин А.А., Ченский А.А., Киричек Р.В.

https://creativecommons.org/licenses/by-nc-nd/4.0

https://journals.eco-vector.com/2073-3909/article/view/689815

Using packet networks for first-person control of unmanned systems arises a problem of large transmitted data volumes. The largest volume of traffic during first-person control is presented by video stream frames. So, to improve the efficiency of the communication network between unmanned systems and external pilot station, it is necessary to compress video stream frames. A high compression degree can be provided by using variational autoencoders. One of the problems of using variational autoencoders for frame compression is the occurrence of specific artifacts in frames. This article proposes methods for suppressing the occurrence of artifacts when restoring frames from the latent space by a neural network decoder, as well as an empirical scale for assessing autoencoder artifacts. The approach proposed encompasses preparing pixel data of a video stream frame for encoding and further reconstruction after decoding. It is experimentally shown that one of the proposed methods allows eliminating the absolute majority of artifacts without introducing significant distortions into the reconstructed frames.

При использовании пакетных сетей для управления беспилотными системами от первого лица возникает проблема большого объема передаваемых данных. Наибольший объем трафика при управлении от первого лица формируют кадры видеопотока. Соответственно, для повышения эффективности использования сети связи между беспилотными системами и станцией внешнего пилота необходимо осуществлять сжатие кадров видеопотока. Высокую степень сжатия обеспечивает использование вариационных автокодировщиков. Одной из проблем использования вариационных автокодировщиков для сжатия изображений является возникновение на изображениях специфичных артефактов. В настоящей статье предлагаются способы нивелирования возникновения артефактов при восстановлении изображений из латентного пространства нейросетевым декодером, а также предлагается эмпирическая шкала оценки артефактов автокодировщиков. Предложенный подход заключается в подготовке пиксельных данных кадра видеопотока к кодированию и восстановлению их после декодирования. Экспериментальным путем показано, что один из предложенных методов позволяет устранять абсолютное большинство артефактов без внесения существенных искажений в восстанавливаемые кадры.

neural networkartifact suppressionframe artifactsvideo stream transmissionvariational autoencoderneural codecFPV-controlfirst person view controlunmanned systemsUAVunmanned aircraft vehicles

нейронная сетьподавление артефактовнивелирование артефактовартефакты изображенияпередача видеопотокавариационный автокодировщикнейросетевой кодекFPV-управлениеуправление от первого лицабеспилотные системыбеспилотные воздушные суда

Berezkin A.A. et al. Research of latent video stream compression methods for FPV control of uavs. Elektrosvjaz’, 2024, no. 6, pp. 26–36. (In Russ.)

Исследование методов латентного сжатия видеопотока при FPV управлении беспилотными системами / А.А. Березкин [и др.] // Электросвязь. 2024. № 6. С. 26–36.

Berezkin A.A. et al. Research of latent space quantization methods of variational autoencoder for FPV video stream frames. Part I. Elektrosvjaz’, 2024, no. 6, pp. 10–16. (In Russ.)

Исследование методов квантования латентного пространства вариационного автокодировщика для кадров FPV видеопотока. Часть I / А.А. Березкин [и др.] // Электросвязь. 2024. № 6. С. 10–16.

Project of the strategy for the development of the telecommunications industry of the Russian Federation for the period up to 2035 [adopted by the Government of Russian Federation on November 24, 2023]. URL: https://digital.gov.ru/ru/documents/9120/ (accessed: 27.07.2024). (In Russ.)

Проект стратегии развития отрасли связи Российской Федерации на период до 2035 года [утвержден Правительством Российской Федерации 24 ноября 2023 г.]. URL: https://digital.gov.ru/ru/documents/9120/ (дата обращения: 27.07.2024).

ITU-Т Recommendation H.264. Improved Image Encoding for General Audiovisual Services. Geneva, 2008, 342 p. (In Russ.)

Рекомендация МСЭ Т Н.264. Усовершенствованное кодирование изображения для общихаудиовизуальных услуг. Женева, 2008. 342 с.

ITU-T Recommendation H.265 (V9). High Efficiency Video Coding. Geneva, 2023, 718 p.

Recommendation ITU-T H.265 (V9). High Efficiency Video Coding. Geneva, 2023. 718 p.

ISO/IEC 10918-1:1994. Information Technology – Digital Compression and Coding of Continuous-Tone Still Images: Requirements and Guidelines. URL: https://www.iso.org/ru/standard/ 18902.html (accessed: 27.07.2024).

ISO/IEC 10918-1:1994. Information Technology – Digital Compression and Coding of Continuous-Tone Still Images: Requirements and Guidelines. URL: https:// www.iso.org/ru/standard/18902.html (дата обращения: 27.07.2024).

Barman N., Martini M.G. An evaluation of the next-generation image coding standard AVIF. 2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX), 2020, pp. 1–4.

Barman N., Martini M.G. An evaluation of the next-generation image coding standard AVIF // 2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX). 2020. P. 1–4.

Ginesu G., Pintus M., Giusto D.D. Objective assessment of the WebP image coding algorithm. Signal Processing: Image Communication, 2012, vol. 27, no. 8, pp. 867–874.

Ginesu G., Pintus M., Giusto D.D. Objective assessment of the WebP image coding algorithm // Signal Processing: Image Communication. 2012. Vol. 27, no. 8. P. 867–874.

Berezkin A.A. et al. Research of methods of quantization of latent space of variational autoencoder for FPV video stream frames. Part II. Elektrosvjaz’, 2024, no. 7, pp. 16–25. (In Russ.)

Исследование методов квантования латентного пространства вариационного автокодировщика для кадров FPV видеопотока. Часть II / А.А. Березкин [и др.] // Электросвязь. 2024. № 7. С. 16–25.

10.

Jiang J., Zhang K., Timofte R. Towards flexible blind JPEG artifacts removal. IEEE/CVF International Conference on Computer Vision, 2021, pp. 4997–5006.

Jiang J., Zhang K., Timofte R. Towards flexible blind JPEG artifacts removal // IEEE/CVF International Conference on Computer Vision. 2021. P. 4997–5006.

11.

Saveljev V., Kim S. K., Kim J. Moire effect in displays: A tutorial. Optical Engineering, 2018, vol. 57, no. 3. URL: https://www.researchgate.net/publication/324074641_Moire_effect_in_displays_A_tutorial (accessed: 07.2024).

Saveljev V., Kim S.K., Kim J. Moire effect in displays: A tutorial // Optical Engineering. 2018. Vol. 57, no. 3. URL: https://www.researchgate.net/publication/324074641_Moire_effect_in_displays_A_tutorial (дата обращения: 22.07.2024).

12.

Zaitsev M., Maclaren J., Herbst M. Motion artifacts in MRI: A complex problem with many partial solutions. Journal of Magnetic Resonance Imaging, 2015, vol. 42, no. 4, pp. 887–901.

Zaitsev M., Maclaren J., Herbst M. Motion artifacts in MRI: a complex problem with many partial solutions // Journal of Magnetic Resonance Imaging. 2015. Vol. 42, no. 4. P. 887–901.

13.

Castellanos N.P., Makarov V.A. Recovering EEG brain signals: Artifact suppression with wavelet enhanced independent component analysis. Journal of Neuroscience Methods, 2006, vol. 158, no. 2, pp. 300–312.

Castellanos N.P., Makarov V.A. Recovering EEG brain signals: artifact suppression with wavelet enhanced independent component analysis // Journal of Neuroscience Methods. 2006. Vol. 158, no. 2. P. 300–312.

14.

Wagenaar D.A., Potter S.M. Real-time multi-channel stimulus artifact suppression by local curve fitting. Journal of Neuroscience Methods, 2002, vol. 120, no. 2, pp. 113–120.

Wagenaar D.A., Potter S.M. Real-time multichannel stimulus artifact suppression by local curve fitting // Journal of Neuroscience Methods. 2002. Vol. 120, no. 2. P. 113–120.

15.

Galteri L. et al. Deep generative adversarial compression artifact removal. IEEE International Conference on Computer Vision (ICCV), 2017, pp. 4826–4835.

Deep generative adversarial compression artifact removal / L. Galteri [et al.] // IEEE International Conference on Computer Vision (ICCV). 2017. P. 4826–4835.

16.

Cavigelli L., Hager P., Benini L. CAS-CNN: A deep convolutional neural network for image compression artifact suppression. 2017 International Joint Conference on Neural Networks (IJCNN), 2017, pp. 752–759.

Cavigelli L., Hager P., Benini L. CAS-CNN: A deep convolutional neural network for image compression artifact suppression // 2017 International Joint Conference on Neural Networks (IJCNN). 2017. P. 752–759.

17.

Yeh C.H. et al. Deep learning-based compressed image artifacts reduction based on multi-scale image fusion. Information Fusion, 2021, vol. 67, no. 4, pp. 195–207.

Deep learning-based compressed image artifacts reduction based on multi-scale image fusion / C.H. Yeh [et al.] // Information Fusion. 2021. Vol. 67, no. 4. P. 195–207.

18.

Github. Stable diffusion: development repository. URL: https://github.com/pesser/stable-diffusion/tree/main (accessed: 28.07.2024).

Github. Stable diffusion: development repository. URL: https://github.com/pesser/stable-diffusion/tree/main (дата обращения: 28.07.2024).

19.

Github. Lossy Image compression with conditional diffusion models.URL: https://github.com/buggyyang/CDC_compression (accessed: 28.07.2024).

Github. Lossy image compression with conditional diffusion models. URL: https://github.com/buggyyang/CDC_compression (дата обращения: 28.07.2024).