The influence of CNN architecture, image size and quality to object detection model on histological specimens

Nina V. Fedosova; Федосова Нина Вениаминовна; Gennadiy N. Berchenko; Берченко Геннадий Николаевич; Olga B. Shugaeva; Шугаева Ольга Борисовна; Dmitriy V. Mashoshin; Машошин Дмитрий Викторович; Mikhail G. Kochan; Кочан Михаил Геннадьевич

doi:10.17816/vto637087

The influence of CNN architecture, image size and quality to object detection model on histological specimens

作者: Fedosova N.V.¹, Berchenko G.N.¹, Shugaeva O.B.¹, Mashoshin D.V.¹, Kochan M.G.¹
隶属关系:
1. Priorov National Medical Research Center of Traumatology and Orthopedics
期: 卷 31, 编号 4 (2024)
页面: 751-758
栏目: SCIENTIFIC REVIEWS
##submission.dateSubmitted##: 15.10.2024
##submission.dateAccepted##: 25.10.2024
##submission.datePublished##: 25.12.2024
URL: https://journals.eco-vector.com/0869-8678/article/view/637087
DOI: https://doi.org/10.17816/vto637087
ID: 637087

如何引用文章

全文:

开放存取

##reader.subscriptionAccessGranted##
受限制的访问

订阅或者付费存取

详细
全文:
作者简介
参考
补充文件
统计

详细

Improving convolutional neural network (CNN) quality for object search in histology scans is a long-standing problem that essentially involves selecting the best CNN architecture and creating a high-quality dataset. The efficacy of object detection algorithms is determined by numerous factors, including image quality, image size, and the search object. The primary aim of this study was to identify published studies on the impact of various image characteristics in a training sample and CNN architecture on the quality of a created model. Literature published in the last 5 years was reviewed, which addressed data pre-processing, methodology, requirements to images included in datasets, image preparation for CNN model development, and architecture selection. At the time of the study, there were no requirements to image size, and there was no data on the ratio of object size to image size for the best model performance. Moreover, the selection of neural network architecture is lacking in transparency and algorithmization. In the majority of cases, researchers recommend architectures that they have developed or used themselves, without explaining the reasons and selection criteria or comparing them to alternative options. All these factors significantly complicate the development of CNN models for medical image processing. This paper presents a brief overview of publications that address image preparation for datasets, as well as a potential approach to CNN architecture selection.

关键词

neural network, mathematical model, artificial intelligence, machine learning

全文:

作者简介

Nina Fedosova

Priorov National Medical Research Center of Traumatology and Orthopedics

Email: hard_sign@mail.ru
ORCID iD: 0000-0002-0829-9188
SPIN 代码: 5380-3194

俄罗斯联邦, 10 Priorova str., 127299 Moscow

Gennadiy Berchenko

Priorov National Medical Research Center of Traumatology and Orthopedics

编辑信件的主要联系方式.
Email: berchenko@cito-bone.ru
ORCID iD: 0000-0002-7920-0552
SPIN 代码: 3367-2493

MD, Dr. Sci. (Medicine), professor

俄罗斯联邦, 10 Priorova str., 127299 Moscow

参考

Hort M, Chen Z, Zhang JM, Harman M, Sarro F. Bias Mitigation for Machine Learning Classifiers: A Comprehensive Survey. ACM Journal on Responsible Computing. 2024;1(2):1–52. doi: 10.1145/3631326
Kamiran F, Toon C. Data preprocessing techniques for classification without discrimination. Knowledge and Information Systems. 2012;33(1):1–33. doi: 10.1007/s10115-011-0463-8
Li T, Chen K-S, Jin M. Analysis and simulation on imaging performance of backward and forward bistatic synthetic aperture radar. Remote Sensing. 2018;10(11):1676. doi: 10.3390/rs10111676
Rius A, Cardellach E, Fabra F, et al. Feasibility of GNSS-R ice sheet altimetry in greenland using TDS-1. Remote Sensing. 2017;9(7):742. doi: 10.3390/rs9070742
Yan J, Liu X, Wang X. Object detection in videos with tubelet proposal networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2017. Р. 727–735. doi: 10.1109/cvpr.2017.101
Kendall A, Gal Y, Cipolla R. Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2018. Р. 7482–7491. doi: 10.1109/cvpr.2018.00781
Chen W, Daneau S, Mannan F, Heide F. Steady-state nonline- of-sight imaging. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019. Р. 6790–6799. doi: 10.1109/cvpr.2019.00695
Kaneyasu H, Etter SB, Sakai T, Sigrist M. Evolution of the filamentary 3-Kelvin phase in Pb-Ru-Sr2RuO4 Josephson junctions. Physical Review B. 2015;92(13):134515. doi: 10.1103/physrevb.92.134515
Ren S, He K, Girshick R, Sun J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2015;28. doi: 10.1109/tpami.2016.2577031
Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector. In: Computer Vision–ECCV. Part I. Springer; 2016. Р. 21–37. doi: 10.1007/978-3-319-46448-0_2
Redmon J, Farhadi A. YOLOv3: An incremental improvement. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017. doi: 10.1109/cvpr.2017.690
Zhang S, Zhu X, Lei Z, et al. S3FD: Single shot scale-invariant face detector. In: IEEE international conference on computer vision. 2017. Р. 192–201. doi: 10.1109/iccv.2017.30
Kumar A, Mital U, Gajera A, Varanasi S, Patra D. Empirical Study of the Impact of Image Quality, Object size, and Occlusion to Object Detection [Internet]. EasyChair Preprint 9786. 2023. Available from: https://easychair.org/publications/preprint/Wf1V
Buyukkinaci M. Fruit images for object detection [Internet]. Available from: https://www.kaggle.com/datasets/mbkinaci/fruit-images-for-objectdetection
Hao Y, Pei H, Lyu Y, et al. Understanding the Impact of Image Quality and Distance of Objects to Object Detection Performance. NYU Multimedia and Visual Computing Lab. 2022. doi: 10.48550/arXiv.2209.08237
Seker E, Talburt JR, Greer ML. Preprocessing to Address Bias in Healthcare Data. Studies in Health Technology and Informatics. 2022;294:327–331. doi: 10.3233/shti220468
Celis LE, Keswani V, Vishnoi N. Data preprocessing to mitigate bias: A maximum entropy-based approach. In: International Conference on Machine Learning, PMLR. 2020. Р. 1349–1359. doi: 10.1201/9781003055129-6

补充文件

附件文件

动作

1. JATS XML

下载

用户名
密码
记住我

忘记您的密码?	注册

用户名
密码
记住我

忘记您的密码?	注册

The influence of CNN architecture, image size and quality to object detection model on histological specimens

全文:

详细

关键词

全文:

作者简介

Nina Fedosova

Gennadiy Berchenko

Olga Shugaeva

Dmitriy Mashoshin

Mikhail Kochan

参考

补充文件