Different gait combinations based on multi-modal deep CNN architectures

Yaprak, Büşranur; Gedikli, Eyüp

Gelişmiş Arama

Erişim

info:eu-repo/semantics/openAccess

Tarih

2024

Yazar

Yaprak, Büşranur
Gedikli, Eyüp

Erişim

info:eu-repo/semantics/openAccess

Üst veri

Tüm öğe kaydını göster

Künye

Scopus EXPORT DATE: 02 May 2024 @ARTICLE{Yaprak2024, url = {https://www.scopus.com/inward/record.uri?eid=2-s2.0-85187673967&doi=10.1007%2fs11042-024-18859-9&partnerID=40&md5=bdb0cb5f246c714354818ce14deec05c}, affiliations = {Department of Software Engineering, Gümüşhane University, Gümüşhane, 29100, Turkey; Department of Software Engineering, Karadeniz Technical University, Trabzon, 61080, Turkey}, correspondence_address = {B. Yaprak; Department of Software Engineering, Gümüşhane University, Gümüşhane, 29100, Turkey; email: busra.kucukugurlu@gumushane.edu.tr}, publisher = {Springer}, issn = {13807501}, coden = {MTAPF}, language = {English}, abbrev_source_title = {Multimedia Tools Appl} }

Özet

Gait recognition is the process of identifying a person from a distance based on their walking patterns. However, the recognition rate drops significantly under cross-view angle and appearance-based variations. In this study, the effectiveness of the most well-known gait representations in solving this problem is investigated based on deep learning. For this purpose, a comprehensive performance evaluation is performed by combining different modalities, including silhouettes, optical flows, and concatenated image of the Gait Energy Image (GEI) head and leg region, with GEI itself. This evaluation is carried out across different multimodal deep convolutional neural network (CNN) architectures, namely fine-tuned EfficientNet-B0, MobileNet-V1, and ConvNeXt-base models. These models are trained separately on GEIs, silhouettes, optical flows, and concatenated image of GEI head and leg regions, and then extracted GEI features are fused in pairs with other extracted modality features to find the most effective gait combination. Experimental results on the two different datasets CASIA-B and Outdoor-Gait show that the concatenated image of GEI head and leg regions significantly increased the recognition rate of the networks compared to other modalities. Moreover, this modality demonstrates greater robustness under varied carrying (BG) and clothing (CL) conditions compared to optical flows (OF) and silhouettes (SF). Codes available at https://github.com/busrakckugurlu/Different-gait-combinations-based-on-multi-modal-deep-CNN-architectures.git © The Author(s) 2024.

Bağlantı

https://link.springer.com/article/10.1007/s11042-024-18859-9
https://hdl.handle.net/20.500.12440/6207

Koleksiyonlar

Scopus İndeksli Yayınlar Koleksiyonu [2037]
WoS İndeksli Yayınlar Koleksiyonu [1814]