Fine-to-coarse self-attention graph convolutional network for skeleton-based action recognition
| dc.contributor.author | Kilic, Ugur | |
| dc.contributor.author | Karadag, Ozge Oztimur | |
| dc.contributor.author | Ozyer, Gulsah Tumuklu | |
| dc.date.accessioned | 2026-01-24T12:31:07Z | |
| dc.date.available | 2026-01-24T12:31:07Z | |
| dc.date.issued | 2026 | |
| dc.department | Alanya Alaaddin Keykubat Üniversitesi | |
| dc.description.abstract | Skeleton data has become an important modality in action recognition due to its robustness to environmental changes, computational efficiency, compact structure, and privacy-oriented nature. With the rise of deep learning, many methods for action recognition using skeleton data have been developed. Among these methods, spatial-temporal graph convolutional networks (ST-GCNs) have seen growing popularity due to the suitability of skeleton data for graph-based modeling. However, ST-GCN models use fixed graph topologies and fixed-size spatial-temporal convolution kernels. This limits their ability to model coordinated movements of joints in different body regions and long-term spatial-temporal dependencies. To address these limitations, we propose a fine-to-coarse self-attention graph convolutional network (FCSA-GCN). Our approach employs a fine-to-coarse scaling strategy for multi-scale feature extraction. This strategy effectively models both local and global spatial temporal relationships and better represents the interactions among joint groups in different body regions. By integrating a temporal self-attention mechanism (TSA) into the multi-scale feature extraction process, we enhance the model's ability to capture long-term temporal dependencies effectively. Additionally, during training, we employ the dynamic weight averaging (DWA) approach to ensure balanced optimization across the multi-scale feature extraction stages. Comprehensive experiments conducted on the NTU-60, NTU-120, and NW-UCLA datasets demonstrate that FCSA-GCN outperforms state-of-the-art methods. These results highlight that the proposed approach effectively addresses the current challenges in skeleton-based action recognition (SBAR). | |
| dc.description.sponsorship | Scientific and Technological Research Council of Turkey (TUBITAK) [124E309, 123E635]; Ataturk University [FDK-2023-11957]; TUBITAK | |
| dc.description.sponsorship | This study was supported by Scientific and Technological Research Council of Turkey (TUBITAK) under the Grant Numbers 124E309 and 123E635, and by Ataturk University under the BAP project code FDK-2023-11957. The authors thank to TUBITAK and University for their support. | |
| dc.identifier.doi | 10.1016/j.asoc.2025.114268 | |
| dc.identifier.issn | 1568-4946 | |
| dc.identifier.issn | 1872-9681 | |
| dc.identifier.scopus | 2-s2.0-105022187562 | |
| dc.identifier.scopusquality | Q1 | |
| dc.identifier.uri | https://doi.org/10.1016/j.asoc.2025.114268 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.12868/5645 | |
| dc.identifier.volume | 186 | |
| dc.identifier.wos | WOS:001624435100009 | |
| dc.identifier.wosquality | Q1 | |
| dc.indekslendigikaynak | Web of Science | |
| dc.indekslendigikaynak | Scopus | |
| dc.language.iso | en | |
| dc.publisher | Elsevier | |
| dc.relation.ispartof | Applied Soft Computing | |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
| dc.rights | info:eu-repo/semantics/closedAccess | |
| dc.snmz | KA_WoS_20260121 | |
| dc.subject | Fine-to-coarse approach | |
| dc.subject | Graph convolutional networks | |
| dc.subject | Multi-scale | |
| dc.subject | Skeleton-based action recognition | |
| dc.subject | Skeletal data | |
| dc.subject | Temporal self-attention | |
| dc.subject | multi-scale feature extraction stages. Comprehensive experiments conducted on the NTU-60 | |
| dc.subject | NTU-120 | |
| dc.title | Fine-to-coarse self-attention graph convolutional network for skeleton-based action recognition | |
| dc.type | Article |












