Maqola

Egocentric zone-aware action recognition across environments

Simone PeironeDepartment of Control and Computer Engineering, Polytechnic of Turin Corso Castelfidardo, 34/d, Turin, 10138, ItalyGabriele GolettoDepartment of Control and Computer Engineering, Polytechnic of Turin Corso Castelfidardo, 34/d, Turin, 10138, ItalyMirco PlanamenteDepartment of Control and Computer Engineering, Polytechnic of Turin Corso Castelfidardo, 34/d, Turin, 10138, ItalyA. BottinoDepartment of Control and Computer Engineering, Polytechnic of Turin Corso Castelfidardo, 34/d, Turin, 10138, ItalyBarbara CaputoDepartment of Control and Computer Engineering, Polytechnic of Turin Corso Castelfidardo, 34/d, Turin, 10138, ItalyGiuseppe AvertaDepartment of Control and Computer Engineering, Polytechnic of Turin Corso Castelfidardo, 34/d, Turin, 10138, Italy

Pattern Recognition Lettersjournal2024en

ABI

Annotatsiya

Human activities exhibit a strong correlation between actions and the places where these are performed, such as washing something at a sink. More specifically, in daily living environments we may identify particular locations, hereinafter named activity-centric zones , which may afford a set of homogeneous actions. Their knowledge can serve as a prior to favor vision models to recognize human activities. However, the appearance of these zones is scene-specific, limiting the transferability of this prior information to unfamiliar areas and domains. This problem is particularly relevant in egocentric vision, where the environment takes up most of the image, making it even more difficult to separate the action from the context. In this paper, we discuss the importance of decoupling the domain-specific appearance of activity-centric zones from their universal, domain-agnostic representations, and show how the latter can improve the cross-domain transferability of Egocentric Action Recognition (EAR) models. We validate our solution on the EPIC-Kitchens-100 and Argo1M datasets. • We shed light on the side-effects of co-occurrence biases in egocentric vision. • We present EgoZAR, an action recognition model leveraging environmental affordances. • We achieve SOTA performance on EPIC-Kitchens-100 in the domain generalization setting.

Hali tarjima qilinmagan

Mavzular

Human Pose and Action Recognition Gait Recognition and Analysis Anomaly Detection Techniques and Applications

Identifikatorlar

DOI: 10.1016/j.patrec.2024.12.008

Iqtiboslar va manbalar

0 ta iqtibos53 ta foydalanilgan manba

Koʻrsatkichlar — AkademScholar · Tez orada