Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseОткрытый API экосистемы
Статья

Attention mechanisms in computer vision: A survey

Meng-Hao GuoBNRist, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, ChinaTian-Xing XuBNRist, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, ChinaJiangjiang LiuTKLNDST, College of Computer Science, Nankai University, Tianjin 300350, ChinaZheng-Ning LiuBNRist, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, ChinaPeng-Tao JiangTKLNDST, College of Computer Science, Nankai University, Tianjin 300350, ChinaTai‐Jiang MuBNRist, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, ChinaSong–Hai ZhangBNRist, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, ChinaRalph R. MartinSchool of Computer Science and Informatics, Cardiff University, Cardiff, UKMing‐Ming ChengTKLNDST, College of Computer Science, Nankai University, Tianjin 300350, ChinaShi‐Min HuBNRist, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
2022en
ABI

Аннотация

Humans can naturally and effectively find salient regions in complex scenes. Motivated by this observation, attention mechanisms were introduced into computer vision with the aim of imitating this aspect of the human visual system. Such an attention mechanism can be regarded as a dynamic weight adjustment process based on features of the input image. Attention mechanisms have achieved great success in many visual tasks, including image classification, object detection, semantic segmentation, video understanding, image generation, 3D vision, multimodal tasks, and self-supervised learning. In this survey, we provide a comprehensive review of various attention mechanisms in computer vision and categorize them according to approach, such as channel attention, spatial attention, temporal attention, and branch attention; a related repository https://github.com/MenghaoGuo/Awesome-Vision-Attentions is dedicated to collecting related work. We also suggest future directions for attention mechanism research.

Перевод пока недоступен

Идентификаторы

Цитирования и источники

Цитирований: 3Использованных источников: 0