23 April 2022 Collaborative Attention Transformer on facial expression recognition under partial occlusion
Yan Luo, Jie Shao, Runxia Yang
Author Affiliations +
Abstract

Recent research on facial expression recognition (FER) in the wild shows challenges still remain. Different from laboratory-controlled expression in the past, images in the wild contain more uncertainties, such as different forms of face information occlusion, ambiguous facial images, noisy labels, and so on. Among them, real-world facial occlusion is the most general and crucial challenge for FER. In addition, because of the COVID-19 disease epidemic, people have to wear masks in public, which brings new challenges to FER tasks. Due to the recent success of the Transformer on numerous computer vision tasks, we propose a Collaborative Attention Transformer (CAT) network that first uses Cross-Shaped Window Transformer as the backbone for the FER task. Meanwhile, two attention modules are collaborated. Channel-Spatial Attention Module is designed to increase the attention of the network to global features. Moreover, Window Attention Gate is used to enhance the ability of the model to focus on local details. The proposed method is evaluated on two public in-the-wild facial expression datasets, RAF-DB and FERPlus, and the results demonstrate that our CAT performs superior to the state-of-the-art methods.

© 2022 SPIE and IS&T 1017-9909/2022/$28.00© 2022 SPIE and IS&T
Yan Luo, Jie Shao, and Runxia Yang "Collaborative Attention Transformer on facial expression recognition under partial occlusion," Journal of Electronic Imaging 31(2), 023037 (23 April 2022). https://doi.org/10.1117/1.JEI.31.2.023037
Received: 1 December 2021; Accepted: 5 April 2022; Published: 23 April 2022
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Transformers

Computed tomography

Facial recognition systems

Virtual reality

Data modeling

Visual process modeling

Eye models

Back to Top