GSA-Gaze: Generative Self-adversarial Learning for Domain Generalized Driver Gaze Estimation

Feb 13, 2024·

HAN Hongcheng

Zhiqiang Tian

Yuying Liu

Shengpeng Li

Dong Zhang

Shaoyi Du*

Corresponding author

· 0 min read

Source Document Code PDF DOI

Abstract

Estimating driver gaze accurately is critical for the human-machine cooperative driving, but the significant facial appearance diversions caused by background, illumination, personal characteristics, etc. pose a challenge to the generalizability of gaze estimation models. In this paper, we propose the generative self-adversarial learning mechanism for generalized gaze estimation that aims to learn general gaze features while eliminating sample-specific features and preventing cross-domain feature over-fitting. Firstly, to reduce information redundancy, the feature encoder is designed based on pyramid-grouped convolution to extract a sparse feature representation from the facial appearance. Secondly, the gaze regression module supervises the model to learn as many gaze-relevant features as possible. Thirdly, the adversarial image reconstruction task prompts the model to eliminate the domain-specific features. The adversarial learning of the gaze regression and the image reconstruction tasks guides the model to learn only general gaze features across domains, preventing cross-domain feature over-fitting, enhancing the domain generalization capability. The results of cross-domain testing of four active gaze datasets prove the effectiveness of the proposed method.

Type

Article-Conference

Publication

IEEE International Conference on Intelligent Transportation Systems (ITSC)

License

CC-BY-4.0

Last updated on Feb 13, 2024

IEEE ITSC

Authors

HAN Hongcheng (he/him)

PhD Candidate in Control Science and Engineering

Han Hongcheng (韩泓丞) received the degree of B.Eng. in School of Energy and Power Engineering, Xi’an Jiaotong University, Xi’an, China in 2020. Since then, he is studying for Ph.D. degree in Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University. His interests focus on intelligent transportation and medical image analysis, specializing in Multimodal data fusion and Image Synthesis.

← HSC-T: B-ultrasound-to-elastography translation via hierarchical structural consistency learning for thyroid cancer diagnosis Jan 4, 2025

基于皮肤镜图像分类的多模态医学语料库构建 Jun 25, 2023 →

No results found

GSA-Gaze: Generative Self-adversarial Learning for Domain Generalized Driver Gaze Estimation