上一条:Jia Li, Yangchen Yu, Yin Chen, Yu Zhang, et al.; DAT: Dialogue-Aware Transformer with Modality-Group Fusion for Human Engagement Estimation, In Proceedings of the 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia, 2024.
下一条:Jia Li, Yin Chen, Xuesong Zhang, et al. Multimodal feature extraction and fusion for emotional reaction intensity estimation and expression classification in videos with transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops. 2023: 5837-5843.