上一条:Cheng H, Zhao Z, He Y, Z Hu, J Li*(通讯作者), et al. VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection. ACM Multimedia 2025.
下一条:Xiao J, Song Z, Hu J, Z Hu*, J Li*(通讯作者), R Hong. Contrastive Alignment with Semantic Gap-Aware Corrections in Text-Video Retrieval. NeurIPS 2025.