以下是部分论文清单:(主要包括CCF-A会议/期刊、IEEE/ACM Transactions期刊等)
2025
1. Kun Li, Dan Guo*, Guoliang Chen*, Chunxiao Fan, Jingyuan Xu, zhiliang wu, Hehe Fan, Meng Wang*. “Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition,AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2025.
2. Shengeng Tang, Jiayi He, Dan Guo, Yanyan Wei, Feng Li, Richang Hong. “Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production”, AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2025.
3. Pengcheng Zhao, Jinxing Zhou, Dan Guo*, Yang Zhao, Yanxiang Chen*. “Multimodal Class-aware Semantic Enhancement Network for Audio-Visual Video Parsing”, AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2025.
4. Ziheng Zhou, Jinxing Zhou, Wei Qian, Shengeng Tang, Xiaojun Chang, Dan Guo*. “Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration”, AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2025.
5. Wei Qian, Gaoji Su, Dan Guo*, Jinxing Zhou, Xiaobai Li, Bin Hu, Shengeng Tang, Meng Wang*. “PhysDiff: Physiology-based Dynamicity Disentangled Diffusion Model for Remote Physiological Measurement”, AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2025.
6. Jingjing Hu, Dan Guo*, Zhan Si, Deguang Liu, Yunfeng Diao, Jing Zhang, Jinxing Zhou, Meng Wang*. “MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights”, AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2025.
7. Zhangbin Li, Jinxing Zhou, Jing Zhang, Shengeng Tang, Kun Li, Dan Guo*. “Patch-level Sounding Object Tracking for Audio-Visual Question Answering”, AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2025.
8. Xinyi Wang, Na Zhao, Zhiyuan Han, Dan Guo, Xun Yang. “AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring”, AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2025.
2024
9. Jinxing Zhou, Xuyang Shen, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Lingpeng Kong, Meng Wang* , Yiran Zhong*. “Audio-Visual Segmentation with Semantics”, International Journal of Computer Vision (IJCV, CCF-A期刊), 2024.
10. Jinxing Zhou, Dan Guo*, Yiran Zhong, Meng Wang*. "Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling", International Journal of Computer Vision (IJCV, CCF-A期刊), 2024.
11. Shuaiyang Li, Feng Xue, Kang Liu, Dan Guo, Richang Hong. "Multimodal Graph Causal Embedding for Multimedia-based Recommendation", IEEE Transactions on Knowledge and Data Engineering (TKDE, Trans.汇刊, CCF-A 期刊),2024.
12. Wei Qian, Kun Li, Dan Guo*, Bin Hu, Meng Wang*. "Cluster-Phys: Facial Clues Clustering Towards Efficient Remote Physiological Measurement", ACM Mutilmedia (ACM MM, CCF-A会议, Oral paper, top 3.97%), 2024.
13. Jingjing Hu, Dan Guo*, Kun Li, Zhan Si, Xun Yang*, Meng Wang*. "Maskable Retentive Network for Video Moment Retrieval", ACM Mutilmedia (ACM MM, CCF-A会议,), 2024.
14. Xun Yang*, Jianming Zeng, Dan Guo, Shanshan Wang, Jianfeng Dong, Meng Wang. "Robust video question answering via contrastive cross-modality representation learning", Science China Information Sciences (SCIS, CCF-A 期刊 ), 2024.
15. Jinxing Zhou, Dan Guo*, Yuxin Mao, Yiran Zhong, Xiaojun Chang, Meng Wang. "Label-anticipated Event Disentanglement for Audio-Visual Video Parsing", European Conference on Computer Vision (ECCV), 2024.
16. Jing Zhang, Liang Zheng*, Meng Wang, Dan Guo*. "Training A Small Emotional Vision Language Model for Visual Art Comprehension", European Conference on Computer Vision (ECCV), 2024.
17. Fei Wang, Dan Guo*, Kun Li, Zhun Zhong, Meng Wang*. "Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A会议), 2024.
18. Chunxiao Fan, Ziqi Wang, Dan Guo*, Meng Wang. "Data-Free Quantization via Pseudo-label Filtering", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A会议), 2024.
19. Fei Wang, Dan Guo*, Kun Li, Meng Wang*. "EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.
20. Zhangbin Li, Dan Guo*, Jinxing Zhou*, Jing Zhang, Meng Wang. "Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.
21. Zhao Xie, Yadong Shi, Kewei Wu, Yaru Cheng, Dan Guo*. "Towards Understanding Future: Consistency Guided Probabilistic Modeling for Action Anticipation", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.
22. Liu Liu, Anran Huang, Qi Wu, Dan Guo*, Xun Yang, Meng Wang. "KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking". AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.
23. Xinyi Wu, Wentao Ma, Dan Guo, Tongqing Zhou, Shan Zhao, Zhiping Cai. "Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.
24. Peipei Song, Dan Guo*, Xun Yang, Shengeng Tang, and Meng Wang. "Emotional Video Captioning with Vision-based Emotion Interpretation Network", IEEE Transactions on Image Processing (IEEE TIP, Trans.汇刊, CCF-A期刊), 2024.
25. Zhao Xie, Chang Jiao, Kewei Wu*, Dan Guo* and Richang Hong. "Active Factor Graph Network for Group Activity Recognition", IEEE Transactions on Image Processing (IEEE TIP, Trans.汇刊, CCF-A期刊), 2024.
26. Dan Guo, Kun Li*, Bin Hu, Yan Zhang, Meng Wang*. "Benchmarking Micro-action Recognition: Dataset, Methods, and Applications", IEEE Transactions on Circuits and Systems for Video Technology. (IEEE TCSVT, Trans.汇刊), 2024.
27. Feiyang Liu, Kun Li, Zhun Zhong, Wei Jia, Bin Hu, Xun Yang*, Meng Wang*, Dan Guo*. “Depth Matters: Spatial Proximity-based Gaze Cone Generation for Gaze Following in Wild”, ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , Trans.汇刊), 2024.
28. Xin Liu, Biao Qian, Haipeng Liu*, Dan Guo, Yang Wang, Meng Wang*. "Seeking False Hard Negatives for Graph Contrastive Learning", IEEE Transactions on Circuits and Systems for Video Technology. (IEEE TCSVT, Trans.汇刊), 2024.
29. Kewei Wu , Wenjie Luo , Zhao Xie , Dan Guo , Zhao Zhang , and Richang Hong. "Ensemble Prototype Network For Weakly-Supervised Temporal Action Localization", IEEE Transactions on Neural Networks and learning systems (IEEE TNNLS, Trans.汇刊), 2024.
30. Wei Qian, Dan Guo*, Kun Li, Xiaowei Zhang, Xilan Tian, Xun Yang, Meng Wang*, "Dual-path TokenLearner for Remote Photoplethysmography-based Physiological Measurement with Facial Videos", IEEE Transactions on Computational Social Systems (IEEE TCSS, Trans.汇刊), 2024.
2023
31. Peipei Song, Dan Guo*, Xun Yang, Shengeng Tang, Erkun Yang, and Meng Wang*. "Emotion-Prior Awareness Network for Emotional Video Captioning", ACM International Conference on Multimedia (ACM MM ,CCF-A 会议, Oral paper, top 5.4%), 2023.
32. Sheng Zhou, Dan Guo*, Jia Li, Xun Yang*, and Meng Wang. "Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA", IEEE Transactions on Image Processing (TIP, Trans.汇刊, CCF-A期刊 ), 2023.
33. Kun Li, Dan Guo*, and Meng Wang*. "ViGT: Proposal-free Video Grounding with Learnable Token in Transformer", Science China Information Sciences (SCIS, CCF-A期刊), 2023.
34. Xinge Peng, Kun Li*, Jiaxiu Li, Guoliang Chen, and Dan Guo*. "Multi-modality Fusion for Emotion Recognition in Videos", IJCAI (CCF-A会议) Challenge paper, 2023.
35. Kun Li, Dan Guo*, Guoliang Chen, Xinge Peng, and Meng Wang. "Joint Skeletal and Semantic Embedding Loss for Micro-gesture Classification", IJCAI (CCF-A会议) Challenge paper, 2023.
36. Jia Li, Wei Qian, Kun Li, Qi Li, Dan Guo*, and Meng Wang*. "Exploiting Diverse Feature for Multimodal Sentiment Analysis", ACM MM (CCF-A 会议) Challenge paper, 2023.
37. Kun Li, Dan Guo* , Guoliang Chen, Feiyang Liu and Meng Wang. "Data Augmentation for Human Behavior Analysis in Multi-Person Conversations", ACM MM (CCF-A 会议) Challenge paper, 2023.
38. Kun Li, Jiaxiu Li, Dan Guo*, Xun Yang*, and Meng Wang. "Transformer-based Visual Grounding with Cross-modality Interaction", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , Trans.汇刊), 2023.
39. Qi Li, Dan Guo*, Wei Qian, Xilan Tian, Xiao Sun, Haifeng Zhao, and Meng Wang*. "Channel-wise Interactive Learning for Remote Heart Rate Estimation from Facial Video", IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT, Trans.汇刊),2023.
40. Jing Zhang, Dan Guo*, Xun Yang*, Peipei Song, and Meng Wang*. "Visual-Linguistic-Stylistic Triple Reward for Cross-Lingual Image Captioning", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , Trans.汇刊), 2023.
41. Sheng Zhou, Dan Guo*, Xun Yang*, Jianfeng Dong, and Meng Wang*. "Graph Pooling Inference Network for Text-Based VQA", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , Trans.汇刊), 2023.
42. Shuaiyang Li, Dan Guo, Kang Liu, Richang Hong, and Feng Xue. "Multimodal Counterfactual Learning Network for Multimedia-based Recommendation", Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR, CCF-A会议), 2023.
43. Kang Liu, Feng Xue*, Dan Guo, Peijie Sun, Shengsheng Qian, and Richang Hong. "Multimodal Graph Contrastive Learning for Multimedia-based Recommendation", IEEE Transactions on Multimedia (IEEE TMM, Trans.汇刊), 2023.
44. Wentao Ma, Xinyi Wu, Shan Zhao*, Tongqing Zhou*, Dan Guo, Lichuan Gu, Zhiping Cai, and Meng Wang. "FedSH: Towards Privacy-preserving Text-based Person Re-Identification", IEEE Transactions on Multimedia (IEEE TMM, Trans.汇刊), 2023.
45. Kang Liu, Feng Xue*, Dan Guo, Le Wu, Shujie Li, and Richang Hong. "MEGCF: Multimodal Entity Graph Collaborative Filtering for Personalized Recommendation", ACM Transactions on Information Systems (ACM TOIS, Trans.汇刊, CCF-A期刊), 2023.
46. Feng Xue*, Tian Yang, Kang Liu, Zikun Hong, Mingwei Cao, Dan Guo, and Richang Hong. "LCSNet: End-to-end Lipreading with Channel-aware Feature Selection", ACM Transactions on Multimedia Computing, Communications, and Applications (ACM TOMM, Trans.汇刊), 2023.
47. 郭丹,姚沈涛,王辉,汪萌.嵌入局部聚类描述符的视频问答Transformer模型[J]. 计算机学报 (CCF-A 中文期刊), 2023.
2022
48. Jinxing Zhou, Dan Guo* and Meng Wang*. "Contrastive Positive Sample Propagation along the Audio-Visual Event Line", IEEE Transactions on Pattern Analysis and Machine Intelligence(TPAMI, Trans.汇刊, CCF-A期刊, IF 24.314 ), 2022.
49. Shengeng Tang, Richang Hong*, Dan Guo*, and Meng Wang, "Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production", ACM International Conference on Multimedia (ACM MM ,CCF-A 会议), 2022.
50. Peipei Song, Dan Guo*, Jun Cheng, and Meng Wang*, "Contextual Attention Network for Emotional Video Captioning", IEEE Transactions on Multimedia (TMM, Trans.汇刊 ), 2022.
51. Peipei Song, Dan Guo*, Jinxing Zhou, Mingliang Xu, and Meng Wang*, "Memorial GAN with Joint Semantic Optimization for Unpaired Image Captioning", IEEE Transactions on Cybernetics (TCYB, Trans.汇刊 ), 2022.
52. Jinxing Zhou, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Meng Wang*, and Yiran Zhong*, "Audio−Visual Segmentation", European Conference on Computer Vision (ECCV), 2022.
53. Tianyuan Xu, Xueliang Liu*, Zhen Huang*, Dan Guo, Richang Hong, and Meng Wang. "Early-Learning regularized Contrastive Learning for Cross-Modal Retrieval with Noisy Labels", ACM International Conference on Multimedia (ACM MM, CCF-A会议), 2022.
54. Zhao Xie, Jiansong Chen, Kewei Wu*, Dan Guo, and Richang Hong. "Global Temporal Difference Network for Action Recognition", IEEE Transactions on Multimedia (IEEE TMM, Trans.汇刊), 2022.
55. Kang Liu, Feng Xue*, Xiangnan He, Dan Guo, and Richang Hong. "Joint Multi-Grained Popularity-Aware Graph Convolution Collaborative Filtering for Recommendation", IEEE Transactions on Computational Social Systems (IEEE TCSS, Trans.汇刊), 2022.
2021
56. Dan Guo, Hui Wang, and Meng Wang*, "Context-Aware Graph Inference with Knowledge Distillation for Visual Dialog", IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, Trans.汇刊, CCF-A期刊, IF 24.314 ), 2021.
57. Hui Wang, Dan Guo*, Xiansheng Hua, and Meng Wang*, "Pairwise VLAD Interaction Network for Video Question Answering", ACM International Conference on Multimedia (ACM MM, CCF-A 会议), 2021.
58. Kun Li, Dan Guo*, and Meng Wang*, "Proposal-Free Video Grounding with Contextual Pyramid Network", AAAI Conference on Artificial Intelligence (AAAI, CCF-A 会议), 2021.
59. Shengeng Tang, Dan Guo*, Richang Hong*, and Meng Wang, "Graph-Based Multimodal Sequential Embedding for Sign Language Translation", IEEE Transactions on Multimedia (TMM, Trans.汇刊), 2021.
2020
60. Dan Guo, Hui Wang, Shuhui Wang, and Meng Wang*, "Textual-Visual Reference-Aware Attention Network for Visual Dialog", IEEE Transactions on Image Processing (TIP, Trans.汇刊, CCF-A期刊), 2020.
61. Dan Guo, Wengang Zhou*, Anyang Li, Houqiang Li, and Meng Wang*, "Hierarchical Recurrent Deep Fusion Using Adaptive Clip Summarization for Sign Language Translation", IEEE Transactions on Image Processing (TIP, Trans.汇刊, CCF-A期刊), 2020.
62. Dan Guo, Hui Wang*, Hanwang Zhang, Zhengjun Zha, and Meng Wang*, "Iterative Context-Aware Graph Inference for Visual Dialog", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A 会议, oral paper, Top 5%), 2020.
63. Dan Guo, Yang Wang*, Peipei Song*, and Meng Wang, "Recurrent Relational Memory Network for Unsupervised Image Captioning", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A会议, 录取率12.6%), 2020.
2019
64. Dan Guo, Kun Li*, and Meng Wang, "DADNet:Dilated-Attention-Deformable ConvNet for Crowd Counting", ACM International Conference on Multimedia (ACM MM, CCF-A 会议, oral paper, Top 9.8% ), 2019.
65. Dan Guo, Shengeng Tang,and Meng Wang, "Connectionist Temporal Modeling of Video and Language:A Joint Model for Translation and Sign Labeling", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A会议 ), 2019.
66. Dan Guo, Shuo Wang, Qi Tian, and Meng Wang, "Dense Temporal Convolution Network for Sign Language Translation", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A会议), 2019.
67. Dan Guo, Hui Wang, and Meng Wang, "Dual Visual Attention Network for Visual Dialog", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A会议), 2019.
68. Shuo Wang, Dan Guo*, Xin Xu, Li Zhuo, and Meng Wang, "Cross-Modality Retrieval by Joint Correlation Learning", ACM Transactions on Multimedia Computing Communications and Applications (ACM TOMCCAP , Trans.汇刊 ), 2019.
2018&Before
69. Shuo Wang, Dan Guo*, Wengang Zhou, Zhengjun Zha, and Meng Wang, "Connectionist Temporal Fusion for Sign Language Translation", International ACM International Conference on Multimedia (ACM MM, CCF-A会议 ), 2018.
70. Dan Guo, Wengang Zhou, Houqiang Li, and Meng Wang, "Hierarchical LSTM for Sign Language Translation", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议, oral paper, Top 5% ), 2018.
71. Dan Guo, Wengang Zhou*, Houqiang Li*, and Meng Wang*, "Online Early-Late Fusion Based on Adaptive HMM for Sign Language Recognition", ACM Transactions on Multimedia Computing Communications and Applications (ACM TOMCCAP , Trans.汇刊 ), 2018.
72. 鲁志红, 郭丹*, 汪萌. 基于加权运动估计和矢量分割的运动补偿内插算法[j]. 自动化学报 (CCF-A中文期刊), 2015.