以下是部分论文清单:(主要包括CCF-A会议/期刊、IEEE/ACM Transactions期刊等)
[1] Jinxing Zhou, Dan Guo*, Yiran Zhong, Meng Wang*. "Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling", International Journal of Computer Vision (IJCV, CCF-A期刊), 2024.
[2] Shuaiyang Li, Feng Xue, Kang Liu, Dan Guo, Richang Hong. "Multimodal Graph Causal Embedding for Multimedia-based Recommendation", IEEE Transactions on Knowledge and Data Engineering (TKDE, CCF-A 期刊),2024.
[3] Wei Qian, Kun Li, Dan Guo*, Bin Hu, Meng Wang*. "Cluster-Phys: Facial Clues Clustering Towards Efficient Remote Physiological Measurement", ACM Mutilmedia (ACM MM, CCF-A会议, Oral paper, top 3.97%), 2024.
[4] Jingjing Hu, Dan Guo*, Kun Li, Zhan Si, Xun Yang*, Meng Wang*. "Maskable Retentive Network for Video Moment Retrieval", ACM Mutilmedia (ACM MM, CCF-A会议,), 2024.
[5] Xun Yang*, Jianming Zeng, Dan Guo, Shanshan Wang, Jianfeng Dong, Meng Wang. "Robust video question answering via contrastive cross-modality representation learning", Science China Information Sciences (SCIS, CCF-A 期刊 ), 2024.
[6] Jinxing Zhou, Dan Guo*, Yuxin Mao, Yiran Zhong, Xiaojun Chang, Meng Wang. "Label-anticipated Event Disentanglement for Audio-Visual Video Parsing", European Conference on Computer Vision (ECCV, CCF-B会议), 2024.
[7] Jing Zhang, Liang Zheng*, Meng Wang, Dan Guo*. "Training A Small Emotional Vision Language Model for Visual Art Comprehension", European Conference on Computer Vision (ECCV, CCF-B会议), 2024.
[8] Fei Wang, Dan Guo*, Kun Li, Zhun Zhong, Meng Wang*. "Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A会议), 2024.
[9] Chunxiao Fan, Ziqi Wang, Dan Guo*, Meng Wang. "Data-Free Quantization via Pseudo-label Filtering", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A会议), 2024.
[10] Fei Wang, Dan Guo*, Kun Li, Meng Wang*. "EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.
[11] Zhangbin Li, Dan Guo*, Jinxing Zhou*, Jing Zhang, Meng Wang. "Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.
[12] Zhao Xie, Yadong Shi, Kewei Wu, Yaru Cheng, Dan Guo*. "Towards Understanding Future: Consistency Guided Probabilistic Modeling for Action Anticipation", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.
[13] Liu Liu, Anran Huang, Qi Wu, Dan Guo*, Xun Yang, Meng Wang. "KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking". AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.
[14] Xinyi Wu, Wentao Ma, Dan Guo, Tongqing Zhou, Shan Zhao, Zhiping Cai. "Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.
[15] Peipei Song, Dan Guo*, Xun Yang, Shengeng Tang, and Meng Wang. "Emotional Video Captioning with Vision-based Emotion Interpretation Network", IEEE Transactions on Image Processing (IEEE TIP, CCF-A期刊), 2024.
[16] Zhao Xie, Chang Jiao, Kewei Wu*, Dan Guo* and Richang Hong. "Active Factor Graph Network for Group Activity Recognition", IEEE Transactions on Image Processing (IEEE TIP, CCF-A期刊), 2024.
[17] Dan Guo, Kun Li*, Bin Hu, Yan Zhang, Meng Wang*. "Benchmarking Micro-action Recognition: Dataset, Methods, and Applications", IEEE Transactions on Circuits and Systems for Video Technology. (IEEE TCSVT, CCF-B期刊), 2024.
[18] Xin Liu, Biao Qian, Haipeng Liu*, Dan Guo,Yang Wang, Meng Wang*. "Seeking False Hard Negatives for Graph Contrastive Learning", IEEE Transactions on Circuits and Systems for Video Technology. (IEEE TCSVT, CCF-B期刊), 2024.
[19] Kewei Wu , Wenjie Luo , Zhao Xie , Dan Guo , Zhao Zhang , and Richang Hong. "Ensemble Prototype Network For Weakly-Supervised Temporal Action Localization", IEEE Transactions on Neural Networks and learning systems (IEEE TNNLS, CCF-B期刊), 2024.
[20] Wei Qian, Dan Guo*, Kun Li, Xiaowei Zhang, Xilan Tian, Xun Yang, Meng Wang*, "Dual-path TokenLearner for Remote Photoplethysmography-based Physiological Measurement with Facial Videos", IEEE Transactions on Computational Social Systems (IEEE TCSS, SCI 2区), 2024.
[21] Peipei Song, Dan Guo*, Xun Yang, Shengeng Tang, Erkun Yang, and Meng Wang*. "Emotion-Prior Awareness Network for Emotional Video Captioning", ACM International Conference on Multimedia (ACM MM ,CCF-A 会议, Oral paper, top 5.4%), 2023.
[22] Sheng Zhou, Dan Guo*, Jia Li, Xun Yang*, and Meng Wang. "Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA", IEEE Transactions on Image Processing (TIP, CCF-A 期刊 ), 2023.
[23] Kun Li, Dan Guo* , and Meng Wang*. "ViGT: Proposal-free Video Grounding with Learnable Token in Transformer", Science China Information Sciences (SCIS, CCF-A 期刊 ), 2023.
[24] Xinge Peng, Kun Li*, Jiaxiu Li, Guoliang Chen, and Dan Guo*. "Multi-modality Fusion for Emotion Recognition in Videos", IJCAI (CCF-A会议) Challenge paper, 2023.
[25] Kun Li, Dan Guo*, Guoliang Chen, Xinge Peng, and Meng Wang. "Joint Skeletal and Semantic Embedding Loss for Micro-gesture Classification", IJCAI (CCF-A会议) Challenge paper, 2023.
[26] Jia Li, Wei Qian, Kun Li, Qi Li, Dan Guo*, and Meng Wang*. "Exploiting Diverse Feature for Multimodal Sentiment Analysis", ACM MM (CCF-A 会议) Challenge paper, 2023.
[27] Kun Li, Dan Guo* , Guoliang Chen, Feiyang Liu and Meng Wang. "Data Augmentation for Human Behavior Analysis in Multi-Person Conversations", ACM MM (CCF-A 会议) Challenge paper, 2023.
[28] Kun Li, Jiaxiu Li, Dan Guo*, Xun Yang*, and Meng Wang. "Transformer-based Visual Grounding with Cross-modality Interaction", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , CCF-B 期刊 ), 2023.
[29] Qi Li, Dan Guo*, Wei Qian, Xilan Tian, Xiao Sun, Haifeng Zhao, and Meng Wang*. "Channel-wise Interactive Learning for Remote Heart Rate Estimation from Facial Video", IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT, CCF-B期刊),2023.
[30] Jing Zhang, Dan Guo*, Xun Yang*, Peipei Song, and Meng Wang*. "Visual-Linguistic-Stylistic Triple Reward for Cross-Lingual Image Captioning", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , CCF-B 期刊), 2023.
[31] Sheng Zhou, Dan Guo*, Xun Yang*, Jianfeng Dong, and Meng Wang*. "Graph Pooling Inference Network for Text-Based VQA", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , CCF-B 期刊), 2023.
[32] Shuaiyang Li, Dan Guo, Kang Liu, Richang Hong, and Feng Xue. "Multimodal Counterfactual Learning Network for Multimedia-based Recommendation", Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR, CCF-A会议), 2023.
[33] Kang Liu, Feng Xue*, Dan Guo, Peijie Sun, Shengsheng Qian, and Richang Hong. "Multimodal Graph Contrastive Learning for Multimedia-based Recommendation", IEEE Transactions on Multimedia (IEEE TMM, CCF-B期刊), 2023.
[34] Wentao Ma, Xinyi Wu, Shan Zhao*, Tongqing Zhou*, Dan Guo, Lichuan Gu, Zhiping Cai, and Meng Wang. "FedSH: Towards Privacy-preserving Text-based Person Re-Identification", IEEE Transactions on Multimedia (IEEE TMM, CCF-B期刊), 2023.
[35] Kang Liu, Feng Xue*, Dan Guo, Le Wu, Shujie Li, and Richang Hong. "MEGCF: Multimodal Entity Graph Collaborative Filtering for Personalized Recommendation", ACM Transactions on Information Systems (ACM TOIS, CCF-A期刊), 2023.
[36] Feng Xue*, Tian Yang, Kang Liu, Zikun Hong, Mingwei Cao, Dan Guo, and Richang Hong. "LCSNet: End-to-end Lipreading with Channel-aware Feature Selection", ACM Transactions on Multimedia Computing, Communications, and Applications (ACM TOMM, CCF-B期刊), 2023.
[37] Jinxing Zhou, Dan Guo* and Meng Wang*. "Contrastive Positive Sample Propagation along the Audio-Visual Event Line", IEEE Transactions on Pattern Analysis and Machine Intelligence(TPAMI, CCF-A期刊, IF 24.314 ), 2022.
[38] Shengeng Tang, Richang Hong*, Dan Guo*, and Meng Wang, "Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production", ACM International Conference on Multimedia (ACM MM ,CCF-A 会议), 2022.
[39] Peipei Song, Dan Guo*, Jun Cheng, and Meng Wang*, "Contextual Attention Network for Emotional Video Captioning", IEEE Transactions on Multimedia (TMM, CCF-B期刊 ), 2022.
[40] Peipei Song, Dan Guo*, Jinxing Zhou, Mingliang Xu, and Meng Wang*, "Memorial GAN with Joint Semantic Optimization for Unpaired Image Captioning", IEEE Transactions on Cybernetics (TCYB, CCF-B期刊 ), 2022.
[41] Jinxing Zhou, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Meng Wang*, and Yiran Zhong*, "Audio−Visual Segmentation", European Conference on Computer Vision (ECCV, CCF-B会议), 2022.
[42] Tianyuan Xu, Xueliang Liu*, Zhen Huang*, Dan Guo, Richang Hong, and Meng Wang. "Early-Learning regularized Contrastive Learning for Cross-Modal Retrieval with Noisy Labels", ACM International Conference on Multimedia (ACM MM, CCF-A会议), 2022.
[43] Zhao Xie, Jiansong Chen, Kewei Wu*, Dan Guo, and Richang Hong. "Global Temporal Difference Network for Action Recognition", IEEE Transactions on Multimedia (IEEE TMM, CCF-B期刊), 2022.
[44] Kang Liu, Feng Xue*, Xiangnan He, Dan Guo, and Richang Hong. "Joint Multi-Grained Popularity-Aware Graph Convolution Collaborative Filtering for Recommendation", IEEE Transactions on Computational Social Systems (IEEE TCSS, SCI 2区), 2022.
[45] Dan Guo, Hui Wang, and Meng Wang*, "Context-Aware Graph Inference with Knowledge Distillation for Visual Dialog", IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, CCF-A期刊, IF 24.314 ), 2021.
[46] Hui Wang, Dan Guo*, Xiansheng Hua, and Meng Wang*, "Pairwise VLAD Interaction Network for Video Question Answering", ACM International Conference on Multimedia (ACM MM, CCF-A 会议 ), 2021.
[47] Kun Li, Dan Guo*, and Meng Wang*, "Proposal-Free Video Grounding with Contextual Pyramid Network", AAAI Conference on Artificial Intelligence (AAAI, CCF-A 会议 ), 2021.
[48] Shengeng Tang, Dan Guo*, Richang Hong*, and Meng Wang, "Graph-Based Multimodal Sequential Embedding for Sign Language Translation", IEEE Transactions on Multimedia (TMM, CCF-B期刊 ), 2021.
[49] Dan Guo, Hui Wang, Shuhui Wang, and Meng Wang*, "Textual-Visual Reference-Aware Attention Network for Visual Dialog", IEEE Transactions on Image Processing (TIP, CCF-A 期刊 ), 2020.
[50] Dan Guo, Wengang Zhou*, Anyang Li, Houqiang Li, and Meng Wang*, "Hierarchical Recurrent Deep Fusion Using Adaptive Clip Summarization for Sign Language Translation", IEEE Transactions on Image Processing (TIP, CCF-A 期刊 ), 2020.
[51] Dan Guo, Hui Wang*, Hanwang Zhang, Zhengjun Zha, and Meng Wang*, "Iterative Context-Aware Graph Inference for Visual Dialog", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A 会议, oral paper, Top 5%), 2020.
[52] Dan Guo, Yang Wang*, Peipei Song*, and Meng Wang, "Recurrent Relational Memory Network for Unsupervised Image Captioning", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A 会议, 录取率12.6%), 2020.
[53] Dan Guo, Kun Li*, and Meng Wang, "DADNet:Dilated-Attention-Deformable ConvNet for Crowd Counting", ACM International Conference on Multimedia (ACM MM, CCF-A 会议, oral paper, Top 9.8% ), 2019.
[54] Dan Guo, Shengeng Tang,and Meng Wang, "Connectionist Temporal Modeling of Video and Language:A Joint Model for Translation and Sign Labeling", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A 会议 ), 2019.
[55] Dan Guo, Shuo Wang, Qi Tian, and Meng Wang, "Dense Temporal Convolution Network for Sign Language Translation", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A 会议), 2019.
[56] Dan Guo, Hui Wang, and Meng Wang, "Dual Visual Attention Network for Visual Dialog", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A 会议 ), 2019.
[57] Shuo Wang, Dan Guo*, Xin Xu, Li Zhuo, and Meng Wang, "Cross-Modality Retrieval by Joint Correlation Learning", ACM Transactions on Multimedia Computing Communications and Applications (ACM TOMCCAP , CCF-B 期刊 ), 2019.
[58] Shuo Wang, Dan Guo*, Wengang Zhou, Zhengjun Zha, and Meng Wang, "Connectionist Temporal Fusion for Sign Language Translation", International ACM International Conference on Multimedia (ACM MM, CCF-A 会议 ), 2018.
[59] Dan Guo, Wengang Zhou, Houqiang Li, and Meng Wang, "Hierarchical LSTM for Sign Language Translation", AAAI Conference on Artificial Intelligence (AAAI, CCF-A 会议, oral paper, Top 5% ), 2018.
[60] Dan Guo, Wengang Zhou*, Houqiang Li*, and Meng Wang*, "Online Early-Late Fusion Based on Adaptive HMM for Sign Language Recognition", ACM Transactions on Multimedia Computing Communications and Applications (ACM TOMCCAP , CCF-B 期刊 ), 2018.
[61] 郭丹,姚沈涛,王辉,汪萌.嵌入局部聚类描述符的视频问答Transformer模型[J]. 计算机学报 (CCF-A 中文期刊), 2023.
[62] 鲁志红, 郭丹*, 汪萌. 基于加权运动估计和矢量分割的运动补偿内插算法[j]. 自动化学报 (CCF-A 中文期刊), 2015.