Dan Guo
Gender:Female
Education Level:Postgraduate (Doctoral)
Alma Mater:Huazhong University of Science and Technology
Scientific Research
Research Field
The main research directions are machine vision, machine learning, deep learning, pattern recognition.
· Cross-modal Understanding and Reasoning
· Audio-Visual Event Understanding and Parsing
· Image/Video Captioning and Explanation
· Temporal Action Detection / Video Grounding
· Vison-based Sign Language Recognition and Translation)
· Vision-based Physiological Measurement
Characteristic research
· Visual Affective Computing
· Visual Sign Language Machine Translation
· Video Semantic Analysis and Grounding
· Visual Chatbot
Paper Publications
The following is a partial list of papers: (mainly CCF-A conferences/journals, IEEE/ACM Transactions journals, etc.)
[1] Fei Wang, Dan Guo*, Kun Li, Zhun Zhong, Meng Wang*. "Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A), 2024.
[2] Chunxiao Fan, Ziqi Wang, Dan Guo*, Meng Wang. "Data-Free Quantization via Pseudo-label Filtering", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A), 2024.
[3] Fei Wang, Dan Guo*, Kun Li, Meng Wang*. "EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer", AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2024.
[4] Zhangbin Li, Dan Guo*, Jinxing Zhou*, Jing Zhang, Meng Wang. "Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering", AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2024.
[5] Zhao Xie, Yadong Shi, Kewei Wu, Yaru Cheng, Dan Guo*. "Towards Understanding Future: Consistency Guided Probabilistic Modeling for Action Anticipation", AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2024.
[6] Liu Liu, Anran Huang, Qi Wu, Dan Guo*, Xun Yang, Meng Wang. "KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking". AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2024.
[7] Xinyi Wu, Wentao Ma, Dan Guo, Tongqing Zhou, Shan Zhao, Zhiping Cai. "Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning", AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2024.
[8] Peipei Song, Dan Guo*, Xun Yang, Shengeng Tang, and Meng Wang. "Emotional Video Captioning with Vision-based Emotion Interpretation Network", IEEE Transactions on Image Processing (IEEE TIP, CCF-A), 2024.
[9] Zhao Xie, Chang Jiao, Kewei Wu*, Dan Guo* and Richang Hong. "Active Factor Graph Network for Group Activity Recognition", IEEE Transactions on Image Processing (IEEE TIP, CCF-A), 2024.
[10] Dan Guo, Kun Li*, Bin Hu, Yan Zhang, Meng Wang*. "Benchmarking Micro-action Recognition: Dataset, Methods, and Applications", IEEE Transactions on Circuits and Systems for Video Technology. (IEEE TCSVT, CCF-B), 2024.
[11] Xin Liu, Biao Qian, Haipeng Liu*, Dan Guo,Yang Wang, Meng Wang*. "Seeking False Hard Negatives for Graph Contrastive Learning", IEEE Transactions on Circuits and Systems for Video Technology. (IEEE TCSVT, CCF-B), 2024.
[12] Kewei Wu , Wenjie Luo , Zhao Xie , Dan Guo , Zhao Zhang , and Richang Hong. "Ensemble Prototype Network For Weakly-Supervised Temporal Action Localization", IEEE Transactions on Neural Networks and learning systems (IEEE TNNLS, CCF-B), 2024.
[13] Wei Qian, Dan Guo*, Kun Li, Xiaowei Zhang, Xilan Tian, Xun Yang, Meng Wang*, "Dual-path TokenLearner for Remote Photoplethysmography-based Physiological Measurement with Facial Videos", IEEE Transactions on Computational Social Systems (IEEE TCSS, SCI 2), 2024.
[14] Peipei Song, Dan Guo*, Xun Yang, Shengeng Tang, Erkun Yang, and Meng Wang*. "Emotion-Prior Awareness Network for Emotional Video Captioning", ACM International Conference on Multimedia (ACM MM ,CCF-A, Oral paper, top 5.4%), 2023.
[15] Sheng Zhou, Dan Guo*, Jia Li, Xun Yang*, and Meng Wang. "Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA", IEEE Transactions on Image Processing (TIP, CCF-A), 2023.
[16] Kun Li, Dan Guo* , and Meng Wang*. "ViGT: Proposal-free Video Grounding with Learnable Token in Transformer", Science China Information Sciences (SCIS, CCF-A),2023.
[17] Xinge Peng, Kun Li*, Jiaxiu Li, Guoliang Chen, and Dan Guo*. "Multi-modality Fusion for Emotion Recognition in Videos", IJCAI (CCF-A) Challenge paper, 2023.
[18] Kun Li, Dan Guo*, Guoliang Chen, Xinge Peng, and Meng Wang. "Joint Skeletal and Semantic Embedding Loss for Micro-gesture Classification", IJCAI (CCF-A) Challenge paper, 2023.
[19] Jia Li, Wei Qian, Kun Li, Qi Li, Dan Guo*, and Meng Wang*. "Exploiting Diverse Feature for Multimodal Sentiment Analysis", ACM MM (CCF-A) Challenge paper, 2023.
[20] Kun Li, Dan Guo* , Guoliang Chen, Feiyang Liu and Meng Wang. "Data Augmentation for Human Behavior Analysis in Multi-Person Conversations", ACM MM (CCF-A) Challenge paper, 2023.
[21] Kun Li, Jiaxiu Li, Dan Guo*, Xun Yang*, and Meng Wang. "Transformer-based Visual Grounding with Cross-modality Interaction", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP, CCF-B), 2023.
[22] Qi Li, Dan Guo*, Wei Qian, Xilan Tian, Xiao Sun, Haifeng Zhao, and Meng Wang*. "Channel-wise Interactive Learning for Remote Heart Rate Estimation from Facial Video", IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT, CCF-B),2023.
[23] Jing Zhang, Dan Guo*, Xun Yang*, Peipei Song, and Meng Wang*. "Visual-Linguistic-Stylistic Triple Reward for Cross-Lingual Image Captioning", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , CCF-B), 2023.
[24] Sheng Zhou, Dan Guo*, Xun Yang*, Jianfeng Dong, and Meng Wang*. "Graph Pooling Inference Network for Text-Based VQA", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , CCF-B), 2023.
[25] Shuaiyang Li, Dan Guo, Kang Liu, Richang Hong, and Feng Xue. "Multimodal Counterfactual Learning Network for Multimedia-based Recommendation", Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR, CCF-A), 2023.
[26] Kang Liu, Feng Xue*, Dan Guo, Peijie Sun, Shengsheng Qian, and Richang Hong. "Multimodal Graph Contrastive Learning for Multimedia-based Recommendation", IEEE Transactions on Multimedia (IEEE TMM, CCF-B), 2023.
[27] Wentao Ma, Xinyi Wu, Shan Zhao*, Tongqing Zhou*, Dan Guo, Lichuan Gu, Zhiping Cai, and Meng Wang. "FedSH: Towards Privacy-preserving Text-based Person Re-Identification", IEEE Transactions on Multimedia (IEEE TMM, CCF-B), 2023.
[28] Kang Liu, Feng Xue*, Dan Guo, Le Wu, Shujie Li, and Richang Hong. "MEGCF: Multimodal Entity Graph Collaborative Filtering for Personalized Recommendation", ACM Transactions on Information Systems (ACM TOIS, CCF-A), 2023.
[29] Feng Xue*, Tian Yang, Kang Liu, Zikun Hong, Mingwei Cao, Dan Guo, and Richang Hong. "LCSNet: End-to-end Lipreading with Channel-aware Feature Selection", ACM Transactions on Multimedia Computing, Communications, and Applications (ACM TOMM, CCF-B), 2023.
[30] Jinxing Zhou, Dan Guo* and Meng Wang*. "Contrastive Positive Sample Propagation along the Audio-Visual Event Line", IEEE Transactions on Pattern Analysis and Machine Intelligence(TPAMI, CCF-A, IF 24.314), 2022.
[31] Shengeng Tang, Richang Hong*, Dan Guo*, and Meng Wang, "Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production", ACM International Conference on Multimedia (ACM MM ,CCF-A), 2022.
[32] Peipei Song, Dan Guo*, Jun Cheng, and Meng Wang*, "Contextual Attention Network for Emotional Video Captioning", IEEE Transactions on Multimedia (TMM, CCF-B), 2022.
[33] Peipei Song, Dan Guo*, Jinxing Zhou, Mingliang Xu, and Meng Wang*, "Memorial GAN with Joint Semantic Optimization for Unpaired Image Captioning", IEEE Transactions on Cybernetics (TCYB, CCF-B), 2022.
[34] Jinxing Zhou, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Meng Wang*, and Yiran Zhong*, "Audio−Visual Segmentation", European Conference on Computer Vision (ECCV, CCF-B), 2022.
[35] Tianyuan Xu, Xueliang Liu*, Zhen Huang*, Dan Guo, Richang Hong, and Meng Wang. "Early-Learning regularized Contrastive Learning for Cross-Modal Retrieval with Noisy Labels", ACM International Conference on Multimedia (ACM MM, CCF-A), 2022.
[36] Zhao Xie, Jiansong Chen, Kewei Wu*, Dan Guo, and Richang Hong. "Global Temporal Difference Network for Action Recognition", IEEE Transactions on Multimedia (IEEE TMM, CCF-B), 2022.
[37] Kang Liu, Feng Xue*, Xiangnan He, Dan Guo, and Richang Hong. "Joint Multi-Grained Popularity-Aware Graph Convolution Collaborative Filtering for Recommendation", IEEE Transactions on Computational Social Systems (IEEE TCSS, SCI 2), 2022.
[38] Dan Guo, Hui Wang, and Meng Wang*, "Context-Aware Graph Inference with Knowledge Distillation for Visual Dialog", IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, CCF-A, IF 24.314), 2021.
[39] Hui Wang, Dan Guo*, Xiansheng Hua, and Meng Wang*, "Pairwise VLAD Interaction Network for Video Question Answering", ACM International Conference on Multimedia (ACM MM, CCF-A), 2021.
[40] Kun Li, Dan Guo*, and Meng Wang*, "Proposal-Free Video Grounding with Contextual Pyramid Network", AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2021.
[41] Shengeng Tang, Dan Guo*, Richang Hong*, and Meng Wang, "Graph-Based Multimodal Sequential Embedding for Sign Language Translation", IEEE Transactions on Multimedia (TMM, CCF-B), 2021.
[42] Dan Guo, Hui Wang, Shuhui Wang, and Meng Wang*, "Textual-Visual Reference-Aware Attention Network for Visual Dialog", IEEE Transactions on Image Processing (TIP, CCF-A), 2020.
[43] Dan Guo, Wengang Zhou*, Anyang Li, Houqiang Li, and Meng Wang*, "Hierarchical Recurrent Deep Fusion Using Adaptive Clip Summarization for Sign Language Translation", IEEE Transactions on Image Processing (TIP, CCF-A), 2020.
[44] Dan Guo, Hui Wang*, Hanwang Zhang, Zhengjun Zha, and Meng Wang*, "Iterative Context-Aware Graph Inference for Visual Dialog", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A, oral paper, Top 5%), 2020.
[45] Dan Guo, Yang Wang*, Peipei Song*, and Meng Wang, "Recurrent Relational Memory Network for Unsupervised Image Captioning", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A , top 12.6%), 2020.
[46] Dan Guo, Kun Li*, and Meng Wang, "DADNet:Dilated-Attention-Deformable ConvNet for Crowd Counting", ACM International Conference on Multimedia (ACM MM, CCF-A, oral paper, Top 9.8%), 2019.
[47] Dan Guo, Shengeng Tang,and Meng Wang, "Connectionist Temporal Modeling of Video and Language:A Joint Model for Translation and Sign Labeling", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A), 2019.
[48] Dan Guo, Shuo Wang, Qi Tian, and Meng Wang, "Dense Temporal Convolution Network for Sign Language Translation", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A), 2019.
[49] Dan Guo, Hui Wang, and Meng Wang, "Dual Visual Attention Network for Visual Dialog", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A), 2019.
[50] Shuo Wang, Dan Guo*, Xin Xu, Li Zhuo, and Meng Wang, "Cross-Modality Retrieval by Joint Correlation Learning", ACM Transactions on Multimedia Computing Communications and Applications (ACM TOMCCAP , CCF-B), 2019.
[51] Shuo Wang, Dan Guo*, Wengang Zhou, Zhengjun Zha, and Meng Wang, "Connectionist Temporal Fusion for Sign Language Translation", International ACM International Conference on Multimedia (ACM MM, CCF-A), 2018.
[52] Dan Guo, Wengang Zhou, Houqiang Li, and Meng Wang, "Hierarchical LSTM for Sign Language Translation", AAAI Conference on Artificial Intelligence (AAAI, CCF-A, oral paper, Top 5%), 2018.
[53] Dan Guo, Wengang Zhou*, Houqiang Li*, and Meng Wang*, "Online Early-Late Fusion Based on Adaptive HMM for Sign Language Recognition", ACM Transactions on Multimedia Computing Communications and Applications (ACM TOMCCAP , CCF-B Journal), 2018.
[54] Dan Guo, Shentao Yao, Hui Wang, and Meng Wang. "Embedding VLAD in Transformer for Video Question Answering". Cinese Journal of Computers (CCF-A Chinese Journal), 2023.
[55] Zhihong Lu, Dan Guo*, and Meng Wang, "Motion-compensated Frame Interpolation Based on Weighted Motion Estimation and Vector Segmentation", Acta Automatica Sinica,(CCF-A Chinese Journal), 2015.
Patents
[1] Dan Guo; Qi Li; Xiao Sun; Jie Huang; Meng Wang; End-to-end remote heart rate detection method based on channel-enhanced spatiotemporal attention network, April 26, 2024, China, ZL202210507744.7.
[2] Dan Guo; Ziyi He; Youwei Ni; Kun Li; Zixin Xu; Jiaqi Ma; Kuang Luo; A dishwashing device based on object detection (utility model), May 12, 2023, China, ZL202220873705.4.
[3] Dan Guo; Shengeng Tang; Xianglong Liu; Richang Hong; Meng Wang; A multimodal fusion sign language recognition system and method based on graph convolution, March 14, 2023, China, ZL202010049714.7.
[4] Dan Guo; Shengeng Tang; Xianglong Liu; Meng Wang; A sign language translation system and method based on multi-level semantic parsing, March 28, 2023, China, ZL202010103960.6.
[5] Ye Zhao; Xiaobin Hu; Zhenzhen Hu; Xueliang Liu; Dan Guo; Yanrong Guo; Le Wu; A method and device for generating video summary descriptions based on attention models, December 9, 2022, China, ZL202110565400.7.
[6] Dan Guo; Peipei Song; Xianglong Liu; Meng Wang; A method for generating an unsupervised image description model based on recursive memory networks, March 15, 2022, China, ZL202010049142.2.
[7] Dan Guo; Peipei Song; Xianglong Liu; Meng Wang; A method for sign language translation based on data-driven multi-level feature dynamic fusion, March 15, 2022, China, ZL202010096391.7.
[8] Dan Guo; Hui Wang; Meng Wang; A method for visual dialogue generation based on context-aware graph neural networks, June 8, 2021, China, ZL201910881298.4.
[9] Dan Guo; Kun Li; Meng Wang; A crowd density estimation method based on multi-scale attention mechanism, March 9, 2021, China, ZL201910531606.0.
[10] Dan Guo; Peipei Song; Ye Zhao; Meng Wang; A multi-feature fusion sign language recognition method based on adaptive hidden Markov models, July 10, 2020, China, ZL201811131806.9.
[11] Dan Guo; Meng Wang; Wengang Zhou; Houqiang Li; Chuanqing LI; Anyang Li; An Asymmetric Multilayer LSTM-Based Approach for Automatic Translation of Continuous Sign Language Videos, 2020-2-11, China, ZL201810027551.5.
[12] Dan Guo; Shuo Wang; Meng Wang; A Sign Language Video Translation Method Based on the Fusion of Temporal Domain Convolutional Networks and Recurrent Neural Networks, 2019-10-18, China, ZL201811070290.1.
[13] Meng Wang; Luming Zhang; Dan Guo; A fast recognition system and a fast recognition method for aerial images based on multi-task topology learning, 2018-2-6, China, ZL201510080478.4.
[14] Meng Wang; Luming Zhang; Dan Guo; Xuting Tian; A viewpoint tracking method based on geometric reconstruction and semantic fusion, 2017-10-3, China, ZL201410733763.7.
[15] Dan Guo; Xuegang Hu; Wu Ni; Xindong Wu; A road network evacuation planning method based on maximum flow rate path prioritization, 2017-6-6, China, ZL201510451828.3.
[16] Meng Wang; Xun Yang; Richang Hong; Dan Guo; Yiqun Liu; Maosong Sun; An image retrieval method based on semantic mapping space construction, 2017-5-17, China, ZL201410393094.3.
[17] Meng Wang; Richang Hong; Bingnan Li; Yiqun Liu; Dan Guo; Xueliang Liu; Xindong Wu; Xun Yang; Retrieval reordering method based on continuous number labeled subspace learning, 2017-2-22, China, ZL201410196946.X.
[18] Meng Wang; Luming Zhang; Dan Guo; Yiqun Liu; Maosong Sun; Zhihong Lu; 3D scene reconstruction method based on GPS information video, 2017-2-22, China, ZL201410752454.4.
[19] Shengeng Tang; Tonghuan Xiao; Dan Guo; Jihao Gu; Chenxi Cao; Wanqiang Song; Bin Huang; A collision warning method based on image object detection and visual depth estimation, 2023-2-27, China, CN202310188292.5. (Practical review)
[20] Shengeng Tang; Wanqiang Song; Dan Guo; Jihao Gu; Tonghuan Xiao; Chenxi Cao; A route planning method for visually impaired people based on weighted undirected graph, 2023-3-6, China, CN202310228006.3. (Practical review)
[21] Zihang Xu; Yangjun Huang; Changlin Chen; Yi He; Murou Li; Zan Huang; Dan Guo; A domain-adaptive image classification method based on regularized joint autonomous training, 2023-4-20, China. CN202310150489.X.
Published Books
English monograph
[1] Multimedia for Accessible Human Computer Interfaces. Springer. 2021.
[2] Pattern Matching with Wildcards and Length Constraint. Science Press. 2016.
Computer software copyright
[1] Xiyi Long; Ruyue Jin; Jinjun Yi; Peipei Song; Dan Guo; Real-time multi-modal fake news detection system in multiple fields V1.0, 2023R11L1048667, original acquisition, all rights, 2023-11-15.
[2] Shengeng Tang; Xueyu Xiu; Dan Guo; Xiaohu Dong; Jun Yao; Weihao Xie; Cross-language sign language translation system V1.0, 2023SR1107827, original acquisition, all rights, 2023-09-20.
[3] Shengeng Tang; Bin Huang; Dan Guo; Jihao Gu; Blind obstacle avoidance travel assistance system V1.0, 2023SR0517944, original acquisition, all rights, 2023-05-05.
[4] Dan Guo; Shengeng Tang; Yinnan Chen; ZiLong Wu; Zehan Wen; Zekuan Liu; Human posture cartoonization system based on key point estimation V1.0, 2022SR0771364, original acquisition, all rights reserved, 2022-06-16.
[5] Zhihong Lu; Dan Guo; Jingwei Wu; Fei Liu; Lijin Zhang; Xuting Tian; Video HD playback software based on motion compensation V1.0, 2014SR098634, original acquisition, all rights reserved, 2014-07-16.
Grants and Awards
Grants
· Top-notch young talents for young scholars of High-end Talent Cultivation Action Program of Anhui Province, China, 2023-09.
· Outstanding Reviewer Award of IEEE International Conference on Multimedia and Expo (IEEE ICME), 2020-07.
· Outstanding Reviewer Award of Computer Science journal, 2021-12.
Competitions
· International Joint Conference on Artificial Intelligence (IJCAI) Challenge on Micro-gesture Analysis for Hidden Emotion Understanding, 1st Place in Micro-gesture Classification Track🏆, 2023.05.
· International Joint Conference on Artificial Intelligence (IJCAI) Challenge on Micro-gesture Analysis for Hidden Emotion Understanding, 2nd Place in Micro-gesture Online Recognition Track, 2023.05.
· ACM International Conference on Multimedia (ACM MM) Multi-modal Group Behaviour Analysis for Artificial Mediation, 1st Place in Bodily Behaviour Recognition Track🏆, 2023.07.
· ACM International Conference on Multimedia (ACM MM) Multi-modal Group Behaviour Analysis for Artificial Mediation, 1st Place in Eye Contact Detection Track🏆, 2023.07.
· ACM International Conference on Multimedia (ACM MM) Multi-modal Group Behaviour Analysis for Artificial Mediation, 3rd in Next Speaker Prediction Track, 2023.07.
· ACM International Conference on Multimedia (ACM MM) Multi-modal Sentiment Analysis Challenge, 3rd in MuSe-Personalisation Track, 2023.07.
Research Projects
- National Natural Science Foundation of China - General Program, No. 62272144, Principal Investigator, 2023-2026.
- National Key Research and Development Program of China, No.2022YFB4500601, Sub-Project Principal Investigator, 2022-2025.
- National Natural Science Foundation of China - Key Program, No. U20A20183, Sub-Project Principal Investigator, 2021-2024.
- National Key Research and Development Program of China, No. 2018YFC0830103, Sub-Project Principal Investigator, 2018-2021.
- The General Program of National Natural Science Foundation of China, No. 61876058, Principal Investigator, 2018-2022.
others
- No content