Dan Guo

Doctoral degree

Postgraduate (Doctoral)

Personal Information

Business Address:Kejiao A Building, Feicui Campus of HFUT, Hefei, Anhui, China

VIEW MORE
Home > Scientific Research
Research Field

    The main research directions are machine vision, machine learning, deep learning, pattern recognition.

      · Cross-modal Understanding and Reasoning

      · Audio-Visual Event Understanding and Parsing

      · Image/Video Captioning and Explanation

      · Temporal Action Detection / Video Grounding

      · Vison-based Sign Language Recognition and Translation)

      · Vision-based Physiological Measurement


    Characteristic research

      · Visual Affective Computing

      · Visual Sign Language Machine Translation

      · Video Semantic Analysis and Grounding

      · Visual Chatbot


Paper Publications

    The following is a partial list of papers: (mainly CCF-A conferences/journals, IEEE/ACM Transactions journals, etc.)


    [1] Fei Wang, Dan Guo*, Kun Li, Zhun Zhong, Meng Wang*. "Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A), 2024.


    [2] Chunxiao Fan, Ziqi Wang, Dan Guo*, Meng Wang. "Data-Free Quantization via Pseudo-label Filtering", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A), 2024.


    [3] Fei Wang, Dan Guo*, Kun Li, Meng Wang*. "EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer", AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2024.


    [4] Zhangbin Li, Dan Guo*, Jinxing Zhou*, Jing Zhang, Meng Wang. "Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering", AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2024.


    [5] Zhao Xie, Yadong Shi, Kewei Wu, Yaru Cheng, Dan Guo*. "Towards Understanding Future: Consistency Guided Probabilistic Modeling for Action Anticipation", AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2024.


    [6] Liu Liu, Anran Huang, Qi Wu, Dan Guo*, Xun Yang, Meng Wang. "KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking". AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2024.


    [7] Xinyi Wu, Wentao Ma, Dan Guo, Tongqing Zhou, Shan Zhao, Zhiping Cai. "Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning", AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2024.


    [8] Peipei Song, Dan Guo*, Xun Yang, Shengeng Tang, and Meng Wang. "Emotional Video Captioning with Vision-based Emotion Interpretation Network", IEEE Transactions on Image Processing (IEEE TIP, CCF-A), 2024.


    [9] Zhao Xie, Chang Jiao, Kewei Wu*, Dan Guo* and Richang Hong. "Active Factor Graph Network for Group Activity Recognition", IEEE Transactions on Image Processing (IEEE TIP, CCF-A), 2024.


    [10] Dan Guo, Kun Li*, Bin Hu, Yan Zhang, Meng Wang*. "Benchmarking Micro-action Recognition: Dataset, Methods, and Applications", IEEE Transactions on Circuits and Systems for Video Technology. (IEEE TCSVT, CCF-B), 2024.


    [11] Xin Liu, Biao Qian, Haipeng Liu*, Dan Guo,Yang Wang, Meng Wang*. "Seeking False Hard Negatives for Graph Contrastive Learning", IEEE Transactions on Circuits and Systems for Video Technology. (IEEE TCSVT, CCF-B), 2024.


    [12] Kewei Wu , Wenjie Luo , Zhao Xie , Dan Guo , Zhao Zhang , and Richang Hong. "Ensemble Prototype Network For Weakly-Supervised Temporal Action Localization", IEEE Transactions on Neural Networks and learning systems (IEEE TNNLS, CCF-B), 2024.


    [13] Wei Qian, Dan Guo*, Kun Li, Xiaowei Zhang, Xilan Tian, Xun Yang, Meng Wang*, "Dual-path TokenLearner for Remote Photoplethysmography-based Physiological Measurement with Facial Videos", IEEE Transactions on Computational Social Systems (IEEE TCSS, SCI 2), 2024.


    [14] Peipei Song, Dan Guo*, Xun Yang, Shengeng Tang, Erkun Yang, and Meng Wang*. "Emotion-Prior Awareness Network for Emotional Video Captioning", ACM International Conference on Multimedia (ACM MM ,CCF-A, Oral paper, top 5.4%), 2023.


    [15] Sheng Zhou, Dan Guo*, Jia Li, Xun Yang*, and Meng Wang. "Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA", IEEE Transactions on Image Processing (TIP, CCF-A), 2023.


    [16] Kun Li, Dan Guo* , and Meng Wang*. "ViGT: Proposal-free Video Grounding with Learnable Token in Transformer", Science China Information Sciences (SCIS, CCF-A),2023.


    [17] Xinge Peng, Kun Li*, Jiaxiu Li, Guoliang Chen, and Dan Guo*. "Multi-modality Fusion for Emotion Recognition in Videos", IJCAI (CCF-A) Challenge paper, 2023.


    [18] Kun Li, Dan Guo*, Guoliang Chen, Xinge Peng, and Meng Wang. "Joint Skeletal and Semantic Embedding Loss for Micro-gesture Classification", IJCAI (CCF-A) Challenge paper, 2023.


    [19] Jia Li, Wei Qian, Kun Li, Qi Li, Dan Guo*, and Meng Wang*. "Exploiting Diverse Feature for Multimodal Sentiment Analysis", ACM MM (CCF-A) Challenge paper, 2023.


    [20] Kun Li, Dan Guo* , Guoliang Chen, Feiyang Liu and Meng Wang. "Data Augmentation for Human Behavior Analysis in Multi-Person Conversations", ACM MM (CCF-A) Challenge paper, 2023.


    [21] Kun Li, Jiaxiu Li, Dan Guo*, Xun Yang*, and Meng Wang. "Transformer-based Visual Grounding with Cross-modality Interaction", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP, CCF-B), 2023.


    [22] Qi Li, Dan Guo*, Wei Qian,  Xilan Tian, Xiao Sun,  Haifeng Zhao, and Meng Wang*. "Channel-wise Interactive Learning for Remote Heart Rate Estimation from Facial Video", IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT, CCF-B),2023.


    [23] Jing Zhang, Dan Guo*, Xun Yang*, Peipei Song, and  Meng Wang*. "Visual-Linguistic-Stylistic Triple Reward for Cross-Lingual Image Captioning", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , CCF-B), 2023.


    [24] Sheng Zhou, Dan Guo*, Xun Yang*, Jianfeng Dong, and Meng Wang*. "Graph Pooling Inference Network for Text-Based VQA", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , CCF-B), 2023.


    [25] Shuaiyang Li, Dan Guo, Kang Liu, Richang Hong, and Feng Xue. "Multimodal Counterfactual Learning Network for Multimedia-based Recommendation", Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR, CCF-A), 2023.


    [26] Kang Liu, Feng Xue*, Dan Guo, Peijie Sun, Shengsheng Qian, and Richang Hong. "Multimodal Graph Contrastive Learning for Multimedia-based Recommendation", IEEE Transactions on Multimedia (IEEE TMM, CCF-B), 2023.


    [27] Wentao Ma, Xinyi Wu, Shan Zhao*, Tongqing Zhou*, Dan Guo, Lichuan Gu, Zhiping Cai, and Meng Wang. "FedSH: Towards Privacy-preserving Text-based Person Re-Identification", IEEE Transactions on Multimedia (IEEE TMM, CCF-B), 2023.


    [28] Kang Liu, Feng Xue*, Dan Guo, Le Wu, Shujie Li, and Richang Hong. "MEGCF: Multimodal Entity Graph Collaborative Filtering for Personalized Recommendation",  ACM Transactions on Information Systems (ACM TOIS, CCF-A), 2023.


    [29] Feng Xue*, Tian Yang, Kang Liu, Zikun Hong, Mingwei Cao, Dan Guo, and Richang Hong. "LCSNet: End-to-end Lipreading with Channel-aware Feature Selection", ACM Transactions on Multimedia Computing, Communications, and Applications (ACM TOMM, CCF-B), 2023.


    [30] Jinxing Zhou, Dan Guo* and Meng Wang*. "Contrastive Positive Sample Propagation along the Audio-Visual Event Line", IEEE Transactions on Pattern Analysis and Machine Intelligence(TPAMI, CCF-A, IF 24.314), 2022.


    [31] Shengeng Tang, Richang Hong*, Dan Guo*, and Meng Wang, "Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production", ACM International Conference on Multimedia (ACM MM ,CCF-A), 2022.


    [32] Peipei Song, Dan Guo*, Jun Cheng, and Meng Wang*, "Contextual Attention Network for Emotional Video Captioning", IEEE Transactions on Multimedia (TMM, CCF-B), 2022.


    [33] Peipei Song, Dan Guo*, Jinxing Zhou, Mingliang Xu, and Meng Wang*, "Memorial GAN with Joint Semantic Optimization for Unpaired Image Captioning", IEEE Transactions on Cybernetics (TCYB, CCF-B), 2022.


    [34] Jinxing Zhou, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Meng Wang*, and Yiran Zhong*, "Audio−Visual Segmentation", European Conference on Computer Vision (ECCV, CCF-B), 2022.


    [35] Tianyuan Xu, Xueliang Liu*, Zhen Huang*, Dan Guo, Richang Hong, and Meng Wang. "Early-Learning regularized Contrastive Learning for Cross-Modal Retrieval with Noisy Labels", ACM International Conference on Multimedia (ACM MM, CCF-A), 2022.


    [36] Zhao Xie, Jiansong Chen, Kewei Wu*, Dan Guo, and Richang Hong. "Global Temporal Difference Network for Action Recognition", IEEE Transactions on Multimedia (IEEE TMM, CCF-B), 2022.


    [37] Kang Liu, Feng Xue*, Xiangnan He, Dan Guo, and Richang Hong. "Joint Multi-Grained Popularity-Aware Graph Convolution Collaborative Filtering for Recommendation", IEEE Transactions on Computational Social Systems (IEEE TCSS, SCI 2), 2022.


    [38] Dan Guo, Hui Wang, and Meng Wang*, "Context-Aware Graph Inference with Knowledge Distillation for Visual Dialog", IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, CCF-A, IF 24.314), 2021.


    [39] Hui Wang, Dan Guo*, Xiansheng Hua, and Meng Wang*, "Pairwise VLAD Interaction Network for Video Question Answering", ACM International Conference on Multimedia (ACM MM, CCF-A), 2021.


    [40] Kun Li, Dan Guo*, and Meng Wang*, "Proposal-Free Video Grounding with Contextual Pyramid Network", AAAI Conference on Artificial Intelligence (AAAI, CCF-A), 2021.


    [41] Shengeng Tang, Dan Guo*, Richang Hong*, and Meng Wang, "Graph-Based Multimodal Sequential Embedding for Sign Language Translation", IEEE Transactions on Multimedia (TMM, CCF-B), 2021.


    [42] Dan Guo, Hui Wang, Shuhui Wang, and Meng Wang*, "Textual-Visual Reference-Aware Attention Network for Visual Dialog", IEEE Transactions on Image Processing (TIP, CCF-A), 2020.


    [43] Dan Guo, Wengang Zhou*, Anyang Li, Houqiang Li, and Meng Wang*, "Hierarchical Recurrent Deep Fusion Using Adaptive Clip Summarization for Sign Language Translation", IEEE Transactions on Image Processing (TIP, CCF-A), 2020.


    [44] Dan Guo, Hui Wang*, Hanwang Zhang, Zhengjun Zha, and Meng Wang*, "Iterative Context-Aware Graph Inference for Visual Dialog", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A, oral paper, Top 5%), 2020. 


    [45] Dan Guo, Yang Wang*, Peipei Song*, and Meng Wang, "Recurrent Relational Memory Network for Unsupervised Image Captioning", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A , top 12.6%), 2020. 


    [46] Dan Guo, Kun Li*, and Meng Wang, "DADNet:Dilated-Attention-Deformable ConvNet for Crowd Counting", ACM International Conference on Multimedia (ACM MM, CCF-A, oral paper, Top 9.8%), 2019.


    [47] Dan Guo, Shengeng Tang,and Meng Wang, "Connectionist Temporal Modeling of Video and Language:A Joint Model for Translation and Sign Labeling", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A), 2019.


    [48] Dan Guo, Shuo Wang, Qi Tian, and Meng Wang, "Dense Temporal Convolution Network for Sign Language Translation", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A), 2019.


    [49] Dan Guo, Hui Wang, and Meng Wang, "Dual Visual Attention Network for Visual Dialog", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A), 2019.


    [50] Shuo Wang, Dan Guo*, Xin Xu, Li Zhuo, and Meng Wang, "Cross-Modality Retrieval by Joint Correlation Learning", ACM Transactions on Multimedia Computing Communications and Applications (ACM TOMCCAP , CCF-B), 2019.


    [51] Shuo Wang, Dan Guo*, Wengang Zhou, Zhengjun Zha, and Meng Wang, "Connectionist Temporal Fusion for Sign Language Translation", International ACM International Conference on Multimedia (ACM MM, CCF-A), 2018.


    [52] Dan Guo, Wengang Zhou, Houqiang Li, and Meng Wang, "Hierarchical LSTM for Sign Language Translation", AAAI Conference on Artificial Intelligence (AAAI, CCF-A, oral paper, Top 5%), 2018.


    [53] Dan Guo, Wengang Zhou*, Houqiang Li*, and Meng Wang*, "Online Early-Late Fusion Based on Adaptive HMM for Sign Language Recognition", ACM Transactions on Multimedia Computing Communications and Applications (ACM TOMCCAP , CCF-B Journal), 2018.


    [54] Dan Guo, Shentao Yao, Hui Wang, and Meng Wang. "Embedding VLAD in Transformer for Video Question Answering". Cinese Journal of Computers (CCF-A Chinese Journal), 2023. 


    [55] Zhihong Lu, Dan Guo*, and Meng Wang, "Motion-compensated Frame Interpolation Based on Weighted Motion Estimation and Vector Segmentation", Acta Automatica Sinica,(CCF-A Chinese Journal), 2015. 


Patents

    [1] Dan Guo; Ziyi He; Youwei Ni; Kun Li; Zixin Xu; Jiaqi Ma; Kuang Luo; A dishwashing device based on object detection (utility model), May 12, 2023, China, ZL202220873705.4.

    [2] Dan Guo; Shengeng Tang; Xianglong Liu; Richang Hong; Meng Wang; A multimodal fusion sign language recognition system and method based on graph convolution, March 14, 2023, China, ZL202010049714.7.

    [3] Dan Guo; Shengeng Tang; Xianglong Liu; Meng Wang; A sign language translation system and method based on multi-level semantic parsing, March 28, 2023, China, ZL202010103960.6.

    [4] Ye Zhao; Xiaobin Hu; Zhenzhen Hu; Xueliang Liu; Dan Guo; Yanrong Guo; Le Wu; A method and device for generating video summary descriptions based on attention models, December 9, 2022, China, ZL202110565400.7.

    [5] Dan Guo; Peipei Song; Xianglong Liu; Meng Wang; A method for generating an unsupervised image description model based on recursive memory networks, March 15, 2022, China, ZL202010049142.2.

    [6] Dan Guo; Peipei Song; Xianglong Liu; Meng Wang; A method for sign language translation based on data-driven multi-level feature dynamic fusion, March 15, 2022, China, ZL202010096391.7.

    [7] Dan Guo; Hui Wang; Meng Wang; A method for visual dialogue generation based on context-aware graph neural networks, June 8, 2021, China, ZL201910881298.4.

    [8] Dan Guo; Kun Li; Meng Wang; A crowd density estimation method based on multi-scale attention mechanism, March 9, 2021, China, ZL201910531606.0.

    [9] Dan Guo; Peipei Song; Ye Zhao; Meng Wang; A multi-feature fusion sign language recognition method based on adaptive hidden Markov models, July 10, 2020, China, ZL201811131806.9.

    [10] Dan Guo; Meng Wang; Wengang Zhou; Houqiang Li; Chuanqing LI; Anyang Li; An Asymmetric Multilayer LSTM-Based Approach for Automatic Translation of Continuous Sign Language Videos, 2020-2-11, China, ZL201810027551.5.

    [11] Dan Guo; Shuo Wang; Meng Wang; A Sign Language Video Translation Method Based on the Fusion of Temporal Domain Convolutional Networks and Recurrent Neural Networks, 2019-10-18, China, ZL201811070290.1.

    [12] Meng Wang; Luming Zhang; Dan Guo; A fast recognition system and a fast recognition method for aerial images based on multi-task topology learning, 2018-2-6, China, ZL201510080478.4.

    [13] Meng Wang; Luming Zhang; Dan Guo; Xuting Tian; A viewpoint tracking method based on geometric reconstruction and semantic fusion, 2017-10-3, China, ZL201410733763.7.

    [14] Dan Guo; Xuegang Hu; Wu Ni; Xindong Wu; A road network evacuation planning method based on maximum flow rate path prioritization, 2017-6-6, China, ZL201510451828.3.

    [15] Meng Wang; Xun Yang; Richang Hong; Dan Guo; Yiqun Liu; Maosong Sun; An image retrieval method based on semantic mapping space construction, 2017-5-17, China, ZL201410393094.3.

    [16] Meng Wang; Richang Hong; Bingnan Li; Yiqun Liu; Dan Guo; Xueliang Liu; Xindong Wu; Xun Yang; Retrieval reordering method based on continuous number labeled subspace learning, 2017-2-22, China, ZL201410196946.X.

    [17] Meng Wang; Luming Zhang; Dan Guo; Yiqun Liu; Maosong Sun; Zhihong Lu; 3D scene reconstruction method based on GPS information video, 2017-2-22, China, ZL201410752454.4.

    [18] Shengeng Tang; Tonghuan Xiao; Dan Guo; Jihao Gu; Chenxi Cao; Wanqiang Song; Bin Huang; A collision warning method based on image object detection and visual depth estimation, 2023-2-27, China, CN202310188292.5. (Practical review)

    [19] Shengeng Tang; Wanqiang Song; Dan Guo; Jihao Gu; Tonghuan Xiao; Chenxi Cao; A route planning method for visually impaired people based on weighted undirected graph, 2023-3-6, China, CN202310228006.3. (Practical review)

    [20] Zihang Xu; Yangjun Huang; Changlin Chen; Yi He; Murou Li; Zan Huang; Dan Guo; A domain-adaptive image classification method based on regularized joint autonomous training, 2023-4-20, China. CN202310150489.X.



Published Books

    English monograph

    [1] Multimedia for Accessible Human Computer Interfaces. Springer. 2021.

    [2] Pattern Matching with Wildcards and Length Constraint. Science Press. 2016. 


    Computer software copyright

    [1] Xiyi Long; Ruyue Jin; Jinjun Yi; Peipei Song; Dan Guo; Real-time multi-modal fake news detection system in multiple fields V1.0, 2023R11L1048667, original acquisition, all rights, 2023-11-15.

    [2] Shengeng Tang; Xueyu Xiu; Dan Guo; Xiaohu Dong; Jun Yao; Weihao Xie; Cross-language sign language translation system V1.0, 2023SR1107827, original acquisition, all rights, 2023-09-20.

    [3] Shengeng Tang; Bin Huang; Dan Guo; Jihao Gu; Blind obstacle avoidance travel assistance system V1.0, 2023SR0517944, original acquisition, all rights, 2023-05-05.

    [4] Dan Guo; Shengeng Tang; Yinnan Chen; ZiLong Wu; Zehan Wen; Zekuan Liu; Human posture cartoonization system based on key point estimation V1.0, 2022SR0771364, original acquisition, all rights reserved, 2022-06-16.

    [5] Zhihong Lu; Dan Guo; Jingwei Wu; Fei Liu; Lijin Zhang; Xuting Tian; Video HD playback software based on motion compensation V1.0, 2014SR098634, original acquisition, all rights reserved, 2014-07-16.



Grants and Awards

    Grants

    · Top-notch young talents for young scholars of High-end Talent Cultivation Action Program of Anhui Province, China, 2023-09.

    · Outstanding Reviewer Award of IEEE International Conference on Multimedia and Expo (IEEE ICME), 2020-07.

    · Outstanding Reviewer Award of Computer Science journal, 2021-12.


    Competitions

    · International Joint Conference on Artificial Intelligence (IJCAI) Challenge on Micro-gesture Analysis for Hidden Emotion Understanding, 1st Place in Micro-gesture Classification Track🏆, 2023.05.

    · International Joint Conference on Artificial Intelligence (IJCAI) Challenge on Micro-gesture Analysis for Hidden Emotion Understanding, 2nd Place in Micro-gesture Online Recognition Track, 2023.05.

    · ACM International Conference on Multimedia (ACM MM) Multi-modal Group Behaviour Analysis for Artificial Mediation, 1st Place in Bodily Behaviour Recognition Track🏆, 2023.07.

    · ACM International Conference on Multimedia (ACM MM) Multi-modal Group Behaviour Analysis for Artificial Mediation, 1st Place in Eye Contact Detection Track🏆, 2023.07.

    · ACM International Conference on Multimedia (ACM MM) Multi-modal Group Behaviour Analysis for Artificial Mediation, 3rd in Next Speaker Prediction Track, 2023.07.

    · ACM International Conference on Multimedia (ACM MM) Multi-modal Sentiment Analysis Challenge, 3rd in MuSe-Personalisation Track, 2023.07.


others

    No content