郭丹  (教授)

博士生导师 硕士生导师

所在单位:智能科学与技术系

职务:Professor

学历:研究生(博士)毕业

办公地点:科教楼A座9楼

性别:女

学位:博士学位

在职信息:在职

毕业院校:华中科技大学

   
当前位置: 中文主页 >> 科学研究

研究领域

    主要研究方向为机器视觉、机器学习、深度学习、模式识别。包括:

    · 跨模态理解与推理(Cross-modal Understanding and Reasoning)

    · 视听事件理解(Audio-Visual Event Understanding and Parsing)

    · 视觉与自然语言(Image/Video Captioning and Explanation)

    · 时序视频检测(Temporal Action Detection / Video Grounding)

    · 视觉手语识别与翻译(Vison-based Sign Language Recognition and Translation)

    · 视觉生理信号检测(Vision-based Physiological Measurement)

    特色研究

    · 视觉情感计算

    · 视觉手语机器翻译

    · 视频语义解析及定位

    · 视觉聊天机器人

论文成果

    以下是部分论文清单:(主要包括CCF-A会议/期刊、IEEE/ACM Transactions期刊等)

    1. [1] Fei Wang, Dan Guo*, Kun Li, Zhun Zhong, Meng Wang*. "Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A会议), 2024.

    2. [2] Chunxiao Fan, Ziqi Wang, Dan Guo*, Meng Wang. "Data-Free Quantization via Pseudo-label Filtering", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A会议), 2024.

    3. [3] Fei Wang, Dan Guo*, Kun Li, Meng Wang*. "EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.

    4. [4] Zhangbin Li, Dan Guo*, Jinxing Zhou*, Jing Zhang, Meng Wang. "Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.

    5. [5] Zhao Xie, Yadong Shi, Kewei Wu, Yaru Cheng, Dan Guo*. "Towards Understanding Future: Consistency Guided Probabilistic Modeling for Action Anticipation", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.

    6. [6] Liu Liu, Anran Huang, Qi Wu, Dan Guo*, Xun Yang, Meng Wang. "KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking". AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.

    7. [7] Xinyi Wu, Wentao Ma, Dan Guo, Tongqing Zhou, Shan Zhao, Zhiping Cai. "Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning", AAAI Conference on Artificial Intelligence (AAAI, CCF-A会议), 2024.

    8. [8] Peipei Song, Dan Guo*, Xun Yang, Shengeng Tang, and Meng Wang. "Emotional Video Captioning with Vision-based Emotion Interpretation Network", IEEE Transactions on Image Processing (IEEE TIP, CCF-A期刊), 2024.

    9. [9] Zhao Xie, Chang Jiao, Kewei Wu*, Dan Guo* and Richang Hong. "Active Factor Graph Network for Group Activity Recognition", IEEE Transactions on Image Processing (IEEE TIP, CCF-A期刊), 2024.

    10. [10] Dan Guo, Kun Li*, Bin Hu, Yan Zhang, Meng Wang*. "Benchmarking Micro-action Recognition: Dataset, Methods, and Applications", IEEE Transactions on Circuits and Systems for Video Technology. (IEEE TCSVT, CCF-B期刊), 2024.

    11. [11] Xin Liu, Biao Qian, Haipeng Liu*, Dan Guo,Yang Wang, Meng Wang*. "Seeking False Hard Negatives for Graph Contrastive Learning", IEEE Transactions on Circuits and Systems for Video Technology. (IEEE TCSVT, CCF-B期刊), 2024.

    12. [12] Kewei Wu , Wenjie Luo , Zhao Xie , Dan Guo , Zhao Zhang , and Richang Hong. "Ensemble Prototype Network For Weakly-Supervised Temporal Action Localization", IEEE Transactions on Neural Networks and learning systems (IEEE TNNLS, CCF-B期刊), 2024.

    13. [13] Wei Qian, Dan Guo*, Kun Li, Xiaowei Zhang, Xilan Tian, Xun Yang, Meng Wang*, "Dual-path TokenLearner for Remote Photoplethysmography-based Physiological Measurement with Facial Videos", IEEE Transactions on Computational Social Systems (IEEE TCSS, SCI 2区), 2024.

    14. [14] Peipei Song, Dan Guo*, Xun Yang, Shengeng Tang, Erkun Yang, and Meng Wang*. "Emotion-Prior Awareness Network for Emotional Video Captioning", ACM International Conference on Multimedia (ACM MM ,CCF-A 会议, Oral paper, top 5.4%), 2023.

    15. [15] Sheng Zhou, Dan Guo*, Jia Li, Xun Yang*, and Meng Wang. "Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA", IEEE Transactions on Image Processing (TIP, CCF-A 期刊 ), 2023.

    16. [16] Kun Li, Dan Guo, and Meng Wang*. "ViGT: Proposal-free Video Grounding with Learnable Token in Transformer", Science China Information Sciences (SCIS, CCF-A 期刊 ), 2023.

    17. [17] Xinge Peng, Kun Li*, Jiaxiu Li, Guoliang Chen, and Dan Guo*. "Multi-modality Fusion for Emotion Recognition in Videos", IJCAI (CCF-A会议) Challenge paper, 2023.

    18. [18] Kun Li, Dan Guo*, Guoliang Chen, Xinge Peng, and Meng Wang. "Joint Skeletal and Semantic Embedding Loss for Micro-gesture Classification", IJCAI (CCF-A会议) Challenge paper, 2023.

    19. [19] Jia Li, Wei Qian, Kun Li, Qi Li, Dan Guo*, and Meng Wang*. "Exploiting Diverse Feature for Multimodal Sentiment Analysis", ACM MM (CCF-A 会议) Challenge paper, 2023.

    20. [20] Kun Li, Dan Guo, Guoliang Chen, Feiyang Liu and Meng Wang. "Data Augmentation for Human Behavior Analysis in Multi-Person Conversations", ACM MM (CCF-A 会议) Challenge paper, 2023.

    21. [21] Kun Li, Jiaxiu Li, Dan Guo*, Xun Yang*, and Meng Wang. "Transformer-based Visual Grounding with Cross-modality Interaction", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , CCF-B 期刊 ), 2023.

    22. [22] Qi Li, Dan Guo*, Wei Qian,  Xilan Tian, Xiao Sun,  Haifeng Zhao, and Meng Wang*. "Channel-wise Interactive Learning for Remote Heart Rate Estimation from Facial Video", IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT, CCF-B期刊),2023.

    23. [23] Jing Zhang, Dan Guo*, Xun Yang*, Peipei Song, and  Meng Wang*. "Visual-Linguistic-Stylistic Triple Reward for Cross-Lingual Image Captioning", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , CCF-B 期刊), 2023.

    24. [24] Sheng Zhou, Dan Guo*, Xun Yang*, Jianfeng Dong, and Meng Wang*. "Graph Pooling Inference Network for Text-Based VQA", ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP , CCF-B 期刊), 2023.

    25. [25] Shuaiyang Li, Dan Guo, Kang Liu, Richang Hong, and Feng Xue. "Multimodal Counterfactual Learning Network for Multimedia-based Recommendation", Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR, CCF-A会议), 2023.

    26. [26] Kang Liu, Feng Xue*, Dan Guo, Peijie Sun, Shengsheng Qian, and Richang Hong. "Multimodal Graph Contrastive Learning for Multimedia-based Recommendation", IEEE Transactions on Multimedia (IEEE TMM, CCF-B期刊), 2023.

    27. [27] Wentao Ma, Xinyi Wu, Shan Zhao*, Tongqing Zhou*, Dan Guo, Lichuan Gu, Zhiping Cai, and Meng Wang. "FedSH: Towards Privacy-preserving Text-based Person Re-Identification", IEEE Transactions on Multimedia (IEEE TMM, CCF-B期刊), 2023.

    28. [28] Kang Liu, Feng Xue*, Dan Guo, Le Wu, Shujie Li, and Richang Hong. "MEGCF: Multimodal Entity Graph Collaborative Filtering for Personalized Recommendation",  ACM Transactions on Information Systems (ACM TOIS, CCF-A期刊), 2023.

    29. [29] Feng Xue*, Tian Yang, Kang Liu, Zikun Hong, Mingwei Cao, Dan Guo, and Richang Hong. "LCSNet: End-to-end Lipreading with Channel-aware Feature Selection", ACM Transactions on Multimedia Computing, Communications, and Applications (ACM TOMM, CCF-B期刊), 2023.

    30. [30] Jinxing Zhou, Dan Guo* and Meng Wang*. "Contrastive Positive Sample Propagation along the Audio-Visual Event Line", IEEE Transactions on Pattern Analysis and Machine Intelligence(TPAMI, CCF-A期刊, IF 24.314 ), 2022.

    31. [31] Shengeng Tang, Richang Hong*, Dan Guo*, and Meng Wang, "Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production", ACM International Conference on Multimedia (ACM MM ,CCF-A 会议), 2022.

    32. [32] Peipei Song, Dan Guo*, Jun Cheng, and Meng Wang*, "Contextual Attention Network for Emotional Video Captioning", IEEE Transactions on Multimedia (TMM, CCF-B期刊 ), 2022.

    33. [33] Peipei Song, Dan Guo*, Jinxing Zhou, Mingliang Xu, and Meng Wang*, "Memorial GAN with Joint Semantic Optimization for Unpaired Image Captioning", IEEE Transactions on Cybernetics (TCYB, CCF-B期刊 ), 2022.

    34. [34] Jinxing Zhou, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Meng Wang*, and Yiran Zhong*, "Audio−Visual Segmentation", European Conference on Computer Vision (ECCV, CCF-B会议), 2022.

    35. [35] Tianyuan Xu, Xueliang Liu*, Zhen Huang*, Dan Guo, Richang Hong, and Meng Wang. "Early-Learning regularized Contrastive Learning for Cross-Modal Retrieval with Noisy Labels", ACM International Conference on Multimedia (ACM MM, CCF-A会议), 2022.

    36. [36] Zhao Xie, Jiansong Chen, Kewei Wu*, Dan Guo, and Richang Hong. "Global Temporal Difference Network for Action Recognition", IEEE Transactions on Multimedia (IEEE TMM, CCF-B期刊), 2022.

    37. [37] Kang Liu, Feng Xue*, Xiangnan He, Dan Guo, and Richang Hong. "Joint Multi-Grained Popularity-Aware Graph Convolution Collaborative Filtering for Recommendation", IEEE Transactions on Computational Social Systems (IEEE TCSS, SCI 2区), 2022.

    38. [38] Dan Guo, Hui Wang, and Meng Wang*, "Context-Aware Graph Inference with Knowledge Distillation for Visual Dialog", IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, CCF-A期刊, IF 24.314 ), 2021.

    39. [39] Hui Wang, Dan Guo*, Xiansheng Hua, and Meng Wang*, "Pairwise VLAD Interaction Network for Video Question Answering", ACM International Conference on Multimedia (ACM MM, CCF-A 会议 ), 2021.

    40. [40] Kun Li, Dan Guo*, and Meng Wang*, "Proposal-Free Video Grounding with Contextual Pyramid Network", AAAI Conference on Artificial Intelligence (AAAI, CCF-A 会议 ), 2021.

    41. [41] Shengeng Tang, Dan Guo*, Richang Hong*, and Meng Wang, "Graph-Based Multimodal Sequential Embedding for Sign Language Translation", IEEE Transactions on Multimedia (TMM, CCF-B期刊 ), 2021.

    42. [42] Dan Guo, Hui Wang, Shuhui Wang, and Meng Wang*, "Textual-Visual Reference-Aware Attention Network for Visual Dialog", IEEE Transactions on Image Processing (TIP, CCF-A 期刊 ), 2020.

    43. [43] Dan Guo, Wengang Zhou*, Anyang Li, Houqiang Li, and Meng Wang*, "Hierarchical Recurrent Deep Fusion Using Adaptive Clip Summarization for Sign Language Translation", IEEE Transactions on Image Processing (TIP, CCF-A 期刊 ), 2020.

    44. [44] Dan Guo, Hui Wang*, Hanwang Zhang, Zhengjun Zha, and Meng Wang*, "Iterative Context-Aware Graph Inference for Visual Dialog", Conference on Computer Vision and Pattern Recognition (CVPR, CCF-A 会议, oral paper, Top 5%), 2020. 

    45. [45] Dan Guo, Yang Wang*, Peipei Song*, and Meng Wang, "Recurrent Relational Memory Network for Unsupervised Image Captioning", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A 会议, 录取率12.6%), 2020. 

    46. [46] Dan Guo, Kun Li*, and Meng Wang, "DADNet:Dilated-Attention-Deformable ConvNet for Crowd Counting", ACM International Conference on Multimedia (ACM MM, CCF-A 会议, oral paper, Top 9.8% ), 2019.

    47. [47] Dan Guo, Shengeng Tang,and Meng Wang, "Connectionist Temporal Modeling of Video and Language:A Joint Model for Translation and Sign Labeling", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A 会议 ), 2019.

    48. [48] Dan Guo, Shuo Wang, Qi Tian, and Meng Wang, "Dense Temporal Convolution Network for Sign Language Translation", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A 会议), 2019.

    49. [49] Dan Guo, Hui Wang, and Meng Wang, "Dual Visual Attention Network for Visual Dialog", International Joint Conference on Artificial Intelligence (IJCAI, CCF-A 会议 ), 2019.

    50. [50] Shuo Wang, Dan Guo*, Xin Xu, Li Zhuo, and Meng Wang, "Cross-Modality Retrieval by Joint Correlation Learning", ACM Transactions on Multimedia Computing Communications and Applications (ACM TOMCCAP , CCF-B 期刊 ), 2019.

    51. [51] Shuo Wang, Dan Guo*, Wengang Zhou, Zhengjun Zha, and Meng Wang, "Connectionist Temporal Fusion for Sign Language Translation", International ACM International Conference on Multimedia (ACM MM, CCF-A 会议 ), 2018.

    52. [52] Dan Guo, Wengang Zhou, Houqiang Li, and Meng Wang, "Hierarchical LSTM for Sign Language Translation", AAAI Conference on Artificial Intelligence (AAAI, CCF-A 会议, oral paper, Top 5% ), 2018.

    53. [53] Dan Guo, Wengang Zhou*, Houqiang Li*, and Meng Wang*, "Online Early-Late Fusion Based on Adaptive HMM for Sign Language Recognition", ACM Transactions on Multimedia Computing Communications and Applications (ACM TOMCCAP , CCF-B 期刊 ), 2018.

    54. [54] 郭丹,姚沈涛,王辉,汪萌.嵌入局部聚类描述符的视频问答Transformer模型[J]. 计算机学报 (CCF-A 中文期刊), 2023. 

    55. [55] 鲁志红, 郭丹*, 汪萌. 基于加权运动估计和矢量分割的运动补偿内插算法[j]. 自动化学报 (CCF-A 中文期刊), 2015. 




专利成果

    [1] 郭丹; 何梓贻; 倪友炜; 李坤; 徐梓鑫; 马嘉淇; 罗匡; 一种基于目标检测的碗碟清洗设备(实用新型), 2023-5-12, 中国, ZL202220873705.4.

    [2] 郭丹; 唐申庚; 刘祥龙; 洪日昌; 汪萌; 一种基于图卷积的多模态融合手语识别系统及方法, 2023-3-14, 中国, ZL202010049714.7.

    [3] 郭丹; 唐申庚; 刘祥龙; 汪萌; 一种基于多层次语义解析的手语翻译系统及方法, 2023-3-28, 中国, ZL202010103960.6.[4]

    [4] 赵烨; 胡晓斌; 胡珍珍; 刘学亮; 郭丹; 郭艳蓉; 吴乐; 一种基于注意力模型的视频摘要描述生成方法及装置, 2022-12-9, 中国, ZL202110565400.7.

    [5] 郭丹; 宋培培; 刘祥龙; 汪萌; 基于递归记忆网络的无监督图像描述模型的生成方法, 2022-3-15, 中国, ZL202010049142.2.

    [6] 郭丹; 宋培培; 刘祥龙; 汪萌; 基于数据自驱动的多阶特征动态融合手语翻译方法, 2022-3-15, 中国, ZL202010096391.7.

    [7] 郭丹; 王辉; 汪萌; 一种基于上下文感知图神经网络的视觉对话生成方法, 2021-6-8, 中国, ZL201910881298.4.

    [8] 郭丹; 李坤; 汪萌; 一种基于多尺度注意力机制的人群密度估计方法, 2021-3-9, 中国, ZL201910531606.0.

    [9] 郭丹; 宋培培; 赵烨; 汪萌; 基于自适应隐马尔可夫的多特征融合手语识别方法, 2020-07-10, 中国, ZL201811131806.9.

    [10] 郭丹; 汪萌; 周文罡; 李厚强; 李传青; 李安阳; 基于非对称多层LSTM的连续手语视频自动翻译方法, 2020-2-11, 中国, ZL201810027551.5.

    [11] 郭丹; 王硕; 汪萌; 基于时域卷积网络与循环神经网络融合的手语视频翻译方法, 2019-10-18, 中国, ZL201811070290.1.

    [12] 汪萌; 张鹿鸣; 郭丹; 一种基于多任务拓扑学习的航拍图像快速识别系统及其快速识别方法, 2018-2-6, 中国, ZL201510080478.4.

    [13] 汪萌; 张鹿鸣; 郭丹; 田绪婷; 一种基于几何重构和语义融合的视点追踪方法, 2017-10-3, 中国, ZL201410733763.7.

    [14] 郭丹; 胡学钢; 倪武; 吴信东; 一种基于最大流率路径优先的路网疏散规划方法, 2017-6-6, 中国, ZL201510451828.3.

    [15] 汪萌; 杨勋; 洪日昌; 郭丹; 刘奕群; 孙茂松; 一种基于语义映射空间构建的图像检索方法, 2017-5-17, 中国, ZL201410393094.3.

    [16] 汪萌; 洪日昌; 李炳南; 刘奕群; 郭丹; 刘学亮; 吴信东; 杨勋; 基于连续数标号子空间学习的检索重排序方法, 2017-2-22, 中国, ZL201410196946.X.

    [17] 汪萌; 张鹿鸣; 郭丹; 刘奕群; 孙茂松; 鲁志红; 基于GPS信息视频的三维场景重建方法, 2017-2-22, 中国, ZL201410752454.4.

    [18] 唐申庚; 肖同欢; 郭丹; 谷纪豪; 曹晨曦; 宋万强; 黄滨; 一种基于图像目标检测和视觉深度估计的碰撞预警方法,2023-2-27,中国,CN202310188292.5.(实审)

    [19] 唐申庚;宋万强;郭丹;黄滨;谷纪豪;肖同欢;曹晨曦;一种基于带权无向图的视障人士路线规划方法,2023-3-6,中国,CN202310228006.3.(实审)

    [20] 徐子航;黄扬竣;陈昌林;贺意;李沐柔;黄赞;郭丹;一种基于正则化联合自主训练的领域自适应图像分类方法,2023-4-20,中国.

    CN202310150489.X.




著作成果

    英文专著2本

    [1] Multimedia for Accessible Human Computer Interfaces. Springer. 2021.

    [2] Pattern Matching with Wildcards and Length Constraint. 科学出版社. 2016. 


    软著5项

    [1] 龙馨仪;靳如月;易锦均;宋培培;郭丹;多领域下的实时多模态虚假新闻检测系统 V1.0,2023R11L1048667,原始取得,全部权利,2023-11-15.

    [2] 唐申庚;修雪玉;郭丹;董晓虎;姚骏;谢伟豪; 跨语言手语翻译系统 V1.0,2023SR1107827,原始取得,全部权利,2023-09-20.

    [3] 唐申庚; 黄滨; 郭丹; 谷纪豪; 盲人避障出行辅助系统 V1.0, 2023SR0517944, 原始取得, 全部权利, 2023-05-05.

    [4] 郭丹; 唐申庚; 陈颖男; 武梓龙; 文则涵; 刘泽宽; 基于关键点估计的人体姿态卡通化系统 V1.0, 2022SR0771364, 原始取得, 全部权利, 2022-06-16.

    [5] 鲁志红; 郭丹; 吴经纬; 刘菲; 张立缙; 田旭婷; 基于运动补偿的视频高清化播放软件 V1.0, 2014SR098634, 原始取得, 全部权利, 2014-07-16.

获奖情况

    1. IJCAI Challenge on Micro-gesture Analysis for Hidden Emotion Understanding,  1st Place in Micro-gesture Classification Track.🏆2023年5月)

    2. IJCAI Challenge on Micro-gesture Analysis for Hidden Emotion Understanding, 2nd Place in Micro-gesture Online Recognition Track. (2023年5月)

    3. ACM MM Multi-modal Group Behaviour Analysis for Artificial Mediation, 1st Place in Bodily Behaviour Recognition Track. 🏆(2023年7月)

    4. ACM MM Multi-modal Group Behaviour Analysis for Artificial Mediation, 1st Place in Eye Contact Detection Track. 🏆(2023年7月)

    5. ACM MM Multi-modal Group Behaviour Analysis for Artificial Mediation, 3rd in Next Speaker Prediction Track. (2023年7月)

    6.  ACM MM Multi-modal Sentiment Analysis Challenge, 3rd in MuSe-Personalisation Track. (2023年7月)

    7. IEEE International Conference on Multimedia and Expo (IEEE ICME多媒体旗舰国际大会)- Outstanding Reviewer Award

科研项目 More>>

其他

    暂无内容