Intrduction
Visual Understanding Team targets on understanding, generating, and transforming multimedia content via computer vision and natural language processing techniques. We are working on sign language translation, image/video captioning, visual dialogue, video grouding and VQA. We have published 30+ journal articles and conference papers, including IEEE TPAMI, IEEE TIP, IEEE TMM, ACM TOMCCAP, CVPR, AAAI, IJCAI, ACM MM, etc.
Student
Status | Name | Contact | Research Interest |
Ph.D student | Kun Li | kunli.hfut@gmail.com | Visual Grouding, Crowd Counting |
Ph.D student | Hui Wang | wanghui.hfut@gmail.com | Visual Dialogue, Video Question Answering |
Ph.D student | Jinxing Zhou | zhoujxhfut@gmail.com | Audio-Visual Event Localization |
Ph.D student | Qi Li | liqi_cs@stu.ahu.edu.cn | Remote Physiological Measurement |
Ph.D student | Jing Zhang | hfutzhangjing@gmail.com | Image Captioning |
Ph.D student | Sheng Zhou | hzgn97@gmail.com | Text Visual Question Answering |
Ph.D student | Wei Qian | qianwei.hfut@gmail.com | Remote Physiological Measurement |
Ph.D student | Xing Yi | -- | -- |
Master student | Fei Wang | -- | Video Magnification |
Master student | Guoliang Chen | -- | Emotion-Action Recognition |
Master student | Zhangbin Li | -- | Audio-Visual Event Localization |
Master student | Jingjing Hu | -- | Temporal Video Grounding |
Master student | Feiyang Liu | -- | Gaze Following(Gaze Target Detection) |
Master student | Jiahui Sun | -- | Video Captioning |