Supervisor of Master's Candidates
School/Department:School of Computer Science and Information Engineering
Administrative Position:Lecturer
Education Level:With Certificate of Graduation for Doctorate Study
Business Address:A904, Science and Education Building, Feicui Lake Campus, Hefei University of Technology
Alma Mater:Hefei University of Technology
Discipline:Computer Applications Technology
Scientific Research
Research Field
Tang focuses on sign language video translation and generation research, involving computer science, artificial intelligence, machine learning, linguistics, and other fields. The main research directions include sign language video data processing and recognition, conversion between sign language videos and spoken languages, natural and smooth sign language video generation, etc.
Sign language video translation and generation is a research topic that aims to use artificial intelligence technology to provide a better communication experience for the deaf community. This field focuses on how to convert spoken language into sign language videos, and how to convert sign language videos into spoken language. The goal of this task of research is to use computer algorithms and machine learning techniques to automatically recognize gestures and facial expressions in sign language videos and then convert them into spoken language or other forms of text output. Besides, the field is also working to generate natural, fluent sign language videos so that spoken language can be more easily understood by the deaf community.
In this field, researchers can explore how to design effective algorithms to process sign language video data, how to train neural networks to automatically recognize sign language, and how to leverage natural language processing techniques to generate natural and fluent sign language videos. At the same time, they can also study the language and culture of the deaf community to better understand their needs and provide them with better services.
Paper Publications
- · Dan Guo, Shengeng Tang, and Meng Wang, "Connectionist Temporal Modeling of Video and Language: a Joint Model for Translation and Sign Labeling", International Joint Conference on Artificial Intelligence (IJCAI), 2019: 751-757.,2019
- · Shengeng Tang, Dan Guo*, Richang Hong*, and Meng Wang, "Graph-Based Multimodal Sequential Embedding for Sign Language Translation", IEEE Transactions on Multimedia (TMM), 2022, 24: 4433-4445.,2022
- · Shengeng Tang, Richang Hong*, Dan Guo*, and Meng Wang, "Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production", ACM International Conference on Multimedia (ACM MM), 2022: 5630-5638.,2022
- · Peipei Song, Dan Guo*, Xun Yang*, Shengeng Tang, Erkun Yang, and Meng Wang*, "Emotion-Prior Awareness Network for Emotional Video Captioning", ACM International Conference on Multimedia (ACM MM), 2023: 589-600.,2022
- · Shengeng Tang, Feng Xue, Jingjing Wu, Shuo Wang, and Richang Hong, "Gloss-driven Conditional Diffusion Models for Sign Language Production", ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 2024.,2024
Patents
- · Dan Guo,Shengeng Tang,刘祥龙,hrc,wm,一种基于图卷积的多模态融合手语识别系统及方法Invent,ZL202010049714.7,2023/03/14
- · Dan Guo,Shengeng Tang,刘祥龙,wm,一种基于多层次语义解析的手语翻译系统及方法Invent,ZL202010103960.6,2023/03/28
- · Dan Guo,谷纪豪,Shengeng Tang,肖同欢,曹晨曦,宋万强,一种基于深度智能交互的室外视障辅助方法Invent,ZL202210371804.7,2024/02/20
- · Dan Guo,曹晨曦,肖同欢,Shengeng Tang,谷纪豪,黄滨,一种基于语义分割的择优式方向偏移预警系统和方法Invent,ZL202210374860.6,2024/02/27
- · Dan Guo,刘泽宽,郭义臣,Shengeng Tang,武梓龙,文则涵,陈颖男,一种基于深度学习的WiFi手语翻译系统及方法Invent,
Research Projects
No content