刘浏  (副教授)

硕士生导师

出生日期:1993-09-21

入职时间:2022-09-08

所在单位:计算机科学与技术系

学历:研究生(博士后)

性别:男

学位:博士学位

在职信息:在职

毕业院校:中国科学技术大学

   
当前位置: 中文主页 >> 科学研究 >> 论文成果

Toward Real-World Category-Level Articulation Pose Estimation

点击次数:

影响因子:11.04

DOI码:10.1109/TIP.2021.3138644

发表刊物:IEEE Transactions on Image Processing

关键字:Articulated Object, Category-level 6D Pose Estimation

摘要:Human life is populated with articulated objects. Current Category-level Articulation Pose Estimation (CAPE) methods are studied under the single-instance setting with a fixed kinematic structure for each category. Considering these limitations, we aim to study the problem of estimating part-level 6D pose for multiple articulated objects with unknown kinematic structures in a single RGB-D image, and reform this problem setting for real-world environments and suggest a CAPE-Real (CAPER) task setting. This setting allows varied kinematic structures within a semantic category, and multiple instances to co-exist in an observation of real world. To support this task, we build an articulated model repository ReArt-48 and present an efficient dataset generation pipeline, which contains Fast Articulated Object Modeling (FAOM) and Semi-Authentic MixEd Reality Technique (SAMERT). Accompanying the pipeline, we build a large-scale mixed reality dataset ReArtMix and a real world dataset ReArtVal. Accompanying the CAPER problem and the dataset, we propose an effective framework that exploits RGB-D input to estimate part-level pose for multiple instances in a single forward pass. In our method, we introduce object detection from RGB-D input to handle the multi-instance problem and segment each instance into several parts. To address the unknown kinematic structure issue, we propose an Articulation Parsing Network to analyze the structure of detected instance, and also build a Pair Articulation Pose Estimation module to estimate per-part 6D pose as well as joint property from connected part pairs. Extensive experiments demonstrate that the proposed method can achieve good performance on CAPER, CAPE and instance-level Robot Arm pose estimation problems. We believe it could serve as a strong baseline for future research on the CAPER task. The datasets and codes in our work will be made publicly available.

合写作者:Haoyuan Fu,Han Xue,Wenqiang Xu

第一作者:Liu Liu

论文类型:期刊论文

通讯作者:Cewu Lu

卷号:31

页面范围:1072 - 1083

是否译文:

发表时间:2022-01-05

收录刊物:SCI

上一条: Deep Learning Based Automatic Multiclass Wild Pest Monitoring Approach Using Hybrid Global and Local Activated Features

下一条: AKB-48: A Real-World Articulated Object Knowledge Base