25 / 2023-08-28 11:20:39
Multi-View Contrastive Self-supervised Triplet network for Skeleton-based Action Recognition
self-supervised learning, skeleton-based action recognition, contrastive learning, multi-view modeling, triplet network
终稿
Huaigang Yang / Dongguan University of Technology
Ziliang Ren / Dongguan University of Technology
Huaqiang Yuan / Dongguan University of Technology
Qieshi Zhang / Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences
Wenhong Wei / Dongguan University of Technology
Xiaoyu Hong / Dongguan University of Technology
Improving feature discrimination power from different viewpoints has long been a hot topic of interest for skeleton-based action recognition. Most of the existing methods are based on fully supervised approaches for feature representation learning, which need to rely on a large number of labeled samples. However, it is easy to lead to overfitting and insufficient generalization of the model. In this work, we propose a novel Multi-View contrastive self-supervised Triplet network for skeleton-based action recognition, dubbed MVT. The original skeleton is encoded with features from different viewpoints, and mutual information maximization through a triplet network combined with triplet-view loss by contrastive learning. In the finetune stage of the downstream task, we froze the encoder combined with a classifier to output the final classification score. The extensive experiments show that our proposed MVT dramatically outperforms the state-of-the-art self-supervised learning methods for NTU-60, NTU-120 and PKU-MMD benchmarks.
重要日期
  • 会议日期

    11月02日

    2023

    11月04日

    2023

  • 12月15日 2023

    初稿截稿日期

  • 12月20日 2023

    注册截止日期

主办单位
IEEE Instrumentation and Measurement Society
Xidian University
移动端
在手机上打开
小程序
打开微信小程序
客服
扫码或点此咨询