Completed part transformer for person re-identification

Zhong Zhang, Di He, Shuang Liu, Baihua Xiao, Tariq S. Durrani

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)
139 Downloads (Pure)

Abstract

Recently, part information of pedestrian images has been demonstrated to be effective for person re-identification (ReID), but the part interaction is ignored when using Transformer to learn long-range dependencies. In this paper, we propose a novel transformer network named Completed Part Transformer (CPT) for person ReID, where we design the part transformer layer to learn the completed part interaction. The part transformer layer includes the intra-part layer and the part-global layer, where they consider long-range dependencies from the aspects of the intra-part interaction and the part-global interaction, simultaneously. Furthermore, in order to overcome the limitation of fixed number of the patch tokens in the transformer layer, we propose the Adaptive Refined Tokens (ART) module to focus on learning the interaction between the informative patch tokens in the pedestrian image, which improves the discrimination of the pedestrian representation. Extensive experimental results on four person ReID datasets, i.e., MSMT17, Market1501, DukeMTMC-reID and CUHK03, demonstrate that the proposed method achieves a new state-of-the-art performance, e.g., it achieves 68.0% mAP and 84.6% Rank-1 accuracy on MSMT17.
Original languageEnglish
Pages (from-to)2303-2313
Number of pages11
JournalIEEE Transactions on Multimedia
Volume26
Early online date12 Jul 2023
DOIs
Publication statusPublished - 2 Feb 2024

Keywords

  • electrical and electronic engineering
  • computer science applications
  • media technology
  • signal processing

Fingerprint

Dive into the research topics of 'Completed part transformer for person re-identification'. Together they form a unique fingerprint.

Cite this