Publications

2026

  1. SplitFlux: Learning to Decouple Content and Style from a Single Image
    Yitong Yang, Yinglin Wang, Changshuo Wang, Yongjun Zhang, Ziyang Chen, and Shuting He
    In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2026
    Corresponding author
  2. Transferable Adversarial Attack on Referring Video Object Segmentation
    Meiwen Ding, Song Xia, Yi Yu, Shuting He, and Xudong Jiang
    IEEE Transactions on Information Forensics and Security (TIFS), 2026
  3. Quadratic Direct Forecast for Training Multi-Step Time-Series Forecast Models
    Hao Wang, Licheng Pan, Yuan Lu, Zhichao Chen, Tianqiao Liu, Shuting He, Zhixuan Chu, Qingsong Wen, Haoxuan Li, and Zhouchen Lin
    In International Conference on Learning Representations (ICLR), 2026
  4. DistDF: Time-Series Forecasting Needs Joint-Distribution Wasserstein Alignment
    Hao Wang, Licheng Pan, Yuan Lu, Zhixuan Chu, Xiaoxi Li, Shuting He, Zhichao Chen, Haoxuan Li, Qingsong Wen, and Zhouchen Lin
    In International Conference on Learning Representations (ICLR), 2026
  5. FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting
    Yitong Yang, Yinglin Wang, Changshuo Wang, Huajie Wang, and Shuting He
    In AAAI Conference on Artificial Intelligence (AAAI), 2026
    Corresponding author
  6. GREx: Generalized Referring Expression Segmentation, Comprehension, and Generation
    Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, and Yu-Gang Jiang
    International Journal of Computer Vision (IJCV), 2026
    Corresponding author

2025

  1. MeViS: A Multi-Modal Dataset for Referring Motion Expression Video Segmentation
    Henghui Ding, Chang Liu, Shuting He, Kaining Ying, Xudong Jiang, Chen Change Loy, and Yu-Gang Jiang
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
    Corresponding author
  2. ReferSplat: Referring Segmentation in 3D Gaussian Splatting
    Shuting He, Guangquan Jie, Changshuo Wang, Yun Zhou, Shuming Hu, Guanbin Li, and Henghui Ding
    In International Conference on Machine Learning ( ICML ), 2025
    Oral, Acceptance Rate 1.0%
  3. Reasoning Beyond Points: A Visual Introspective Approach for Few-Shot 3D Segmentation
    Changshuo Wang, Shuting He, Xiang Fang, Zhijian Hu, Jia-Hong Huang, Yixian Shen, and Prayag Tiwari
    In Annual Conference on Neural Information Processing Systems ( NeurIPS ), 2025
  4. Iterative Missing Data Imputation with Model Form Adaptation and Non-Missing Feature Supervision
    Hao Wang, Zhengnan Li, Zhichao Chen, Xu Chen, Shuting He, Guangyi Liu, Haoxuan Li, and Zhouchen Lin
    In Annual Conference on Neural Information Processing Systems ( NeurIPS ), 2025
  5. Prompt-Softbox-Prompt: A Free-Text Embedding Control for Image Editing
    Yitong Yang, Yinglin Wang, Tian Zhang, Jing Wang, and Shuting He
    In ACM International Conference on Multimedia ( ACM MM ), 2025
    Corresponding author
  6. Seeing the Overlooked: Bio-Visual Inspired Weak Saliency Feedback Transformer for Person Re-identification
    Changshuo Wang, Shuting He, Xiang Fang, Fangzhe Nan, and Prayag Tiwari
    In ACM International Conference on Multimedia ( ACM MM ), 2025
  7. HRSeg: High-Resolution Visual Perception and Enhancement for Reasoning Segmentation
    Weihuang Lin, Yiwei Ma, Xiaoshuai Sun, Shuting He, Jiayi Ji, Liujuan Cao, and Rongrong Ji
    In ACM International Conference on Multimedia ( ACM MM ), 2025
  8. GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding
    Zijun Lin, Shuting He, Cheston Tan, and Bihan Wen
    In IEEE International Conference on Computer Vision (ICCV), 2025
  9. SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation
    Shiqi Huang, Shuting He, Huaiyuan Qin, and Bihan Wen
    In IEEE International Conference on Computer Vision (ICCV), 2025
    Highlight, Acceptance Rate 5.0%
  10. GlFoMR: A Glance-then-Focus Multimodal Reasoning Framework for Diagram Question Answering Number
    Yaxian Wang, Bifan Wei, Jun Liu, Lingling Zhang, Shuting He, Jun Li, and Qika Lin
    In International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
  11. Point Clouds Meets Physics: Dynamic Acoustic Field Fitting Network for Point Cloud Understanding
    Changshuo Wang, Shuting He, Xiang Fang, Jiawei Han, Zhonghang Liu, Xin Ning, Weijun Li, and Prayag Tiwari
    In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025
  12. ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation
    Shiqi Huang, Shuting He, and Bihan Wen
    In AAAI Conference on Artificial Intelligence (AAAI), 2025
  13. Taylor Series-Inspired Local Structure Fitting Network for Few-shot Point Cloud Semantic Segmentation
    Changshuo Wang, Shuting He, Xiang Fang, Meiqing Wu, Siew Kei Lam, and Prayag Tiwari
    In AAAI Conference on Artificial Intelligence (AAAI), 2025
  14. Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension
    Yaxian Wang, Henghui Ding, Shuting He, Xudong Jiang, Bifan Wei, and Jun Liu
    In AAAI Conference on Artificial Intelligence (AAAI), 2025
  15. PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
    Henghui Ding, Chang Liu, Nikhila Ravi, Shuting He, Yunchao Wei, Song Bai, and Philip Torr
    In IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2025
  16. Looking Clearer with Text: A Hierarchical Context Blending Network for Occluded Person Re-Identification
    Changshuo Wang, Shuting He, Meiqing Wu, Siew-Kei Lam, Prayag Tiwari, and Xingyu Gao
    IEEE Transactions on Information Forensics and Security (TIFS), 2025

2024

  1. Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
    Shuting He and Henghui Ding
    In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  2. SegPoint: Segment Any Point Cloud via Large Language Model
    Shuting He and Henghui Ding
    In European Conference on Computer Vision ( ECCV ), 2024
  3. RefMask3D: Language-Guided Transformer for 3D Referring Segmentation
    Shuting He and Henghui Ding
    In ACM International Conference on Multimedia ( ACM MM ), 2024
  4. Context-Aware Integration of Language and Visual References for Natural Language Tracking
    Yanyan Shao, Shuting He, Qi Ye, Yuchao Feng, Wenhan Luo, and Jiming Chen
    In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  5. Dual-head Genre-instance Transformer Network for Arbitrary Style Transfer
    Meichen Liu, Shuting He, Songnan Lin, and Bihan Wen
    In ACM International Conference on Multimedia ( ACM MM ), 2024
  6. 1st Place Solution to VISDA-2020: Bias Elimination for Domain Adaptive Pedestrian Re-Identification
    Jianyang Gu, Hao Luo, Weihua Chen, Yiqi Jiang, Yuqi Zhang, Shuting He, Fan Wang, Hao Li, and Wei Jiang
    In European Conference on Computer Vision Workshops (ECCVW), 2024
  7. TIP
    VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search
    Shuting He, Hao Luo, Wei Jiang, Xudong Jiang, and Henghui Ding
    IEEE Transactions on Image Processing (TIP), 2024
  8. Region Generation and Assessment Network for Occluded Person Re-Identification
    Shuting He, Weihua Chen, Kai Wang, Hao Luo, Fan Wang, Wei Jiang, and Henghui Ding
    IEEE Transactions on Information Forensics and Security (TIFS), 2024
  9. RS
    Leveraging Mixed Data Sources for Enhanced Road Segmentation in Synthetic Aperture Radar Images
    Tian Lan, Shuting He, Yuanyuan Qing, and Bihan Wen
    Remote Sensing (RS), 2024

2023

  1. Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation
    Shuting He, Henghui Ding, and Wei Jiang
    In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
  2. Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation
    Shuting He, Henghui Ding, and Wei Jiang
    In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
  3. MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
    Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Philip H. S. Torr, and Song Bai
    In IEEE International Conference on Computer Vision (ICCV), 2023
  4. MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
    Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, and Chen Change Loy
    In IEEE International Conference on Computer Vision (ICCV), 2023
  5. TIP
    Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation
    Shuting He, Xudong Jiang, Wei Jiang, and Henghui Ding
    IEEE Transactions on Image Processing (TIP), 2023

2022

  1. Transformer-Based Domain-Specific Representation for Unsupervised Domain Adaptive Vehicle Re-Identification
    Ran Wei, Jianyang Gu, Shuting He, and Wei Jiang
    IEEE Transactions on Intelligent Transportation Systems (T-ITS), 2022

2021

  1. TransReID: Transformer-based Object Re-Identification
    Shuting He, Hao Luo, Pichao Wang, Fan Wang, Hao Li, and Wei Jiang
    In IEEE International Conference on Computer Vision (ICCV), 2021
  2. An Empirical Study of Vehicle Re-Identification on the AI City Challenge
    Hao Luo, Weihua Chen, Xianzhe Xu, Jianyang Gu, Yuqi Zhang, Chong Liu, Yiqi Jiang, Shuting He, and Fan Wang
    In IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2021

2020

  1. Multi-domain Learning and Identity Mining for Vehicle Re-Identification
    Shuting He, Hao Luo, Weihua Chen, Miao Zhang, Yuqi Zhang, Fan Wang, Hao Li, and Wei Jiang
    In IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2020