Point Cluster: A Compact Message Unit for Communication-Efficient Collaborative Perception
Zihan Ding, Jiahui Fu, Si Liu, Hongyu Li, Siheng Chen, Hongsheng Li, Shifeng Zhang, Xu Zhou
ICLR 2025
Single-stream Policy Optimization
Zhongwen Xu*, Zihan Ding*
arXiv Preprint
Topv-nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-Shot Object Navigation
Linqing Zhong, Chen Gao, Zihan Ding, Yue Liao, Huimin Ma, Shifeng Zhang, Xu Zhou, Si Liu
arXiv Preprint
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Hongyu Li, Tianrui Hui, Zihan Ding, Jing Zhang, Bin Ma, Xiaoming Wei, Jizhong Han, Si Liu
ACM MM 2024
Region-Adaptive and Context-Complementary Cross Modulation for RGB-T Semantic Segmentation
Fengguang Peng, Zihan Ding, Ziming Chen, Gang Wang, Tianrui Hui, Si Liu, Hang Shi
Pattern Recognition
Language-Aware Spatial-Temporal Collaboration for Referring Video Segmentation
Tianrui Hui, Si Liu, Zihan Ding, Shaofei Huang, Guanbin Li, Wenguan Wang, Luoqi Liu, Jizhong Han
TPAMI
Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding
Tianrui Hui, Zihan Ding, Junshi Huang, Xiaoming Wei, Xiaolin Wei, Jiao Dai, Jizhong Han, Si Liu
IJCAI 2023
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection
Luting Wang, Yi Liu, Penghui Du, Zihan Ding, Yue Liao, Qiaosong Qi, Biaolong Chen, Si Liu
CVPR 2023
Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Zihan Ding, Tianrui Hui, Junshi Huang, Xiaoming Wei, Jizhong Han, Si Liu
CVPR 2022
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding
Zihan Ding*, Zi-han Ding*, Tianrui Hui, Junshi Huang, Xiaoming Wei, Xiaolin Wei, Si Liu
ACM MM 2022
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
Tianrui Hui, Shaofei Huang, Si Liu, Zihan Ding, Guanbin Li, Wenguan Wang, Jizhong Han, Fei Wang
CVPR 2021
Progressive Multimodal Interaction Network for Referring Video Object Segmentation
Zihan Ding, Tianrui Hui, Shaofei Huang, Si Liu, Xuan Luo, Junshi Huang, Xiaoming Wei
The 3rd Large-scale Video Object Segmentation Challenge