About me

I received my Ph.D. degree from the University of Science and Technology of China (USTC) in 2024.
I am currently a researcher at Alibaba Tongyi Lab, where I am actively involved in the research and development of the Wan Video and Wan Image series of generative models.

Publications

2025-2026

  • Wan: Open and advanced large-scale video generative models. [arxiv]
    Team Wan.
    Technical Report

  • Wan-Image: Pushing the Boundaries of Generative Visual Intelligence. [arxiv]
    Team Wan.
    Technical Report

  • ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement. [arxiv]
    Zhihang Liu, Xiaoyi Bao, Pandeng Li , Junjie Zhou, Zhaohe Liao, Yefei He, Kaixun Jiang, Chen-Wei Xie, Yun Zheng, Hongtao Xie.
    Computer Vision and Pattern Recognition (CVPR 2026)

  • Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models.
    Zhihang Liu, Chen-Wei Xie, Pandeng Li , Liming Zhao, Longxiang Tang, Yun Zheng, Chuanbin Liu, Hongtao Xie.
    Computer Vision and Pattern Recognition (CVPR 2025)

  • What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Coverage of MLLMs. [arxiv]
    Zhihang Liu, Chen-Wei Xie, Bin Wen, Feiwu Yu, Jixuan Chen, Boqiang Zhang, Nianzu Yang, Pandeng Li , Yun Zheng, Hongtao Xie.
    Conference on Neural Information Processing Systems (NeurIPS 2025)

  • UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing [arxiv]
    Hao Tang, Chen-Wei Xie, Xiaoyi Bao, Tingyu Weng, Pandeng Li, Yun Zheng, Liwei Wang.
    International Conference on Learning Representations (ICLR 2026)

  • AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation. [arxiv]
    Zhaohe Liao, Kaixun Jiang, Zhihang Liu, Yujie Wei, Junqiu Yu, Quanhao Li, Hong-Tao Yu, Pandeng Li , Yuzheng Wang, Zhen Xing, Shiwei Zhang, Chen-Wei Xie, Yun Zheng, Xihui Liu .

2022-2023

  • MomentDiff: Generative Video Moment Retrieval from Random to Real. [arXiv] [code]
    Pandeng Li, Chen-Wei Xie, Hongtao Xie, Liming Zhao, Lei Zhang, Yun Zheng, Deli Zhao, Yongdong Zhang.
    Conference on Neural Information Processing Systems (NeurIPS 2023)

  • Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval. [paper] [supp] [code]
    Pandeng Li, Chen-Wei Xie, Liming Zhao, Hongtao Xie, Jiannan Ge, Yun Zheng, Deli Zhao, Yongdong Zhang.
    International Conference on Computer Vision (ICCV 2023) (Oral Presentation, 2%)

  • Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval. [paper] [code]
    Pandeng Li, Hongtao Xie, Jiannan Ge, Lei Zhang, Shaobo Min, Yongdong Zhang.
    European Conference on Computer Vision (ECCV 2022)

  • Neighborhood-Adaptive Structure Augmented Metric Learning. [paper] [code]
    Pandeng Li, Yan Li, Hongtao Xie, Lei Zhang.
    Association for the Advancement of Artificial Intelligence (AAAI 2022) (Oral Presentation, 4.5%)

  • Deep Fourier Ranking Quantization for Semi-supervised Image Retrieval. [paper] [code]
    Pandeng Li, Hongtao Xie, Shaobo Min, Jiannan Ge, Xun Chen, Yongdong Zhang.
    IEEE Transactions on Image Processing (TIP 2022)

  • Online Residual Quantization Via Streaming Data Correlation Preserving. [paper]
    Pandeng Li, Hongtao Xie, Shaobo Min, Zheng-Jun Zha, Yongdong Zhang.
    IEEE Transactions on Multimedia (TMM 2022)

  • Neighborhood-Adaptive Multi-cluster Ranking for Deep Metric Learning. [paper]
    Pandeng Li, Hongtao Xie, Yan Jiang, Jiannan Ge, Yongdong Zhang.
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT 2022)

Awards

  • 2022/09 National Scholarships, USTC
  • 2023/12 The third prize (¥ 40,000) at The 2nd Guangdong-Hong Kong-Macao International Algorithm Competition
  • 2024/5 CAS Presidential Scholarship