About me

I received my Ph.D. degree from the University of Science and Technology of China (USTC) in 2024.
I am currently a researcher at Alibaba Tongyi Lab, where I am actively involved in the research and development of the Wan Video and Wan Image series of generative models.

Publications

2025-2026

Wan: Open and advanced large-scale video generative models. [arxiv]
Team Wan.
Technical Report
Wan-Image: Pushing the Boundaries of Generative Visual Intelligence. [arxiv]
Team Wan.
Technical Report
ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement. [arxiv]
Zhihang Liu, Xiaoyi Bao, Pandeng Li ^✉, Junjie Zhou, Zhaohe Liao, Yefei He, Kaixun Jiang, Chen-Wei Xie, Yun Zheng, Hongtao Xie.
Computer Vision and Pattern Recognition (CVPR 2026)
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models.
Zhihang Liu, Chen-Wei Xie, Pandeng Li ^✉, Liming Zhao, Longxiang Tang, Yun Zheng, Chuanbin Liu, Hongtao Xie.
Computer Vision and Pattern Recognition (CVPR 2025)
What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Coverage of MLLMs. [arxiv]
Zhihang Liu, Chen-Wei Xie, Bin Wen, Feiwu Yu, Jixuan Chen, Boqiang Zhang, Nianzu Yang, Pandeng Li ^✉, Yun Zheng, Hongtao Xie.
Conference on Neural Information Processing Systems (NeurIPS 2025)
UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing [arxiv]
Hao Tang, Chen-Wei Xie, Xiaoyi Bao, Tingyu Weng, Pandeng Li, Yun Zheng, Liwei Wang.
International Conference on Learning Representations (ICLR 2026)
AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation. [arxiv]
Zhaohe Liao, Kaixun Jiang, Zhihang Liu, Yujie Wei, Junqiu Yu, Quanhao Li, Hong-Tao Yu, Pandeng Li ^✉, Yuzheng Wang, Zhen Xing, Shiwei Zhang, Chen-Wei Xie, Yun Zheng, Xihui Liu ^✉.

2022-2023

MomentDiff: Generative Video Moment Retrieval from Random to Real. [arXiv] [code]
Pandeng Li, Chen-Wei Xie, Hongtao Xie, Liming Zhao, Lei Zhang, Yun Zheng, Deli Zhao, Yongdong Zhang.
Conference on Neural Information Processing Systems (NeurIPS 2023)
Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval. [paper] [supp] [code]
Pandeng Li, Chen-Wei Xie, Liming Zhao, Hongtao Xie, Jiannan Ge, Yun Zheng, Deli Zhao, Yongdong Zhang.
International Conference on Computer Vision (ICCV 2023) (Oral Presentation, 2%)
Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval. [paper] [code]
Pandeng Li, Hongtao Xie, Jiannan Ge, Lei Zhang, Shaobo Min, Yongdong Zhang.
European Conference on Computer Vision (ECCV 2022)
Neighborhood-Adaptive Structure Augmented Metric Learning. [paper] [code]
Pandeng Li, Yan Li, Hongtao Xie, Lei Zhang.
Association for the Advancement of Artificial Intelligence (AAAI 2022) (Oral Presentation, 4.5%)
Deep Fourier Ranking Quantization for Semi-supervised Image Retrieval. [paper] [code]
Pandeng Li, Hongtao Xie, Shaobo Min, Jiannan Ge, Xun Chen, Yongdong Zhang.
IEEE Transactions on Image Processing (TIP 2022)
Online Residual Quantization Via Streaming Data Correlation Preserving. [paper]
Pandeng Li, Hongtao Xie, Shaobo Min, Zheng-Jun Zha, Yongdong Zhang.
IEEE Transactions on Multimedia (TMM 2022)
Neighborhood-Adaptive Multi-cluster Ranking for Deep Metric Learning. [paper]
Pandeng Li, Hongtao Xie, Yan Jiang, Jiannan Ge, Yongdong Zhang.
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT 2022)

Awards

2022/09 National Scholarships, USTC
2023/12 The third prize (¥ 40,000) at The 2nd Guangdong-Hong Kong-Macao International Algorithm Competition
2024/5 CAS Presidential Scholarship

Pandeng Li (李攀登)

Publications

2025-2026

2022-2023

Awards