Biography

I am a Presidential Postdoctoral Fellow (PPF, Principal Investigator), jointly affiliated with Nanyang Technological University and Chalmers University of Technology (with Prof. Fredrik D. Johansson). In 2023, I completed my Ph.D. in NTU under Alibaba Talent Program, supervised by Prof. Hanwang Zhang and co-supervised by Prof. Qianru Sun. During Ph.D., I did an internship in Sea working under Prof. Pan Zhou. Prior to that, I received my bachelor's degree from NTU in 2017 under MOE SM2 scholarship.

My research lies at the intersection of multimodal LLMs (MLLMs), representation learning, and causal generalization, with the broader goal of building AI systems that can learn and reason beyond language. I have published in top-tier venues including NeurIPS, ICLR, CVPR, and ICCV, receiving multiple oral and spotlight recognitions and a CVPR 2025 Best Student Paper Honorable Mention. My recent works explore post-training for LLMs and MLLMs to advance reasoning, grounding, and agentic capabilities in foundation models.

News

[10, 2025] 2 papers accepted by NeurIPS 2025.
[06, 2025] 2 papers accepted by ICCV 2025 (1 highlight).
[06, 2025] 2 papers accepted by CVPR 2025 (1 best student paper honorable mention and 1 oral presentation).
[05, 2025] Released Selftok technical report (image tokenization, MLLM pre-training and post-training).
[02, 2025] Continued the PPF in Chalmers University of Technology, Sweden.
[06, 2024] 1 paper about few-shot learning accepted by CVPR 2024.
[05, 2024] 1 paper about unsupervised representation learning accepted by ICLR 2024.
[02, 2024] Started the PPF in NTU.
[10, 2023] Started a research internship in Sea.
[09, 2023] 1 paper about unsupervised domain adaptation accepted by NeurIPS 2023.
[08, 2023] Awarded Wallenberg-NTU Presidential Postdoctoral Fellowship.
[07, 2023] 2 papers about open-world detection and fair face recognition are accepted by ICCV 2023.
[03, 2023] 1 paper about video anomaly detection accepted by CVPR 2023.
[08, 2022] Received 2022 PREMIA Best Student Paper Awards (The Gold Award).
[09, 2021] 1 paper about self-supervised learning accepted by NeurIPS 2022 (Spotlight).
[07, 2021] 1 paper about unsupervised domain adaptation accepted by ICCV 2021 (Oral).
[03, 2021] 1 paper about zero-shot learning accepted by CVPR 2021.
[09, 2020] 1 paper about few-shot learning accepted by NeurIPS 2020.
[05, 2020] Joined Alibaba Talent Program to do a Ph.D. in NTU.

Publications [Google Scholar]

LLMs and MLLMs

Expanding the Action Space of LLMs to Reason Beyond Language

Zhongqi Yue, Weishi Wang*, Yundaichuan Zhan, Juncheng Li, Daniel Dahlmeier, Fredrik D. Johansson

2025

Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens

Kaihang Pan*, Wang Lin*, Zhongqi Yue*, Tenglong Ao, Liyu Jia, Wei Zhao, Juncheng Li, Siliang Tang, Hanwang Zhang

CVPR 2025

Best Student Paper Honorable Mention 7/13008

Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Bohan Wang, Zhongqi Yue, Fengda Zhang, Shuo Chen, Li'an Bi, Junzhe Zhang, Xue Song, Kennard Yanting Chan, Jiachun Pan, Weijia Wu, Mingze Zhou, Wang Lin, Kaihang Pan, Saining Zhang, Liyu Jia, Wentao Hu, Wei Zhao, Hanwang Zhang

Technical Report

Selftok-Zero: Reinforcement Learning for Visual Generation via Discrete and Autoregressive Visual Tokens

Bohan Wang, Mingze Zhou, Zhongqi Yue, Wang Lin, Kaihang Pan, Liyu Jia, Wentao Hu, Wei Zhao, Hanwang Zhang

NeurIPS 2025

Mastering Collaborative Multi-Modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness

Qifan Yu*, Zhebei Shen*, Zhongqi Yue*, Yang Wu, Wenqiao Zhang, Yunfei Li, Juncheng Li, Siliang Tang, Yueting Zhuang

ICCV 2025

Highlight 262/11239

AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea

Qifan Yu*, Wei Chow*, Zhongqi Yue*, Kaihang Pan, Yang Wu, Xiaoyang Wan, Juncheng Li, Siliang Tang, Hanwang Zhang, Yueting Zhuang

CVPR 2025

Oral 96/13008

Counterfactual Evolution of Multimodal Datasets via Visual Programming

Minghe Gao*, Zhongqi Yue*, Wenjie Yan, Yihao Hu, Wei Ji, Siliang Tang, Jun Xiao, Tat-Seng Chua, Yueting Zhuang, Juncheng Li

NeurIPS 2025

Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program

Minghe Gao, Xuqi Liu, Zhongqi Yue, Yang Wu, Shuang Chen, Juncheng Li, Siliang Tang, Fei Wu, Tat-Seng Chua, Yueting Zhuang

ICCV 2025

Representation Learning

Self-Supervised Learning Disentangled Group Representation as Feature

Tan Wang, Zhongqi Yue, Jianqiang Huang, Qianru Sun, Hanwang Zhang

NeurIPS 2021

Spotlight Presentation 260/9122 PREMIA Best Student Paper 2022

Exploring Diffusion Time-Steps for Unsupervised Representation Learning

Zhongqi Yue, Jiankun Wang, Qianru Sun, Lei Ji, Eric I-Chao Chang, Hanwang Zhang

ICLR 2024

Invariant Feature Regularization for Fair Face Recognition

Jiali Ma, Zhongqi Yue, Tomoyuki Kagaya, Tomoki Suzuki, Karlekar Jayashree, Sugiri Pranata, Hanwang Zhang

ICCV 2023

Generalization

Few-Shot Learner Parameterization by Diffusion Time-Steps

Zhongqi Yue, Pan Zhou, Richang Hong, Hanwang Zhang, Qianru Sun

CVPR 2024

Transporting Causal Mechanisms for Unsupervised Domain Adaptation

Zhongqi Yue, Qianru Sun, Xian-Sheng Hua, Hanwang Zhang

ICCV 2021

Oral Presentation 210/6236

Make the U in UDA Matter: Invariant Consistency Learning for Unsupervised Domain Adaptation

Zhongqi Yue, Hanwang Zhang, Qianru Sun

NeurIPS 2023

Random Boxes Are Open-world Object Detectors

Yanghao Wang, Zhongqi Yue, Xian-Sheng Hua, Hanwang Zhang

ICCV 2023

Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection

Hui Lv, Zhongqi Yue, Qianru Sun, Bin Luo, Zhen Cui, Hanwang Zhang

CVPR 2023

Counterfactual Zero-Shot and Open-Set Visual Recognition

Zhongqi Yue*, Tan Wang*, Qianru Sun, Xian-Sheng Hua, Hanwang Zhang

CVPR 2021

Interventional Few-Shot Learning

Zhongqi Yue, Hanwang Zhang, Qianru Sun, Xian-Sheng Hua

NeurIPS 2020