Chonghan Liu

I am a LLM Algorithm Engineer. My research interests include LLM Inference Optimization, Reinforcement Learning for LLMs, and Multimodal Large Language Models.

I received my B.S. in Computer Science from Nanjing University, and dropped out of the M.S. program at UCLA to work full-time in AI. I am a collaborator on LLaMA-Factory and an active contributor to NVIDIA-NeMo/Automodel.

Chonghan Liu
Publications [all Papers →]
LLM EnsembleRethinking LLM Ensembling from the Perspective of Mixture Models Spotlight
Jiale Fu, Yuchu Jiang, Chonghan Liu, Joey Tianyi Zhou, Xu Yang
ICML 2026
[paper]
AriadneAriadne: A Controllable Framework for Probing and Extending VLM Reasoning Boundaries Poster
Minghe Shen, Zhuo Zhi, Chonghan Liu, Shuo Xing, Zhengzhong Tu, Che Liu
ACL Main 2026
[paper]
d²Cached²Cache: Accelerating Diffusion-Based LLMs via Dual Adaptive Caching Poster
Yuchu Jiang, Yue Cai, Xiangzhong Luo, Jiale Fu, Jiarui Wang, Chonghan Liu, Xu Yang
ICLR 2026
[paper] [code]
Flatter TokensFlatter Tokens are More Valuable for Speculative Draft Model Training Poster
Jiaming Fan, Cao Daming, Xiangzhong Luo, Jiale Fu, Chonghan Liu, Xu Yang
ICLR 2026
[paper]
AdaptiveStepAdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence Poster
Yuliang Liu, Junjie Lu, Zhaoling Chen, Chaofeng Qu, Jason Klein Liu, Chonghan Liu, Zefan Cai, Yunhui Xia, Li Zhao, Jiang Bian, Chuheng Zhang, Wei Shen, Zhouhan Lin
ICML 2025
[paper]
VEPOVEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models
Chonghan Liu, Yimin Du, Qi An, Xin He, Cunqi Zhai, Fei Tan, Weijia Lin, Xiaochun Gong, Yongchao Deng, Shousheng Jia, ...
arXiv preprint arXiv:2603.19152, 2026
[paper]
MirageMirage: A Multi-modal Benchmark for Spatial Perception, Reasoning, and Intelligence
Chonghan Liu, Haoran Wang, Frank Henry, Peng Miao, Yifan Zhang, Yue Zhao, Peiran Wu
arXiv preprint arXiv:2505.10604, 2025
[paper]
Open Source [all PRs →]
NVIDIA-NeMo/Automodel— PyTorch-native LLM/VLM training framework
Speculative decoding (EAGLE / DFlash / DSpark / Domino / JetSpec): grew this from a single Llama EAGLE-1 recipe into Automodel's full draft-model training stack — P-EAGLE parallel drafting, DFlash, DeepSeek V4 DSpark, Domino online training, and JetSpec causal parallel drafting, plus a vLLM/SGLang serving bridge and an end-to-end tutorial
Model & MoE support: added training support for DeepSeek V4 Flash (plus Multi-Token Prediction), Hy3-preview, and Hy-MT2-30B-A3B, plus Rollout Routing Replay (R3) for MoE RL training
verl-project/verl— RLHF training framework
Other contributions
mergedhiyouga/LLaMA-Factory — Add TRL PPO implementation reference
mergedhiyouga/LLaMA-Factory — Add second ShareGPT conversation format support
mergedlanggenius/dify — Add missing workflow i18n keys
mergedOpenRLHF/OpenRLHF-M — Add Qwen model example training script
opengoogle-research/metricx — Fix prediction output readability and simplify inference installation
mergedmercurystraw/heat — Fix token_type_ids misdetected as a multimodal field in iLLaDA rollout