junkangwu

Follow

Junkang Wu junkangwu

Follow

Ph.D. student @ USTC.

42 followers · 1 following

University of Science and Technology of China
17:24 (UTC +08:00)
https://junkangwu.github.io/

Achievements

Achievements

Pinned Loading

QAE QAE Public

[ICLR 2026] Quantile Advantage Estimation for Entropy-Safe Reasoning

Python 23
alpha-DPO alpha-DPO Public

[ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"

Python 30
Dr_DPO Dr_DPO Public

[ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"

Python 18 3
beta-DPO beta-DPO Public

[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$

Python 50 5
ADNCE ADNCE Public

[NeurIPS2023] Official code of "Understanding Contrastive Learning via Distributionally Robust Optimization"

Python 41 2
Adap_tau Adap_tau Public

[WWW 2023] Official code of "Adap-$\tau$: Adaptively Modulating Embedding Magnitude for Recommendation"

Python 29 3