arvyanh

Follow

Mingxuan Yu arvyanh

Follow

0 followers · 1 following

MT
M78

Achievements

Achievements

Pinned Loading

ms-swift ms-swift Public

Forked from modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python
mbridge mbridge Public

Forked from ISEEKYAN/mbridge

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python
verl verl Public

Forked from verl-project/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python
meituan-search/verl meituan-search/verl Public

Forked from verl-project/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 12 2