Skip to content
View arvyanh's full-sized avatar

Block or report arvyanh

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. ms-swift ms-swift Public

    Forked from modelscope/ms-swift

    Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

    Python

  2. mbridge mbridge Public

    Forked from ISEEKYAN/mbridge

    Bridge Megatron-Core to Hugging Face/Reinforcement Learning

    Python

  3. verl verl Public

    Forked from verl-project/verl

    verl: Volcano Engine Reinforcement Learning for LLMs

    Python

  4. meituan-search/verl meituan-search/verl Public

    Forked from verl-project/verl

    verl: Volcano Engine Reinforcement Learning for LLMs

    Python 12 2