Skip to content
View tardis-key's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Zhejiang University -> Huawei
  • Hangzhou,China

Block or report tardis-key

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. ROLL ROLL Public

    Forked from alibaba/ROLL

    An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

    Python

  2. siiRL siiRL Public

    Forked from sii-research/siiRL

    siiRL: Shanghai Inovation Institute RL Framework for LLM Post-Training

    Python

  3. verl verl Public

    Forked from verl-project/verl

    verl: Volcano Engine Reinforcement Learning for LLMs

    Python 2

  4. rl-insight rl-insight Public

    Forked from verl-project/rl-insight

    Provide performance insight capabilities for RL frameworks.

    Python

  5. verl-project/rl-insight verl-project/rl-insight Public

    Provide performance insight capabilities for RL frameworks.

    Python 36 27