Skip to content
View cavities12's full-sized avatar

Block or report cavities12

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. multiturn-rl-agent multiturn-rl-agent Public

    Multi-turn RL agents with simulation-based planning compatible with OpenRLHF

    Python

  2. verl-project/verl verl-project/verl Public

    verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

    Python 22.2k 4.2k