Skip to content
View JC-Chen1's full-sized avatar

Highlights

  • Pro

Organizations

@MetaEvo @lean-dojo @PRIME-RL

Block or report JC-Chen1

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. PRIME-RL/P1 PRIME-RL/P1 Public

    P1: Mastering Physics Olympiads with Reinforcement Learning

    88 4

  2. PRIME-RL/Entropy-Mechanism-of-RL PRIME-RL/Entropy-Mechanism-of-RL Public

    The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

    Python 443 15

  3. MetaEvo/Symbol MetaEvo/Symbol Public

    Python implementation of SYMBOL

    Python 18 4

  4. MetaEvo/MetaBox MetaEvo/MetaBox Public

    MetaBox: Benchmarking Platform for Meta-Black-Box Optimization

    Python 168 15

  5. THUDM/slime THUDM/slime Public

    slime is an LLM post-training framework for RL Scaling.

    Python 7.2k 1k

  6. verl-project/verl verl-project/verl Public

    verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

    Python 22.2k 4.2k