JC-Chen1

Jiacheng Chen JC-Chen1

Achievements

PRIME-RL/P1 PRIME-RL/P1 Public

P1: Mastering Physics Olympiads with Reinforcement Learning

88 4
PRIME-RL/Entropy-Mechanism-of-RL PRIME-RL/Entropy-Mechanism-of-RL Public

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 443 15
MetaEvo/Symbol MetaEvo/Symbol Public

Python implementation of SYMBOL

Python 18 4
MetaEvo/MetaBox MetaEvo/MetaBox Public

MetaBox: Benchmarking Platform for Meta-Black-Box Optimization

Python 168 15
THUDM/slime THUDM/slime Public

slime is an LLM post-training framework for RL Scaling.

Python 7.2k 1k
verl-project/verl verl-project/verl Public

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22.2k 4.2k