Skip to content
View shikicloud's full-sized avatar

Highlights

  • Pro

Block or report shikicloud

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. RL RL Public

    Forked from NVIDIA-NeMo/RL

    Scalable toolkit for efficient model reinforcement

    Python

  2. TensorRT-LLM TensorRT-LLM Public

    Forked from NVIDIA/TensorRT-LLM

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

    Python

  3. verl verl Public

    Forked from verl-project/verl

    verl: Volcano Engine Reinforcement Learning for LLMs

    Python