Skip to content
View DylanChen-NV's full-sized avatar

Block or report DylanChen-NV

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. TensorRT-Model-Optimizer TensorRT-Model-Optimizer Public

    Forked from NVIDIA/Model-Optimizer

    nvidia-modelopt is a unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for do…

    Python 1

  2. TensorRT-LLM TensorRT-LLM Public

    Forked from NVIDIA/TensorRT-LLM

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

    C++

  3. verl verl Public

    Forked from verl-project/verl

    verl: Volcano Engine Reinforcement Learning for LLMs

    Python

  4. repo repo Public

  5. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  6. cutlass cutlass Public

    Forked from StudyingShao/cutlass

    CUDA Templates for Linear Algebra Subroutines

    C++