Skip to content
View hchings's full-sized avatar
  • San Francisco Bay Area

Block or report hchings

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. NVIDIA/TensorRT-LLM NVIDIA/TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

    Python 14k 2.5k

  2. NVIDIA/Model-Optimizer NVIDIA/Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downs…

    Python 3.1k 465

  3. verl-project/verl verl-project/verl Public

    verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

    Python 22.2k 4.2k