Skip to content
View chang-wenbin's full-sized avatar

Block or report chang-wenbin

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Paddle Paddle Public

    Forked from PaddlePaddle/Paddle

    PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

    C++ 1

  2. PaddleMIX PaddleMIX Public

    Forked from PaddlePaddle/PaddleMIX

    Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …

    Python 1

  3. cutlass cutlass Public

    Forked from NVIDIA/cutlass

    CUDA Templates for Linear Algebra Subroutines

    C++ 2

  4. UseTritonInPaddle UseTritonInPaddle Public

    Forked from zhoutianzi666/UseTritonInPaddle

    Python 1

  5. marlin marlin Public

    Forked from IST-DASLab/marlin

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python

  6. FastDeploy FastDeploy Public

    Forked from PaddlePaddle/FastDeploy

    High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

    Python 1