Johnsonms

Follow

Johnson Johnsonms

Follow

flash-attention Maintainer | Cutlass | SGlang kernel contributor | HPC, C++, CUDA, LLM training & inference

14 followers · 11 following

Achievements

Achievements

Pinned Loading

cutlass cutlass Public

Forked from NVIDIA/cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 1
flash-attention flash-attention Public

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 1
quack quack Public

Forked from Dao-AILab/quack

A Quirky Assortment of CuTe Kernels

Python 1
sglang sglang Public

Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 1
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
Johnson Johnson Public

My personal repository

1