Skip to content
View andyluo7's full-sized avatar

Block or report andyluo7

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. autoresearch autoresearch Public

    Forked from karpathy/autoresearch

    AI agents running research on single-GPU nanochat training automatically

    Python 62 16

  2. cpu-gpu-codesign-agentic-inference cpu-gpu-codesign-agentic-inference Public

    CPU-GPU co-design analysis for agentic LLM inference. Blog: andyluo7.github.io

    Python 7 1

  3. turboquant-amd turboquant-amd Public

    TurboQuant: Near-optimal KV cache quantization for LLM serving on AMD GPUs (arXiv: 2504.19874, ICLR 2026)

    Python 4 1

  4. RFinference RFinference Public

    Python 1

  5. self-driving-car self-driving-car Public

    Forked from ndrplz/self-driving-car

    Udacity Self-Driving Car Engineer Nanodegree projects.

    C++ 1 1

  6. LMCache LMCache Public

    Forked from LMCache/LMCache

    Supercharge Your LLM with the Fastest KV Cache Layer

    Python 1