Skip to content
View simpleton's full-sized avatar
👷‍♂️
Hello World
👷‍♂️
Hello World

Sponsoring

@wez

Organizations

@facebook @wifi-io @facebookincubator

Block or report simpleton

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI

24 repositories

DFloat11 [NeurIPS '25]: Lossless Compression of LLMs and DiTs for Efficient GPU Inference

Python 639 38 Updated Nov 24, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,983 1,928 Updated Jun 26, 2026

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 16,858 1,607 Updated May 26, 2026

Lightweight coding agent that runs in your terminal

Rust 94,719 14,046 Updated Jul 1, 2026

Model Context Protocol Servers

TypeScript 87,900 11,112 Updated Jun 29, 2026

On-device AI across mobile, embedded and edge for PyTorch

Python 4,771 1,057 Updated Jul 1, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 60,364 10,386 Updated Nov 12, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 35,936 3,664 Updated Jun 30, 2026

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.

Python 16,657 1,211 Updated Mar 24, 2026

Autonomous coding agent as an SDK, IDE extension, or CLI assistant.

TypeScript 64,151 6,808 Updated Jul 1, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 84,950 18,731 Updated Jul 1, 2026

The agent engineering platform.

Python 140,600 23,352 Updated Jun 30, 2026

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,806 3,003 Updated Apr 14, 2026

An elegant PyTorch deep reinforcement learning library.

Python 10,826 1,322 Updated Apr 3, 2026

A framework for efficient model inference with omni-modality models

Python 5,376 1,204 Updated Jul 1, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,847 6,838 Updated Jul 1, 2026

The open source coding agent.

TypeScript 181,008 22,311 Updated Jul 1, 2026

Mobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured

TypeScript 22,323 1,859 Updated Jun 27, 2026

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

Python 1,433 146 Updated Mar 19, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,727 1,074 Updated Apr 30, 2026

A kernel library written in tilelang

Python 1,616 144 Updated Apr 23, 2026

Web-based 3D visualization in Python

Python 2,654 202 Updated Jun 26, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 162,061 33,686 Updated Jul 1, 2026

Open-source, community-driven agent harness

Rust 39,231 3,397 Updated Jul 1, 2026