Skip to content
View transmissions11's full-sized avatar

Highlights

  • Pro

Organizations

@paradigmxyz @2ndwest

Block or report transmissions11

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WebAssembly Virtual Machine

C++ 2,774 232 Updated Apr 5, 2026
TeX 1 Updated Jan 26, 2026
Go 2 Updated Mar 13, 2025

Open-source framework for the research and development of foundation models.

Python 1,153 136 Updated Jul 1, 2026

Named Tensors for Legible Deep Learning in JAX

Python 225 21 Updated Nov 8, 2025

A simple library for scaling up JAX programs

Python 148 11 Updated Nov 4, 2025
Jupyter Notebook 34 8 Updated Jun 3, 2024

A simple, performant, and scalable Jax LLM!

Python 2,345 546 Updated Jul 1, 2026

JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training

Python 79 2 Updated Jun 18, 2026

SpecTrax is a JAX-native library for neural networks and graph learning, built for performance, composability and modularity.

Python 41 1 Updated Jun 27, 2026

torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JAX-Pytorch interoperability, meaning, one can mix JAX & Pytor…

Python 228 34 Updated Jun 17, 2026

A Python DSL to write Nvidia PTX for Hopper and Blackwell in JAX and PyTorch

Python 315 27 Updated May 8, 2026

Experimentation using the xla compiler from rust

Rust 109 18 Updated Aug 17, 2024

Tokamax: A GPU and TPU kernel library.

Python 238 37 Updated Jun 30, 2026

Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and integrated with your favorite AWS services

Python 613 194 Updated Jun 18, 2026

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Python 448 66 Updated Jan 5, 2026

ATLAS Autoformalized Textbook Library At Scale

Lean 254 31 Updated Jun 2, 2026

ThunderKittens LCF forward non-causal attention kernel benchmarked against FlashAttention-2 and FlashAttention-3 on Hopper.

Cuda 11 Updated May 23, 2026

This module defines a type system for distributed training code, based off of JAX's sharding in types, but adapted for the PyTorch ecosystem.

Python 32 1 Updated Jun 29, 2026

Disruptor BlockingQueue

Java 323 46 Updated Mar 3, 2025

FoundationDB - the open source, distributed, transactional key-value store

C++ 16,461 1,531 Updated Jun 30, 2026

FoundationDB Rust client api

Rust 220 46 Updated Jun 29, 2026

Seastar boilerplate project with cmake

C++ 35 17 Updated Mar 11, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,727 1,074 Updated Apr 30, 2026

DeepGEMM: clean and efficient BLAS kernel library on GPU

Cuda 7,461 1,079 Updated Jun 29, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,796 1,306 Updated Jun 15, 2026

NoSQL data store using the SEASTAR framework, compatible with Redis

C++ 1,331 167 Updated Oct 2, 2019

mTCP: A Highly Scalable User-level TCP Stack for Multicore Systems

C 2,130 462 Updated Jul 4, 2024

Algorithm powering the For You feed on X

Rust 26,343 4,519 Updated May 15, 2026
Next