Stars
- All languages
- Bikeshed
- BitBake
- C
- C++
- CMake
- CSS
- CoffeeScript
- Common Workflow Language
- Coq
- Cuda
- Dart
- Dockerfile
- Go
- HCL
- HTML
- Haskell
- Java
- JavaScript
- JetBrains MPS
- Jupyter Notebook
- Kotlin
- LLVM
- Lua
- M4
- MATLAB
- MLIR
- Makefile
- Objective-C
- PHP
- PowerShell
- Python
- R
- Ruby
- Rust
- Scala
- Shell
- TeX
- TypeScript
- Vala
- Verilog
- Vue
- WebAssembly
A framework for efficient model inference with omni-modality models
Control Google Pixel Buds Pro from the Linux command line.
Monocular whole-body 3D human pose estimation using the SOMA body model
🂻 JACK Audio Connection Kit (JACK) Client for Python 🐍
An curated list for feed-forward 3D scene modeling, including research directions, datasets, and applications.
Compact PipeWire system-wide parametric EQ for Linux desktops
A high-throughput and memory-efficient inference and serving engine for LLMs
A minimal Python logger that tracks everything you try when building AI - metrics, prompts, models, etc, so you can see what changed and why.
This repository contains the implementation of SAM3 trackers.
On-device AI across mobile, embedded and edge for PyTorch
DevSpace - The Fastest Developer Tool for Kubernetes ⚡ Automate your deployment workflow with DevSpace and develop software directly inside Kubernetes.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
[ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention
Lightweight coding agent that runs in your terminal
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Reading, writing, and processing images in a wide variety of file formats, using a format-agnostic API, aimed at VFX applications.
[AAAI2026] X-SAM: From Segment Anything to Any Segmentation
[IJCV 2026] Multimodal Referring Segmentation
A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of-the-art methods, innovative applications, and key advanceme…
Official Repository of "ROSE: Remove Objects with Side Effects in Videos"




