Skip to content
View softwarefollower's full-sized avatar

Block or report softwarefollower

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,901 15,036 Updated Apr 1, 2026

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

C++ 9,905 653 Updated Apr 1, 2026

⚡ Native Swift LLM inference server for Apple Silicon. OpenAI-compatible API, SSD streaming for 100B+ MoE models, TurboQuant KV cache compression, + iOS iPhone app.

C++ 5 Updated Apr 1, 2026

Scrypted is a high performance video integration and automation platform

TypeScript 5,634 344 Updated Mar 31, 2026

A framework for efficient model inference with omni-modality models

Python 4,086 668 Updated Apr 1, 2026

Bonsai Demo

Shell 109 10 Updated Apr 1, 2026

Production-grade multi-agent orchestration framework. Model-agnostic, supports team collaboration, task scheduling, and inter-agent communication.

TypeScript 1,132 591 Updated Apr 1, 2026

Claude Code without any telemetry - the perfect cli to combo with CCR

Python 238 86 Updated Apr 1, 2026

Distribute and run LLMs with a single file.

C++ 23,947 1,291 Updated Mar 31, 2026

A fast, local-first "search engine" for !bang users

TypeScript 1,203 350 Updated Mar 22, 2026
Jupyter Notebook 12,272 886 Updated Oct 25, 2025

Windows alt-tab on macOS

Swift 15,285 515 Updated Mar 22, 2026

TurboQuant KV cache compression for MLX with fused Metal kernels. 4.6x compression at 98% FP16 speed.

Python 56 9 Updated Mar 31, 2026

Training the missing codec encoder for Mistral's Voxtral-4B-TTS, enabling zero-shot voice cloning

Python 77 10 Updated Mar 30, 2026

KV cache compression via block-diagonal rotation. Beats TurboQuant: better PPL (6.91 vs 7.07), 28% faster decode, 5.3x faster prefill, 44x fewer params. Drop-in llama.cpp integration.

Python 228 23 Updated Apr 1, 2026

Fastest file searching tool for macOS

Rust 5 1 Updated Mar 25, 2026

Fastest file searching tool for macOS

Rust 914 30 Updated Mar 24, 2026

OpenAI-compatible API server for Apple on-device models

Swift 836 59 Updated Oct 2, 2025

Chromium fork named after radioactive element No. 90. Source code and Linux releases. Windows/MacOS/ARM builds served in different repos, links are towards the top of the README.md.

C++ 6,979 244 Updated Mar 14, 2026

Mealie is a self hosted recipe manager and meal planner with a RestAPI backend and a reactive frontend application built in Vue for a pleasant user experience for the whole family. Easily add recip…

Python 11,852 1,187 Updated Apr 1, 2026

Application for managing recipes, planning meals, building shopping lists and much much more!

HTML 8,126 790 Updated Apr 1, 2026

Free and open source video editor, based on MLT Framework and KDE Frameworks

C++ 4,844 394 Updated Apr 1, 2026

🎥 Command line media player

C 34,625 3,267 Updated Mar 31, 2026

Official mirror of Blender

C++ 17,936 2,839 Updated Apr 1, 2026

OBS Studio - Free and open source software for live streaming and screen recording

C 71,307 9,114 Updated Mar 31, 2026

The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.

JavaScript 57,306 6,198 Updated Apr 1, 2026

A privacy-first, open-source platform for knowledge management and collaboration. Download link: http://github.com/logseq/logseq/releases. roadmap: https://discuss.logseq.com/t/logseq-product-roadm…

Clojure 41,790 2,529 Updated Apr 1, 2026

A lightweight macOS file browser and cleanup utility

Swift 17 Updated Jan 18, 2026

Fast, local-first voice app for Mac. Dictation and transcription powered by Parakeet TDT on the Neural Engine.

Swift 34 3 Updated Apr 1, 2026
Next