Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
LostRuins / koboldcpp
Forked from ggml-org/llama.cppRun GGUF models easily with a KoboldAI UI. One File. Zero Install.
⚡ Native Swift LLM inference server for Apple Silicon. OpenAI-compatible API, SSD streaming for 100B+ MoE models, TurboQuant KV cache compression, + iOS iPhone app.
Scrypted is a high performance video integration and automation platform
A framework for efficient model inference with omni-modality models
Production-grade multi-agent orchestration framework. Model-agnostic, supports team collaboration, task scheduling, and inter-agent communication.
Claude Code without any telemetry - the perfect cli to combo with CCR
Distribute and run LLMs with a single file.
A fast, local-first "search engine" for !bang users
TurboQuant KV cache compression for MLX with fused Metal kernels. 4.6x compression at 98% FP16 speed.
Training the missing codec encoder for Mistral's Voxtral-4B-TTS, enabling zero-shot voice cloning
KV cache compression via block-diagonal rotation. Beats TurboQuant: better PPL (6.91 vs 7.07), 28% faster decode, 5.3x faster prefill, 44x fewer params. Drop-in llama.cpp integration.
maddada / cardinal
Forked from cardisoft/cardinalFastest file searching tool for macOS
OpenAI-compatible API server for Apple on-device models
Chromium fork named after radioactive element No. 90. Source code and Linux releases. Windows/MacOS/ARM builds served in different repos, links are towards the top of the README.md.
Mealie is a self hosted recipe manager and meal planner with a RestAPI backend and a reactive frontend application built in Vue for a pleasant user experience for the whole family. Easily add recip…
Application for managing recipes, planning meals, building shopping lists and much much more!
Free and open source video editor, based on MLT Framework and KDE Frameworks
OBS Studio - Free and open source software for live streaming and screen recording
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
A privacy-first, open-source platform for knowledge management and collaboration. Download link: http://github.com/logseq/logseq/releases. roadmap: https://discuss.logseq.com/t/logseq-product-roadm…
A lightweight macOS file browser and cleanup utility
Fast, local-first voice app for Mac. Dictation and transcription powered by Parakeet TDT on the Neural Engine.