llm-optimization

jxzhangjhu / Awesome-LLM-Prompt-Optimization

Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models

optimization prompt evolution-strategies prompt-learning prompt-tuning large-language-models llms prompt-optimization llm-optimizer llm-optimization

Updated Mar 27, 2024

villagecomputing / superpipe

Star

Superpipe - optimized LLM pipelines for structured data

classification data-extraction structured-data data-labeling llm llm-evaluation llm-optimization

Updated Jun 18, 2024
Python

romsto / Speculative-Decoding

Star

Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.

fast-inference llm llm-inference speculative-decoding llm-optimization

Updated Dec 2, 2024
Python

doramirdor / Nadir

Star

Nadir is a Python package designed to dynamically choose the best llm for your prompt by balancing complexity and cost and response time.

llm llm-optimizer llm-optimization llm-cost

Updated Feb 11, 2025
Python

choihyunsus / n2-arachne

Sponsor

Star

Optimize AI workflows with Arachne. Automatically assembles the perfect code context (Tree, Target, Deps, Semantic) to fit context windows without noise. Built for efficiency and scale.

sqlite mcp developer-tools ai-agents context-window llm-optimization context-assembly

Updated Mar 29, 2026
TypeScript

REPOZY / superpowers-optimized

Star

Optimized fork of Superpowers — Retains full original features while adding automatic 3-tier workflow routing, integrated safety guards (OWASP-aligned), red-team adversarial testing with auto-fix pipeline, a built-in memory stack, and much more. Delivers faster, more reliable, hallucination-resistant coding sessions.

tdd superpowers coding-assistant llm-optimization agentic-framework agentic-workflow agentic-ai cursor-plugin claude-code ai-coding-agent coding-agent claude-code-plugin claude-code-plugins-marketplace codex-skills opencode-plugin superpowers-core-skills safety-hooks

Updated Mar 30, 2026
Shell

rafsilva85 / credit-optimizer-v5

Star

Save 47% on Manus AI credits automatically. Zero downsides. Pays for itself in ~27 prompts. Free MCP Server (PyPI) + $12 Manus Skill bundle with Fast Navigation (115x speed boost).

mcp ai-agents cost-optimization pypi-package llm-optimization model-context-protocol mcp-server manus-ai credit-optimization manus-skill

Updated Mar 27, 2026
Python

nshkrdotcom / ds_ex

Sponsor

Star

DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework

machine-learning elixir otp ai functional-programming beam metaprogramming erlang-vm ai-framework declarative-programming self-improving dspy prompt-engineering llm-optimization automated-optimization program-optimization nshkr-ai-agents

Updated Jul 12, 2025
Elixir

ronantakizawa / sleeptimecompute

Star

A Demo of Running Sleep-time Compute to Reduce LLM Latency

ai llm llm-optimization

Updated May 17, 2025
Jupyter Notebook

nshkrdotcom / DSPex

Sponsor

Star

Declarative Self Improving Elixir - DSPy Orchestration in Elixir

elixir otp ai functional-programming beam erlang-vm ai-framework autonomous-systems declarative-programming self-improving dspy prompt-engineering prompt-optimization llm-optimization program-optimization llm-programming nshkr-ai-agents

Updated Mar 30, 2026
Elixir

cameronking4 / programmatic-tool-calling-ai-sdk

Star

⚡ Cut LLM inference costs 80% with Programmatic Tool Calling. Instead of N tool call round-trips, generate JavaScript to orchestrate tools in Vercel Sandbox. Supports Anthropic, OpenAI, 100+ models via AI Gateway. Novel MCP Bridge for external service integration.

typescript nextjs virtualization ai-sdk vercel shadcn-ui vercel-ai-sdk llm-optimization ai-gateway model-context-protocol mcp-server mcp-client token-optimization ai-elements vercel-sandbox programmatic-tool-calling

Updated Dec 3, 2025
TypeScript

ihuzaifashoukat / llmoptimizer

Star

Generate clean, AI-ready llms.txt files for your website or docs. Supports crawling, sitemaps, static builds, and framework-aware adapters (Next.js, Vite, Nuxt, Astro, Remix). Includes Markdown/MDX docs mode and robots.txt generator for LLM and search crawlers.

Updated Aug 27, 2025
TypeScript

contactjccoaching-wq / immune

Star

Hybrid adaptive memory system for Claude Code — Cheatsheet (positive patterns) + Immune (negative patterns)

code-quality error-detection automated-code-review learning-system prompt-engineering llm-optimization adaptive-memory agent-skills claude-code claude-code-skills

Updated Mar 14, 2026
JavaScript

ShotokanOSS / ggufForge

Star

A comprehensive toolkit for training and running lightweight adapters for GGUF-based language models (ERNIE, Llama, Mistral, Phi-3, etc.) without modifying the base model.

pytorch transfer-learning efficient-training transfer-learning-nlp efficient-training-and-inference language-model-adapter llama-cpp local-llm llama-cpp-python gguf llm-optimization transfer-learning-and-fine-tuning local-llm-gguf low-resource-finetuning quantization-recovery logit-correction cost-efficient-ai

Updated Feb 23, 2026
Python

AntsAreRad / opti-oignon

Star

Opti-Oignon is a comprehensive optimization framework for local LLMs running on Ollama. It maximizes the performance of your local models through intelligent task routing based on a custom benchmark, RAG (Retrieval-Augmented Generation), and multi-model orchestration.

python ai multi-agent gradio rag llm local-llm ollama llm-optimization

Updated Dec 27, 2025
Python

ssakhavi / talks-hints-llm-for-scheduling

Star

An application for LLM-based scheduling a roster prepared for my presentation at TTSH HINT.

milp operations-research rostering nurse-scheduling-problem llm-optimization

Updated Mar 19, 2026
Python

chirindaopensource / compact_prompt_unified_pipeline_prompt_data_compression_LLM_workflows

Star

End-to-End Python implementation of CompactPrompt (Choi et al., 2025): a unified pipeline for LLM prompt and data compression. Features modular compression pipeline with dependency-driven phrase pruning, reversible n-gram encoding, K-means quantization, and embedding-based exemplar selection. Achieves 2-4x token reduction while preserving accuracy.

Updated Nov 30, 2025
Jupyter Notebook

SangiSI / llm-foundations-and-systems

Star

Structured repository covering LLM foundations, fine-tuning workflows, optimization strategies, deployment patterns, evaluation methods, and Responsible AI considerations.

python machine-learning fine-tuning responsible-ai large-language-models prompt-engineering llm-evaluation llm-deployment llm-optimization llm-monitoring llm-fundamentals

Updated Dec 19, 2025
Jupyter Notebook

bv-saketha-rama / flux-cache

Star

Adaptive semantic cache for LLMs with streaming support, ML-based thresholds, and real-time cost tracking. Built in Rust for sub-millisecond performance.

Updated Jan 14, 2026

contactjccoaching-wq / chimera

Star

Bio-inspired optimization pipeline for Claude Code. 3 systems: Slime Mold (explore→prune) → PRISM (perspectives→compile) → Immune (scan→correct→learn). Domain-agnostic.

pipeline multi-agent code-quality bio-inspired prompt-engineering llm-optimization agent-skills claude-code claude-code-skills

Updated Mar 14, 2026
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Here are 61 public repositories matching this topic...

jxzhangjhu / Awesome-LLM-Prompt-Optimization

villagecomputing / superpipe

romsto / Speculative-Decoding

doramirdor / Nadir

choihyunsus / n2-arachne

REPOZY / superpowers-optimized

rafsilva85 / credit-optimizer-v5

nshkrdotcom / ds_ex

ronantakizawa / sleeptimecompute

nshkrdotcom / DSPex

cameronking4 / programmatic-tool-calling-ai-sdk

ihuzaifashoukat / llmoptimizer

contactjccoaching-wq / immune

ShotokanOSS / ggufForge

AntsAreRad / opti-oignon

ssakhavi / talks-hints-llm-for-scheduling

chirindaopensource / compact_prompt_unified_pipeline_prompt_data_compression_LLM_workflows

SangiSI / llm-foundations-and-systems

bv-saketha-rama / flux-cache

contactjccoaching-wq / chimera

Improve this page

Add this topic to your repo