Skip to content
@gpustack

GPUStack

GPU cluster manager for optimized AI model deployment

Pinned Loading

  1. gpustack gpustack Public

    A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

    Python 4.8k 490

  2. runner runner Public

    Collection of Dockerfiles to build images for various inference services across different accelerated backends.

    Dockerfile 11 9

  3. runtime runtime Public

    Provides a unified interface to detect GPU resources and manages GPU workloads.

    Python 12 15

  4. gguf-parser-go gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    Go 254 24

  5. vox-box vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    Python 201 33

Repositories

Showing 10 of 15 repositories
  • .github Public

    Meta-Github repository for all GPUStack repositories.

    gpustack/.github’s past year of commit activity
    1 Apache-2.0 4 0 0 Updated Apr 1, 2026
  • gpustack Public

    A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

    gpustack/gpustack’s past year of commit activity
    Python 4,764 Apache-2.0 490 511 29 Updated Apr 1, 2026
  • community-inference-backends Public

    Community Inference Backends for GPUStack V2

    gpustack/community-inference-backends’s past year of commit activity
    Python 11 Apache-2.0 8 0 0 Updated Apr 1, 2026
  • gpustack-ui Public
    gpustack/gpustack-ui’s past year of commit activity
    TypeScript 77 Apache-2.0 55 2 5 Updated Mar 31, 2026
  • runtime Public

    Provides a unified interface to detect GPU resources and manages GPU workloads.

    gpustack/runtime’s past year of commit activity
    Python 12 Apache-2.0 15 0 3 Updated Mar 31, 2026
  • gpustack/gpustack.github.io’s past year of commit activity
    HTML 1 2 0 0 Updated Mar 28, 2026
  • gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    gpustack/gguf-parser-go’s past year of commit activity
    Go 254 MIT 24 1 0 Updated Mar 25, 2026
  • gpustack/gpustack-higress-plugin’s past year of commit activity
    Go 1 2 0 0 Updated Mar 20, 2026
  • runner Public

    Collection of Dockerfiles to build images for various inference services across different accelerated backends.

    gpustack/runner’s past year of commit activity
    Dockerfile 11 Apache-2.0 9 0 0 Updated Mar 13, 2026
  • gpustack/benchmark-runner’s past year of commit activity
    Python 2 Apache-2.0 2 1 0 Updated Mar 6, 2026

Most used topics

Loading…