Skip to content
View arozanov's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@meta-edu

Block or report arozanov

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. turboquant-mlx turboquant-mlx Public

    TurboQuant KV cache compression for MLX with fused Metal kernels. 4.6x compression at 98% FP16 speed.

    Python 109 20

  2. ggml-ane ggml-ane Public

    Objective-C++ 12

  3. mlx-lm mlx-lm Public

    Forked from ml-explore/mlx-lm

    Run LLMs with MLX

    Python 2 1

  4. vllm-mlx vllm-mlx Public

    Forked from waybarrios/vllm-mlx

    OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX …

    Python 2

  5. mlx mlx Public

    Forked from ml-explore/mlx

    MLX: An array framework for Apple silicon

    C++ 1

  6. pack-calculator pack-calculator Public

    Pack calculator coding challenge

    Go 1