arozanov

Follow

🎯

Focusing

Anton Rozanov arozanov

🎯

Focusing

Follow

Software Developer. Node.js, React.

9 followers · 7 following

Achievements

Achievements

Organizations

Popular repositories Loading

turboquant-mlx turboquant-mlx Public

TurboQuant KV cache compression for MLX with fused Metal kernels. 4.6x compression at 98% FP16 speed.

Python 109 20
ggml-ane ggml-ane Public

Objective-C++ 12
mlx-lm mlx-lm Public

Forked from ml-explore/mlx-lm

Run LLMs with MLX

Python 2 1
vllm-mlx vllm-mlx Public

Forked from waybarrios/vllm-mlx

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX …

Python 2
mlx mlx Public

Forked from ml-explore/mlx

MLX: An array framework for Apple silicon

C++ 1
pack-calculator pack-calculator Public

Pack calculator coding challenge

Go 1