mlx-serve

PyPI page
Home page
Author: mlx-serve contributors
Summary: Local inference server for Apple Silicon that hot-swaps MLX models (LLM, vision, embeddings, TTS, STT) via OpenAI-compatible API
Latest version: 0.1.0
Required dependencies: fastapi | httpx | psutil | python-multipart | pyyaml | uvicorn
Optional dependencies: mlx-audio | mlx-embeddings | mlx-lm | mlx-vlm | mlx-whisper | pytest | pytest-asyncio | ruff

Downloads last day: 8
Downloads last week: 43
Downloads last month: 108