raullenchai/Rapid-MLX
The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI replacement. Works with Claude Code, Cursor, Aider.
GitHub repository with 2,800 stars and 342 forks.
Language: Python
Topics: apple-silicon, claude-code, cursor, deepseek, fastapi, hacktoberfest, inference, llm, local-llm, m1