hwdsl2/docker-ai-stack
Deploy a complete, self-hosted AI stack on your own server with one command. Includes Ollama (LLM), AnythingLLM (chat UI), LiteLLM (AI gateway), Whisper (STT), Kokoro (TTS), Embeddings (RAG), and MCP Gateway. Most services run locally; LiteLLM optionally routes to external providers. Multi-arch: amd64, arm64. NVIDIA GPU (CUDA) acceleration.
GitHub repository with 62 stars and 11 forks.
Language: Shell
Topics: ai, ai-stack, arm64, docker, docker-compose, docker-image, embeddings, inference, linux, llm