alez007/modelship
Self-hosted, multi-model AI inference server. Run LLMs, TTS, STT, embeddings, and image generation with an OpenAI-compatible API.
GitHub repository with 35 stars and 4 forks.
Language: Python
Topics: ai, ai-platform, diffusers, embeddings, image-generation, inference, llm, openai, ray, self-hosted