cloudrift-ai/deplodock
Benchmark and deploy optimized LLM models on GPU servers with vLLM or SGLang. Chose from a list of optimized recipes for popular models or create your own with custom configurations. Run benchmarks across different GPU types and configurations, track results, and share experiments with the community.
GitHub repository with 56 stars and 5 forks.
Language: Python