sgl-project/sglang
SGLang is a high-performance serving framework for large language models and multimodal models.
GitHub repository with 28,862 stars and 6,348 forks.
Language: Python
Topics: attention, blackwell, cuda, deepseek, diffusion, glm, gpt-oss, inference, llama, llm