ModelTC/LightLLM
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
GitHub repository with 4,086 stars and 332 forks.
Language: Python
Topics: deep-learning, gpt, llama, llm, model-serving, nlp, openai-triton