jd-opensource/xllm
A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.
GitHub repository with 1,320 stars and 221 forks.
Language: C++
Topics: deepseek, inference, inference-engine, llm-inference, qwen, large-language-models, glm