PaddlePaddle/FastDeploy
High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
GitHub repository with 3,691 stars and 751 forks.
Language: Python
Topics: ernie, ernie-45, ernie-45-vl, inference, llm, llm-serving, openai, serving, vllm