modelscope/evalscope
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
GitHub repository with 2,888 stars and 386 forks.
Language: Python
Topics: evaluation, llm, performance, rag, vlm