outsourc-e/bench-loop
Local-first CLI for benchmarking LLMs on real hardware — quality, speed, reliability, and a real multi-turn agent loop.
GitHub repository with 32 stars and 6 forks.
Language: Python
Topics: agent, benchmark, cli, evaluation, llm, local-llm, mlx, ollama, vllm