kubernetes-sigs/devops-bench
A standardized benchmarking suite to evaluate how well different agents or models perform specific DevOps tasks. Its goal is to provide an open-source, reproducible way to transparently assess agent performance across various infrastructure platforms and operational environments.
GitHub repository with 5 stars and 7 forks.
Language: Python