benchflow-ai/skillsbench

SkillsBench evaluates how well skills work and how effective agents are at using them

GitHub repository with 1,299 stars and 313 forks.

Language: PDDL

Open provider repository

Latest metric snapshot

2026-06-05: 1,299 stars and 313 forks.