benchflow-ai/skillsbench
SkillsBench evaluates how well skills work and how effective agents are at using them
GitHub repository with 1,299 stars and 313 forks.
Language: PDDL
SkillsBench evaluates how well skills work and how effective agents are at using them
GitHub repository with 1,299 stars and 313 forks.
Language: PDDL
2026-06-05: 1,299 stars and 313 forks.