TiesPetersen/SkillBenchmark
A benchmarking suite for Agent Skills (SKILL.md files) that measures whether a skill actually improves LLM output quality, and by how much.
GitHub repository with 10 stars and 1 forks.
Language: Python
Topics: ai, benchmark, claude, claude-code, llm, prompt-engineering, skill, tokens