strands-labs/benchmark-harnesses
Strands-based agents and harnesses for agentic benchmarks.
GitHub repository with 21 stars and 3 forks.
Language: Python
Topics: agentic, agentic-ai, ai, benchmarks, genai, llm, machine-learning, strands-agents, strands-labs, swe-bench