The open-source diagnostic for AI misalignment. 32 tests across fabrication, manipulation, deception, unpredictability, and opacity. Provider-agnostic. Runs against OpenAI, Anthropic, Bedrock, Azure, Gemini, and more. Letter grade in under 5 minutes, content-addressed manifest for bit-identical replay. Built by iMe.
GitHub repository with 466 stars and 90 forks.
Trending score: 0.53; stars gained: +2; forks gained: +1.
Language: Python
Topics: ai, diagnostic-tool, misalignment, agent-evaluation, ai-alignment, ai-evaluation