vishwanathakuthota/openvals
Open-source AI model evaluation and benchmarking framework for LLMs (OpenAI, Ollama, Claude, Gemini)
GitHub repository with 10 stars and 6 forks.
Language: Python
Topics: ai-agents, ai-evaluation, ai-evaluation-framework, ai-quality, ai-reliability, ai-safety, ai-testing, calude, gemini, llm-benchmarking