exmergo/research-llm-car-wash-test
The car wash is 100m away from my house. Should I walk or drive? LLMs tackle this test in surprisingly different ways
GitHub repository with 8 stars and 1 forks.
Language: Python
Topics: car-wash, claude, exmergo, gemini, gpt, llama, llm, llm-evaluation, open-research, research