forecastingresearch/forecastbench
Forecastbench is a dynamic, contamination-free benchmark of LLM forecasting accuracy with human comparison groups, serving as a valuable proxy for general intelligence.
GitHub repository with 68 stars and 12 forks.
Language: Python
Topics: benchmark, forecastbench, forecasting, llm-benchmarking