InternScience/ResearchClawBench
🦞 ResearchClawBench: Evaluating AI Agents for Automated Research from Re-Discovery to New-Discovery
GitHub repository with 127 stars and 11 forks.
Language: Jupyter Notebook
Topics: auto-research, openclaw, agent, ai4science, benchmark, claude, claude-code, codex, discovery, evaluation