InternScience/ResearchHarness
A lightweight, general-purpose harness for tool-using LLM agents, fair benchmark evaluation, harness baselines, and personal assistant workflows.
GitHub repository with 29 stars and 6 forks.
Language: Python
Topics: agent, llm, agent-harness, assistant, baseline, benchmarking, evaluation, gemini, glm, gpt