JudgmentLabs/judgeval

The Continuous-Improvement Stack for Agents. Our environment data and evals power agent improvement and monitoring.

GitHub repository with 1,036 stars and 93 forks.

Language: Python

Topics: langchain, langgraph, llama-index, llm, llm-evaluation, llm-observability, open-source, openai, prompt-engineering, agent

Open provider repository

Latest metric snapshot

2026-06-10: 1,036 stars and 93 forks.

Similar repositories

  1. 1. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 24,986 stars and 1,636 forks.

    Trending score: 5.73; stars gained: +2,844; forks gained: +202.

    Language: Python

    Topics: agent, ai, anthropic, claude-code, compression, context-engineering

  2. 2. BerriAI/litellm

    Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]

    GitHub repository with 50,215 stars and 8,841 forks.

    Trending score: 4.12; stars gained: +246; forks gained: +59.

    Language: Python

    Topics: ai-gateway, anthropic, azure-openai, bedrock, gateway, langchain

  3. 3. langchain-ai/deepagents

    The batteries-included agent harness.

    GitHub repository with 24,561 stars and 3,480 forks.

    Trending score: 3.81; stars gained: +159; forks gained: +21.

    Language: Python

    Topics: ai, deepagents, langchain, langgraph, python, typescript

  4. 4. langchain-ai/langchain

    The agent engineering platform.

    GitHub repository with 139,167 stars and 23,072 forks.

    Trending score: 3.80; stars gained: +155; forks gained: +42.

    Language: Python

    Topics: agents, ai, ai-agents, anthropic, chatgpt, deepagents

  5. 5. bytedance/deer-flow

    An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours.

    GitHub repository with 71,070 stars and 9,628 forks.

    Trending score: 3.76; stars gained: +154; forks gained: +29.

    Language: Python

    Topics: agent, agentic, agentic-framework, agentic-workflow, ai, ai-agents

  6. 6. plastic-labs/honcho

    Memory library for building stateful agents

    GitHub repository with 5,110 stars and 611 forks.

    Trending score: 3.72; stars gained: +66; forks gained: +16.

    Language: Python

    Topics: ai, llm, memory, personalization, embeddings, rag

Trending in Python

  1. 1. mvanhorn/last30days-skill

    AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

    GitHub repository with 40,614 stars and 3,271 forks.

    Trending score: 5.82; stars gained: +1,312; forks gained: +87.

    Language: Python

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 24,986 stars and 1,636 forks.

    Trending score: 5.73; stars gained: +2,844; forks gained: +202.

    Language: Python

    Topics: agent, ai, anthropic, claude-code, compression, context-engineering

  3. 3. pewdiepie-archdaemon/odysseus

    Self-hosted AI workspace.

    GitHub repository with 69,640 stars and 8,816 forks.

    Trending score: 5.70; stars gained: +951; forks gained: +165.

    Language: Python

  4. 4. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 192,310 stars and 33,527 forks.

    Trending score: 5.48; stars gained: +990; forks gained: +282.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  5. 5. safishamsi/graphify

    AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.

    GitHub repository with 66,406 stars and 6,716 forks.

    Trending score: 5.25; stars gained: +1,314; forks gained: +109.

    Language: Python

    Topics: claude-code, graphrag, knowledge-graph, codex, openclaw, skills

  6. 6. hugohe3/ppt-master

    AI generates a real, editable PowerPoint from any document — native shapes & animations, speaker notes voiced as audio narration, and the option to follow your own .pptx template, not slide images · by Hugo He

    GitHub repository with 27,112 stars and 2,418 forks.

    Trending score: 5.10; stars gained: +903; forks gained: +61.

    Language: Python

    Topics: ai-agent, aippt, office, powerpoint, powerpoint-generation, ppt

Trending topic: langchain

  1. 1. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 24,986 stars and 1,636 forks.

    Trending score: 5.73; stars gained: +2,844; forks gained: +202.

    Language: Python

    Topics: agent, ai, anthropic, claude-code, compression, context-engineering

  2. 2. BerriAI/litellm

    Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]

    GitHub repository with 50,215 stars and 8,841 forks.

    Trending score: 4.12; stars gained: +246; forks gained: +59.

    Language: Python

    Topics: ai-gateway, anthropic, azure-openai, bedrock, gateway, langchain

  3. 3. langchain-ai/deepagents

    The batteries-included agent harness.

    GitHub repository with 24,561 stars and 3,480 forks.

    Trending score: 3.81; stars gained: +159; forks gained: +21.

    Language: Python

    Topics: ai, deepagents, langchain, langgraph, python, typescript

  4. 4. langchain-ai/langchain

    The agent engineering platform.

    GitHub repository with 139,167 stars and 23,072 forks.

    Trending score: 3.80; stars gained: +155; forks gained: +42.

    Language: Python

    Topics: agents, ai, ai-agents, anthropic, chatgpt, deepagents

  5. 5. bytedance/deer-flow

    An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours.

    GitHub repository with 71,070 stars and 9,628 forks.

    Trending score: 3.76; stars gained: +154; forks gained: +29.

    Language: Python

    Topics: agent, agentic, agentic-framework, agentic-workflow, ai, ai-agents

  6. 6. plastic-labs/honcho

    Memory library for building stateful agents

    GitHub repository with 5,110 stars and 611 forks.

    Trending score: 3.72; stars gained: +66; forks gained: +16.

    Language: Python

    Topics: ai, llm, memory, personalization, embeddings, rag