JudgmentLabs/judgeval

The Continuous-Improvement Stack for Agents. Our environment data and evals power agent improvement and monitoring.

GitHub repository with 1,036 stars and 93 forks.

Language: Python

Topics: langchain, langgraph, llama-index, llm, llm-evaluation, llm-observability, open-source, openai, prompt-engineering, agent

Open provider repository

Latest metric snapshot

2026-06-10: 1,036 stars and 93 forks.

Similar repositories

1. chopratejas/headroom

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

GitHub repository with 24,986 stars and 1,636 forks.

Trending score: 5.73; stars gained: +2,844; forks gained: +202.

Language: Python

Topics: agent, ai, anthropic, claude-code, compression, context-engineering
2. BerriAI/litellm

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]

GitHub repository with 50,215 stars and 8,841 forks.

Trending score: 4.12; stars gained: +246; forks gained: +59.

Language: Python

Topics: ai-gateway, anthropic, azure-openai, bedrock, gateway, langchain
3. langchain-ai/deepagents

The batteries-included agent harness.

GitHub repository with 24,561 stars and 3,480 forks.

Trending score: 3.81; stars gained: +159; forks gained: +21.

Language: Python

Topics: ai, deepagents, langchain, langgraph, python, typescript
4. langchain-ai/langchain

The agent engineering platform.

GitHub repository with 139,167 stars and 23,072 forks.

Trending score: 3.80; stars gained: +155; forks gained: +42.

Language: Python

Topics: agents, ai, ai-agents, anthropic, chatgpt, deepagents
5. bytedance/deer-flow

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours.

GitHub repository with 71,070 stars and 9,628 forks.

Trending score: 3.76; stars gained: +154; forks gained: +29.

Language: Python

Topics: agent, agentic, agentic-framework, agentic-workflow, ai, ai-agents
6. plastic-labs/honcho

Memory library for building stateful agents

GitHub repository with 5,110 stars and 611 forks.

Trending score: 3.72; stars gained: +66; forks gained: +16.

Language: Python

Topics: ai, llm, memory, personalization, embeddings, rag

Trending in Python

1. mvanhorn/last30days-skill

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

GitHub repository with 40,614 stars and 3,271 forks.

Trending score: 5.82; stars gained: +1,312; forks gained: +87.

Language: Python
2. chopratejas/headroom

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

GitHub repository with 24,986 stars and 1,636 forks.

Trending score: 5.73; stars gained: +2,844; forks gained: +202.

Language: Python

Topics: agent, ai, anthropic, claude-code, compression, context-engineering
3. pewdiepie-archdaemon/odysseus

Self-hosted AI workspace.

GitHub repository with 69,640 stars and 8,816 forks.

Trending score: 5.70; stars gained: +951; forks gained: +165.

Language: Python
4. NousResearch/hermes-agent

The agent that grows with you

GitHub repository with 192,310 stars and 33,527 forks.

Trending score: 5.48; stars gained: +990; forks gained: +282.

Language: Python

Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
5. safishamsi/graphify

AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.

GitHub repository with 66,406 stars and 6,716 forks.

Trending score: 5.25; stars gained: +1,314; forks gained: +109.

Language: Python

Topics: claude-code, graphrag, knowledge-graph, codex, openclaw, skills
6. hugohe3/ppt-master

AI generates a real, editable PowerPoint from any document — native shapes & animations, speaker notes voiced as audio narration, and the option to follow your own .pptx template, not slide images · by Hugo He

GitHub repository with 27,112 stars and 2,418 forks.

Trending score: 5.10; stars gained: +903; forks gained: +61.

Language: Python

Topics: ai-agent, aippt, office, powerpoint, powerpoint-generation, ppt

JudgmentLabs/judgeval

Latest metric snapshot

Similar repositories

Trending in Python

Trending topic: langchain