Arize-ai/phoenix
AI Observability & Evaluation
GitHub repository with 9,989 stars and 909 forks.
Language: Python
Topics: llmops, ai-monitoring, ai-observability, llm-eval, aiengineering, datasets, agents, llms, prompt-engineering, anthropic
AI Observability & Evaluation
GitHub repository with 9,989 stars and 909 forks.
Language: Python
Topics: llmops, ai-monitoring, ai-observability, llm-eval, aiengineering, datasets, agents, llms, prompt-engineering, anthropic
Trending score 3.03, activity score 4.67, stars gained +17, forks gained +2.
2026-06-05: 9,989 stars and 909 forks.
Observal is a unified platform for agent distribution, observability and insights.
GitHub repository with 1,997 stars and 376 forks.
Trending score: 4.07; stars gained: +119; forks gained: +63.
Language: Python
Topics: cli-tool, evaluation, large-language-models, llm, llm-evaluation, llm-observability
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]
GitHub repository with 49,379 stars and 8,621 forks.
Trending score: 4.07; stars gained: +250; forks gained: +59.
Language: Python
Topics: ai-gateway, anthropic, azure-openai, bedrock, gateway, langchain
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.
GitHub repository with 26,310 stars and 5,812 forks.
Trending score: 3.17; stars gained: +43; forks gained: +7.
Language: Python
Topics: machine-learning, ai, ml, mlflow, apache-spark, model-management
AI Observability & Evaluation
GitHub repository with 9,989 stars and 909 forks.
Trending score: 3.03; stars gained: +17; forks gained: +2.
Language: Python
Topics: llmops, ai-monitoring, ai-observability, llm-eval, aiengineering, datasets
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
GitHub repository with 19,434 stars and 1,496 forks.
Trending score: 2.37; stars gained: +10; forks gained: +1.
Language: Python
Topics: evaluation, hacktoberfest, hacktoberfest2025, langchain, llama-index, llm
Continuum — the agent runtime by ShyftLabs. Build, orchestrate, ship.
GitHub repository with 69 stars and 6 forks.
Trending score: 0.93; stars gained: +8; forks gained: +0.
Language: Python
Topics: agent-framework, agentic-ai, ai-agents, ai-orchestration, anthropic, enterprise-ai
The agent that grows with you
GitHub repository with 181,806 stars and 31,193 forks.
Trending score: 5.95; stars gained: +1,867; forks gained: +361.
Language: Python
Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
GitHub repository with 13,361 stars and 853 forks.
Trending score: 5.69; stars gained: +2,829; forks gained: +175.
Language: Python
Topics: agent, ai, anthropic, compression, context-engineering, context-window
Academic Research Skills for Claude Code: research → write → review → revise → finalize
GitHub repository with 27,484 stars and 2,256 forks.
Trending score: 5.52; stars gained: +1,079; forks gained: +89.
Language: Python
Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review
Learn it. Build it. Ship it for others.
GitHub repository with 28,622 stars and 4,680 forks.
Trending score: 5.32; stars gained: +1,261; forks gained: +238.
Language: Python
Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course
GitHub repository with 30,029 stars and 4,231 forks.
Trending score: 4.88; stars gained: +688; forks gained: +114.
Language: Python
An opinionated list of Python frameworks, libraries, tools, and resources
GitHub repository with 301,371 stars and 28,044 forks.
Trending score: 4.60; stars gained: +518; forks gained: +24.
Language: Python
Topics: awesome, python, collections, python-frameworks, python-libraries, python-tools
Observal is a unified platform for agent distribution, observability and insights.
GitHub repository with 1,997 stars and 376 forks.
Trending score: 4.07; stars gained: +119; forks gained: +63.
Language: Python
Topics: cli-tool, evaluation, large-language-models, llm, llm-evaluation, llm-observability
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]
GitHub repository with 49,379 stars and 8,621 forks.
Trending score: 4.07; stars gained: +250; forks gained: +59.
Language: Python
Topics: ai-gateway, anthropic, azure-openai, bedrock, gateway, langchain
Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS.
GitHub repository with 5,497 stars and 695 forks.
Trending score: 3.70; stars gained: +61; forks gained: +12.
Language: Go
Topics: ai-gateway, gateway, gateway-services, generative-ai, guardrails, llm
Community maintained hardware plugin for vLLM on Ascend
GitHub repository with 2,201 stars and 1,350 forks.
Trending score: 3.25; stars gained: +16; forks gained: +22.
Language: C++
Topics: ascend, inference, llm, llm-serving, llmops, mlops
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.
GitHub repository with 26,310 stars and 5,812 forks.
Trending score: 3.17; stars gained: +43; forks gained: +7.
Language: Python
Topics: machine-learning, ai, ml, mlflow, apache-spark, model-management
AI Observability & Evaluation
GitHub repository with 9,989 stars and 909 forks.
Trending score: 3.03; stars gained: +17; forks gained: +2.
Language: Python
Topics: llmops, ai-monitoring, ai-observability, llm-eval, aiengineering, datasets