Arize-ai/phoenix

AI Observability & Evaluation

GitHub repository with 9,989 stars and 909 forks.

Language: Python

Topics: llmops, ai-monitoring, ai-observability, llm-eval, aiengineering, datasets, agents, llms, prompt-engineering, anthropic

Open provider repository

24h trend summary

Trending score 3.03, activity score 4.67, stars gained +17, forks gained +2.

Latest metric snapshot

2026-06-05: 9,989 stars and 909 forks.

Similar repositories

1. BlazeUp-AI/Observal

Observal is a unified platform for agent distribution, observability and insights.

GitHub repository with 1,997 stars and 376 forks.

Trending score: 4.07; stars gained: +119; forks gained: +63.

Language: Python

Topics: cli-tool, evaluation, large-language-models, llm, llm-evaluation, llm-observability
2. BerriAI/litellm

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]

GitHub repository with 49,379 stars and 8,621 forks.

Trending score: 4.07; stars gained: +250; forks gained: +59.

Language: Python

Topics: ai-gateway, anthropic, azure-openai, bedrock, gateway, langchain
3. mlflow/mlflow

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

GitHub repository with 26,310 stars and 5,812 forks.

Trending score: 3.17; stars gained: +43; forks gained: +7.

Language: Python

Topics: machine-learning, ai, ml, mlflow, apache-spark, model-management
4. Arize-ai/phoenix

AI Observability & Evaluation

GitHub repository with 9,989 stars and 909 forks.

Trending score: 3.03; stars gained: +17; forks gained: +2.

Language: Python

Topics: llmops, ai-monitoring, ai-observability, llm-eval, aiengineering, datasets
5. comet-ml/opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

GitHub repository with 19,434 stars and 1,496 forks.

Trending score: 2.37; stars gained: +10; forks gained: +1.

Language: Python

Topics: evaluation, hacktoberfest, hacktoberfest2025, langchain, llama-index, llm
6. shyftlabs/continuum

Continuum — the agent runtime by ShyftLabs. Build, orchestrate, ship.

GitHub repository with 69 stars and 6 forks.

Trending score: 0.93; stars gained: +8; forks gained: +0.

Language: Python

Topics: agent-framework, agentic-ai, ai-agents, ai-orchestration, anthropic, enterprise-ai

Trending in Python

1. NousResearch/hermes-agent

The agent that grows with you

GitHub repository with 181,806 stars and 31,193 forks.

Trending score: 5.95; stars gained: +1,867; forks gained: +361.

Language: Python

Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
2. chopratejas/headroom

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

GitHub repository with 13,361 stars and 853 forks.

Trending score: 5.69; stars gained: +2,829; forks gained: +175.

Language: Python

Topics: agent, ai, anthropic, compression, context-engineering, context-window
3. Imbad0202/academic-research-skills

Academic Research Skills for Claude Code: research → write → review → revise → finalize

GitHub repository with 27,484 stars and 2,256 forks.

Trending score: 5.52; stars gained: +1,079; forks gained: +89.

Language: Python

Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review
4. rohitg00/ai-engineering-from-scratch

Learn it. Build it. Ship it for others.

GitHub repository with 28,622 stars and 4,680 forks.

Trending score: 5.32; stars gained: +1,261; forks gained: +238.

Language: Python

Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course
5. anthropics/financial-services

GitHub repository with 30,029 stars and 4,231 forks.

Trending score: 4.88; stars gained: +688; forks gained: +114.

Language: Python
6. vinta/awesome-python

An opinionated list of Python frameworks, libraries, tools, and resources

GitHub repository with 301,371 stars and 28,044 forks.

Trending score: 4.60; stars gained: +518; forks gained: +24.

Language: Python

Topics: awesome, python, collections, python-frameworks, python-libraries, python-tools

Arize-ai/phoenix

24h trend summary

Latest metric snapshot

Similar repositories

1. BlazeUp-AI/Observal

2. BerriAI/litellm

3. mlflow/mlflow

4. Arize-ai/phoenix

5. comet-ml/opik

6. shyftlabs/continuum

Trending in Python

1. NousResearch/hermes-agent

2. chopratejas/headroom

3. Imbad0202/academic-research-skills

4. rohitg00/ai-engineering-from-scratch

5. anthropics/financial-services

6. vinta/awesome-python

Trending topic: llmops

1. BlazeUp-AI/Observal

2. BerriAI/litellm

3. maximhq/bifrost

4. vllm-project/vllm-ascend

5. mlflow/mlflow

6. Arize-ai/phoenix