dagster-io/dagster

An orchestration platform for the development, production, and observation of data assets.

GitHub repository with 15,628 stars and 2,155 forks.

Language: Python

Topics: data-pipelines, dagster, workflow, data-science, workflow-automation, python, scheduler, data-orchestrator, etl, analytics

Open provider repository

Latest metric snapshot

2026-06-05: 15,628 stars and 2,155 forks.

Similar repositories

  1. 1. pathwaycom/pathway

    Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

    GitHub repository with 63,095 stars and 1,679 forks.

    Trending score: 0.46; stars gained: -30; forks gained: +4.

    Language: Python

    Topics: batch-processing, kafka, pathway, python, streaming, machine-learning-algorithms

  2. 2. ucbepic/docetl

    A system for agentic LLM-powered data processing and ETL

    GitHub repository with 3,759 stars and 400 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +1.

    Language: Python

    Topics: agents, data, data-pipelines, document-analysis, document-processing, elt

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 182,432 stars and 31,284 forks.

    Trending score: 5.95; stars gained: +1,867; forks gained: +361.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 14,053 stars and 885 forks.

    Trending score: 5.69; stars gained: +2,829; forks gained: +175.

    Language: Python

    Topics: agent, ai, anthropic, compression, context-engineering, context-window

  3. 3. Imbad0202/academic-research-skills

    Academic Research Skills for Claude Code: research → write → review → revise → finalize

    GitHub repository with 27,616 stars and 2,272 forks.

    Trending score: 5.52; stars gained: +1,079; forks gained: +89.

    Language: Python

    Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review

  4. 4. rohitg00/ai-engineering-from-scratch

    Learn it. Build it. Ship it for others.

    GitHub repository with 28,711 stars and 4,695 forks.

    Trending score: 5.32; stars gained: +1,261; forks gained: +238.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

  5. 5. vinta/awesome-python

    An opinionated list of Python frameworks, libraries, tools, and resources

    GitHub repository with 301,435 stars and 28,046 forks.

    Trending score: 4.60; stars gained: +518; forks gained: +24.

    Language: Python

    Topics: awesome, collections, python, python-frameworks, python-libraries, python-tools

  6. 6. Alishahryar1/free-claude-code

    Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)

    GitHub repository with 32,540 stars and 4,942 forks.

    Trending score: 4.56; stars gained: +467; forks gained: +82.

    Language: Python

Trending topic: data-pipelines

  1. 1. pathwaycom/pathway

    Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

    GitHub repository with 63,095 stars and 1,679 forks.

    Trending score: 0.46; stars gained: -30; forks gained: +4.

    Language: Python

    Topics: batch-processing, kafka, pathway, python, streaming, machine-learning-algorithms

  2. 2. ucbepic/docetl

    A system for agentic LLM-powered data processing and ETL

    GitHub repository with 3,759 stars and 400 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +1.

    Language: Python

    Topics: agents, data, data-pipelines, document-analysis, document-processing, elt

  3. 3. opendatadiscovery/odd-platform

    First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

    GitHub repository with 1,408 stars and 139 forks.

    Trending score: 0.04; stars gained: +0; forks gained: -1.

    Language: Java

    Topics: alerting, bigdata, data-catalog, data-discovery, data-engineering, data-exploration

  4. 4. dataflint/spark

    Drop-in replacement for Apache Spark UI

    GitHub repository with 463 stars and 54 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Language: TypeScript

    Topics: apache-spark, big-data, data-pipeline, data-pipelines, databricks, dataproc