MTSWebServices/data-rentgen
NextGen DataMotion Lineage
GitHub repository with 18 stars and 0 forks.
Language: Python
Topics: airflow, dbt, flink, hive, lineage, openlineage, rest-api, spark
NextGen DataMotion Lineage
GitHub repository with 18 stars and 0 forks.
Language: Python
Topics: airflow, dbt, flink, hive, lineage, openlineage, rest-api, spark
2026-06-05: 18 stars and 0 forks.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
GitHub repository with 45,703 stars and 17,175 forks.
Trending score: 1.23; stars gained: +18; forks gained: +20.
Language: Python
Topics: airflow, apache, apache-airflow, automation, dag, data-engineering
Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.
GitHub repository with 79 stars and 10 forks.
Trending score: 0.94; stars gained: +8; forks gained: +1.
Language: Python
Topics: active-learning, agent, airflow, auto-config, data-engineering, data-quality
Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code
GitHub repository with 1,213 stars and 294 forks.
Trending score: 0.60; stars gained: +3; forks gained: +0.
Language: Python
Topics: airflow, airflow-operators, apache-airflow, dbt, python, workflow
Production-style real-time e-commerce lakehouse with Kafka, Airflow, Databricks, Medallion architecture, data quality, quarantine, Terraform, and Dash analytics.
GitHub repository with 27 stars and 0 forks.
Trending score: 0.31; stars gained: +1; forks gained: +0.
Language: Python
Topics: airflow, data-engineering, data-quality, databricks, delta-lake, kafka
✍️ Revise and enhance novels with ReNovel-AI, your smart tool for story reimagining and memory-driven writing assistance.
GitHub repository with 12 stars and 0 forks.
Trending score: 0.17; stars gained: +0; forks gained: +0.
Language: Python
Topics: ai, ai-agents, ai-writing, airflow, chromadb, creative-writing
Real-time fraud detection lakehouse with Kafka, medallion pipelines, data quality, explainable scoring, and dashboards.
GitHub repository with 12 stars and 0 forks.
Trending score: 0.03; stars gained: +0; forks gained: +0.
Language: Python
Topics: airflow, data-engineering, data-quality, databricks, fraud-detection, kafka
The agent that grows with you
GitHub repository with 181,790 stars and 31,192 forks.
Trending score: 5.95; stars gained: +1,867; forks gained: +361.
Language: Python
Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
GitHub repository with 13,361 stars and 853 forks.
Trending score: 5.69; stars gained: +2,829; forks gained: +175.
Language: Python
Topics: agent, ai, anthropic, compression, context-engineering, context-window
Academic Research Skills for Claude Code: research → write → review → revise → finalize
GitHub repository with 27,484 stars and 2,256 forks.
Trending score: 5.52; stars gained: +1,079; forks gained: +89.
Language: Python
Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review
Learn it. Build it. Ship it for others.
GitHub repository with 28,622 stars and 4,680 forks.
Trending score: 5.32; stars gained: +1,261; forks gained: +238.
Language: Python
Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course
GitHub repository with 30,029 stars and 4,231 forks.
Trending score: 4.88; stars gained: +688; forks gained: +114.
Language: Python
An opinionated list of Python frameworks, libraries, tools, and resources
GitHub repository with 301,371 stars and 28,044 forks.
Trending score: 4.60; stars gained: +518; forks gained: +24.
Language: Python
Topics: awesome, python, collections, python-frameworks, python-libraries, python-tools
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
GitHub repository with 45,703 stars and 17,175 forks.
Trending score: 1.23; stars gained: +18; forks gained: +20.
Language: Python
Topics: airflow, apache, apache-airflow, automation, dag, data-engineering
Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.
GitHub repository with 79 stars and 10 forks.
Trending score: 0.94; stars gained: +8; forks gained: +1.
Language: Python
Topics: active-learning, agent, airflow, auto-config, data-engineering, data-quality
Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code
GitHub repository with 1,213 stars and 294 forks.
Trending score: 0.60; stars gained: +3; forks gained: +0.
Language: Python
Topics: airflow, airflow-operators, apache-airflow, dbt, python, workflow
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
GitHub repository with 14,296 stars and 5,041 forks.
Trending score: 0.49; stars gained: +2; forks gained: -1.
Language: Java
Topics: airflow, azkaban, cloud-native, data-pipelines, job-scheduler, orchestration
Production-style real-time e-commerce lakehouse with Kafka, Airflow, Databricks, Medallion architecture, data quality, quarantine, Terraform, and Dash analytics.
GitHub repository with 27 stars and 0 forks.
Trending score: 0.31; stars gained: +1; forks gained: +0.
Language: Python
Topics: airflow, data-engineering, data-quality, databricks, delta-lake, kafka
✍️ Revise and enhance novels with ReNovel-AI, your smart tool for story reimagining and memory-driven writing assistance.
GitHub repository with 12 stars and 0 forks.
Trending score: 0.17; stars gained: +0; forks gained: +0.
Language: Python
Topics: ai, ai-agents, ai-writing, airflow, chromadb, creative-writing