illuin-tech/data-pipeline
Library for describing data transformation pipelines by compositing simple reusable components.
GitHub repository with 6 stars and 0 forks.
Language: Java
Topics: data-pipeline, etl, java
Library for describing data transformation pipelines by compositing simple reusable components.
GitHub repository with 6 stars and 0 forks.
Language: Java
Topics: data-pipeline, etl, java
2026-06-05: 6 stars and 0 forks.
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
GitHub repository with 20,731 stars and 6,893 forks.
Trending score: 0.94; stars gained: +4; forks gained: +0.
Language: Java
Topics: database, distributed-database, distributed-sql-database, sql, shard, database-cluster
Agentic Data Engineering Harness for building data pipelines, data products, data APIs, and data lakes autonomously
GitHub repository with 213 stars and 25 forks.
Trending score: 0.09; stars gained: +0; forks gained: +0.
Language: Java
Topics: streaming, data-pipeline, api, event-driven, data-engineering, harness
Multi-platform SDK for integrating GitHub Copilot Agent into apps and services
GitHub repository with 9,056 stars and 1,216 forks.
Trending score: 3.47; stars gained: +166; forks gained: +12.
Language: Java
Light, fluffy, and always free - The AWS Local Emulator alternative
GitHub repository with 13,640 stars and 1,293 forks.
Trending score: 3.33; stars gained: +78; forks gained: +7.
Language: Java
Topics: aws, aws-emulation, localstack, devops, docker, ec2
WebHomeTV 基于FongMi二次开发,增强了 WebHome 自定义首页、App Native SDK、网盘链接检测 和 Nostr推荐首页。 这个项目的核心目标是让 CSP 站点首页可以变成一个真正可开发的网页应用:开发者可以用 HTML/CSS/JavaScript 定制首页,再通过 App 暴露的 Native 能力完成搜索、播放、跨域请求、资源代理、最近观看、网盘检测和状态同步。
GitHub repository with 372 stars and 107 forks.
Trending score: 3.29; stars gained: +83; forks gained: +16.
Language: Java
AI equity research agent with resilient workflows, Redis Lua single-flight, pgvector RAG, versioned reports, evidence tracing, and RAG evaluation.
GitHub repository with 1,000 stars and 58 forks.
Trending score: 3.24; stars gained: +77; forks gained: +1.
Language: Java
Topics: ai-agent, financial-research, llm-evaluation, pgvector, postgresql, rabbitmq
GitHub repository with 742 stars and 106 forks.
Trending score: 2.98; stars gained: +53; forks gained: +7.
Language: Java
Apache Doris is an easy-to-use, high performance and unified analytics database.
GitHub repository with 15,438 stars and 3,812 forks.
Trending score: 2.65; stars gained: +11; forks gained: +7.
Language: Java
Topics: agent, ai, bigquery, database, dbt, delta-lake
Local-first ETL/ELT studio: a drag-and-drop visual pipeline designer that compiles to SQL and runs on DuckDB. Tiny desktop app, no servers, git-friendly workspaces.
GitHub repository with 275 stars and 20 forks.
Trending score: 2.64; stars gained: +36; forks gained: +0.
Language: Rust
Topics: data-engineering, data-integration, data-pipeline, data-quality, desktop-app, drag-and-drop
High-performance AI pipeline engine with a C++ core and 50+ Python-extensible nodes. Build, debug, and scale LLM workflows with 13+ model providers, 8+ vector databases, and agent orchestration, all from your IDE. Includes VS Code extension, TypeScript/Python SDKs, and Docker deployment.
GitHub repository with 3,760 stars and 1,226 forks.
Trending score: 2.16; stars gained: +7; forks gained: +8.
Language: C++
Topics: ai, cpp, data-pipeline, data-processing, machine-learning, mcp
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
GitHub repository with 3,656 stars and 128 forks.
Trending score: 1.93; stars gained: +77; forks gained: +5.
Language: Go
Topics: bigquery, copy-database, data-ingestion, data-integration, data-pipeline, duckdb
SeaTunnel Web is a visual tool for building and watching over your Apache SeaTunnel data pipelines, with drag-and-drop DAGs and simple connector setup.
GitHub repository with 279 stars and 30 forks.
Trending score: 1.85; stars gained: +57; forks gained: +5.
Language: TypeScript
Topics: batch, dag, data-engineering, data-integration, data-pipeline, etl
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
GitHub repository with 6,490 stars and 373 forks.
Trending score: 1.18; stars gained: +16; forks gained: -1.
Language: Python
Topics: data, data-analysis, data-pipeline, data-processing, data-science, data-visualization
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
GitHub repository with 20,731 stars and 6,893 forks.
Trending score: 0.94; stars gained: +4; forks gained: +0.
Language: Java
Topics: database, distributed-database, distributed-sql-database, sql, shard, database-cluster