illuin-tech/data-pipeline

Library for describing data transformation pipelines by compositing simple reusable components.

GitHub repository with 6 stars and 0 forks.

Language: Java

Topics: data-pipeline, etl, java

Open provider repository

Latest metric snapshot

2026-06-05: 6 stars and 0 forks.

Similar repositories

  1. 1. apache/shardingsphere

    Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.

    GitHub repository with 20,731 stars and 6,893 forks.

    Trending score: 0.94; stars gained: +4; forks gained: +0.

    Language: Java

    Topics: database, distributed-database, distributed-sql-database, sql, shard, database-cluster

  2. 2. DataSQRL/sqrl

    Agentic Data Engineering Harness for building data pipelines, data products, data APIs, and data lakes autonomously

    GitHub repository with 213 stars and 25 forks.

    Trending score: 0.09; stars gained: +0; forks gained: +0.

    Language: Java

    Topics: streaming, data-pipeline, api, event-driven, data-engineering, harness

Trending in Java

  1. 1. github/copilot-sdk

    Multi-platform SDK for integrating GitHub Copilot Agent into apps and services

    GitHub repository with 9,056 stars and 1,216 forks.

    Trending score: 3.47; stars gained: +166; forks gained: +12.

    Language: Java

  2. 2. floci-io/floci

    Light, fluffy, and always free - The AWS Local Emulator alternative

    GitHub repository with 13,640 stars and 1,293 forks.

    Trending score: 3.33; stars gained: +78; forks gained: +7.

    Language: Java

    Topics: aws, aws-emulation, localstack, devops, docker, ec2

  3. 3. fish2018/webhtv

    WebHomeTV 基于FongMi二次开发,增强了 WebHome 自定义首页、App Native SDK、网盘链接检测 和 Nostr推荐首页。 这个项目的核心目标是让 CSP 站点首页可以变成一个真正可开发的网页应用:开发者可以用 HTML/CSS/JavaScript 定制首页,再通过 App 暴露的 Native 能力完成搜索、播放、跨域请求、资源代理、最近观看、网盘检测和状态同步。

    GitHub repository with 372 stars and 107 forks.

    Trending score: 3.29; stars gained: +83; forks gained: +16.

    Language: Java

  4. 4. juanjuandog/FinSight-AI

    AI equity research agent with resilient workflows, Redis Lua single-flight, pgvector RAG, versioned reports, evidence tracing, and RAG evaluation.

    GitHub repository with 1,000 stars and 58 forks.

    Trending score: 3.24; stars gained: +77; forks gained: +1.

    Language: Java

    Topics: ai-agent, financial-research, llm-evaluation, pgvector, postgresql, rabbitmq

  5. 5. Lucas0623z/NoteLite

    GitHub repository with 742 stars and 106 forks.

    Trending score: 2.98; stars gained: +53; forks gained: +7.

    Language: Java

  6. 6. apache/doris

    Apache Doris is an easy-to-use, high performance and unified analytics database.

    GitHub repository with 15,438 stars and 3,812 forks.

    Trending score: 2.65; stars gained: +11; forks gained: +7.

    Language: Java

    Topics: agent, ai, bigquery, database, dbt, delta-lake

Trending topic: data-pipeline

  1. 1. SouravRoy-ETL/duckle

    Local-first ETL/ELT studio: a drag-and-drop visual pipeline designer that compiles to SQL and runs on DuckDB. Tiny desktop app, no servers, git-friendly workspaces.

    GitHub repository with 275 stars and 20 forks.

    Trending score: 2.64; stars gained: +36; forks gained: +0.

    Language: Rust

    Topics: data-engineering, data-integration, data-pipeline, data-quality, desktop-app, drag-and-drop

  2. 2. rocketride-org/rocketride-server

    High-performance AI pipeline engine with a C++ core and 50+ Python-extensible nodes. Build, debug, and scale LLM workflows with 13+ model providers, 8+ vector databases, and agent orchestration, all from your IDE. Includes VS Code extension, TypeScript/Python SDKs, and Docker deployment.

    GitHub repository with 3,760 stars and 1,226 forks.

    Trending score: 2.16; stars gained: +7; forks gained: +8.

    Language: C++

    Topics: ai, cpp, data-pipeline, data-processing, machine-learning, mcp

  3. 3. bruin-data/ingestr

    ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

    GitHub repository with 3,656 stars and 128 forks.

    Trending score: 1.93; stars gained: +77; forks gained: +5.

    Language: Go

    Topics: bigquery, copy-database, data-ingestion, data-integration, data-pipeline, duckdb

  4. 4. weifuwan/seatunnel-web

    SeaTunnel Web is a visual tool for building and watching over your Apache SeaTunnel data pipelines, with drag-and-drop DAGs and simple connector setup.

    GitHub repository with 279 stars and 30 forks.

    Trending score: 1.85; stars gained: +57; forks gained: +5.

    Language: TypeScript

    Topics: batch, dag, data-engineering, data-integration, data-pipeline, etl

  5. 5. datajuicer/data-juicer

    Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

    GitHub repository with 6,490 stars and 373 forks.

    Trending score: 1.18; stars gained: +16; forks gained: -1.

    Language: Python

    Topics: data, data-analysis, data-pipeline, data-processing, data-science, data-visualization

  6. 6. apache/shardingsphere

    Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.

    GitHub repository with 20,731 stars and 6,893 forks.

    Trending score: 0.94; stars gained: +4; forks gained: +0.

    Language: Java

    Topics: database, distributed-database, distributed-sql-database, sql, shard, database-cluster