apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

GitHub repository with 3,291 stars and 1,328 forks.

Language: Java

Topics: big-data, data-ingestion, flink, paimon, real-time-analytics, spark, streaming-datalake, table-store

Open provider repository

Latest metric snapshot

2026-06-05: 3,291 stars and 1,328 forks.

Similar repositories

  1. 1. StarRocks/starrocks

    The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

    GitHub repository with 11,756 stars and 2,435 forks.

    Trending score: 2.61; stars gained: +10; forks gained: +1.

    Language: Java

    Topics: analytics, big-data, cloudnative, database, datalake, delta-lake

  2. 2. apache/beam

    Apache Beam is a unified programming model for Batch and Streaming data processing.

    GitHub repository with 8,605 stars and 4,576 forks.

    Trending score: 2.18; stars gained: +5; forks gained: +4.

    Language: Java

    Topics: python, java, big-data, beam, batch, golang

  3. 3. vespa-engine/vespa

    The AI search platform

    GitHub repository with 6,946 stars and 717 forks.

    Trending score: 1.18; stars gained: +8; forks gained: +0.

    Language: Java

    Topics: vespa, search-engine, big-data, ai, serving-recommendation, machine-learning

  4. 4. apache/flink

    Apache Flink

    GitHub repository with 26,042 stars and 13,940 forks.

    Trending score: 1.14; stars gained: +2; forks gained: -2.

    Language: Java

    Topics: scala, java, big-data, flink, python, sql

  5. 5. apache/iotdb

    Apache IoTDB

    GitHub repository with 6,340 stars and 1,140 forks.

    Trending score: 0.93; stars gained: +4; forks gained: -1.

    Language: Java

    Topics: timeseries, iot, big-data, java, database, nosql

  6. 6. apache/fluss

    Apache Fluss is a streaming storage built for real-time analytics.

    GitHub repository with 1,930 stars and 555 forks.

    Trending score: 0.90; stars gained: +4; forks gained: +3.

    Language: Java

    Topics: big-data, fluss, hacktoberfest, lakehouse, real-time-analytics, streaming

Trending in Java

  1. 1. github/copilot-sdk

    Multi-platform SDK for integrating GitHub Copilot Agent into apps and services

    GitHub repository with 9,020 stars and 1,213 forks.

    Trending score: 3.47; stars gained: +166; forks gained: +12.

    Language: Java

  2. 2. floci-io/floci

    Light, fluffy, and always free - The AWS Local Emulator alternative

    GitHub repository with 13,633 stars and 1,294 forks.

    Trending score: 3.33; stars gained: +78; forks gained: +7.

    Language: Java

    Topics: aws, aws-emulation, devops, docker, ec2, ecs

  3. 3. fish2018/webhtv

    WebHomeTV 基于FongMi二次开发,增强了 WebHome 自定义首页、App Native SDK、网盘链接检测 和 Nostr推荐首页。 这个项目的核心目标是让 CSP 站点首页可以变成一个真正可开发的网页应用:开发者可以用 HTML/CSS/JavaScript 定制首页,再通过 App 暴露的 Native 能力完成搜索、播放、跨域请求、资源代理、最近观看、网盘检测和状态同步。

    GitHub repository with 361 stars and 107 forks.

    Trending score: 3.29; stars gained: +83; forks gained: +16.

    Language: Java

  4. 4. juanjuandog/FinSight-AI

    AI equity research agent with resilient workflows, Redis Lua single-flight, pgvector RAG, versioned reports, evidence tracing, and RAG evaluation.

    GitHub repository with 978 stars and 57 forks.

    Trending score: 3.24; stars gained: +77; forks gained: +1.

    Language: Java

    Topics: ai-agent, financial-research, llm-evaluation, pgvector, postgresql, rabbitmq

  5. 5. dbeaver/dbeaver

    Free universal database tool and SQL client

    GitHub repository with 50,402 stars and 4,220 forks.

    Trending score: 3.22; stars gained: +37; forks gained: +10.

    Language: Java

    Topics: ai, database, databricks, db2, dbeaver, erd

  6. 6. Lucas0623z/NoteLite

    GitHub repository with 731 stars and 105 forks.

    Trending score: 2.98; stars gained: +53; forks gained: +7.

    Language: Java

Trending topic: big-data

  1. 1. ClickHouse/ClickHouse

    ClickHouse® is a real-time analytics database management system

    GitHub repository with 47,822 stars and 8,466 forks.

    Trending score: 2.96; stars gained: +53; forks gained: +10.

    Language: C++

    Topics: ai, analytics, big-data, clickhouse, cloud-native, cpp

  2. 2. StarRocks/starrocks

    The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

    GitHub repository with 11,756 stars and 2,435 forks.

    Trending score: 2.61; stars gained: +10; forks gained: +1.

    Language: Java

    Topics: analytics, big-data, cloudnative, database, datalake, delta-lake

  3. 3. apache/beam

    Apache Beam is a unified programming model for Batch and Streaming data processing.

    GitHub repository with 8,605 stars and 4,576 forks.

    Trending score: 2.18; stars gained: +5; forks gained: +4.

    Language: Java

    Topics: python, java, big-data, beam, batch, golang

  4. 4. apache/datafusion

    Apache DataFusion SQL Query Engine

    GitHub repository with 8,847 stars and 2,153 forks.

    Trending score: 2.07; stars gained: +6; forks gained: +3.

    Language: Rust

    Topics: arrow, big-data, dataframe, datafusion, olap, python

  5. 5. Eventual-Inc/Daft

    High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

    GitHub repository with 5,546 stars and 483 forks.

    Trending score: 1.26; stars gained: +2; forks gained: +1.

    Language: Rust

    Topics: machine-learning, python, data-engineering, distributed-computing, rust, big-data

  6. 6. vespa-engine/vespa

    The AI search platform

    GitHub repository with 6,946 stars and 717 forks.

    Trending score: 1.18; stars gained: +8; forks gained: +0.

    Language: Java

    Topics: vespa, search-engine, big-data, ai, serving-recommendation, machine-learning