hortonworks/cloudbreak

CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user access and data governance features.

GitHub repository with 361 stars and 233 forks.

Language: Java

Topics: big-data, deployment, cloud, java, hadoop, cloudera, hacktoberfest

Open provider repository

24h trend summary

Trending score 0.05, activity score 0.05, stars gained +0, forks gained +0.

Latest metric snapshot

2026-06-05: 361 stars and 233 forks.

Similar repositories

  1. 1. StarRocks/starrocks

    The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

    GitHub repository with 11,760 stars and 2,434 forks.

    Trending score: 2.61; stars gained: +10; forks gained: +1.

    Language: Java

    Topics: analytics, big-data, cloudnative, database, datalake, delta-lake

  2. 2. apache/beam

    Apache Beam is a unified programming model for Batch and Streaming data processing.

    GitHub repository with 8,605 stars and 4,577 forks.

    Trending score: 2.18; stars gained: +5; forks gained: +4.

    Language: Java

    Topics: batch, beam, big-data, golang, java, python

  3. 3. vespa-engine/vespa

    The AI search platform

    GitHub repository with 6,948 stars and 717 forks.

    Trending score: 1.18; stars gained: +8; forks gained: +0.

    Language: Java

    Topics: ai, big-data, java, machine-learning, rag, search

  4. 4. apache/flink

    Apache Flink

    GitHub repository with 26,045 stars and 13,940 forks.

    Trending score: 1.14; stars gained: +2; forks gained: -2.

    Language: Java

    Topics: scala, java, big-data, flink, python, sql

  5. 5. apache/fluss

    Apache Fluss is a streaming storage built for real-time analytics.

    GitHub repository with 1,931 stars and 555 forks.

    Trending score: 0.90; stars gained: +4; forks gained: +3.

    Language: Java

    Topics: streaming, fluss, lakehouse, real-time-analytics, big-data, hacktoberfest

  6. 6. prestodb/presto

    The official home of the Presto distributed SQL query engine for big data

    GitHub repository with 16,710 stars and 5,539 forks.

    Trending score: 0.80; stars gained: +2; forks gained: +3.

    Language: Java

    Topics: java, presto, hive, hadoop, big-data, sql

Trending in Java

  1. 1. github/copilot-sdk

    Multi-platform SDK for integrating GitHub Copilot Agent into apps and services

    GitHub repository with 9,056 stars and 1,216 forks.

    Trending score: 3.47; stars gained: +166; forks gained: +12.

    Language: Java

  2. 2. floci-io/floci

    Light, fluffy, and always free - The AWS Local Emulator alternative

    GitHub repository with 13,640 stars and 1,293 forks.

    Trending score: 3.33; stars gained: +78; forks gained: +7.

    Language: Java

    Topics: aws, aws-emulation, localstack, devops, docker, ec2

  3. 3. fish2018/webhtv

    WebHomeTV 基于FongMi二次开发,增强了 WebHome 自定义首页、App Native SDK、网盘链接检测 和 Nostr推荐首页。 这个项目的核心目标是让 CSP 站点首页可以变成一个真正可开发的网页应用:开发者可以用 HTML/CSS/JavaScript 定制首页,再通过 App 暴露的 Native 能力完成搜索、播放、跨域请求、资源代理、最近观看、网盘检测和状态同步。

    GitHub repository with 372 stars and 107 forks.

    Trending score: 3.29; stars gained: +83; forks gained: +16.

    Language: Java

  4. 4. juanjuandog/FinSight-AI

    AI equity research agent with resilient workflows, Redis Lua single-flight, pgvector RAG, versioned reports, evidence tracing, and RAG evaluation.

    GitHub repository with 1,003 stars and 58 forks.

    Trending score: 3.24; stars gained: +77; forks gained: +1.

    Language: Java

    Topics: ai-agent, financial-research, llm-evaluation, pgvector, postgresql, rabbitmq

  5. 5. Lucas0623z/NoteLite

    GitHub repository with 742 stars and 106 forks.

    Trending score: 2.98; stars gained: +53; forks gained: +7.

    Language: Java

  6. 6. apache/doris

    Apache Doris is an easy-to-use, high performance and unified analytics database.

    GitHub repository with 15,438 stars and 3,812 forks.

    Trending score: 2.65; stars gained: +11; forks gained: +7.

    Language: Java

    Topics: agent, ai, bigquery, database, dbt, delta-lake

Trending topic: big-data

  1. 1. ClickHouse/ClickHouse

    ClickHouse® is a real-time analytics database management system

    GitHub repository with 47,832 stars and 8,468 forks.

    Trending score: 2.96; stars gained: +53; forks gained: +10.

    Language: C++

    Topics: ai, analytics, big-data, clickhouse, cloud-native, cpp

  2. 2. StarRocks/starrocks

    The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

    GitHub repository with 11,760 stars and 2,434 forks.

    Trending score: 2.61; stars gained: +10; forks gained: +1.

    Language: Java

    Topics: analytics, big-data, cloudnative, database, datalake, delta-lake

  3. 3. apache/beam

    Apache Beam is a unified programming model for Batch and Streaming data processing.

    GitHub repository with 8,605 stars and 4,577 forks.

    Trending score: 2.18; stars gained: +5; forks gained: +4.

    Language: Java

    Topics: batch, beam, big-data, golang, java, python

  4. 4. apache/datafusion

    Apache DataFusion SQL Query Engine

    GitHub repository with 8,848 stars and 2,153 forks.

    Trending score: 2.07; stars gained: +6; forks gained: +3.

    Language: Rust

    Topics: arrow, big-data, dataframe, datafusion, olap, python

  5. 5. Eventual-Inc/Daft

    High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

    GitHub repository with 5,546 stars and 483 forks.

    Trending score: 1.26; stars gained: +2; forks gained: +1.

    Language: Rust

    Topics: machine-learning, python, data-engineering, distributed-computing, rust, big-data

  6. 6. vespa-engine/vespa

    The AI search platform

    GitHub repository with 6,948 stars and 717 forks.

    Trending score: 1.18; stars gained: +8; forks gained: +0.

    Language: Java

    Topics: ai, big-data, java, machine-learning, rag, search