alvincjin/Niagara
Niagara is a DaaS platform implemented by SDACK stack in Scala
GitHub repository with 6 stars and 2 forks.
Language: Scala
Topics: spark, docker, akka, cassandra, kafka, scala
Niagara is a DaaS platform implemented by SDACK stack in Scala
GitHub repository with 6 stars and 2 forks.
Language: Scala
Topics: spark, docker, akka, cassandra, kafka, scala
2026-06-05: 6 stars and 2 forks.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
GitHub repository with 8,835 stars and 2,107 forks.
Trending score: 0.60; stars gained: +3; forks gained: -1.
Language: Scala
Topics: spark, acid, big-data, analytics, delta-lake
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
GitHub repository with 977 stars and 283 forks.
Trending score: 0.36; stars gained: +0; forks gained: +0.
Language: Scala
Topics: spark, gpu, rapids, big-data
Apache Spark Connector for Azure Kusto
GitHub repository with 81 stars and 35 forks.
Trending score: 0.04; stars gained: +0; forks gained: +0.
Language: Scala
Topics: kusto, spark, scala, azure
♞ lichess.org: the forever free, adless and open source chess server ♞
GitHub repository with 18,312 stars and 2,683 forks.
Trending score: 1.10; stars gained: +7; forks gained: +2.
Language: Scala
Topics: chess, free-software, functional-programming, game, lichess, non-profit
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
GitHub repository with 8,835 stars and 2,107 forks.
Trending score: 0.60; stars gained: +3; forks gained: -1.
Language: Scala
Topics: spark, acid, big-data, analytics, delta-lake
The Community Maintained High Velocity Web Framework For Java and Scala.
GitHub repository with 12,621 stars and 4,031 forks.
Trending score: 0.47; stars gained: +1; forks gained: +0.
Language: Scala
Topics: scala, java, reactive, web-framework, restful, play
Typed Dataset api for Scala 3
GitHub repository with 6 stars and 4 forks.
Trending score: 0.42; stars gained: +1; forks gained: +2.
Language: Scala
Build type-safe, boilerplate-less MCP servers and clients in Scala
GitHub repository with 82 stars and 7 forks.
Trending score: 0.42; stars gained: +1; forks gained: +0.
Language: Scala
Production-grade Arrow FlightSQL gateway in front of DuckDB Quack + DuckLake. Multi-tenant pools, pluggable auth (DB/JWT/OIDC), table-level ACLs, role-aware routing, and a live admin console
GitHub repository with 21 stars and 1 forks.
Trending score: 0.39; stars gained: +1; forks gained: +0.
Language: Scala
Apache Doris is an easy-to-use, high performance and unified analytics database.
GitHub repository with 15,438 stars and 3,812 forks.
Trending score: 2.65; stars gained: +11; forks gained: +7.
Language: Java
Topics: agent, ai, bigquery, database, dbt, delta-lake
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
GitHub repository with 41,902 stars and 8,303 forks.
Trending score: 1.49; stars gained: +23; forks gained: +3.
Language: Jupyter Notebook
Topics: course, data-engineering, dbt, docker, free, kafka
Python SQL Parser and Transpiler
GitHub repository with 9,303 stars and 1,158 forks.
Trending score: 0.95; stars gained: +5; forks gained: +3.
Language: Python
Topics: transpiler, sql, python, parser, optimizer, bigquery
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
GitHub repository with 28,616 stars and 4,601 forks.
Trending score: 0.88; stars gained: +3; forks gained: +1.
Language: Python
Topics: redash, python, visualization, analytics, bi, redshift
Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.
GitHub repository with 2,845 stars and 163 forks.
Trending score: 0.88; stars gained: +7; forks gained: +0.
Language: Rust
Topics: apache-iceberg, apache-spark, arrow, artificial-intelligence, big-data, data-engineering
YTsaurus is a scalable and fault-tolerant open-source big data platform.
GitHub repository with 2,195 stars and 205 forks.
Trending score: 0.84; stars gained: +2; forks gained: +0.
Language: C++
Topics: big-data, clickhouse, distributed-database, lakehouse, olap-database, spark