RumbleDB/rumble
Quick start: pip install jsoniq ⛈️ RumbleDB 2.1.0 "Cedrus Libani" 🌳 for Apache Spark | Run queries on your large-scale, messy datasets (JSON, text, CSV, Parquet, Delta...) | Data Lakehouse with Updates, Scripting, Declarative Machine Learning and more
GitHub repository with 239 stars and 84 forks.
Language: Java
Topics: azure, csv, data-science, dataframes, delta-lake, hdfs, json, jsoniq, lakehouse, machine-learning