adidas/lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
GitHub repository with 288 stars and 50 forks.
Language: Python
Topics: big-data, configuration-driven, data-engineering, data-quality, databricks, delta-lake, framework, great-expectations, lakehouse, spark