bakdata/kpops
Deploy Kafka pipelines to Kubernetes
GitHub repository with 14 stars and 3 forks.
Language: Python
Topics: kafka, kafka-connect, kafka-streams, kubernetes, pipelines, stream-processing
Deploy Kafka pipelines to Kubernetes
GitHub repository with 14 stars and 3 forks.
Language: Python
Topics: kafka, kafka-connect, kafka-streams, kubernetes, pipelines, stream-processing
2026-06-05: 14 stars and 3 forks.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
GitHub repository with 63,100 stars and 1,679 forks.
Trending score: 0.46; stars gained: -30; forks gained: +4.
Language: Python
Topics: batch-processing, data-analytics, data-pipelines, data-processing, dataflow, etl
Real Time Sources for Apache Kafka, Azure Event Hubs, and Fabric Event Streams
GitHub repository with 25 stars and 7 forks.
Trending score: 0.33; stars gained: +1; forks gained: +0.
Language: Python
Topics: azure, cloudevents, event-hubs, event-streams, fabric, gtfs
Python client for Apache Kafka
GitHub repository with 5,893 stars and 1,457 forks.
Trending score: 0.32; stars gained: +0; forks gained: +0.
Language: Python
Topics: kafka, python
Production-style real-time e-commerce lakehouse with Kafka, Airflow, Databricks, Medallion architecture, data quality, quarantine, Terraform, and Dash analytics.
GitHub repository with 27 stars and 0 forks.
Trending score: 0.31; stars gained: +1; forks gained: +0.
Language: Python
Topics: airflow, data-engineering, data-quality, databricks, delta-lake, kafka
Create an executable project (API and Admin App) from a database or natural language prompt with 1 command, customize with declarative rules and Python in your IDE, containerize and deploy.
GitHub repository with 50 stars and 11 forks.
Trending score: 0.20; stars gained: +0; forks gained: +0.
Language: Python
Topics: api, flask, kafka, python, rules, sqlalchemy
AI agent skills for stream processing and event streaming
GitHub repository with 32 stars and 2 forks.
Trending score: 0.05; stars gained: +0; forks gained: +1.
Language: Python
Topics: ai, confluent, flink, kafka, skills, cdc
The agent that grows with you
GitHub repository with 181,345 stars and 31,117 forks.
Trending score: 5.95; stars gained: +1,867; forks gained: +361.
Language: Python
Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
GitHub repository with 12,942 stars and 833 forks.
Trending score: 5.69; stars gained: +2,829; forks gained: +175.
Language: Python
Topics: agent, ai, anthropic, claude-code, compression, context-engineering
Academic Research Skills for Claude Code: research → write → review → revise → finalize
GitHub repository with 27,327 stars and 2,249 forks.
Trending score: 5.52; stars gained: +1,079; forks gained: +89.
Language: Python
Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review
GitHub repository with 29,986 stars and 4,219 forks.
Trending score: 4.88; stars gained: +688; forks gained: +114.
Language: Python
Turn any technical book PDF into a Claude Code skill — ready to study, reference, and use while you work.
GitHub repository with 4,221 stars and 528 forks.
Trending score: 4.88; stars gained: +476; forks gained: +68.
Language: Python
An opinionated list of Python frameworks, libraries, tools, and resources
GitHub repository with 301,341 stars and 28,044 forks.
Trending score: 4.60; stars gained: +518; forks gained: +24.
Language: Python
Topics: awesome, python, collections, python-frameworks, python-libraries, python-tools
Apache Kafka - A distributed event streaming platform
GitHub repository with 32,714 stars and 15,251 forks.
Trending score: 2.36; stars gained: +10; forks gained: +7.
Language: Java
Topics: scala, kafka, java, streaming
Free and open log management
GitHub repository with 8,050 stars and 1,107 forks.
Trending score: 1.77; stars gained: +4; forks gained: +2.
Language: Java
Topics: log-analysis, log-collector, log-viewer, logging, logging-server, siem
Event streaming platform for agentic AI. Continuously ingest, transform, and serve event streams in real time, at scale.
GitHub repository with 9,062 stars and 776 forks.
Trending score: 1.62; stars gained: +2; forks gained: +0.
Language: Rust
Topics: apache-iceberg, data-engineering, database, etl-pipeline, event-streaming, kafka
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
GitHub repository with 41,902 stars and 8,303 forks.
Trending score: 1.49; stars gained: +23; forks gained: +3.
Language: Jupyter Notebook
Topics: course, data-engineering, dbt, docker, free, kafka
Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.
GitHub repository with 17,600 stars and 5,819 forks.
Trending score: 1.19; stars gained: +8; forks gained: +1.
Language: Go
Topics: telegraf, monitoring, time-series, metrics, golang, influxdb
Apache Kafka® running on Kubernetes
GitHub repository with 5,829 stars and 1,501 forks.
Trending score: 1.12; stars gained: +6; forks gained: +5.
Language: Java
Topics: kafka, kubernetes, openshift, messaging, kafka-connect, kafka-streams