sipist/sipist-workspace

This repository provides containerized applications and microservices for the Intelligent Retrieval Systems course @ Instituto Superior Técnico

GitHub repository with 5 stars and 1 forks.

Topics: data-engineering, data-science, docker, jupyter, jupyterlab, notebook, postgres, postgresql, python, search-engine

Open provider repository

Latest metric snapshot

2026-06-05: 5 stars and 1 forks.

Similar repositories

  1. 1. Kaelio/ktx

    ktx is an executable context layer for data and analytics agents 🐙 Allow Claude Code, Codex, and any AI agent to query data accurately through MCP with skills, memory and a semantic layer

    GitHub repository with 896 stars and 46 forks.

    Trending score: 2.76; stars gained: +24; forks gained: +1.

    Language: TypeScript

    Topics: agent, agent-skills, agents, ai-agent, ai-agents, analytics

  2. 2. SouravRoy-ETL/duckle

    Local-first ETL/ELT studio: a drag-and-drop visual pipeline designer that compiles to SQL and runs on DuckDB. Tiny desktop app, no servers, git-friendly workspaces.

    GitHub repository with 285 stars and 21 forks.

    Trending score: 2.64; stars gained: +36; forks gained: +0.

    Language: Rust

    Topics: data-engineering, data-integration, data-pipeline, data-quality, desktop-app, drag-and-drop

  3. 3. weifuwan/seatunnel-web

    SeaTunnel Web is a visual tool for building and watching over your Apache SeaTunnel data pipelines, with drag-and-drop DAGs and simple connector setup.

    GitHub repository with 290 stars and 30 forks.

    Trending score: 1.85; stars gained: +57; forks gained: +5.

    Language: TypeScript

    Topics: batch, dag, data-engineering, data-integration, data-pipeline, etl

  4. 4. risingwavelabs/risingwave

    Event streaming platform for agentic AI. Continuously ingest, transform, and serve event streams in real time, at scale.

    GitHub repository with 9,065 stars and 776 forks.

    Trending score: 1.62; stars gained: +2; forks gained: +0.

    Language: Rust

    Topics: apache-iceberg, data-engineering, database, etl-pipeline, event-streaming, kafka

  5. 5. DataTalksClub/data-engineering-zoomcamp

    Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

    GitHub repository with 41,902 stars and 8,303 forks.

    Trending score: 1.49; stars gained: +23; forks gained: +3.

    Language: Jupyter Notebook

    Topics: course, data-engineering, dbt, docker, free, kafka

  6. 6. Eventual-Inc/Daft

    High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

    GitHub repository with 5,547 stars and 483 forks.

    Trending score: 1.26; stars gained: +2; forks gained: +1.

    Language: Rust

    Topics: ai-engineering, ai-pipeline, arrow, artificial-intelligence, big-data, data-engineering

Trending topic: data-engineering

  1. 1. Kaelio/ktx

    ktx is an executable context layer for data and analytics agents 🐙 Allow Claude Code, Codex, and any AI agent to query data accurately through MCP with skills, memory and a semantic layer

    GitHub repository with 896 stars and 46 forks.

    Trending score: 2.76; stars gained: +24; forks gained: +1.

    Language: TypeScript

    Topics: agent, agent-skills, agents, ai-agent, ai-agents, analytics

  2. 2. SouravRoy-ETL/duckle

    Local-first ETL/ELT studio: a drag-and-drop visual pipeline designer that compiles to SQL and runs on DuckDB. Tiny desktop app, no servers, git-friendly workspaces.

    GitHub repository with 285 stars and 21 forks.

    Trending score: 2.64; stars gained: +36; forks gained: +0.

    Language: Rust

    Topics: data-engineering, data-integration, data-pipeline, data-quality, desktop-app, drag-and-drop

  3. 3. weifuwan/seatunnel-web

    SeaTunnel Web is a visual tool for building and watching over your Apache SeaTunnel data pipelines, with drag-and-drop DAGs and simple connector setup.

    GitHub repository with 290 stars and 30 forks.

    Trending score: 1.85; stars gained: +57; forks gained: +5.

    Language: TypeScript

    Topics: batch, dag, data-engineering, data-integration, data-pipeline, etl

  4. 4. risingwavelabs/risingwave

    Event streaming platform for agentic AI. Continuously ingest, transform, and serve event streams in real time, at scale.

    GitHub repository with 9,065 stars and 776 forks.

    Trending score: 1.62; stars gained: +2; forks gained: +0.

    Language: Rust

    Topics: apache-iceberg, data-engineering, database, etl-pipeline, event-streaming, kafka

  5. 5. DataTalksClub/data-engineering-zoomcamp

    Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

    GitHub repository with 41,902 stars and 8,303 forks.

    Trending score: 1.49; stars gained: +23; forks gained: +3.

    Language: Jupyter Notebook

    Topics: course, data-engineering, dbt, docker, free, kafka

  6. 6. Eventual-Inc/Daft

    High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

    GitHub repository with 5,547 stars and 483 forks.

    Trending score: 1.26; stars gained: +2; forks gained: +1.

    Language: Rust

    Topics: ai-engineering, ai-pipeline, arrow, artificial-intelligence, big-data, data-engineering