pik-piam/madrat
R package | May All Data be Reproducible and Transparent (MADRaT)
GitHub repository with 17 stars and 46 forks.
Language: R
Topics: data-processing, pipeline, r, reproducibility
R package | May All Data be Reproducible and Transparent (MADRaT)
GitHub repository with 17 stars and 46 forks.
Language: R
Topics: data-processing, pipeline, r, reproducibility
2026-06-05: 17 stars and 46 forks.
Produce PRISMA-2020 compliant flow diagrams
GitHub repository with 276 stars and 116 forks.
Trending score: 0.49; stars gained: +2; forks gained: +0.
Language: R
GitHub repository with 10 stars and 0 forks.
Trending score: 0.33; stars gained: +1; forks gained: -1.
Language: R
📊 A Scalable Phenotyping and Statistical Pipeline for UK Biobank RAP Data Analysis
GitHub repository with 33 stars and 4 forks.
Trending score: 0.33; stars gained: +1; forks gained: +0.
Language: R
Constituent history of the S&P 500 from various data sources
GitHub repository with 35 stars and 13 forks.
Trending score: 0.33; stars gained: +1; forks gained: +1.
Language: R
Topics: backtesting, equity-data, equity-research, sp500, sp500-data-analysis
🧭 Open source tools for air quality data analysis
GitHub repository with 357 stars and 122 forks.
Trending score: 0.32; stars gained: +1; forks gained: +0.
Language: R
Topics: air-quality, meteorology, air-quality-data, openair, package, r
Table of software for the analysis of single-cell RNA-seq data.
GitHub repository with 340 stars and 83 forks.
Trending score: 0.32; stars gained: +1; forks gained: +0.
Language: R
Topics: database, rna-seq, scrna-seq, single-cell, single-cell-rna-seq, software
High-performance AI pipeline engine with a C++ core and 50+ Python-extensible nodes. Build, debug, and scale LLM workflows with 13+ model providers, 8+ vector databases, and agent orchestration, all from your IDE. Includes VS Code extension, TypeScript/Python SDKs, and Docker deployment.
GitHub repository with 3,765 stars and 1,226 forks.
Trending score: 2.16; stars gained: +7; forks gained: +8.
Language: C++
Topics: ai, cpp, data-pipeline, data-processing, machine-learning, mcp
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
GitHub repository with 6,490 stars and 373 forks.
Trending score: 1.18; stars gained: +16; forks gained: -1.
Language: Python
Topics: data-analysis, data-science, large-language-models, llm, data-visualization, llms
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
GitHub repository with 63,095 stars and 1,679 forks.
Trending score: 0.46; stars gained: -30; forks gained: +4.
Language: Python
Topics: batch-processing, kafka, pathway, python, streaming, machine-learning-algorithms
Open pixelated STEM framework
GitHub repository with 126 stars and 73 forks.
Trending score: 0.28; stars gained: +0; forks gained: +0.
Language: Python
Topics: electron-microscopy, data-processing, image-processing, python
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
GitHub repository with 365 stars and 122 forks.
Trending score: 0.09; stars gained: +0; forks gained: +0.
Language: Python
Topics: python, big-data, data-processing, multicloud, big-data-analytics, serverless
📝 Streamline text processing in Arabic and English with ChunkWise, a library offering 31 chunking strategies for NLP and RAG systems.
GitHub repository with 5 stars and 0 forks.
Trending score: 0.05; stars gained: +0; forks gained: +0.
Language: Python
Topics: aggregation, attention-mechanism, chunk, chunkwise-processing, csv, data-analysis