ropensci/dataset
Create interoperable and well described data frames in R
GitHub repository with 23 stars and 8 forks.
Language: R
Topics: dataset, metadata-management, r, rstats
Create interoperable and well described data frames in R
GitHub repository with 23 stars and 8 forks.
Language: R
Topics: dataset, metadata-management, r, rstats
2026-06-05: 23 stars and 8 forks.
An R package for single-cell and spatial omics analysis, integration, interpretation, and visualization.
GitHub repository with 254 stars and 31 forks.
Trending score: 0.57; stars gained: +2; forks gained: +0.
Language: R
Produce PRISMA-2020 compliant flow diagrams
GitHub repository with 276 stars and 116 forks.
Trending score: 0.49; stars gained: +2; forks gained: +0.
Language: R
Topics: prisma, review, systematic, visualization, meta-analysis, reporting
Tools for making beautiful & useful command line interfaces
GitHub repository with 711 stars and 86 forks.
Trending score: 0.37; stars gained: +1; forks gained: +0.
Language: R
Topics: cli, r
GitHub repository with 10 stars and 0 forks.
Trending score: 0.33; stars gained: +1; forks gained: -1.
Language: R
📊 A Scalable Phenotyping and Statistical Pipeline for UK Biobank RAP Data Analysis
GitHub repository with 33 stars and 4 forks.
Trending score: 0.33; stars gained: +1; forks gained: +0.
Language: R
Constituent history of the S&P 500 from various data sources
GitHub repository with 35 stars and 13 forks.
Trending score: 0.33; stars gained: +1; forks gained: +1.
Language: R
Topics: backtesting, equity-data, equity-research, sp500, sp500-data-analysis
Single source of truth for GenAI and agentic AI security incidents, mapped to OWASP LLM Top 10, OWASP Agentic Top 10 (ASI), NIST AI RMF, and MITRE ATLAS.
GitHub repository with 13 stars and 3 forks.
Trending score: 0.87; stars gained: +6; forks gained: +1.
Language: Python
Topics: agentic-incidents, ai-incidents, ai-safety, cybersecurity, dataset, genai-incidents
Browser compatibility data for Web technologies as displayed on MDN
GitHub repository with 5,681 stars and 2,565 forks.
Trending score: 0.63; stars gained: +2; forks gained: +2.
Language: JSON
Topics: compat, compatibility, data, dataset, json
A hand-curated, topic-organized library of the best ML education — 923 docs (391 arXiv papers, 474 Stanford/MIT/Karpathy/fast.ai lectures, 58 explainer articles), normalized to Markdown with full provenance. Open it in Obsidian or point your agent at it. A clean ML corpus for learning, RAG & fine-tuning.
GitHub repository with 121 stars and 13 forks.
Trending score: 0.58; stars gained: +3; forks gained: +1.
Language: Python
Topics: arxiv, corpus, dataset, deep-learning, education, llm
Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ for the code
GitHub repository with 69 stars and 93 forks.
Trending score: 0.42; stars gained: +1; forks gained: +2.
Topics: crawling, dataset, language-detection
200+ auto-updated space, astronomy & physics datasets on Hugging Face (NASA, NOAA, ESA, JPL, SpaceX, Wikidata). Satellites, asteroids, space probes (Voyager, Cassini, Mars), space weather, exoplanets, pulsars, radio/X-ray surveys, cosmic rays, particle physics, and more. Parquet format, no API keys.
GitHub repository with 7 stars and 1 forks.
Trending score: 0.36; stars gained: +1; forks gained: +1.
Language: Python
Topics: asteroids, astronomy, dataset, esa, exoplanet, huggingface-datasets
The National Gallery of Art Open Data Program
GitHub repository with 746 stars and 120 forks.
Trending score: 0.32; stars gained: +1; forks gained: +0.
Language: Python
Topics: art, collection, csv, csv-format, data, dataset