zengin-code/zengin-py
:snake: The python implementation of ZenginCode
GitHub repository with 18 stars and 5 forks.
Language: Python
Topics: dataset, pypi-package, python, zengin
:snake: The python implementation of ZenginCode
GitHub repository with 18 stars and 5 forks.
Language: Python
Topics: dataset, pypi-package, python, zengin
2026-06-05: 18 stars and 5 forks.
Single source of truth for GenAI and agentic AI security incidents, mapped to OWASP LLM Top 10, OWASP Agentic Top 10 (ASI), NIST AI RMF, and MITRE ATLAS.
GitHub repository with 13 stars and 3 forks.
Trending score: 0.87; stars gained: +6; forks gained: +1.
Language: Python
Topics: agentic-incidents, ai-incidents, ai-safety, cybersecurity, dataset, genai-incidents
A hand-curated, topic-organized library of the best ML education — 923 docs (391 arXiv papers, 474 Stanford/MIT/Karpathy/fast.ai lectures, 58 explainer articles), normalized to Markdown with full provenance. Open it in Obsidian or point your agent at it. A clean ML corpus for learning, RAG & fine-tuning.
GitHub repository with 120 stars and 13 forks.
Trending score: 0.58; stars gained: +3; forks gained: +1.
Language: Python
Topics: arxiv, corpus, dataset, deep-learning, education, llm
200+ auto-updated space, astronomy & physics datasets on Hugging Face (NASA, NOAA, ESA, JPL, SpaceX, Wikidata). Satellites, asteroids, space probes (Voyager, Cassini, Mars), space weather, exoplanets, pulsars, radio/X-ray surveys, cosmic rays, particle physics, and more. Parquet format, no API keys.
GitHub repository with 7 stars and 1 forks.
Trending score: 0.36; stars gained: +1; forks gained: +1.
Language: Python
Topics: asteroids, huggingface-datasets, machine-learning-dataset, nasa, noaa, open-data
URB - Urban Routing Benchmark - Benchmarking MARL algorithms on the fleet routing tasks.
GitHub repository with 16 stars and 9 forks.
Trending score: 0.07; stars gained: +0; forks gained: +1.
Language: Python
Topics: benchmark, dataset, reinforcement-learning, reinforcement-learning-algorithms, sumo, routerl
MoreFixes: A Large-Scale Dataset of CVE Fix Commits Mined through Enhanced Repository Discovery
GitHub repository with 72 stars and 17 forks.
Trending score: 0.07; stars gained: +0; forks gained: +0.
Language: Python
Topics: hacktoberfest, commit, cve, dataset, fix, patch
AgML is a centralized framework for agricultural machine learning. AgML provides access to public agricultural datasets for common agricultural deep learning tasks, with standard benchmarks and pretrained models, as well the ability to generate synthetic data and annotations.
GitHub repository with 281 stars and 45 forks.
Trending score: 0.05; stars gained: +0; forks gained: +1.
Language: Python
Topics: deep-learning, agriculture, pytorch, dataset, image-classification, object-detection
The agent that grows with you
GitHub repository with 181,607 stars and 31,156 forks.
Trending score: 5.95; stars gained: +1,867; forks gained: +361.
Language: Python
Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
GitHub repository with 13,361 stars and 853 forks.
Trending score: 5.69; stars gained: +2,829; forks gained: +175.
Language: Python
Topics: agent, ai, anthropic, compression, context-engineering, context-window
Academic Research Skills for Claude Code: research → write → review → revise → finalize
GitHub repository with 27,422 stars and 2,253 forks.
Trending score: 5.52; stars gained: +1,079; forks gained: +89.
Language: Python
Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review
GitHub repository with 30,002 stars and 4,224 forks.
Trending score: 4.88; stars gained: +688; forks gained: +114.
Language: Python
Turn any technical book PDF into a Claude Code skill — ready to study, reference, and use while you work.
GitHub repository with 4,250 stars and 534 forks.
Trending score: 4.88; stars gained: +476; forks gained: +68.
Language: Python
An opinionated list of Python frameworks, libraries, tools, and resources
GitHub repository with 301,371 stars and 28,044 forks.
Trending score: 4.60; stars gained: +518; forks gained: +24.
Language: Python
Topics: awesome, python, collections, python-frameworks, python-libraries, python-tools
Single source of truth for GenAI and agentic AI security incidents, mapped to OWASP LLM Top 10, OWASP Agentic Top 10 (ASI), NIST AI RMF, and MITRE ATLAS.
GitHub repository with 13 stars and 3 forks.
Trending score: 0.87; stars gained: +6; forks gained: +1.
Language: Python
Topics: agentic-incidents, ai-incidents, ai-safety, cybersecurity, dataset, genai-incidents
Browser compatibility data for Web technologies as displayed on MDN
GitHub repository with 5,681 stars and 2,565 forks.
Trending score: 0.63; stars gained: +2; forks gained: +2.
Language: JSON
Topics: compat, compatibility, data, dataset, json
A hand-curated, topic-organized library of the best ML education — 923 docs (391 arXiv papers, 474 Stanford/MIT/Karpathy/fast.ai lectures, 58 explainer articles), normalized to Markdown with full provenance. Open it in Obsidian or point your agent at it. A clean ML corpus for learning, RAG & fine-tuning.
GitHub repository with 120 stars and 13 forks.
Trending score: 0.58; stars gained: +3; forks gained: +1.
Language: Python
Topics: arxiv, corpus, dataset, deep-learning, education, llm
Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ for the code
GitHub repository with 69 stars and 93 forks.
Trending score: 0.42; stars gained: +1; forks gained: +2.
Topics: crawling, dataset, language-detection
200+ auto-updated space, astronomy & physics datasets on Hugging Face (NASA, NOAA, ESA, JPL, SpaceX, Wikidata). Satellites, asteroids, space probes (Voyager, Cassini, Mars), space weather, exoplanets, pulsars, radio/X-ray surveys, cosmic rays, particle physics, and more. Parquet format, no API keys.
GitHub repository with 7 stars and 1 forks.
Trending score: 0.36; stars gained: +1; forks gained: +1.
Language: Python
Topics: asteroids, huggingface-datasets, machine-learning-dataset, nasa, noaa, open-data
List of all datasets included in Google Earth Engine (generated from https://developers.google.com/earth-engine/datasets/catalog/)
GitHub repository with 92 stars and 27 forks.
Trending score: 0.32; stars gained: +1; forks gained: +0.
Topics: earth-engine, dataset