google/sedpack
Sedpack - Scalable and efficient data packing
GitHub repository with 36 stars and 11 forks.
Language: Python
Topics: dataset, deep-learning
Sedpack - Scalable and efficient data packing
GitHub repository with 36 stars and 11 forks.
Language: Python
Topics: dataset, deep-learning
2026-06-04: 36 stars and 11 forks.
MoreFixes: A Large-Scale Dataset of CVE Fix Commits Mined through Enhanced Repository Discovery
GitHub repository with 72 stars and 17 forks.
Trending score: 0.39; stars gained: +1; forks gained: +0.
Language: Python
Topics: hacktoberfest, commit, cve, dataset, fix, patch
A hand-curated, topic-organized library of the best ML education — 923 docs (391 arXiv papers, 474 Stanford/MIT/Karpathy/fast.ai lectures, 58 explainer articles), normalized to Markdown with full provenance. Open it in Obsidian or point your agent at it. A clean ML corpus for learning, RAG & fine-tuning.
GitHub repository with 120 stars and 13 forks.
Trending score: 0.30; stars gained: +1; forks gained: +1.
Language: Python
Topics: arxiv, corpus, dataset, deep-learning, education, llm
The National Gallery of Art Open Data Program
GitHub repository with 745 stars and 120 forks.
Trending score: 0.05; stars gained: +0; forks gained: +0.
Language: Python
Topics: open, data, art, collection, csv, national-gallery
International Securities Identification Numbers for various Indian Securities
GitHub repository with 50 stars and 6 forks.
Trending score: 0.05; stars gained: +0; forks gained: +0.
Language: Python
Topics: isin, funds, securities, india, nsdl, dataset
200+ auto-updated space, astronomy & physics datasets on Hugging Face (NASA, NOAA, ESA, JPL, SpaceX, Wikidata). Satellites, asteroids, space probes (Voyager, Cassini, Mars), space weather, exoplanets, pulsars, radio/X-ray surveys, cosmic rays, particle physics, and more. Parquet format, no API keys.
GitHub repository with 7 stars and 1 forks.
Trending score: 0.05; stars gained: +0; forks gained: +0.
Language: Python
Topics: asteroids, astronomy, dataset, esa, exoplanet, huggingface-datasets
AgML is a centralized framework for agricultural machine learning. AgML provides access to public agricultural datasets for common agricultural deep learning tasks, with standard benchmarks and pretrained models, as well the ability to generate synthetic data and annotations.
GitHub repository with 281 stars and 44 forks.
Trending score: 0.04; stars gained: +0; forks gained: +0.
Language: Python
Topics: agriculture, computer-vision, dataset, deep-learning, image-classification, object-detection
The agent that grows with you
GitHub repository with 180,642 stars and 30,980 forks.
Trending score: 5.79; stars gained: +1,360; forks gained: +322.
Language: Python
Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)
GitHub repository with 32,285 stars and 4,901 forks.
Trending score: 4.60; stars gained: +456; forks gained: +73.
Language: Python
SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.
GitHub repository with 4,876 stars and 485 forks.
Trending score: 4.55; stars gained: +340; forks gained: +27.
Language: Python
Topics: agent-skills, self-evolving-agents
754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Copilot, Codex CLI, Cursor, Gemini CLI & 20+ platforms · 26 security domains · Apache 2.0
GitHub repository with 13,233 stars and 1,551 forks.
Trending score: 4.53; stars gained: +301; forks gained: +38.
Language: Python
Topics: ai-agents, claude-code, cybersecurity, incident-response, mitre-attack, penetration-testing
Turn any technical book PDF into a Claude Code skill — ready to study, reference, and use while you work.
GitHub repository with 4,141 stars and 516 forks.
Trending score: 4.43; stars gained: +415; forks gained: +37.
Language: Python
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands.
GitHub repository with 130,099 stars and 21,139 forks.
Trending score: 4.42; stars gained: +277; forks gained: +38.
Language: Python
Label Studio is a multi-type data labeling and annotation tool with standardized output format
GitHub repository with 27,511 stars and 3,554 forks.
Trending score: 1.16; stars gained: +7; forks gained: +1.
Language: TypeScript
Topics: computer-vision, deep-learning, image-annotation, annotation-tool, annotation, labeling
MoreFixes: A Large-Scale Dataset of CVE Fix Commits Mined through Enhanced Repository Discovery
GitHub repository with 72 stars and 17 forks.
Trending score: 0.39; stars gained: +1; forks gained: +0.
Language: Python
Topics: hacktoberfest, commit, cve, dataset, fix, patch
Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ for the code
GitHub repository with 69 stars and 92 forks.
Trending score: 0.33; stars gained: +1; forks gained: +1.
Topics: crawling, dataset, language-detection
Le site officiel de l'Adresse
GitHub repository with 164 stars and 39 forks.
Trending score: 0.33; stars gained: +1; forks gained: +0.
Language: TypeScript
Topics: adresse, open-data, dataset
Large list of handpicked color names 🌈
GitHub repository with 2,934 stars and 222 forks.
Trending score: 0.31; stars gained: +1; forks gained: +0.
Language: JavaScript
Topics: color, colors, colour, colours, dataset, dictionary
A hand-curated, topic-organized library of the best ML education — 923 docs (391 arXiv papers, 474 Stanford/MIT/Karpathy/fast.ai lectures, 58 explainer articles), normalized to Markdown with full provenance. Open it in Obsidian or point your agent at it. A clean ML corpus for learning, RAG & fine-tuning.
GitHub repository with 120 stars and 13 forks.
Trending score: 0.30; stars gained: +1; forks gained: +1.
Language: Python
Topics: arxiv, corpus, dataset, deep-learning, education, llm