google/sedpack

Sedpack - Scalable and efficient data packing

GitHub repository with 36 stars and 11 forks.

Language: Python

Topics: dataset, deep-learning

Open provider repository

Latest metric snapshot

2026-06-04: 36 stars and 11 forks.

Similar repositories

  1. 1. JafarAkhondali/Morefixes

    MoreFixes: A Large-Scale Dataset of CVE Fix Commits Mined through Enhanced Repository Discovery

    GitHub repository with 72 stars and 17 forks.

    Trending score: 0.39; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: hacktoberfest, commit, cve, dataset, fix, patch

  2. 2. ATOM00blue/machine-learning-library

    A hand-curated, topic-organized library of the best ML education — 923 docs (391 arXiv papers, 474 Stanford/MIT/Karpathy/fast.ai lectures, 58 explainer articles), normalized to Markdown with full provenance. Open it in Obsidian or point your agent at it. A clean ML corpus for learning, RAG & fine-tuning.

    GitHub repository with 120 stars and 13 forks.

    Trending score: 0.30; stars gained: +1; forks gained: +1.

    Language: Python

    Topics: arxiv, corpus, dataset, deep-learning, education, llm

  3. 3. NationalGalleryOfArt/opendata

    The National Gallery of Art Open Data Program

    GitHub repository with 745 stars and 120 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: open, data, art, collection, csv, national-gallery

  4. 4. captn3m0/india-isin-data

    International Securities Identification Numbers for various Indian Securities

    GitHub repository with 50 stars and 6 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: isin, funds, securities, india, nsdl, dataset

  5. 5. juliensimon/space-datasets

    200+ auto-updated space, astronomy & physics datasets on Hugging Face (NASA, NOAA, ESA, JPL, SpaceX, Wikidata). Satellites, asteroids, space probes (Voyager, Cassini, Mars), space weather, exoplanets, pulsars, radio/X-ray surveys, cosmic rays, particle physics, and more. Parquet format, no API keys.

    GitHub repository with 7 stars and 1 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: asteroids, astronomy, dataset, esa, exoplanet, huggingface-datasets

  6. 6. Project-AgML/AgML

    AgML is a centralized framework for agricultural machine learning. AgML provides access to public agricultural datasets for common agricultural deep learning tasks, with standard benchmarks and pretrained models, as well the ability to generate synthetic data and annotations.

    GitHub repository with 281 stars and 44 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: agriculture, computer-vision, dataset, deep-learning, image-classification, object-detection

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 180,642 stars and 30,980 forks.

    Trending score: 5.79; stars gained: +1,360; forks gained: +322.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. Alishahryar1/free-claude-code

    Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)

    GitHub repository with 32,285 stars and 4,901 forks.

    Trending score: 4.60; stars gained: +456; forks gained: +73.

    Language: Python

  3. 3. microsoft/SkillOpt

    SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

    GitHub repository with 4,876 stars and 485 forks.

    Trending score: 4.55; stars gained: +340; forks gained: +27.

    Language: Python

    Topics: agent-skills, self-evolving-agents

  4. 4. mukul975/Anthropic-Cybersecurity-Skills

    754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Copilot, Codex CLI, Cursor, Gemini CLI & 20+ platforms · 26 security domains · Apache 2.0

    GitHub repository with 13,233 stars and 1,551 forks.

    Trending score: 4.53; stars gained: +301; forks gained: +38.

    Language: Python

    Topics: ai-agents, claude-code, cybersecurity, incident-response, mitre-attack, penetration-testing

  5. 5. virgiliojr94/book-to-skill

    Turn any technical book PDF into a Claude Code skill — ready to study, reference, and use while you work.

    GitHub repository with 4,141 stars and 516 forks.

    Trending score: 4.43; stars gained: +415; forks gained: +37.

    Language: Python

  6. 6. anthropics/claude-code

    Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands.

    GitHub repository with 130,099 stars and 21,139 forks.

    Trending score: 4.42; stars gained: +277; forks gained: +38.

    Language: Python

Trending topic: dataset

  1. 1. HumanSignal/label-studio

    Label Studio is a multi-type data labeling and annotation tool with standardized output format

    GitHub repository with 27,511 stars and 3,554 forks.

    Trending score: 1.16; stars gained: +7; forks gained: +1.

    Language: TypeScript

    Topics: computer-vision, deep-learning, image-annotation, annotation-tool, annotation, labeling

  2. 2. JafarAkhondali/Morefixes

    MoreFixes: A Large-Scale Dataset of CVE Fix Commits Mined through Enhanced Repository Discovery

    GitHub repository with 72 stars and 17 forks.

    Trending score: 0.39; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: hacktoberfest, commit, cve, dataset, fix, patch

  3. 3. commoncrawl/web-languages

    Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ for the code

    GitHub repository with 69 stars and 92 forks.

    Trending score: 0.33; stars gained: +1; forks gained: +1.

    Topics: crawling, dataset, language-detection

  4. 4. BaseAdresseNationale/adresse.data.gouv.fr

    Le site officiel de l'Adresse

    GitHub repository with 164 stars and 39 forks.

    Trending score: 0.33; stars gained: +1; forks gained: +0.

    Language: TypeScript

    Topics: adresse, open-data, dataset

  5. 5. meodai/color-names

    Large list of handpicked color names 🌈

    GitHub repository with 2,934 stars and 222 forks.

    Trending score: 0.31; stars gained: +1; forks gained: +0.

    Language: JavaScript

    Topics: color, colors, colour, colours, dataset, dictionary

  6. 6. ATOM00blue/machine-learning-library

    A hand-curated, topic-organized library of the best ML education — 923 docs (391 arXiv papers, 474 Stanford/MIT/Karpathy/fast.ai lectures, 58 explainer articles), normalized to Markdown with full provenance. Open it in Obsidian or point your agent at it. A clean ML corpus for learning, RAG & fine-tuning.

    GitHub repository with 120 stars and 13 forks.

    Trending score: 0.30; stars gained: +1; forks gained: +1.

    Language: Python

    Topics: arxiv, corpus, dataset, deep-learning, education, llm