amineHorseman/images-web-crawler

This package is a complete tool for creating a large dataset of images (specially designed -but not only- for machine learning enthusiasts). It can crawl the web, download images, rename / resize / covert the images and merge folders..

GitHub repository with 107 stars and 24 forks.

Language: Python

Topics: dataset, images, crawler, dataset-creation, image-dataset, image-classification, machine-learning, image-processing, google-images-downloader, google-images-crawler

Open provider repository

24h trend summary

Trending score 0.00, activity score 0.00, stars gained +0, forks gained +0.

Latest metric snapshot

2026-06-02: 107 stars and 24 forks.

Similar repositories

  1. 1. emmanuelgjr/genai_incidents

    Single source of truth for GenAI and agentic AI security incidents, mapped to OWASP LLM Top 10, OWASP Agentic Top 10 (ASI), NIST AI RMF, and MITRE ATLAS.

    GitHub repository with 13 stars and 3 forks.

    Trending score: 0.87; stars gained: +6; forks gained: +1.

    Language: Python

    Topics: agentic-incidents, ai-incidents, ai-safety, cybersecurity, dataset, genai-incidents

  2. 2. ATOM00blue/machine-learning-library

    A hand-curated, topic-organized library of the best ML education — 923 docs (391 arXiv papers, 474 Stanford/MIT/Karpathy/fast.ai lectures, 58 explainer articles), normalized to Markdown with full provenance. Open it in Obsidian or point your agent at it. A clean ML corpus for learning, RAG & fine-tuning.

    GitHub repository with 121 stars and 13 forks.

    Trending score: 0.58; stars gained: +3; forks gained: +1.

    Language: Python

    Topics: arxiv, corpus, dataset, deep-learning, education, llm

  3. 3. juliensimon/space-datasets

    200+ auto-updated space, astronomy & physics datasets on Hugging Face (NASA, NOAA, ESA, JPL, SpaceX, Wikidata). Satellites, asteroids, space probes (Voyager, Cassini, Mars), space weather, exoplanets, pulsars, radio/X-ray surveys, cosmic rays, particle physics, and more. Parquet format, no API keys.

    GitHub repository with 7 stars and 1 forks.

    Trending score: 0.36; stars gained: +1; forks gained: +1.

    Language: Python

    Topics: asteroids, astronomy, dataset, esa, exoplanet, huggingface-datasets

  4. 4. DataDog/malicious-software-packages-dataset

    An open-source dataset of malicious software packages found in the wild, 100% vetted by humans.

    GitHub repository with 348 stars and 52 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: dataset, guarddog, malicious-packages, software-supply-chain-security

  5. 5. DeepMIMO/DeepMIMO

    DeepMIMOv4: A Toolchain and Database for Ray-tracing Datasets.

    GitHub repository with 107 stars and 23 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: 5g, 6g, dataset, machine-learning, radio-propagation, raytracing

  6. 6. EuniAI/TerminalWorld

    Benchmarking Agents on Real-World Terminal Tasks

    GitHub repository with 17 stars and 1 forks.

    Trending score: 0.03; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: agent, benchmark, cli, dataset, evaluation, llm

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 182,737 stars and 31,332 forks.

    Trending score: 5.95; stars gained: +1,867; forks gained: +361.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. Imbad0202/academic-research-skills

    Academic Research Skills for Claude Code: research → write → review → revise → finalize

    GitHub repository with 27,643 stars and 2,276 forks.

    Trending score: 5.52; stars gained: +1,079; forks gained: +89.

    Language: Python

    Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review

  3. 3. rohitg00/ai-engineering-from-scratch

    Learn it. Build it. Ship it for others.

    GitHub repository with 28,771 stars and 4,705 forks.

    Trending score: 5.32; stars gained: +1,261; forks gained: +238.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

  4. 4. vinta/awesome-python

    An opinionated list of Python frameworks, libraries, tools, and resources

    GitHub repository with 301,459 stars and 28,045 forks.

    Trending score: 4.60; stars gained: +518; forks gained: +24.

    Language: Python

    Topics: awesome, python, collections, python-frameworks, python-libraries, python-tools

  5. 5. Alishahryar1/free-claude-code

    Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)

    GitHub repository with 32,561 stars and 4,946 forks.

    Trending score: 4.56; stars gained: +467; forks gained: +82.

    Language: Python

  6. 6. langchain-ai/langchain

    The agent engineering platform.

    GitHub repository with 138,601 stars and 22,962 forks.

    Trending score: 4.53; stars gained: +171; forks gained: +31.

    Language: Python

    Topics: ai, anthropic, gemini, langchain, llm, openai

Trending topic: dataset

  1. 1. emmanuelgjr/genai_incidents

    Single source of truth for GenAI and agentic AI security incidents, mapped to OWASP LLM Top 10, OWASP Agentic Top 10 (ASI), NIST AI RMF, and MITRE ATLAS.

    GitHub repository with 13 stars and 3 forks.

    Trending score: 0.87; stars gained: +6; forks gained: +1.

    Language: Python

    Topics: agentic-incidents, ai-incidents, ai-safety, cybersecurity, dataset, genai-incidents

  2. 2. ATOM00blue/machine-learning-library

    A hand-curated, topic-organized library of the best ML education — 923 docs (391 arXiv papers, 474 Stanford/MIT/Karpathy/fast.ai lectures, 58 explainer articles), normalized to Markdown with full provenance. Open it in Obsidian or point your agent at it. A clean ML corpus for learning, RAG & fine-tuning.

    GitHub repository with 121 stars and 13 forks.

    Trending score: 0.58; stars gained: +3; forks gained: +1.

    Language: Python

    Topics: arxiv, corpus, dataset, deep-learning, education, llm

  3. 3. juliensimon/space-datasets

    200+ auto-updated space, astronomy & physics datasets on Hugging Face (NASA, NOAA, ESA, JPL, SpaceX, Wikidata). Satellites, asteroids, space probes (Voyager, Cassini, Mars), space weather, exoplanets, pulsars, radio/X-ray surveys, cosmic rays, particle physics, and more. Parquet format, no API keys.

    GitHub repository with 7 stars and 1 forks.

    Trending score: 0.36; stars gained: +1; forks gained: +1.

    Language: Python

    Topics: asteroids, astronomy, dataset, esa, exoplanet, huggingface-datasets

  4. 4. meodai/color-names

    Large list of handpicked color names 🌈

    GitHub repository with 2,934 stars and 222 forks.

    Trending score: 0.31; stars gained: +1; forks gained: +0.

    Language: JavaScript

    Topics: color, colors, colour, colours, dataset, dictionary

  5. 5. lutangar/cities.json

    :cityscape: Cities of the world in Json, based on GeoNames Gazetteer

    GitHub repository with 488 stars and 88 forks.

    Trending score: 0.12; stars gained: +0; forks gained: +0.

    Language: JavaScript

    Topics: cities, json, dataset, geonames-gazetteer, geolocation

  6. 6. DataDog/malicious-software-packages-dataset

    An open-source dataset of malicious software packages found in the wild, 100% vetted by humans.

    GitHub repository with 348 stars and 52 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: dataset, guarddog, malicious-packages, software-supply-chain-security