adbar/trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

GitHub repository with 6,122 stars and 382 forks.

Language: Python

Topics: web-scraping, text-extraction, nlp, html2text, text-mining, crawler, text-cleaning, text-preprocessing, article-extractor, readability

Open provider repository

24h trend summary

Trending score 2.05, freshness score 0.77, stars gained +8, forks gained +1.

Latest metric snapshot

2026-06-15: 6,122 stars and 382 forks.

Similar repositories

  1. 1. CloakHQ/CloakBrowser

    Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed.

    GitHub repository with 26,177 stars and 2,071 forks.

    Trending score: 5.28; stars gained: +897; forks gained: +60.

    Language: Python

    Topics: anti-detect, bot-detection, browser-automation, chromium, cloudflare, fingerprint

  2. 2. ScrapeGraphAI/Scrapegraph-ai

    Python scraper based on AI

    GitHub repository with 27,223 stars and 2,566 forks.

    Trending score: 4.00; stars gained: +93; forks gained: +15.

    Language: Python

    Topics: ai-crawler, ai-scraping, ai-search, crawler, data-extraction, firecrawl-alternative

  3. 3. dgtlmoon/changedetection.io

    Best and simplest tool for website change detection, web page monitoring, and website change alerts. Perfect for tracking content changes, price drops, restock alerts, and website defacement monitoring—all for free or enjoy our SaaS plan!

    GitHub repository with 32,011 stars and 1,838 forks.

    Trending score: 3.80; stars gained: +161; forks gained: +16.

    Language: Python

    Topics: back-in-stock, change-alert, change-detection, change-monitoring, monitoring, notifications

  4. 4. feder-cr/invisible_playwright

    Anti-Detect Browser that passes every bot detection test. Drop-in Playwright replacement.

    GitHub repository with 1,352 stars and 147 forks.

    Trending score: 3.23; stars gained: +49; forks gained: +5.

    Language: Python

    Topics: anti-bot, anti-detect-browser, automation, browser-automation, browser-fingerprinting, captcha-bypass

  5. 5. scrapy/scrapy

    Scrapy, a fast high-level web crawling & scraping framework for Python.

    GitHub repository with 62,255 stars and 11,648 forks.

    Trending score: 3.17; stars gained: +42; forks gained: +7.

    Language: Python

    Topics: crawler, crawling, framework, hacktoberfest, python, scraping

  6. 6. lexiforest/curl_cffi

    Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.

    GitHub repository with 5,828 stars and 499 forks.

    Trending score: 2.24; stars gained: +12; forks gained: +0.

    Language: Python

    Topics: curl, http-client, curl-impersonate, http, https, ja3

Trending in Python

  1. 1. harry0703/MoneyPrinterTurbo

    利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

    GitHub repository with 88,031 stars and 12,625 forks.

    Trending score: 6.02; stars gained: +1,097; forks gained: +218.

    Language: Python

    Topics: ai, automation, chatgpt, moviepy, python, shortvideo

  2. 2. pewdiepie-archdaemon/odysseus

    Self-hosted AI workspace.

    GitHub repository with 71,489 stars and 9,116 forks.

    Trending score: 5.98; stars gained: +834; forks gained: +140.

    Language: Python

  3. 3. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 194,172 stars and 34,003 forks.

    Trending score: 5.92; stars gained: +753; forks gained: +209.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  4. 4. NVIDIA/SkillSpector

    Security scanner for AI agent skills. Detect vulnerabilities, malicious patterns, and security risks.

    GitHub repository with 5,962 stars and 441 forks.

    Trending score: 5.61; stars gained: +874; forks gained: +76.

    Language: Python

  5. 5. rohitg00/ai-engineering-from-scratch

    Learn it. Build it. Ship it for others.

    GitHub repository with 32,676 stars and 5,366 forks.

    Trending score: 5.59; stars gained: +762; forks gained: +135.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

  6. 6. Agents365-ai/drawio-skill

    Generate draw.io diagrams from natural language — 6 presets, vision self-check + up to 5-round refinement, codebase-to-diagram, 10,000+ official shapes & 321 AI/LLM brand logos. Exports PNG/SVG/PDF/JPG.

    GitHub repository with 3,445 stars and 240 forks.

    Trending score: 5.51; stars gained: +1,369; forks gained: +113.

    Language: Python

    Topics: agent-skill, agent-skills, architecture-diagram, claude-code, claude-code-skill, claude-skills

Trending topic: web-scraping

  1. 1. CloakHQ/CloakBrowser

    Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed.

    GitHub repository with 26,177 stars and 2,071 forks.

    Trending score: 5.28; stars gained: +897; forks gained: +60.

    Language: Python

    Topics: anti-detect, bot-detection, browser-automation, chromium, cloudflare, fingerprint

  2. 2. firecrawl/firecrawl

    The API to search, scrape, and interact with the web at scale. 🔥

    GitHub repository with 133,013 stars and 7,799 forks.

    Trending score: 5.24; stars gained: +635; forks gained: +11.

    Language: TypeScript

    Topics: ai, ai-agents, ai-crawler, ai-scraping, ai-search, crawler

  3. 3. ScrapeGraphAI/Scrapegraph-ai

    Python scraper based on AI

    GitHub repository with 27,223 stars and 2,566 forks.

    Trending score: 4.00; stars gained: +93; forks gained: +15.

    Language: Python

    Topics: ai-crawler, ai-scraping, ai-search, crawler, data-extraction, firecrawl-alternative

  4. 4. dgtlmoon/changedetection.io

    Best and simplest tool for website change detection, web page monitoring, and website change alerts. Perfect for tracking content changes, price drops, restock alerts, and website defacement monitoring—all for free or enjoy our SaaS plan!

    GitHub repository with 32,011 stars and 1,838 forks.

    Trending score: 3.80; stars gained: +161; forks gained: +16.

    Language: Python

    Topics: back-in-stock, change-alert, change-detection, change-monitoring, monitoring, notifications

  5. 5. feder-cr/invisible_playwright

    Anti-Detect Browser that passes every bot detection test. Drop-in Playwright replacement.

    GitHub repository with 1,352 stars and 147 forks.

    Trending score: 3.23; stars gained: +49; forks gained: +5.

    Language: Python

    Topics: anti-bot, anti-detect-browser, automation, browser-automation, browser-fingerprinting, captcha-bypass

  6. 6. scrapy/scrapy

    Scrapy, a fast high-level web crawling & scraping framework for Python.

    GitHub repository with 62,255 stars and 11,648 forks.

    Trending score: 3.17; stars gained: +42; forks gained: +7.

    Language: Python

    Topics: crawler, crawling, framework, hacktoberfest, python, scraping