apify/crawlee-python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

GitHub repository with 9,144 stars and 747 forks.

Language: Python

Topics: apify, automation, beautifulsoup, crawler, crawling, headless, headless-chrome, parsel, pip, playwright

Open provider repository

24h trend summary

Trending score 1.08, activity score 1.37, stars gained +7, forks gained -1.

Latest metric snapshot

2026-06-05: 9,144 stars and 747 forks.

Similar repositories

  1. 1. apify/crawlee-python

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

    GitHub repository with 9,144 stars and 747 forks.

    Trending score: 1.08; stars gained: +7; forks gained: -1.

    Language: Python

    Topics: apify, automation, beautifulsoup, crawler, crawling, headless

  2. 2. apify/apify-sdk-python

    Apify SDK for Python—The official library for building Apify Actors: serverless cloud programs for web scraping, browser automation, data processing, and AI agents. Manages the Actor lifecycle, storages (datasets, key-value stores, request queues), events, proxies, and pay-per-event monetization. Built on top of the the Apify API Client.

    GitHub repository with 169 stars and 23 forks.

    Trending score: 0.21; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: actor, apify, automation, crawlee, data-extraction, proxy

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 181,572 stars and 31,155 forks.

    Trending score: 5.95; stars gained: +1,867; forks gained: +361.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 13,361 stars and 853 forks.

    Trending score: 5.69; stars gained: +2,829; forks gained: +175.

    Language: Python

    Topics: agent, ai, anthropic, compression, context-engineering, context-window

  3. 3. Imbad0202/academic-research-skills

    Academic Research Skills for Claude Code: research → write → review → revise → finalize

    GitHub repository with 27,422 stars and 2,253 forks.

    Trending score: 5.52; stars gained: +1,079; forks gained: +89.

    Language: Python

    Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review

  4. 4. anthropics/financial-services

    GitHub repository with 30,002 stars and 4,224 forks.

    Trending score: 4.88; stars gained: +688; forks gained: +114.

    Language: Python

  5. 5. virgiliojr94/book-to-skill

    Turn any technical book PDF into a Claude Code skill — ready to study, reference, and use while you work.

    GitHub repository with 4,250 stars and 534 forks.

    Trending score: 4.88; stars gained: +476; forks gained: +68.

    Language: Python

  6. 6. vinta/awesome-python

    An opinionated list of Python frameworks, libraries, tools, and resources

    GitHub repository with 301,341 stars and 28,044 forks.

    Trending score: 4.60; stars gained: +518; forks gained: +24.

    Language: Python

    Topics: awesome, python, collections, python-frameworks, python-libraries, python-tools

Trending topic: apify

  1. 1. apify/crawlee-python

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

    GitHub repository with 9,144 stars and 747 forks.

    Trending score: 1.08; stars gained: +7; forks gained: -1.

    Language: Python

    Topics: apify, automation, beautifulsoup, crawler, crawling, headless

  2. 2. apify/apify-sdk-python

    Apify SDK for Python—The official library for building Apify Actors: serverless cloud programs for web scraping, browser automation, data processing, and AI agents. Manages the Actor lifecycle, storages (datasets, key-value stores, request queues), events, proxies, and pay-per-event monetization. Built on top of the the Apify API Client.

    GitHub repository with 169 stars and 23 forks.

    Trending score: 0.21; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: actor, apify, automation, crawlee, data-extraction, proxy

  3. 3. crawlee-cloud/crawlee-cloud

    Self-hosted, open-source platform for running Apify Actors. Drop-in compatible with the Apify SDK.

    GitHub repository with 21 stars and 4 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: TypeScript

    Topics: apify-sdk, automation, crawlee-framework, docker, open-source, self-hosted

  4. 4. apismith-labs/instagram-reels-transcript-api

    Instagram Reels Transcript API examples using Apify. Integrations for Python, Node.js, Java, Go, Rust, cURL, batch processing, and MCP workflows for ChatGPT, Claude, and Gemini.

    GitHub repository with 7 stars and 4 forks.

    Trending score: 0.00; stars gained: +0; forks gained: +0.

    Topics: ai-agent, apify, chatgpt, claude, gemini, golang