bzsanti/oxidizePdf

Pure Rust PDF library for AI/RAG: structure-aware chunking, no ML, no C deps.

GitHub repository with 173 stars and 21 forks.

Language: Rust

Topics: data-extraction, document-processing, pdf, pdf-generation, pdf-parser, rust, text-extraction, digital-signatures, encryption, invoice

Open provider repository

Latest metric snapshot

2026-06-05: 173 stars and 21 forks.

Trending in Rust

  1. 1. BigPizzaV3/CodexPlusPlus

    An enhanced tool for CodexApp, striving to make Codex better to use and more comfortable 一个CodexApp的增强工具,努力让Codex变得更好用更舒服

    GitHub repository with 14,059 stars and 871 forks.

    Trending score: 5.16; stars gained: +916; forks gained: +44.

    Language: Rust

  2. 2. rtk-ai/rtk

    CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

    GitHub repository with 59,182 stars and 3,643 forks.

    Trending score: 4.96; stars gained: +654; forks gained: +44.

    Language: Rust

    Topics: agentic-coding, ai-coding, anthropic, claude-code, cli, command-line-tool

  3. 3. openai/codex

    Lightweight coding agent that runs in your terminal

    GitHub repository with 88,941 stars and 13,069 forks.

    Trending score: 4.58; stars gained: +326; forks gained: +48.

    Language: Rust

  4. 4. tinyhumansai/openhuman

    Your Personal AI super intelligence. Private, Simple and extremely powerful.

    GitHub repository with 30,892 stars and 2,983 forks.

    Trending score: 4.37; stars gained: +332; forks gained: +50.

    Language: Rust

  5. 5. fallow-rs/fallow

    Codebase intelligence for TypeScript and JavaScript. Free static layer: unused code, duplication, circular deps, complexity hotspots, architecture boundaries. Optional paid runtime layer: hot-path review and cold-path deletion evidence from real production traffic. Rust-native, sub-second, zero-config framework support.

    GitHub repository with 3,118 stars and 96 forks.

    Trending score: 4.05; stars gained: +346; forks gained: +16.

    Language: Rust

    Topics: cli, code-duplication, code-quality, codebase-intelligence, copy-paste-detection, dead-code

  6. 6. openlake-project/openlake

    High performance object store for fast LLM Inference and GPU Training. Feed your GPUs at blazing fast speeds

    GitHub repository with 1,118 stars and 176 forks.

    Trending score: 4.00; stars gained: +244; forks gained: +120.

    Language: Rust

    Topics: blackwell, gpt, gpu, high-performance, llm, llm-training

Trending topic: data-extraction

  1. 1. firecrawl/firecrawl

    The API to search, scrape, and interact with the web at scale. 🔥

    GitHub repository with 129,046 stars and 7,682 forks.

    Trending score: 4.80; stars gained: +954; forks gained: +49.

    Language: TypeScript

    Topics: ai, ai-agents, ai-crawler, ai-scraping, ai-search, crawler

  2. 2. ScrapeGraphAI/Scrapegraph-ai

    Python scraper based on AI

    GitHub repository with 26,745 stars and 2,486 forks.

    Trending score: 1.91; stars gained: +105; forks gained: +7.

    Language: Python

    Topics: scraping, scraping-python, llm, web-crawler, web-scraping, ai-scraping

  3. 3. crackarchermoat/ai-ocr-document-parser-mathpix-nanonets

    AI OCR — mathpix, nanonets, ai document parser, ocr tool.

    GitHub repository with 5 stars and 0 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Topics: ai-ocr, data-extraction, document-parsing, mathpix, nanonets, text-recognition

  4. 4. myklovenyzforever/chem-pdf-extractor

    LLM-powered scientific PDF data extraction tool for chemistry and chemical engineering literature.

    GitHub repository with 10 stars and 1 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: catalysis, chemical-engineering, data-extraction, excel-export, literature-review, llm

  5. 5. thunderbit-com/thunderbit-mcp-server

    AI-powered web scraping and structured data extraction. CLI + MCP server + Claude Code plugin for the Thunderbit Open API.

    GitHub repository with 13 stars and 3 forks.

    Trending score: 0.02; stars gained: +0; forks gained: +0.

    Language: JavaScript

    Topics: ai-agents, claude, data-extraction, llm, markdown, mcp

  6. 6. mdowis/anansi

    A self-healing web scraper built for hostile sites: selectors repair themselves, browser rendering kicks in when needed, and Chrome TLS fingerprinting evades bot detection. Ships with an MCP server so any LLM can drive a full crawl through conversation.

    GitHub repository with 90 stars and 17 forks.

    Trending score: 0.00; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: adaptive-scraping, ai-agent, anti-bot, crawler, data-extraction, llm-tools