pdf repositories

Discover trending repositories tagged pdf, ranked by recent growth and activity.

  1. 1. microsoft/markitdown

    Python tool for converting files and office documents to Markdown.

    GitHub repository with 139,256 stars and 9,481 forks.

    Trending score: 3.28; stars gained: +3,163; forks gained: +184.

    Language: Python

    Topics: langchain, openai, autogen-extension, autogen, markdown, microsoft-office

  2. 2. opendatalab/MinerU

    Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

    GitHub repository with 66,498 stars and 5,607 forks.

    Trending score: 2.60; stars gained: +331; forks gained: +32.

    Language: Python

    Topics: ai4science, document-analysis, docx, extract-data, layout-analysis, ocr

  3. 3. shipfastlabs/parsel

    A fast, helpful, and open-source document parser for PHP

    GitHub repository with 244 stars and 5 forks.

    Trending score: 2.38; stars gained: +16; forks gained: +0.

    Language: PHP

    Topics: docs, ocr, pdf, text-extarction

  4. 4. run-llama/liteparse

    A fast, helpful, and open-source document parser

    GitHub repository with 9,137 stars and 550 forks.

    Trending score: 2.34; stars gained: +193; forks gained: +16.

    Language: Rust

    Topics: document-ocr, document-processing, ocr, ocr-recognition, pdf, pdf-parser

  5. 5. docling-project/docling

    Get your documents ready for gen AI

    GitHub repository with 60,986 stars and 4,257 forks.

    Trending score: 2.19; stars gained: +117; forks gained: +15.

    Language: Python

    Topics: ai, convert, document-parser, document-parsing, documents, docx

  6. 6. Stirling-Tools/Stirling-PDF

    #1 PDF Application on GitHub that lets you edit PDFs on any device anywhere

    GitHub repository with 80,215 stars and 7,028 forks.

    Trending score: 1.94; stars gained: +113; forks gained: +13.

    Language: TypeScript

    Topics: docker, java, pdf, pdf-converter, pdf-manipulation, pdf-merger

  7. 7. datadrivenconstruction/OpenConstructionERP

    Open-source construction ERP - BOQ, PDF/CAD/BIM takeoff, AI cost matching. 42 regional catalogues, 21 languages, 71 modules. AGPL-3.0. v3.0 - pip install openconstructionerp

    GitHub repository with 306 stars and 108 forks.

    Trending score: 1.83; stars gained: +7; forks gained: +1.

    Language: TypeScript

    Topics: 4d, 5d, ai, autocad, bill-of-quantities, bim

  8. 8. koreader/koreader

    An ebook reader application supporting PDF, DjVu, EPUB, FB2 and many more formats, running on Cervantes, Kindle, Kobo, PocketBook and Android devices

    GitHub repository with 27,117 stars and 1,737 forks.

    Trending score: 1.82; stars gained: +36; forks gained: +5.

    Language: Lua

    Topics: cbz, djvu, djvu-reflow, ebook, ebook-reader, eink

  9. 9. documenso/documenso

    The Open Source DocuSign Alternative.

    GitHub repository with 13,209 stars and 2,709 forks.

    Trending score: 1.75; stars gained: +37; forks gained: +8.

    Language: TypeScript

    Topics: document-signing, next-auth, nextjs, open-source, pades-standard, pdf-sign

  10. 10. hehonghui/awesome-english-ebooks

    经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新

    GitHub repository with 31,927 stars and 2,619 forks.

    Trending score: 1.58; stars gained: +45; forks gained: -1.

    Language: CSS

    Topics: download, ebooks, economist, economist-ebooks, new-yorker, pdf

  11. 11. siyuan-note/siyuan

    A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

    GitHub repository with 44,269 stars and 2,826 forks.

    Trending score: 1.49; stars gained: +24; forks gained: +0.

    Language: TypeScript

    Topics: agent, anki, deepseek, electron, evernote, knowledge-base

  12. 12. Aryan-Raj3112/episteme

    A native Android document reader application built with Kotlin and Jetpack Compose.

    GitHub repository with 765 stars and 37 forks.

    Trending score: 1.48; stars gained: +22; forks gained: +0.

    Language: Kotlin

    Topics: foss, jetpack-compose, kotlin, opensource, pdf, reader

  13. 13. ciromattia/kcc

    KCC (a.k.a. Kindle Comic Converter) is a comic and manga converter for ebook readers.

    GitHub repository with 5,187 stars and 333 forks.

    Trending score: 1.16; stars gained: +8; forks gained: -1.

    Language: Python

    Topics: azw3, cbz, comics, eink, epub, kindle

  14. 14. notoriouslab/doc-cleaner

    doc-cleaner:一個為繁體中文金融文件設計的開源文件清洗工具,支援完全離線運行,你的文件,不該為了整理而離開你的電腦 :)

    GitHub repository with 263 stars and 37 forks.

    Trending score: 0.93; stars gained: +8; forks gained: +0.

    Language: Python

    Topics: bank-statement, pdf, python

  15. 15. papermark/papermark

    Papermark is the open-source DocSend alternative with built-in analytics and custom domains.

    GitHub repository with 8,454 stars and 1,247 forks.

    Trending score: 0.92; stars gained: +4; forks gained: +2.

    Language: TypeScript

    Topics: nextjs, typescript, dataroom, next-auth, open-source, pdf

  16. 16. karimz1/imgcompress

    Imgcompress is a self-hosted image processing toolbox that handles compression, format conversion, and AI background removal in a single web interface. It supports over 70 input formats (including PSD, HEIC, and RAW) and can output common formats or generate PDFs, all without sending files to external services.

    GitHub repository with 236 stars and 22 forks.

    Trending score: 0.86; stars gained: +5; forks gained: +0.

    Language: TypeScript

    Topics: imagecompression, imageoptimizer, self-hosted, webtool, datahoarder, docker

  17. 17. beltromatti/get-it

    Read it. See it. Get it. Built at GDG AI Hack Milan 2026 for "Learn Different" track.

    GitHub repository with 66 stars and 14 forks.

    Trending score: 0.86; stars gained: +7; forks gained: +2.

    Language: JavaScript

    Topics: agents, ai, codex, edtech, feynman-technique, flashcards

  18. 18. py-pdf/pypdf

    A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

    GitHub repository with 10,028 stars and 1,576 forks.

    Trending score: 0.84; stars gained: +5; forks gained: +1.

    Language: Python

    Topics: pypdf2, pdf, python, pdf-parser, pdf-parsing, pdf-manipulation

  19. 19. Karna14314/Pdf_Tools

    Privacy-first, offline PDF editor for Android. Merge, split, compress, convert and annotate PDFs — 100% on-device, no internet required.

    GitHub repository with 329 stars and 13 forks.

    Trending score: 0.82; stars gained: +6; forks gained: +0.

    Language: Kotlin

    Topics: android, android-app, jetpack-compose, kotlin, material-design, offline

  20. 20. pdfme/pdfme

    Open-source PDF generation library built with TypeScript and React. Features a WYSIWYG template designer, PDF viewer, and powerful generation capabilities. Create custom PDFs effortlessly in both browser and Node.js environments.

    GitHub repository with 4,390 stars and 471 forks.

    Trending score: 0.82; stars gained: +3; forks gained: +1.

    Language: TypeScript

    Topics: pdf, pdf-generation, pdf-designer, pdf-viewer, typescript, react

  21. 21. yuroyami/KitePDF

    PDF library written in pure Kotlin (reading, viewing, rendering, editing and creating PDFs) for Android, iOS, Web and JVM Desktop. No JNI or expect/actuals, only 100% Kotlin. Supports Compose multiplatform.

    GitHub repository with 10 stars and 0 forks.

    Trending score: 0.80; stars gained: +5; forks gained: +0.

    Language: Kotlin

    Topics: compose-multiplatform, klib, kotlin, kotlin-multiplatform, library, pdf

  22. 22. simpledms/simpledms

    Document management for small businesses.

    GitHub repository with 141 stars and 3 forks.

    Trending score: 0.78; stars gained: +6; forks gained: +0.

    Language: Go

    Topics: dms, documents, documents-management, documents-manager, document, document-management

  23. 23. Tencent/tgfx

    A lightweight 2D graphics library for modern GPUs, delivering high-performance text, image, and vector rendering across major platforms.

    GitHub repository with 1,539 stars and 128 forks.

    Trending score: 0.72; stars gained: +3; forks gained: +1.

    Language: C++

    Topics: 2d, graphics, tgfx, rendering, gpu, filter

  24. 24. mbret/prose-reader

    Reading engine - Render epubs in the browser or mobile

    GitHub repository with 24 stars and 1 forks.

    Trending score: 0.67; stars gained: +3; forks gained: +0.

    Language: TypeScript

    Topics: cbr, cbz, cbz-archive, ebook, ebook-library, ebook-reader