yfedoseev/pdf_oxide
The fastest PDF library for Python and Rust. Text extraction, image extraction, markdown conversion, PDF creation & editing. 0.8ms mean, 5× faster than industry leaders, 100% pass rate on 3,830 PDFs. MIT/Apache-2.0.
GitHub repository with 803 stars and 87 forks.
Language: Rust
Topics: document-processing, pdf, pdf-editor, pdf-generation, pdf-library, pdf-parser, python, rust, text-extraction, data-extraction