pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
GitHub repository with 9,973 stars and 738 forks.
Language: Python
Topics: mupdf, xps, pdf-documents, epub, ocr, pdf, font, python, data-science, extract-data