iyulab/unpdf
High-performance Rust PDF extraction library with Markdown/JSON output, CJK/RTL support, multi-column layout detection, and Python/.NET/CLI bindings.
GitHub repository with 30 stars and 3 forks.
Language: Rust
Topics: cjk, document-extraction, ffi, markdown, pdf, rust