Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
📊 Project Info
- Language
- Python
- Stars
- ⭐ 74,149
- Forks
- 10,114
- Today
- +439
- Ranking
- #8
- Collection
- Overall
- Trending Date
- March 31, 2026
- Last Push
- 3/31/2026
🏷️ Topics
ai4sciencechineseocrdocument-parsingdocument-translationkieocrpaddleocr-vlpdf-extractor-ragpdf-parserpdf2markdownpp-ocrpp-structurerag


