PaddlePaddle

PaddlePaddle / PaddleOCR

#8
74,14910,114+439 todayPython

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

📊 Project Info

Language
Python
Stars
74,149
Forks
10,114
Today
+439
Ranking
#8
Collection
Overall
Trending Date
March 31, 2026
Last Push
3/31/2026

🏷️ Topics

ai4sciencechineseocrdocument-parsingdocument-translationkieocrpaddleocr-vlpdf-extractor-ragpdf-parserpdf2markdownpp-ocrpp-structurerag

📸 Screenshots

PaddleOCR screenshot 1PaddleOCR screenshot 2PaddleOCR screenshot 3