The Principal Dev – Masterclass for Tech Leads

The Principal Dev – Masterclass for Tech Leads28-29 May

Join

New Python OCR Libraries 2026

GitHub Libraries Python OCR Libraries

paddlepaddle/paddleocr 75K +2166

added 1 year ago

Awesome multilingual OCR toolkit based on PaddlePaddle . It's a an ultra lightweight OCR system with support for 80+ languages, data annotation and synthesis.

sirfz/tesserocr 2K

added 1 year ago

A Python wrapper for the tesseract-ocr API

madmaze/pytesseract 6K +6

added 1 year ago

A Python wrapper for Google Tesseract

lukas-blecher/latex-ocr 16K +32

added 1 year ago

Takes an image of a math formula and returns corresponding LaTeX code.

ocrmypdf/ocrmypdf 33K +144

added 1 year ago

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted.

hiroi-sora/umi-ocr 36K +207

added 1 year ago

Free, open source, batch offline OCR text recognition tool.

jaidedai/easyocr 29K +97

added 1 year ago

Ready-to-use OCR with 80+ supported languages and all popular writing scripts

libs.tech

Discover the best Python libraries and hidden gems. Coded at night under caffeine, ad-free, curated by Python community.
about | issues | follow

Thanks to our contributors

3831
93
9
4
4
3
3
3
2
2
2
2
2
2
2
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1

Join libs.tech

...and unlock some superpowers

GitHub

We won't share your data with anyone else.