OCR scanned PDF
Tesseract OCR → text (15 languages).
First-run note: downloads ~3 MB tesseract engine + ~10 MB language model the first time. After that, cached in the browser. ~1–3 seconds per page on typical hardware.
Tesseract OCR → text (15 languages).