Local Searchable OCR
Make scanned PDFs searchable. Runs 100% locally with WebAssembly.
Language data is downloaded once on first use and cached locally by your browser.
How it works
Pages are rendered to images, recognized in a Web Worker via Tesseract.js, then re-injected as a hidden text layer. Your document never leaves the browser.
Key Features
Runs in Your Browser
Tesseract WebAssembly runs entirely on your device. No server processing, no upload limits.
Multi-Language
Supports English, Chinese (simplified & traditional), Japanese, French, German, Spanish, and more.
Searchable Output
Adds a hidden searchable text layer aligned with the page image — visually identical, fully indexable.
Copy & Paste
Select and copy text from previously scanned documents in any PDF reader.
100% Private
Sensitive documents never leave your device — perfect for legal, medical, and corporate use.
Real-Time Progress
Watch page-by-page progress as Tesseract recognizes text in real time.
How It Works
Upload Scanned PDF
Drag your scanned or image-based PDF into the editor.
Choose Language
Select the recognition language. The model downloads to your browser cache the first time.
Run OCR
Click Run OCR. Each page is processed locally in a Web Worker — your CPU does the work.
Download Searchable PDF
Save the OCR-enhanced PDF. Open in any reader and use Ctrl+F to search.