Upload a scanned PDF
Use this page for image-only PDFs where text cannot be selected. If the PDF already has selectable text, copying directly from the PDF may be faster.
Render scanned PDF pages, run OCR locally, and turn page images into searchable, editable text exports.
Choose files to begin OCR.
Selected files and page previews will appear here.
This OCR workspace is built for real files, not just a single demo image. Use these controls to improve recognition, manage batches, and export the text in the format you need.
Use this page for image-only PDFs where text cannot be selected. If the PDF already has selectable text, copying directly from the PDF may be faster.
Enter ranges like 1-3, 7, 10-12. For large PDFs, start with a short range so the browser does not spend time rendering pages you do not need.
Use 2x for most scanned documents. Try 3x for small print, but remember that sharper rendering uses more memory and takes longer.
Choose the language that appears in the document, then use Smart cleanup for paragraphs or Trim lines for forms, lists, invoices, and tables.
Each PDF page is rendered as an image and then read by OCR. You can pause or cancel long jobs while keeping pages that already finished.
V1 exports extracted text as TXT, PDF, DOCX, CSV, JSON, or ZIP. It does not create an invisible searchable layer over the original scanned PDF.
PDF OCR accuracy depends on the page image hidden inside the PDF. A clean scan at a readable resolution is much easier to recognize than a tilted phone scan saved as a PDF.
Scanned forms, printed documents, worksheets, invoices, receipts, and image-only PDFs with straight pages and readable text.
Low-resolution scans, sideways pages, handwriting, stamps, multi-column tables, faded copies, and pages photographed in poor lighting.
High confidence means the rendered page was readable. Low confidence usually means the page image is blurry, skewed, too small, or visually noisy.
PDF OCR is for scanned PDFs, image-only PDFs, forms, worksheets, receipts, and documents where text cannot be selected. Pages are rendered locally and then recognized with OCR.
Run OCR locally, review the editable text, then copy or export the result in the format that fits your workflow.
Run OCR locally, review the editable text, then copy or export the result in the format that fits your workflow.
Run OCR locally, review the editable text, then copy or export the result in the format that fits your workflow.
OCR selected pages from scanned forms, applications, archived records, and document packets without uploading the PDF.
Extract text from scanned worksheets or book pages, then export a clean study note as TXT, PDF, or DOCX.
Read totals, vendor names, dates, and line-item text from image-only PDF receipts before saving structured exports.
Use 2x render scale for better recognition on small text.
OCR only the pages you need when the PDF is large.
This creates extracted text exports, not a perfect searchable overlay PDF in v1.
V1 exports extracted text as PDF or DOCX. It does not add an invisible searchable text layer over the original scan.
No. PDF pages are rendered and OCR is processed in your browser.
Yes. Use ranges such as 1-3, 7, 10-12 before starting OCR.
Each selected page is rendered to an image and then recognized by OCR, which is CPU-heavy.