How does the image to text OCR work in my browser?
The tool uses a JavaScript port of an OCR engine that runs entirely on your device. When you drop an image onto the page, the engine analyses each region, identifies characters and outputs the recognised text. The first run downloads the OCR model, which takes a few seconds, and after that processing is fast. Because everything runs locally, your image and the recognised text never leave your computer, which keeps sensitive documents private.
Which languages can the OCR recognise?
Out of the box the tool supports more than 100 languages, including English, Spanish, French, German, Italian, Portuguese, Dutch, Polish, Russian, Arabic, Hindi, Chinese, Japanese and Korean. Pick the language from the dropdown before scanning so the engine knows which character set to use. Mixed language documents work best when the dominant language is selected, although the engine can pick up some words in the secondary language as long as they share a script.
What kinds of images give the best OCR results?
High contrast images with clear printed text and at least 300 pixels per inch produce the most accurate results. Straighten skewed scans before uploading, since rotation affects accuracy. Handwriting is hit or miss because the engine is trained on printed fonts. If a result looks wrong, increase the contrast in the image filters tool, then run the OCR again. Avoid extracting text from photos with strong perspective distortion, since they confuse the line detection step.
Can I OCR a PDF file or only image files?
The tool accepts image files like PNG, JPG and WebP. PDFs need to be converted to images first, either using a separate PDF to image tool or by exporting individual pages as PNG from a PDF reader. Once you have the images, drop them all in at once and the OCR runs page by page, concatenating the recognised text into a single editable result. For multi page exports, save the recognised text as a single TXT file when you are done.
Is my image uploaded for OCR processing?
No. The OCR engine runs locally in your browser, so the image and the recognised text stay on your device. The model files are downloaded from a CDN the first time you run a scan, but no part of your document is sent anywhere. This makes the tool safe for confidential paperwork like contracts, receipts and identity documents that you do not want to put on a third party OCR service. The page works offline after the first run.
In what formats can I export the recognised text?
You can copy the text to the clipboard, download it as a plain TXT file, or save it as a Word compatible DOCX. There is also an option to keep line breaks and paragraph spacing as they appear in the image, or to flow everything into a single block. For tables, the engine recognises rows and columns as best it can, but expect to clean up the result by hand because OCR is rarely perfect on complex layouts.