About

This Optical Character Recognition (OCR) engine transforms raster images into editable machine-encoded text. Unlike server-side converters, this tool processes sensitive documents entirely within your browser using WebAssembly technology, ensuring zero data leakage.

Accuracy in OCR is a function of image contrast and resolution. This utility includes a Pre-processing Pipeline that applies grayscale conversion, binarization, and contrast adjustment before the neural network analysis. This dramatically improves character recognition rates on low-quality scans or unevenly lit photographs.

Formulas

The pre-processing engine uses the Luma formula to convert RGB values to grayscale, matching human perception:

Y = 0.299R + 0.587G + 0.114B

Contrast adjustment is applied using the following transfer function, where C is the contrast factor (range -255 to 255) and F is the correction factor:

F = 259C + 255255259 − C

The pixel intensity I_new is calculated from the current intensity I_old:

I_new = clampFI_old − 128 + 128

Reference Data

Language	Code	Script Type	Accuracy Rating (Avg)
English	eng	Latin	99.1%
Spanish	spa	Latin	98.5%
French	fra	Latin	98.2%
German	deu	Latin	97.8%
Portuguese	por	Latin	98.4%
Italian	ita	Latin	98.0%
Russian	rus	Cyrillic	96.5%
Chinese (Simp)	chi_sim	Logographic	94.2%
Japanese	jpn	Logographic	93.8%
Arabic	ara	Abjad	91.5%
Hindi	hin	Devanagari	92.0%
Turkish	tur	Latin	97.5%
Polish	pol	Latin	97.2%
Dutch	nld	Latin	98.1%
Swedish	swe	Latin	98.3%

Frequently Asked Questions

Garbled text usually indicates poor image quality, low resolution, or complex fonts. To fix this: 1) Use the "Preprocessing" tab to increase Contrast and enable Binarization. 2) Use the "Crop" tool to select only the text area, excluding headers or images. 3) Ensure the correct Language is selected.

No. This tool uses a Wasm (WebAssembly) implementation of the Tesseract engine. All processing happens locally on your device's CPU/GPU. Your data never leaves your browser.

OCR engines are optimized for printed fonts (serif/sans-serif). While it may detect neat block handwriting, cursive or messy scripts will likely result in a high error rate.

Screenshots often have low DPI (72-96). Upscaling the image or using the "Sharpen" pre-processor (if available) can help. Also, ensure you are not capturing mixed languages without selecting the appropriate multi-language pack.