Optical Character Recognition (OCR) is the mechanical or electronic conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo or from subtitle text superimposed on an image.
Widely used as a form of information entry from printed paper data records – whether passport documents, invoices, bank statements, computerized receipts, business cards, mail, printouts of static-data, or any suitable documentation – it is a common method of digitizing printed texts so that they can be electronically edited, searched, stored more compactly.
API Den's OCR service supports more than 120 languages. The codes and descriptions of the available languages are as follows.
Supported File Types
We currently support JPG, PNG and TIF. Multi-page PDF support is on the way!
OCR output can be displayed in plain text or XHTML format. The XHTML output contains the coordinates and various information of the words on the page.
Below is a part of XHTML output of a document.
Let's use this sample file. - JPG, 822K
"fileBase64" : "..."
apiKey: Your API key.
language: Language of the document. [Default: eng]
outputFormat: Accepted values: "text", "xhtml". [Default: "text"]
fileBase64: Base64 encoded output of file contents.
Expected response for XHTML output format:
If we choose "text" for the output format, the output would look like this.
* Composer package will be available in few days.