Build the workflows you need without worrying about security. We don’t store any document data, and our API endpoints are served through encrypted connections.
Easy Integration
Get up and running in hours, not weeks. Access well-documented APIs and code samples that make integrating with your existing workflows a snap.
Robust and Flexible
With access to more than 30 tools, you can process one document in multiple ways by using API credits. Generate PDF from HTML, convert Word, Excel, PowerPoint and image files to PDF, and more.
Simple and Transparent Pricing
Select a package that suits your needs according to the number of credits you wish to spend. Each API tool and action has a specific credit cost.
Try It Out
This example will run English language OCR on your uploaded document, making any text in the document selectable and searchable.
Add a scanned PDF named document.pdf to your project folder. You can also use our sample file.
The file name is case sensitive. Make sure the file name matches the file name in the sample code.The file name is case sensitive. Make sure the file name matches the file name in the sample code.
The following section will walk you through how to best make use of all the functionality the OCR API provides.
The Basics of OCR
Let’s start with the basics. The OCR API allows you to take files in any supported format and make any text in
them selectable and searchable. This is useful for images and scanned documents. To learn more about OCR itself,
see
here.
For our first example, we’ll run OCR on a single image file. To do this, add a page1.jpg file to
the same folder as your code. You can use any image containing text, or use our provided
sample page.
Run the code, and you’ll get a result.pdf with your page OCRed. Our example sets the OCR language
to English. If your content is in a different language, make sure to update the language property
accordingly. You can find a list of all supported languages at the
bottom of this page.
While running OCR on a single page is useful, often you’ll have a folder full of scanned pages that you want to
both run OCR on and merge into a single searchable PDF.
Luckily, this is easy; pass in multiple images — one for each page in your request — and Nutrient DWS API will merge
all of them into a PDF before running OCR on it.
Add more files in the same folder as your code and run the updated code. You can duplicate and rename the
existing file you have, or you can add some other images containing text.