OCR Supported Languages in Java

The OCR component offered by many of our products supports a wide variety of languages. The identification of text in a given document requires understanding of the specific language and its symbols and rules (for example, the language’s use of ligatures and punctuation rules). Because of this, our OCR component requires you to pass in the language of the processed document as part of its configuration.

Below is an exhaustive list of the languages the OCR component supports on all supported platforms.

  • Croatian

  • Czech

  • Danish

  • Dutch

  • English

  • Finnish

  • French

  • German

  • Indonesian

  • Italian

  • Malay

  • Norwegian

  • Polish

  • Portuguese

  • Serbian

  • Slovak

  • Slovenian

  • Spanish

  • Swedish

  • Turkish

  • Welsh

Please note that languages aren’t region specific. For example, you’d use English for both American English and British English.

If you don’t see the language you need in this list, please contact support.