This processor executes OCR on documents after a Tika processor has run over the documents.

OCR is run only when Tika failed to extract data from a given PDF file.

Configuration

Example configuration in a processor:

{
  "parser": {
    "key": "/file"
  },
  "name": "OCR Processor",
  "active": true,
  "id": "b25f9a02-a8ca-471c-858e-51853c9e76a6",
  "type": "ocr-processor"
}

Configuration parameters:

parser.key

(Required, String) Record field where the document can be found