This processor executes OCR on documents after a Tika processor has run over the documents.
OCR is run only when Tika failed to extract data from a given PDF file.
Configuration
Example configuration in a processor:
{
"parser": {
"key": "/file"
},
"name": "OCR Processor",
"active": true,
"id": "b25f9a02-a8ca-471c-858e-51853c9e76a6",
"type": "ocr-processor"
}
Configuration parameters:
parser.key
(Required, String) Record field where the document can be found