...
{
"parser": {
"metadata": true,
"key": "/input",
"contentTypeField": "/metadata/content-type",
"defaultEncoding": "UTF-8",
"output" : {
"field": "outputFieldName",
"toStorage": true
},
"extraction" : {
"type" : "xpath",
"xpathQuery" : "/xhtml:html/xhtml:body//node()"
}
},
"name": "Tika Processor",
"active": true,
"id": "b25f9a02-a8ca-471c-858e-51853c9e76a6",
"type": "tika-processor"
}
...
(Required, String) Record field where the document can be found
parser.contentTypeField
(Optional, String) Record field with the content type to use during parsing
parser.defaultEncoding
(Optional, String) Encoding to use for the extracted text. Default is "UTF-8"
parser.output.field
(Optional, String) field where extracted content should be placed. Default is "tika".
...