Page Comparison

...

{
  "parser": {
    "metadata": true,
    "key": "/input",
    "contentTypeField": "/metadata/content-type",
    "defaultEncoding": "UTF-8",
    "output" : {
      "field": "outputFieldName",
      "toStorage": true
    },
    "extraction" : {
      "type" : "xpath",
      "xpathQuery" : "/xhtml:html/xhtml:body//node()"
    }
  },
  "name": "Tika Processor",
  "active": true,
  "id": "b25f9a02-a8ca-471c-858e-51853c9e76a6",
  "type": "tika-processor"
}

...

(Required, String) Record field where the document can be found

parser.contentTypeField

(Optional, String) Record field with the content type to use during parsing

parser.defaultEncoding

(Optional, String) Encoding to use for the extracted text. Default is "UTF-8"

parser.output.field

(Optional, String) field where extracted content should be placed. Default is "tika".

...

Version	Old Version 1	New Version 2
Changes made by	Javier Mendez (Unlicensed)	Continuous Integration [bot]
Saved on	Jun 06, 2022	Jun 06, 2022

Versions Compared

Key