...
{
"servers": [
{
"host": "website",
"port": 5000,
"path": "yake"
}
],
"name": "Keyword Extraction Processor",
"active": true,
"type": "keyword-extraction-processor",
"sourceField": "source",
"language": "en",
"max_ngram_sizemaxNgramSize": 3,
"min_ngram_sizeminNgramSize": 2,
"number_of_keywordsmaxNumberOfKeywords": 20,
"deduplication_algo": "seqm",
"outputField": "output",
"id": "efe35dc7-fa16-4787-9362-db23395c96e8"
}
...
(Optional, String) The language of the text to be processed. Default: "en"
max_ngram_size
maxNgramSize
(Optional, int) Max contiguous sequence of items. Default: 3
min_ngram_size
minNgramSize
(Optional, int) Min contiguous sequence of items. Default: 2
number_of_keywords
maxNumberOfKeywords
(Optional, int) Total of keywords to be extracted. Default: 20
...
(Optional, String) Function that evaluates the recordset for duplicate records, with options being "leve", "jaro" and "seqm". Default: "seqm"
outputField
...