Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

{
  "servers": [
    {
      "host": "website",
      "port": 5000,
      "path": "yake"
    }
  ],
  "name": "Keyword Extraction Processor",
  "active": true,
  "type": "keyword-extraction-processor",
  "sourceField": "source",
  "language": "en",
  "max_ngram_sizemaxNgramSize": 3,
  "min_ngram_sizeminNgramSize": 2,
  "number_of_keywordsmaxNumberOfKeywords": 20,
  "deduplication_algo": "seqm",
  "outputField": "output",
  "id": "efe35dc7-fa16-4787-9362-db23395c96e8"
}

...

(Optional, String) The language of the text to be processed. Default: "en"

max_ngram_sizemaxNgramSize

(Optional, int) Max contiguous sequence of items. Default: 3

min_ngram_sizeminNgramSize

(Optional, int) Min contiguous sequence of items. Default: 2

number_of_keywordsmaxNumberOfKeywords

(Optional, int) Total of keywords to be extracted. Default: 20

...

(Optional, String) Function that evaluates the recordset for duplicate records, with options being "leve", "jaro" and "seqm". Default: "seqm"

outputField

...