Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 6 Current »

Chemical Tagger Processor

This processor takes input text from a field and extracts chemical elements using Oscar4 (Open Source Chemistry Analysis Routines) which is an open source extensible system for the automated annotation of chemistry in scientific articles. It can be used to identify chemical names, reaction names, ontology terms, enzymes and chemical prefixes and adjectives, and chemical data such as state, yield, IR, NMR and mass spectra and elemental analyses.

For more information about Oscar4: https://github.com/BlueObelisk/oscar4

Configuration

Example configuration in a processor:

{
  "key": "text",
  "structureType": "SMILES"
}

Configuration parameters:

key

(Required, String) field with the text.

structureType (Optional, String) format used to describe the structure of chemicals. Defaults to SMILES.

Available values: SMILES, STD_INCHI, STD_INCHI_KEY, CML.

Input/Output examples

Input

{
  "text": "The quick brown ethyl acetate jumps over the lazy bromine"
}

Output

{
  "chemicals": [
    {
      "name": "ethyl acetate", 
      "type": "COMPOUND", 
      "structure": "CCOC(C)=O"
    },
    {
      "name": "bromine",
      "type": "COMPOUND",
      "structure": "[Br]"
    }
  ]
}
  • No labels