This guide is a step-by-step manual for creating and using a training set that can be applied for the trainable calculations of Calculator Plugins (pKa, logP).
The training of the plugin starts with a file where you collect your experimental data (step 1, see below). From this, you generate a file which serves as knowledge base for the plugin (step 2), that you have to copy to a directory where the plugin picks it for usage (step 3). The plugin calculation will consider this knowledge base file if you turn on this option when running the calculation (step 4).
In this example this file is mydata.sdf.
The picture below is a detail from the training file. ID1 is the index
of the atom with the experimental pKa1 value (ID2 would
be the index of the second measured pKa value /pKa2/, etc.).
This atom index can be viewed by checking the Atom number option in the molecule editor (menu: View->Misc).
cxcalc -T pka -o [path] mydata.sdf(option
-o
gives the location of the folder
creates.) A 'pKaReg' folder will be created containing the training data.
Create a folder called config in the Marvin installation directory.
In Marvin, check the Use correction library box to activate the training option: |
pKa calculation without training data | pKa calculation with training data |
---|---|
cxcalc
:
cxcalc pka -c "NC1=CC(=CC=C1)C(O)=O" -a 3 -b 3
-c
use the correction library-a
generate the n highest value of acidic pK
a (in this example 3)-b
generate the n highest value of basic pK
a (in this example 3)For more options see this page.
cxcalc -T logP -t LOGP -o logPparameters.txt trainingset.sdfWith the
-o
option you can define a path for the file generated.
Create a folder called config in the Marvin installation directory