Markush DARC format

Codename: vmn

Contents

Import from VMN format

Markush structure files complying the Markush DARC format are processed by Marvin, though with some limitations. Interpretation of the main VMN features are listed below. The AMN file is looked for in the directory of the VMN file, with the same name and .amn extension (e.g. the AMN file of 46mrk001.vmn is 46mrk001.amn). The atom attributes and the AMN text notes are stored in Marvin atom properties, as described below.

Interpretation of VMN features

Structure shortcuts (abbreviated groups)

The following structure shortcuts (abbreviated groups) are supported:

C2, C3, ..., C50
ACEBUCNCO1CO2
COIETIBUIPRMBENBU
NO2NPROBEPBEPHPO3
PO4SBUSO2SO3TBU

Amino acids (peptides)

The following standard amino acids (peptide abbreviated groups) are supported:

ALAARGASNASPCYSGLN
GLUGLYHISILELEULYS
METPHEPROSERTHRTRY
TYRVAL

The following non-standard peptides are also supported:

ABUaminobutyric acid
ASUaminosuberic acid
GLPpyroglumatic acid
HCYhomocysteine
HSEhomoserine
NLEnorleucine
NVAnorvaline
ORNornithine
SARsarcosine
STAstatine

Note, that peptide connection bonds are not handled currently, therefore peptide sequences may not be correct.

For more information on peptide representation refer to the Peptide import documentation.

Superatoms (homology pseudo atoms)

Superatoms representing homology groups are read in as pseudo atoms. The following homologies are interpreted by enumeration and search:

CHKCHECHYCYCARYHET
HEAHEFUNKMXAMXA35
TRMLANACTHALACYPRT
XX

For a detailed description of the interpretation, refer to the Homology groups in Markush structures manual.

Multiple R-group attachments

From Marvin 5.4 multiple R-group attachments are fully supported by the new R-group attachment representation:

R-group attachment representation

VMN format corrections

Limitations

Repeating units with repetition ranges

The number of elements in a repetition range is limited to 10 (e.g. range M100=2,5- is interpreted as M100=2,5-13). Repeating units with more than 4 crossing bonds are not processed by search and enumeration.

Superatoms (homologies) that are not supported

The following superatoms (homologies) are not supported by search and enumeration (but read in and displayed as pseudo-atoms):

POLPEGDYEPRT

The detailed description of homology interpretation is described in the Homology groups in Markush structures manual.

Atom attributes that are not processed by search and enumeration

The following atom attributes are not processed by search and enumeration (but displayed in atom labels):

PAPolymer indicator
SPPosition indicator

Import options

o Import runs in the original mode. VMN format corrections are not executed.

Export to VMN format

VMN export is not available yet.

References

  1. The Markush DARC Format, T Ferns, Internal Technical Report, Thomson Scientific, 2002
  2. Derwent World Patents Index, Markush DARC User Manual, The Thomson Corporation, 1993, 2008