MOL V2000 files
- Atom block:
- x, y, z coordinates
- atom type:
- 1H, 2He, 3Li, ..., 103Lr,
- atom list and exclusive list L,
- "any" atoms A, Q, *,
- lonely pair LP
- R-Group, R or RN, where N > 0 integer. Before version 5.9, R, without number, was written as R#
- charge
- stereo care box
- valence
- atom-atom mapping (for reactions)
- inversion/retention flag (for reactions)
Codename: mol
Extension: .mol
- Bond block:
- bond type: 1, 2, 3, aromatic, "any", "single or double", "single or aromatic", "double or aromatic" or "hydrogen"
- bond stereo information: up or down
- bond topology: ring or chain
- Properties block:
M ALS
- atom list and exclusive listM APO
- Rgroup attachment pointM CHG
- chargeM RAD
- radicalM ISO
- isotope mass numbersM RGP
- Rgroup labels on root structureM LOG
- Rgroup logicM LIN
- link nodesM SUB
- substitution count query property (s)M UNS
- unsaturated atom query property (u)M RBC
- ring bond count query property (rb)M STY
- Sgroup typeM SST
- Sgroup subtypeM SCN
- Sgroup connectivity (head-to-head, head-to-tail or either/unknown)M SAL
- atoms that define the SgroupM SPA
- multiple group parent atom list (paradigmatic repeating unit atoms)M SBL
- Sgroup's crossing bondsM SMT
- Sgroup labelM SPL
- Sgroup parent listM SDS EXP
- Sgroup expansionM SDT
- Data sgroup field descriptionM SDD
- Data sgroup display informationM SCD
- Data sgroup dataM SED
- Data sgroup data end of lineM SNC
- Sgroup component numbersM CRS
- Sgroup correspondenceM SDI
- display coordinates in each S-group bracketM SBT
- the displayed S-group bracket styleM SAP
- the S-group attachment point informationM MRV SMA
- SMARTS H, X, R, r, a, A properties (Marvin extension)A
- Atom aliasV
- Atom value
Extended MOLfiles (V3000)
If the number of atoms or bonds in a molecule exceeds 999, the extended format is used. In an extended MOLfile, the following properties and features are supported:
- Atom block:
- x, y, z coordinates
- atom type:
- 1H, 2He, 3Li, ..., 103Lr,
- "any" atoms A, Q, *,
- lonely pair LP
- atom-atom mapping (for reactions)
- inversion/retention flag (INVRET)
CHG
- chargeRAD
- radicalCFG
- parityVAL
- valenceMASS
- isotope mass numberHCOUNT
- number of implicit hydrogensSTBOX
- stereo care boxINVRET
- inversion/retention flagATTCHPT
- R-group attachment pointRGROUPS
- R-groups that comprise this R# atomSUBST
- Substitution count query property (s)UNSAT
- Unsaturated atom query property (u)RBCNT
- Ring bond count query property
Restriction: only one R-group can comprise an atom in Marvin
Codename: mol:V3
Extension: .mol
- Bond block:
- bond type: 1, 2, 3, aromatic, "any", "single or double", "single or aromatic", "double or aromatic"
CFG
- bond stereo configuration: up or downTOPO
- bond topology: ring or chainSTBOX
- stereo care box
LINKNODE
- Link nodes.- Sgroup block:
ATOMS
- atoms that define the SgroupPATOMS
- multiple group parent atom list (paradigmatic repeating unit atoms)XBONDS
- crossing bondsMULT
- multiple group multiplierCONNECT
- connectivity (head-to-head, head-to-tail or either/unknown)LABEL
- display labelPARENT
- parent SgroupESTATE
- expanded stateFIELDNAME
- data Sgroup field nameFIELDINFO
- data Sgroup field information (type and units)FIELDDISP
- data Sgroup field display informationQUERYTYPE
- data Sgroup program query codeQUERYOP
- data Sgroup query operatorFIELDDATA
- data Sgroup field valueBRKXYZ
- display coordinates in each S-group bracketBRKTYP
- the displayed S-group bracket styleCOMPNO
- Sgroup component numbersCBONDS
- Sgroup's crossing bondsXBHEAD, XBCORR
- Sgroup correspondenceSUBTYPE
- Sgroup subtypeCLASS
- Sgroup class; AA: amino acidCSTATE
- Sgroup contracted stateESTATE
- Sgroup expanded stateSAP
- the S-group attachment point information
Enhanced stereo features, see also the V3ec and V3ea export options.MDLV30/STEABS
- ABSOLUTE stereochemical groupMDLV30/STEREL
- OR stereochemical groupMDLV30/STERAC
- AND stereochemical group
MDLV30/HILITE
- Highlighted atoms and bonds, currently as represented as atom/bond set 1. (This feature is experimental and import only!)
- Rgroup blocks with
RLOGIC
entries - Template block (import only)
Reaction files (V2000)
A reaction file consists of a REACTANT block, a PRODUCT block, and (optionally) an AGENT block. Reaction files containing reaction agents are non-standard.
Each block starts with 'Molecule or Reaction Identifier'. The form of a molecule identifier must be one of the following:
Codename: rxn
Extension: .rxn
$MFMT $MIREG N $MFMT $MEREG N $MIREG N $MEREG N.
Here $MFMT
means that a molecule is given in a molfile format, $MIREG N
is the internal and $MEREG N
is the external registry number of the molecule. Similarly, the identifier has the following form,
$RFMT $RIREG N $RFMT $REREG N $RIREG N $REREG N.
Here $RFMT
means that a reaction is given in a rxnfile format, $RIREG N
is internal and $REREG N
is the external registry number of the reaction.
A reaction agent is a molecule structure that does not take part in the chemical reaction, but is added to the reaction equation for informative purpose only. Agents are normally displayed graphically above the reaction arrow, added to the reaction file after the reactants and the products. The number of agents is displayed in the file header (after the number of reactants and the number of products) if it is non-zero. Reaction files containing agents are non-standard.
Extended reaction files (V3000)
Codename: rxn:V3
Extension: .rxn
SD Files
In SDfiles read by marvin, the name field is special, it overrides the molecule name specified in the molfile part.
Incompatibility note: The MDL definition declares the maximal line length for molecule properties in 200 characters. We ignore this limit.
Codename: .sdf
Extension: .sdf
RG Files
A special feature of Marvin RGfiles is that they can contain a reaction as the root structure. This feature is non-standard, such mixed RG/Rxnfiles can only be imported by Marvin.
Codename: rgf
Extension: .rgf