public class DocumentToStructure extends Object
Modifier and Type | Field and Description |
---|---|
static String |
BYTE |
static String |
CHARACTER
The starting character offset since the beginning of the document,
for text formats (html, xml, txt).
|
static String |
CONFIDENCE
The confidence that the structure is correct.
|
static String |
CONTEXT
The context of the structure recognized in the text.
|
static String |
CONTEXT_INDEX
Index of the hit inside the context.
|
static String |
DOC_AUTHOR |
static String |
DOC_CREATION_DATE |
static String |
DOC_LAST_AUTHOR |
static String |
DOC_PATENT_ASSIGNEES
The assignees of the patent, separated by newline characters.
|
static String |
DOC_PATENT_ID |
static String |
DOC_PATENT_INVENTORS
The inventors of the patent, separated by newline characters.
|
static String |
DOC_PATENT_IPC
The IPC classification(s) for the patent, separated by newline characters.
|
static String |
DOC_PATENT_IPCR
The IPCR classification(s) for the patent, separated by newline characters.
|
static String |
DOC_TITLE |
static String |
DOCUMENT
The file name of the source document.
|
static String |
DOCUMENT_METADATA |
static String |
END_CHARACTER
The ending character offset since the beginning of the document,
for text formats (html, xml, txt).
|
static String |
PAGE
The page number, if applicable (e.g.
|
static String |
SECTION
The section of the document where the structure was found.
|
static String |
SOURCE_TEXT
The source text, as it appears in the original document.
|
static String |
TYPE
The type of source for the structure.
|
static String |
TYPE_CAS
CAS number.
|
static String |
TYPE_CDX
Embedded ChemDraw structure.
|
static String |
TYPE_COMMON
Common name.
|
static String |
TYPE_GENERIC
Generic name, for instance "C1-C4 alkyl".
|
static String |
TYPE_INCHI
InChI string.
|
static String |
TYPE_ION
Ion abbreviation, for instance K+ or Ca2+.
|
static String |
TYPE_MRV
Embedded ChemAxon MRV structure.
|
static String |
TYPE_OSR
Structure image recognized by Optical Structure Recognition.
|
static String |
TYPE_PEPTIDE
Peptide notation, for instance Val-Gly-Ser-Ala.
|
static String |
TYPE_SMILES
SMILES string.
|
static String |
TYPE_SYMYX
Embedded Symyx/ISIS draw structure.
|
static String |
TYPE_SYSTEMATIC
Systematic name.
|
Constructor and Description |
---|
DocumentToStructure() |
Modifier and Type | Method and Description |
---|---|
static MolImporter |
process(String text) |
static MolImporter |
process(String text,
String options)
Creates a MolImporter instance to import structures for a given text.
|
public static final String SOURCE_TEXT
public static final String DOCUMENT
public static final String PAGE
public static final String CHARACTER
public static final String END_CHARACTER
public static final String BYTE
public static final String DOC_AUTHOR
public static final String DOC_LAST_AUTHOR
public static final String DOC_TITLE
public static final String DOC_CREATION_DATE
public static final String DOC_PATENT_ID
public static final String DOC_PATENT_IPC
public static final String DOC_PATENT_IPCR
public static final String DOC_PATENT_ASSIGNEES
public static final String DOC_PATENT_INVENTORS
public static final String DOCUMENT_METADATA
public static final String CONFIDENCE
public static final String SECTION
public static final String CONTEXT
public static final String CONTEXT_INDEX
public static final String TYPE
public static final String TYPE_SYSTEMATIC
public static final String TYPE_COMMON
public static final String TYPE_GENERIC
public static final String TYPE_SMILES
public static final String TYPE_INCHI
public static final String TYPE_CAS
public static final String TYPE_ION
public static final String TYPE_PEPTIDE
public static final String TYPE_CDX
public static final String TYPE_MRV
public static final String TYPE_SYMYX
public static final String TYPE_OSR
public static MolImporter process(String text)
public static MolImporter process(String text, String options)