Documentation PMechDB Article Contact Us

PMechDB Documentation

PMechDB overview


Agricultural chemistry / crop protection
Medicinal chemistry
Environmental chemistry
Protein Structure

The polar mechanism is the most common mechanism in organic and organometallic chemistry. These reactions result from the heterolytic cleavage of bonds and the formation of charged intermediates. Due to the high reactivity of the polar reactants, many of these types of reactions can occur at standard and physiological conditions. Therefore, polar reactions are frequently studied and used in organic, inorganic, organometallic, biological and environmental chemistry. Polar reactions are crucial to many of the most important biological processes, such as the synthesis of peptide bonds which form the backbone of proteins, and they are invaluable to the drug design process. Hence, polar organic reactions are of great importance and their mechanisms and outcomes are worthy of investigation

PMechDB is a live platform for aggregating, curating, and distributing chemcial reactions in the form of elementary polar steps to accelerate research in chemoinformatics and polar reaction modeling. The PMechDB platform is designed to facilitate training deep learning and other AI models in data-driven workflows using its tabular data, with no need for additional pre-processing steps It provides a unified model that ought to facilitate data sharing, model building, dissemination, and publications. We encourage the community to explore and use the PMechDB data and functionalities, and contribute to its expansion.


How to use PMechDB

To download the PMechDB data set, the user must enter his information and email address. Upon reading and agreeing to the terms of the CC-BY-NC-ND license, the user will receive an email containing the PMechDB data set. The PMechDB data set is a directory with five comma separated value (csv) files:

    manually_curated_train.csv:   All the manually curated polar elementary steps for training in the PMechDB data set.
    manually_curated_test_challenging.csv:   The challenging manually curated polar elementary steps used to test trained models.
    combinatorial_all.csv:   All the combinatorially generated polar elementary steps in the PmechDB data set.
    combinatorial_train.csv:   The training split of combinatorial elementary steps.
    combinatorial_test.csv:   The testing split of combinatorial elementary steps.


For the manually curated data, the csv file has one column containing the SMIRKS and arrow codes of the elementary step, and one column containing the orbital pair classification. For the combinatorial data, the csv files has five columns:
    1. The SMIKRS and arrow codes of elementary steps.
    2. The solvent of the elementary step.
    3. The temperature (K) of the elementary step.
    4. The sN(N+E) value.
    5. The orbital pair classification.

Read this documentation on how to use Ketcher to draw molecules and reactions.

Upload multiple reaction steps

You must create a comma separated value (.csv) file containing all the steps. Each row of the file must represent a reaction with four columns:

(1) Reaction SMIRKS
(2) Arrow codes
(3) Original source
(4) Auxilary information (e.g. initial energy, special condition such as low pressure)

Here is a file sample that could be uploaded to PMechDB:

Arrow Codes Description:


(1) The "=" character represents where the electrons flow by separating the initial position of the electrons from the final position of the electrons after an arrow is applied.
(2) Any time we have a "," character between two atoms, this represents a bond between the two atoms. For example, "20,21" means a bond between atom 20 and atom 21.
(3) The ";" character separates one arrow from the next arrow in the arrow pushing mechanism.

Here is an example of a PMechDB reaction in the correct format:

Atom Mapping Description:


All polar reaction steps involve formation of a new bond between a nucleophilic functional group and an electrophilic functional group. Resonance interconversions are treated as reaction steps. The NEW bond between the functional groups is formed between an atom mapped as 10 in the nucleophile and 20 in the electrophile. All atoms in the nucleophile that are involved in arrow-pushing are mapped sequentially, with respect to electron flow specification, as 10, 11, 12, 13, etc. All atoms in the electrophile that are involved in arrow-pushing are mapped sequentially, with respect to electron flow specification, as 20, 21, 22, 23, etc. The electron flow represented with curved arrows should follow a chain of mapped atoms that looks like, for example, 15>14>13>12>11>10>20>21>22>23>24>25.

If you could not upload the file you prepared, or the reaction data you collected does not fit the format mentioned above,

You can send the file via email to:

  • Pierre Baldi , Email: pfbaldi [at] uci [dot] edu.
  • Ryan Miller , Email: rjmille3 [at] uci [dot] edu.
The subject of the email must read Upload Reactions to PMechDB.
It would be helpful that within your email, you include the reason(s) why you did not upload the file.
After receveing your email, we perform automatic and manual examination of the reactions. Then, you will receive and email with information on weather the reaction was inserted into the PMechDB database or not.
You can contact us with your questions.