Monomers are the building blocks of biomolecules and HELM adopters need to decide what monomers they want to use very early in their journey with HELM. Until now there has been little guidance for monomer set creation and no recommended starting set.
The HELM project team has worked with Evan Bolton of PubChem and Anna Gaulton and Patricia Bento of ChEMBL to analyze their public datasets and identify the monomers that appear most frequently. Using a combination of metrics: the appearance of the monomer in biomolecules in PubChem, ChEMBL, the general literature, and patents, the team has identified a set of just over 300 peptide monomers and nearly 400 nucleotide monomers.
We commend these monomers to new users. The set enables you to represent a large number of biomolecules with a tractable number of monomers. As the set becomes established, it is hoped that it will reduce the need for translation of HELM strings from different sources.
For more information on HELM, see www.OpenHELM.org.
We welcome new members, if you would like to join us please email info@OpenHELM.org.