Molecular Sets (MOSES): a collaborative benchmarking platform for generative drug discovery

Published in

InsilicoMedicine

3 min readDec 1, 2018

As we are heading to the NIPS/NeurIPS conference in Montreal, we are very happy to announce our recent collaboration in generative drug discovery called MOSES. It is a benchmarking platform for molecular generation models. The code and the paper are available at the GitHub repository https://github.com/molecularsets/moses.

In a recent editorial for a special issue on Deep Learning for Drug Discovery and Biomarker Development at Molecular Pharmaceutics, I outlined the opportunities and challenges of the emerging generative chemistry industry.

The original benchmarking platform is a result of collaboration between Insilico Medicine, Neuromation, and Alán Aspuru-Guzik’s laboratory. The researchers and teams are invited to contribute their datasets and models to extend the benchmarking platform. MOSES implements several popular molecular generation models and ranks them, according to a predefined set of metrics. MOSES aims to increase the pace of drug discovery and facilitate sharing and comparison of new models. MOSES is supposed to boost AI-powered drug discovery, just as ImageNet boosted deep learning for imaging data.

The ongoing research in machine learning, in particular, deep learning, brings up the issues of reproducibility and fair comparison of different approaches. While there are multiple methods for generating novel molecular structures with machine learning models, there is no conventional way to run and evaluate the performance of these generative models. The MOSES platform provides a standardized benchmarking dataset, a set of open-sourced models with unified implementation, and metrics to evaluate and assess the results of generation.

Here are some of the quotes from myself and collaborators:

“When we started the journey in generative chemistry using GANs in 2015, it was rather exotic and unproven technology. Today it is experimentally-validated and the field is exploding with many groups joining and making meaningful contributions. We believe that it is important to develop a set of standards and benchmarks to help the community, to accelerate the delivery of AI-generated drugs to the patients”, said Alex Zhavoronkov, Ph.D., founder, and CEO of Insilico Medicine.

“At Insilico Medicine, we take reproducibility and fair evaluation of machine learning models very seriously. Growing popularity of generative AI applications in drug discovery dictates a need for a standardized benchmarking platform supported and maintained by the research community. With MOSES, we come one step closer to the ultimate goal of disrupting the industry with better drug compounds produced by advanced computational and machine learning methods”, said Alexander Zhebrak, CTO of Insilico Medicine.

“I am happy to announce this research collaboration on behalf of Neuromation. Here at Neuromation, we are trying to bring cutting edge results from deep learning to many fields, with healthcare being among our top priorities. MOSES is a benchmarking platform that has the potential to become an industry standard for generative models in biochemistry; developing it has been an exciting road for us and a wonderful opportunity to work together with Insilico Medicine,” said Sergey Nikolenko, Ph.D., Chief Research Officer of Neuromation.

The platform, described in the paper, will be presented at the NIPS Expo in Montreal on December 2, by Daniil Polykovskiy, Alexander Zhebrak, and Alex Zhavoronkov, co-authors of the paper.

Here is the original press release.

Molecular Sets (MOSES): a collaborative benchmarking platform for generative drug discovery

Written by Alex Zhavoronkov, PhD