Author
Listed:
- Lars A Bratholm
- Will Gerrard
- Brandon Anderson
- Shaojie Bai
- Sunghwan Choi
- Lam Dang
- Pavel Hanchar
- Addison Howard
- Sanghoon Kim
- Zico Kolter
- Risi Kondor
- Mordechai Kornbluth
- Youhan Lee
- Youngsoo Lee
- Jonathan P Mailoa
- Thanh Tu Nguyen
- Milos Popovic
- Goran Rakocevic
- Walter Reade
- Wonho Song
- Luka Stojanovic
- Erik H Thiede
- Nebojsa Tijanic
- Andres Torrubia
- Devin Willmott
- Craig P Butts
- David R Glowacki
Abstract
The rise of machine learning (ML) has created an explosion in the potential strategies for using data to make scientific predictions. For physical scientists wishing to apply ML strategies to a particular domain, it can be difficult to assess in advance what strategy to adopt within a vast space of possibilities. Here we outline the results of an online community-powered effort to swarm search the space of ML strategies and develop algorithms for predicting atomic-pairwise nuclear magnetic resonance (NMR) properties in molecules. Using an open-source dataset, we worked with Kaggle to design and host a 3-month competition which received 47,800 ML model predictions from 2,700 teams in 84 countries. Within 3 weeks, the Kaggle community produced models with comparable accuracy to our best previously published ‘in-house’ efforts. A meta-ensemble model constructed as a linear combination of the top predictions has a prediction accuracy which exceeds that of any individual model, 7-19x better than our previous state-of-the-art. The results highlight the potential of transformer architectures for predicting quantum mechanical (QM) molecular properties.
Suggested Citation
Lars A Bratholm & Will Gerrard & Brandon Anderson & Shaojie Bai & Sunghwan Choi & Lam Dang & Pavel Hanchar & Addison Howard & Sanghoon Kim & Zico Kolter & Risi Kondor & Mordechai Kornbluth & Youhan Le, 2021.
"A community-powered search of machine learning strategy space to find NMR property prediction models,"
PLOS ONE, Public Library of Science, vol. 16(7), pages 1-16, July.
Handle:
RePEc:plo:pone00:0253612
DOI: 10.1371/journal.pone.0253612
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0253612. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.