IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v14y2023i1d10.1038_s41467-023-43836-5.html
   My bibliography  Save this article

Language models and protocol standardization guidelines for accelerating synthesis planning in heterogeneous catalysis

Author

Listed:
  • Manu Suvarna

    (Institute for Chemical and Bioengineering, Department of Chemistry and Applied Biosciences, ETH Zurich)

  • Alain Claude Vaucher

    (IBM Research Europe)

  • Sharon Mitchell

    (Institute for Chemical and Bioengineering, Department of Chemistry and Applied Biosciences, ETH Zurich)

  • Teodoro Laino

    (IBM Research Europe)

  • Javier Pérez-Ramírez

    (Institute for Chemical and Bioengineering, Department of Chemistry and Applied Biosciences, ETH Zurich)

Abstract

Synthesis protocol exploration is paramount in catalyst discovery, yet keeping pace with rapid literature advances is increasingly time intensive. Automated synthesis protocol analysis is attractive for swiftly identifying opportunities and informing predictive models, however such applications in heterogeneous catalysis remain limited. In this proof-of-concept, we introduce a transformer model for this task, exemplified using single-atom heterogeneous catalysts (SACs), a rapidly expanding catalyst family. Our model adeptly converts SAC protocols into action sequences, and we use this output to facilitate statistical inference of their synthesis trends and applications, potentially expediting literature review and analysis. We demonstrate the model’s adaptability across distinct heterogeneous catalyst families, underscoring its versatility. Finally, our study highlights a critical issue: the lack of standardization in reporting protocols hampers machine-reading capabilities. Embracing digital advances in catalysis demands a shift in data reporting norms, and to this end, we offer guidelines for writing protocols, significantly improving machine-readability. We release our model as an open-source web application, inviting a fresh approach to accelerate heterogeneous catalysis synthesis planning.

Suggested Citation

  • Manu Suvarna & Alain Claude Vaucher & Sharon Mitchell & Teodoro Laino & Javier Pérez-Ramírez, 2023. "Language models and protocol standardization guidelines for accelerating synthesis planning in heterogeneous catalysis," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
  • Handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-43836-5
    DOI: 10.1038/s41467-023-43836-5
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-023-43836-5
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-023-43836-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Chris Stokel-Walker & Richard Van Noorden, 2023. "What ChatGPT and generative AI mean for science," Nature, Nature, vol. 614(7947), pages 214-216, February.
    2. Alain C. Vaucher & Federico Zipoli & Joppe Geluykens & Vishnu H. Nair & Philippe Schwaller & Teodoro Laino, 2020. "Automated extraction of chemical synthesis actions from experimental procedures," Nature Communications, Nature, vol. 11(1), pages 1-11, December.
    3. Sharon Mitchell & Javier Pérez-Ramírez, 2020. "Single atom catalysis: a decade of stunning progress and the promise for a bright future," Nature Communications, Nature, vol. 11(1), pages 1-3, December.
    4. Alain C. Vaucher & Philippe Schwaller & Joppe Geluykens & Vishnu H. Nair & Anna Iuliano & Teodoro Laino, 2021. "Inferring experimental procedures from text-based representations of chemical reactions," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nathaniel H. Park & Matteo Manica & Jannis Born & James L. Hedrick & Tim Erdmann & Dmitry Yu. Zubarev & Nil Adell-Mill & Pedro L. Arrechea, 2023. "Artificial intelligence driven design of catalysts and materials for ring opening polymerization using a domain-specific language," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    2. Turgut Karakose & Murat Demirkol & Ramazan Yirci & Hakan Polat & Tuncay Yavuz Ozdemir & Tijen Tülübaş, 2023. "A Conversation with ChatGPT about Digital Leadership and Technology Integration: Comparative Analysis Based on Human–AI Collaboration," Administrative Sciences, MDPI, vol. 13(7), pages 1-19, June.
    3. Ion-Danut LIXANDRU, 2024. "The Use of Artificial Intelligence for Qualitative Data Analysis: ChatGPT," Informatica Economica, Academy of Economic Studies - Bucharest, Romania, vol. 28(1), pages 57-67.
    4. Charles E. Creissen & Marc Fontecave, 2022. "Keeping sight of copper in single-atom catalysts for electrochemical carbon dioxide reduction," Nature Communications, Nature, vol. 13(1), pages 1-4, December.
    5. Chong Lan & Yongsheng Wang & Chengze Wang & Shirong Song & Zheng Gong, 2023. "Application of ChatGPT-Based Digital Human in Animation Creation," Future Internet, MDPI, vol. 15(9), pages 1-18, September.
    6. Brady D. Lund & Ting Wang & Nishith Reddy Mannuru & Bing Nie & Somipam Shimray & Ziang Wang, 2023. "ChatGPT and a new academic reality: Artificial Intelligence‐written research papers and the ethics of the large language models in scholarly publishing," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 74(5), pages 570-581, May.
    7. Longsheng Cao & Fernando A. Soto & Dan Li & Tao Deng & Enyuan Hu & Xiner Lu & David A. Cullen & Nico Eidson & Xiao-Qing Yang & Kai He & Perla B. Balbuena & Chunsheng Wang, 2024. "Pd-Ru pair on Pt surface for promoting hydrogen oxidation and evolution in alkaline media," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    8. Limei Qin & Jie Gan & Dechao Niu & Yueqiang Cao & Xuezhi Duan & Xing Qin & Hao Zhang & Zheng Jiang & Yongjun Jiang & Sheng Dai & Yongsheng Li & Jianlin Shi, 2022. "Interfacial-confined coordination to single-atom nanotherapeutics," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    9. Tomaž Kosar & Dragana Ostojić & Yu David Liu & Marjan Mernik, 2024. "Computer Science Education in ChatGPT Era: Experiences from an Experiment in a Programming Course for Novice Programmers," Mathematics, MDPI, vol. 12(5), pages 1-22, February.
    10. Zihao Zhang & Jinshu Tian & Yubing Lu & Shize Yang & Dong Jiang & Weixin Huang & Yixiao Li & Jiyun Hong & Adam S. Hoffman & Simon R. Bare & Mark H. Engelhard & Abhaya K. Datye & Yong Wang, 2023. "Memory-dictated dynamics of single-atom Pt on CeO2 for CO oxidation," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    11. Kim, Jungkeun & Kim, Jeong Hyun & Kim, Changju & Park, Jooyoung, 2023. "Decisions with ChatGPT: Reexamining choice overload in ChatGPT recommendations," Journal of Retailing and Consumer Services, Elsevier, vol. 75(C).
    12. Wahyono, Budi & Rapih, Subroto & Boungou, Whelsy, 2023. "Unleashing the wordsmith: Analysing the stock market reactions to the launch of ChatGPT in the US Education sector," Finance Research Letters, Elsevier, vol. 58(PC).
    13. Daniel Souza & Aldo Geuna & Jeff Rodr'iguez, 2024. "How Small is Big Enough? Open Labeled Datasets and the Development of Deep Learning," Papers 2408.10359, arXiv.org.
    14. Giordano, Vito & Spada, Irene & Chiarello, Filippo & Fantoni, Gualtiero, 2024. "The impact of ChatGPT on human skills: A quantitative study on twitter data," Technological Forecasting and Social Change, Elsevier, vol. 203(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-43836-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.