IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-50677-3.html
   My bibliography  Save this article

Multimodal deep learning using on-chip diffractive optics with in situ training capability

Author

Listed:
  • Junwei Cheng

    (Huazhong University of Science and Technology)

  • Chaoran Huang

    (The Chinese University of Hong Kong)

  • Jialong Zhang

    (Huazhong University of Science and Technology)

  • Bo Wu

    (Huazhong University of Science and Technology)

  • Wenkai Zhang

    (Huazhong University of Science and Technology)

  • Xinyu Liu

    (Huazhong University of Science and Technology)

  • Jiahui Zhang

    (Huazhong University of Science and Technology)

  • Yiyi Tang

    (Huazhong University of Science and Technology)

  • Hailong Zhou

    (Huazhong University of Science and Technology)

  • Qiming Zhang

    (University of Shanghai for Science and Technology)

  • Min Gu

    (University of Shanghai for Science and Technology)

  • Jianji Dong

    (Huazhong University of Science and Technology
    Optics Valley Laboratory)

  • Xinliang Zhang

    (Huazhong University of Science and Technology
    Optics Valley Laboratory)

Abstract

Multimodal deep learning plays a pivotal role in supporting the processing and learning of diverse data types within the realm of artificial intelligence generated content (AIGC). However, most photonic neuromorphic processors for deep learning can only handle a single data modality (either vision or audio) due to the lack of abundant parameter training in optical domain. Here, we propose and demonstrate a trainable diffractive optical neural network (TDONN) chip based on on-chip diffractive optics with massive tunable elements to address these constraints. The TDONN chip includes one input layer, five hidden layers, and one output layer, and only one forward propagation is required to obtain the inference results without frequent optical-electrical conversion. The customized stochastic gradient descent algorithm and the drop-out mechanism are developed for photonic neurons to realize in situ training and fast convergence in the optical domain. The TDONN chip achieves a potential throughput of 217.6 tera-operations per second (TOPS) with high computing density (447.7 TOPS/mm2), high system-level energy efficiency (7.28 TOPS/W), and low optical latency (30.2 ps). The TDONN chip has successfully implemented four-class classification in different modalities (vision, audio, and touch) and achieve 85.7% accuracy on multimodal test sets. Our work opens up a new avenue for multimodal deep learning with integrated photonic processors, providing a potential solution for low-power AI large models using photonic technology.

Suggested Citation

  • Junwei Cheng & Chaoran Huang & Jialong Zhang & Bo Wu & Wenkai Zhang & Xinyu Liu & Jiahui Zhang & Yiyi Tang & Hailong Zhou & Qiming Zhang & Min Gu & Jianji Dong & Xinliang Zhang, 2024. "Multimodal deep learning using on-chip diffractive optics with in situ training capability," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-50677-3
    DOI: 10.1038/s41467-024-50677-3
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-50677-3
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-50677-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. G. Mourgias-Alexandris & M. Moralis-Pegios & A. Tsakyridis & S. Simos & G. Dabos & A. Totovic & N. Passalis & M. Kirtas & T. Rutirawut & F. Y. Gardes & A. Tefas & N. Pleros, 2022. "Noise-resilient and high-speed deep learning with coherent silicon photonics," Nature Communications, Nature, vol. 13(1), pages 1-7, December.
    2. Weipeng Zhang & Alexander Tait & Chaoran Huang & Thomas Ferreira de Lima & Simon Bilodeau & Eric C. Blow & Aashu Jha & Bhavin J. Shastri & Paul Prucnal, 2023. "Broadband physical layer cognitive radio with an integrated photonic processor for blind source separation," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    3. Michael Moor & Oishi Banerjee & Zahra Shakeri Hossein Abad & Harlan M. Krumholz & Jure Leskovec & Eric J. Topol & Pranav Rajpurkar, 2023. "Foundation models for generalist medical artificial intelligence," Nature, Nature, vol. 616(7956), pages 259-265, April.
    4. Chen Sun & Mark T. Wade & Yunsup Lee & Jason S. Orcutt & Luca Alloatti & Michael S. Georgas & Andrew S. Waterman & Jeffrey M. Shainline & Rimas R. Avizienis & Sen Lin & Benjamin R. Moss & Rajesh Kumar, 2015. "Single-chip microprocessor that communicates directly using light," Nature, Nature, vol. 528(7583), pages 534-538, December.
    5. J. Feldmann & N. Youngblood & M. Karpov & H. Gehring & X. Li & M. Stappers & M. Gallo & X. Fu & A. Lukashchuk & A. S. Raja & J. Liu & C. D. Wright & A. Sebastian & T. J. Kippenberg & W. H. P. Pernice , 2021. "Publisher Correction: Parallel convolutional processing using an integrated photonic tensor core," Nature, Nature, vol. 591(7849), pages 13-13, March.
    6. Bowen Bai & Qipeng Yang & Haowen Shu & Lin Chang & Fenghe Yang & Bitao Shen & Zihan Tao & Jing Wang & Shaofu Xu & Weiqiang Xie & Weiwen Zou & Weiwei Hu & John E. Bowers & Xingjun Wang, 2023. "Microcomb-based integrated photonic processing unit," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    7. H. Zhang & M. Gu & X. D. Jiang & J. Thompson & H. Cai & S. Paesani & R. Santagati & A. Laing & Y. Zhang & M. H. Yung & Y. Z. Shi & F. K. Muhammad & G. Q. Lo & X. S. Luo & B. Dong & D. L. Kwong & L. C., 2021. "An optical neural chip for implementing complex-valued neural network," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    8. Tingzhao Fu & Yubin Zang & Yuyao Huang & Zhenmin Du & Honghao Huang & Chengyang Hu & Minghua Chen & Sigang Yang & Hongwei Chen, 2023. "Photonic machine learning with on-chip diffractive optics," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    9. Xingyuan Xu & Mengxi Tan & Bill Corcoran & Jiayang Wu & Andreas Boes & Thach G. Nguyen & Sai T. Chu & Brent E. Little & Damien G. Hicks & Roberto Morandotti & Arnan Mitchell & David J. Moss, 2021. "11 TOPS photonic convolutional accelerator for optical neural networks," Nature, Nature, vol. 589(7840), pages 44-51, January.
    10. J. Feldmann & N. Youngblood & M. Karpov & H. Gehring & X. Li & M. Stappers & M. Gallo & X. Fu & A. Lukashchuk & A. S. Raja & J. Liu & C. D. Wright & A. Sebastian & T. J. Kippenberg & W. H. P. Pernice , 2021. "Parallel convolutional processing using an integrated photonic tensor core," Nature, Nature, vol. 589(7840), pages 52-58, January.
    11. J. Feldmann & N. Youngblood & C. D. Wright & H. Bhaskaran & W. H. P. Pernice, 2019. "All-optical spiking neurosynaptic networks with self-learning capabilities," Nature, Nature, vol. 569(7755), pages 208-214, May.
    12. Amir H. Atabaki & Sajjad Moazeni & Fabio Pavanello & Hayk Gevorgyan & Jelena Notaros & Luca Alloatti & Mark T. Wade & Chen Sun & Seth A. Kruger & Huaiyu Meng & Kenaish Al Qubaisi & Imbert Wang & Bohan, 2018. "Publisher Correction: Integrating photonics with silicon nanoelectronics for the next generation of systems on a chip," Nature, Nature, vol. 560(7716), pages 4-4, August.
    13. Amir H. Atabaki & Sajjad Moazeni & Fabio Pavanello & Hayk Gevorgyan & Jelena Notaros & Luca Alloatti & Mark T. Wade & Chen Sun & Seth A. Kruger & Huaiyu Meng & Kenaish Al Qubaisi & Imbert Wang & Bohan, 2018. "Integrating photonics with silicon nanoelectronics for the next generation of systems on a chip," Nature, Nature, vol. 556(7701), pages 349-354, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wen Zhou & Bowei Dong & Nikolaos Farmakidis & Xuan Li & Nathan Youngblood & Kairan Huang & Yuhan He & C. David Wright & Wolfram H. P. Pernice & Harish Bhaskaran, 2023. "In-memory photonic dot-product engine with electrically programmable weight banks," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    2. Bowen Bai & Qipeng Yang & Haowen Shu & Lin Chang & Fenghe Yang & Bitao Shen & Zihan Tao & Jing Wang & Shaofu Xu & Weiqiang Xie & Weiwen Zou & Weiwei Hu & John E. Bowers & Xingjun Wang, 2023. "Microcomb-based integrated photonic processing unit," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    3. Xiangyan Meng & Guojie Zhang & Nuannuan Shi & Guangyi Li & José Azaña & José Capmany & Jianping Yao & Yichen Shen & Wei Li & Ninghua Zhu & Ming Li, 2023. "Compact optical convolution processing unit based on multimode interference," Nature Communications, Nature, vol. 14(1), pages 1-9, December.
    4. Chen-Guang Wang & Wuyue Xu & Chong Li & Lili Shi & Junliang Jiang & Tingting Guo & Wen-Cheng Yue & Tianyu Li & Ping Zhang & Yang-Yang Lyu & Jiazheng Pan & Xiuhao Deng & Ying Dong & Xuecou Tu & Sining , 2024. "Integrated and DC-powered superconducting microcomb," Nature Communications, Nature, vol. 15(1), pages 1-7, December.
    5. Xuan-Kun Li & Jian-Xu Ma & Xiang-Yu Li & Jun-Jie Hu & Chuan-Yang Ding & Feng-Kai Han & Xiao-Min Guo & Xi Tan & Xian-Min Jin, 2024. "High-efficiency reinforcement learning with hybrid architecture photonic integrated circuit," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    6. Han Zhao & Bingzhao Li & Huan Li & Mo Li, 2022. "Enabling scalable optical computing in synthetic frequency dimension using integrated cavity acousto-optics," Nature Communications, Nature, vol. 13(1), pages 1-7, December.
    7. Guangwei Cong & Noritsugu Yamamoto & Takashi Inoue & Yuriko Maegami & Morifumi Ohno & Shota Kita & Shu Namiki & Koji Yamada, 2022. "On-chip bacterial foraging training in silicon photonic circuits for projection-enabled nonlinear classification," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    8. Steven Becker & Dirk Englund & Birgit Stiller, 2024. "An optoacoustic field-programmable perceptron for recurrent neural networks," Nature Communications, Nature, vol. 15(1), pages 1-8, December.
    9. Liuting Shan & Qizhen Chen & Rengjian Yu & Changsong Gao & Lujian Liu & Tailiang Guo & Huipeng Chen, 2023. "A sensory memory processing system with multi-wavelength synaptic-polychromatic light emission for multi-modal information recognition," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    10. G. Mourgias-Alexandris & M. Moralis-Pegios & A. Tsakyridis & S. Simos & G. Dabos & A. Totovic & N. Passalis & M. Kirtas & T. Rutirawut & F. Y. Gardes & A. Tefas & N. Pleros, 2022. "Noise-resilient and high-speed deep learning with coherent silicon photonics," Nature Communications, Nature, vol. 13(1), pages 1-7, December.
    11. Wenting Wang & Ping-Keng Lu & Abhinav Kumar Vinod & Deniz Turan & James F. McMillan & Hao Liu & Mingbin Yu & Dim-Lee Kwong & Mona Jarrahi & Chee Wei Wong, 2022. "Coherent terahertz radiation with 2.8-octave tunability through chip-scale photomixed microresonator optical parametric oscillation," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    12. H. H. Zhu & J. Zou & H. Zhang & Y. Z. Shi & S. B. Luo & N. Wang & H. Cai & L. X. Wan & B. Wang & X. D. Jiang & J. Thompson & X. S. Luo & X. H. Zhou & L. M. Xiao & W. Huang & L. Patrick & M. Gu & L. C., 2022. "Space-efficient optical computing with an integrated chip diffractive neural network," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    13. Bitao Shen & Haowen Shu & Weiqiang Xie & Ruixuan Chen & Zhi Liu & Zhangfeng Ge & Xuguang Zhang & Yimeng Wang & Yunhao Zhang & Buwen Cheng & Shaohua Yu & Lin Chang & Xingjun Wang, 2023. "Harnessing microcomb-based parallel chaos for random number generation and optical decision making," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    14. Xiaoyun Yuan & Yong Wang & Zhihao Xu & Tiankuang Zhou & Lu Fang, 2023. "Training large-scale optoelectronic neural networks with dual-neuron optical-artificial learning," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    15. Xuguang Zhang & Zixuan Zhou & Yijun Guo & Minxue Zhuang & Warren Jin & Bitao Shen & Yujun Chen & Jiahui Huang & Zihan Tao & Ming Jin & Ruixuan Chen & Zhangfeng Ge & Zhou Fang & Ning Zhang & Yadong Liu, 2024. "High-coherence parallelization in integrated photonics," Nature Communications, Nature, vol. 15(1), pages 1-9, December.
    16. Yiwei Li & Ning An & Zheyi Lu & Yuchen Wang & Bing Chang & Teng Tan & Xuhan Guo & Xizhen Xu & Jun He & Handing Xia & Zhaohui Wu & Yikai Su & Yuan Liu & Yunjiang Rao & Giancarlo Soavi & Baicheng Yao, 2022. "Nonlinear co-generation of graphene plasmons for optoelectronic logic operations," Nature Communications, Nature, vol. 13(1), pages 1-7, December.
    17. Jingwei Ling & Zhengdong Gao & Shixin Xue & Qili Hu & Mingxiao Li & Kaibo Zhang & Usman A. Javid & Raymond Lopez-Rios & Jeremy Staffa & Qiang Lin, 2024. "Electrically empowered microcomb laser," Nature Communications, Nature, vol. 15(1), pages 1-8, December.
    18. Mitsumasa Nakajima & Katsuma Inoue & Kenji Tanaka & Yasuo Kuniyoshi & Toshikazu Hashimoto & Kohei Nakajima, 2022. "Physical deep learning with biologically inspired training method: gradient-free approach for physical hardware," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    19. Miltiadis Moralis-Pegios & George Giamougiannis & Apostolos Tsakyridis & David Lazovsky & Nikos Pleros, 2024. "Perfect linear optics using silicon photonics," Nature Communications, Nature, vol. 15(1), pages 1-8, December.
    20. Ming Deng & Michele Cotrufo & Jian Wang & Jianji Dong & Zhichao Ruan & Andrea Alù & Lin Chen, 2024. "Broadband angular spectrum differentiation using dielectric metasurfaces," Nature Communications, Nature, vol. 15(1), pages 1-10, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-50677-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.