IDEAS home Printed from https://ideas.repec.org/a/gam/jagris/v13y2023i4p768-d1107983.html
   My bibliography  Save this article

Insights into Cottonseed Cultivar Identification Using Raman Spectroscopy and Explainable Machine Learning

Author

Listed:
  • Jianan Chi

    (School of Information Engineering, Tarim University, Alaer 843300, China
    Southern Xinjiang Research Center for Information Technology in Agriculture, Tarim University, Alaer 843300, China
    Henan Kaifeng College of Science Technology and Communication, Kaifeng 475000, China
    These authors contributed equally to this work.)

  • Xiangxin Bu

    (Southern Xinjiang Research Center for Information Technology in Agriculture, Tarim University, Alaer 843300, China
    These authors contributed equally to this work.)

  • Xiao Zhang

    (School of Information Engineering, Tarim University, Alaer 843300, China
    Southern Xinjiang Research Center for Information Technology in Agriculture, Tarim University, Alaer 843300, China)

  • Lijun Wang

    (School of Information Engineering, Tarim University, Alaer 843300, China
    Analysis and Testing Center, Tarim University, Alar 843300, China)

  • Nannan Zhang

    (School of Information Engineering, Tarim University, Alaer 843300, China
    Southern Xinjiang Research Center for Information Technology in Agriculture, Tarim University, Alaer 843300, China)

Abstract

Securing authentic cottonseed identity information is crucial for preserving the livelihoods of farmers. Traditional seed identification methods are generally time-consuming, and have a high degree of difficulty. Raman spectroscopy, in combination with machine learning (ML), has opened up new avenues for seed identification. In this study, we explored the feasibility of using Raman spectroscopy combined with ML for cottonseed identification. Using Raman confocal microscopy, we constructed fingerprints of cottonseeds and analyzed their important Raman peaks. We integrated two feature exploration methods (Principal Component Analysis and Harris Hawk optimization) and three ML algorithms (Support Vector Machine, eXtreme Gradient Boosting, and Multi-Layer Perceptron) into a Raman spectroscopy analysis framework to accurately identify cottonseed cultivars. Through the utilization of SHapley Additive exPlanations (SHAP), we provide an in-depth explanation of the model’s decision-making process. Our results demonstrate that XGBoost, a tree-based model, exhibits outstanding accuracy (overall accuracy of 0.94–0.88) in cottonseed identification. Notably, lignin emerged as a pivotal factor that strongly influenced the model’s prediction of cottonseed cultivars, as revealed by the XGBoost interpretation. Overall, our study illustrates the effectiveness of combining Raman spectroscopy with ML to precisely identify cottonseed cultivars. The SHAP framework used in our study enables seed-related personnel to better comprehend the model’s prediction mechanism. These valuable insights are expected to enhance seed planting and management practices in the future.

Suggested Citation

  • Jianan Chi & Xiangxin Bu & Xiao Zhang & Lijun Wang & Nannan Zhang, 2023. "Insights into Cottonseed Cultivar Identification Using Raman Spectroscopy and Explainable Machine Learning," Agriculture, MDPI, vol. 13(4), pages 1-17, March.
  • Handle: RePEc:gam:jagris:v:13:y:2023:i:4:p:768-:d:1107983
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2077-0472/13/4/768/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2077-0472/13/4/768/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ke-Lin Du & Chi-Sing Leung & Wai Ho Mow & M. N. S. Swamy, 2022. "Perceptron: Learning, Generalization, Model Selection, Fault Tolerance, and Role in the Deep Learning Era," Mathematics, MDPI, vol. 10(24), pages 1-46, December.
    2. Irfan Afzal & Talha Javed & Masoume Amirkhani & Alan G. Taylor, 2020. "Modern Seed Technology: Seed Coating Delivery Systems for Enhancing Seed and Crop Performance," Agriculture, MDPI, vol. 10(11), pages 1-20, November.
    3. Tin Ko Oo & Noppol Arunrat & Sukanya Sereenonchai & Achara Ussawarujikulchai & Uthai Chareonwong & Winai Nutmagul, 2022. "Comparing Four Machine Learning Algorithms for Land Cover Classification in Gold Mining: A Case Study of Kyaukpahto Gold Mine, Northern Myanmar," Sustainability, MDPI, vol. 14(17), pages 1-15, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Siyu He & Ying Zang & Zishun Huang & Wanyan Tao & He Xing & Wei Qin & Youcong Jiang & Zaiman Wang, 2022. "Design of and Experiment on a Cleaning Mechanism of the Pneumatic Single Seed Metering Device for Coated Hybrid Rice," Agriculture, MDPI, vol. 12(8), pages 1-17, August.
    2. Ali Najem Alkawaz & Jeevan Kanesan & Anis Salwa Mohd Khairuddin & Irfan Anjum Badruddin & Sarfaraz Kamangar & Mohamed Hussien & Maughal Ahmed Ali Baig & N. Ameer Ahammad, 2023. "Training Multilayer Neural Network Based on Optimal Control Theory for Limited Computational Resources," Mathematics, MDPI, vol. 11(3), pages 1-15, February.
    3. Alan G. Taylor & Masoume Amirkhani & Hank Hill, 2021. "Modern Seed Technology," Agriculture, MDPI, vol. 11(7), pages 1-6, July.
    4. Talha Javed & Irfan Afzal & Rosario Paolo Mauro, 2021. "Seed Coating in Direct Seeded Rice: An Innovative and Sustainable Approach to Enhance Grain Yield and Weed Management under Submerged Conditions," Sustainability, MDPI, vol. 13(4), pages 1-13, February.
    5. Hilary Mayton & Masoume Amirkhani & Daibin Yang & Stephen Donovan & Alan G. Taylor, 2021. "Tomato Seed Coat Permeability: Optimal Seed Treatment Chemical Properties for Targeting the Embryo with Implications for Internal Seed-Borne Pathogen Control," Agriculture, MDPI, vol. 11(3), pages 1-11, February.
    6. Hilary Mayton & Masoume Amirkhani & Michael Loos & Burton Johnson & John Fike & Chuck Johnson & Kevin Myers & Jennifer Starr & Gary C. Bergstrom & Alan Taylor, 2022. "Evaluation of Industrial Hemp Seed Treatments for Management of Damping-Off for Enhanced Stand Establishment," Agriculture, MDPI, vol. 12(5), pages 1-12, April.
    7. Niamat Ullah Khan & Aftab Ahmad Khan & Muhammad Arif Goheer & Izwa Shafique & Sadam Hussain & Saddam Hussain & Talha Javed & Maliha Naz & Rubab Shabbir & Ali Raza & Faisal Zulfiqar & Freddy Mora-Poble, 2021. "Effect of Zero and Minimum Tillage on Cotton Productivity and Soil Characteristics under Different Nitrogen Application Rates," Sustainability, MDPI, vol. 13(24), pages 1-13, December.
    8. Ke-Lin Du & M. N. S. Swamy & Zhang-Quan Wang & Wai Ho Mow, 2023. "Matrix Factorization Techniques in Machine Learning, Signal Processing, and Statistics," Mathematics, MDPI, vol. 11(12), pages 1-50, June.
    9. Wacław Jarecki & Justyna Wietecha, 2021. "Effect of seed coating on the yield of soybean Glycine max (L.) Merr," Plant, Soil and Environment, Czech Academy of Agricultural Sciences, vol. 67(8), pages 468-473.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jagris:v:13:y:2023:i:4:p:768-:d:1107983. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.