IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v9y2021i16p1941-d614372.html
   My bibliography  Save this article

Domain Heuristic Fusion of Multi-Word Embeddings for Nutrient Value Prediction

Author

Listed:
  • Gordana Ispirova

    (Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia
    Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia)

  • Tome Eftimov

    (Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia)

  • Barbara Koroušić Seljak

    (Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia)

Abstract

Being both a poison and a cure for many lifestyle and non-communicable diseases, food is inscribing itself into the prime focus of precise medicine. The monitoring of few groups of nutrients is crucial for some patients, and methods for easing their calculations are emerging. Our proposed machine learning pipeline deals with nutrient prediction based on learned vector representations on short text–recipe names. In this study, we explored how the prediction results change when, instead of using the vector representations of the recipe description, we use the embeddings of the list of ingredients. The nutrient content of one food depends on its ingredients; therefore, the text of the ingredients contains more relevant information. We define a domain-specific heuristic for merging the embeddings of the ingredients, which combines the quantities of each ingredient in order to use them as features in machine learning models for nutrient prediction. The results from the experiments indicate that the prediction results improve when using the domain-specific heuristic. The prediction models for protein prediction were highly effective, with accuracies up to 97.98%. Implementing a domain-specific heuristic for combining multi-word embeddings yields better results than using conventional merging heuristics, with up to 60% more accuracy in some cases.

Suggested Citation

  • Gordana Ispirova & Tome Eftimov & Barbara Koroušić Seljak, 2021. "Domain Heuristic Fusion of Multi-Word Embeddings for Nutrient Value Prediction," Mathematics, MDPI, vol. 9(16), pages 1-15, August.
  • Handle: RePEc:gam:jmathe:v:9:y:2021:i:16:p:1941-:d:614372
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/9/16/1941/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/9/16/1941/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:16:p:1941-:d:614372. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.