IDEAS home Printed from https://ideas.repec.org/a/spr/elmark/v32y2022i4d10.1007_s12525-022-00612-5.html
   My bibliography  Save this article

Global reconstruction of language models with linguistic rules – Explainable AI for online consumer reviews

Author

Listed:
  • Markus Binder

    (University of Regensburg, Germany, at the Faculty of Informatics and Data Science)

  • Bernd Heinrich

    (University of Regensburg, Germany, at the Faculty of Informatics and Data Science)

  • Marcus Hopf

    (University of Regensburg, Germany, at the Faculty of Informatics and Data Science)

  • Alexander Schiller

    (University of Regensburg, Germany, at the Faculty of Informatics and Data Science)

Abstract

Analyzing textual data by means of AI models has been recognized as highly relevant in information systems research and practice, since a vast amount of data on eCommerce platforms, review portals or social media is given in textual form. Here, language models such as BERT, which are deep learning AI models, constitute a breakthrough and achieve leading-edge results in many applications of text analytics such as sentiment analysis in online consumer reviews. However, these language models are “black boxes”: It is unclear how they arrive at their predictions. Yet, applications of language models, for instance, in eCommerce require checks and justifications by means of global reconstruction of their predictions, since the decisions based thereon can have large impacts or are even mandatory due to regulations such as the GDPR. To this end, we propose a novel XAI approach for global reconstructions of language model predictions for token-level classifications (e.g., aspect term detection) by means of linguistic rules based on NLP building blocks (e.g., part-of-speech). The approach is analyzed on different datasets of online consumer reviews and NLP tasks. Since our approach allows for different setups, we further are the first to analyze the trade-off between comprehensibility and fidelity of global reconstructions of language model predictions. With respect to this trade-off, we find that our approach indeed allows for balanced setups for global reconstructions of BERT’s predictions. Thus, our approach paves the way for a thorough understanding of language model predictions in text analytics. In practice, our approach can assist businesses in their decision-making and supports compliance with regulatory requirements.

Suggested Citation

  • Markus Binder & Bernd Heinrich & Marcus Hopf & Alexander Schiller, 2022. "Global reconstruction of language models with linguistic rules – Explainable AI for online consumer reviews," Electronic Markets, Springer;IIM University of St. Gallen, vol. 32(4), pages 2123-2138, December.
  • Handle: RePEc:spr:elmark:v:32:y:2022:i:4:d:10.1007_s12525-022-00612-5
    DOI: 10.1007/s12525-022-00612-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12525-022-00612-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12525-022-00612-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. James O’Donovan & Hannes F Wagner & Stefan Zeume, 2019. "The Value of Offshore Secrets: Evidence from the Panama Papers," The Review of Financial Studies, Society for Financial Studies, vol. 32(11), pages 4117-4155.
    2. Shrestha, Yash Raj & Krishna, Vaibhav & von Krogh, Georg, 2021. "Augmenting organizational decision-making with deep learning algorithms: Principles, promises, and challenges," Journal of Business Research, Elsevier, vol. 123(C), pages 588-603.
    3. Andreas J. Steur & Fabian Fritzsche & Mischa Seiter, 2022. "It’s all about the text: An experimental investigation of inconsistent reviews on restaurant booking platforms," Electronic Markets, Springer;IIM University of St. Gallen, vol. 32(3), pages 1187-1220, September.
    4. Chatterjee, Swagato & Goyal, Divesh & Prakash, Atul & Sharma, Jiwan, 2021. "Exploring healthcare/health-product ecommerce satisfaction: A text mining and machine learning application," Journal of Business Research, Elsevier, vol. 131(C), pages 815-825.
    5. Bernd Heinrich & Marcus Hopf & Daniel Lohninger & Alexander Schiller & Michael Szubartowicz, 2021. "Data quality in recommender systems: the impact of completeness of item content data on prediction accuracy of recommender systems," Electronic Markets, Springer;IIM University of St. Gallen, vol. 31(2), pages 389-409, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Christian Meske & Babak Abedin & Mathias Klier & Fethi Rabhi, 2022. "Explainable and responsible artificial intelligence," Electronic Markets, Springer;IIM University of St. Gallen, vol. 32(4), pages 2103-2106, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Menkhoff, Lukas & Miethe, Jakob, 2019. "Tax evasion in new disguise? Examining tax havens' international bank deposits," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 176, pages 53-78.
    2. de Jong, Jeroen P.J. & Ben-Menahem, Shiko M. & Franke, Nikolaus & Füller, Johann & von Krogh, Georg, 2021. "Treading new ground in household sector innovation research: Scope, emergence, business implications, and diffusion," Research Policy, Elsevier, vol. 50(8).
    3. Verena K. Dutt & Christopher A. Ludwig & Katharina Nicolay & Heiko Vay & Johannes Voget, 2019. "Increasing tax transparency: investor reactions to the country-by-country reporting requirement for EU financial institutions," International Tax and Public Finance, Springer;International Institute of Public Finance, vol. 26(6), pages 1259-1290, December.
    4. Fabrizio Colella & Keith Maskus & Alessandro Peri, 2024. "Unintended Consequences of Money-Laundering Regulations," RF Berlin - CReAM Discussion Paper Series 2403, Rockwool Foundation Berlin (RF Berlin) - Centre for Research and Analysis of Migration (CReAM).
    5. Christian Janiesch & Patrick Zschech & Kai Heinrich, 2021. "Machine learning and deep learning," Electronic Markets, Springer;IIM University of St. Gallen, vol. 31(3), pages 685-695, September.
    6. Ashton, John & Burnett, Tim & Diaz-Rainey, Ivan & Ormosi, Peter, 2021. "Known unknowns: How much financial misconduct is detected and deterred?," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 74(C).
    7. Müller, Raphael & Spengel, Christoph & Vay, Heiko, 2020. "On the determinants and effects of corporate tax transparency: Review of an emerging literature," ZEW Discussion Papers 20-063, ZEW - Leibniz Centre for European Economic Research.
    8. Fangjun Wang & Shuolei Xu & Junqin Sun & Charles P. Cullinan, 2020. "Corporate Tax Avoidance: A Literature Review And Research Agenda," Journal of Economic Surveys, Wiley Blackwell, vol. 34(4), pages 793-811, September.
    9. Do, Quoc-Anh & Galbiati, Roberto & Marx, Benjamin & Ortiz Serrano, Miguel A., 2024. "J'Accuse! Antisemitism and financial markets in the time of the Dreyfus Affair," Journal of Financial Economics, Elsevier, vol. 154(C).
    10. Colonnelli, Emanuele & Lagaras, Spyridon & Ponticelli, Jacopo & Prem, Mounu & Tsoutsoura, Margarita, 2022. "Revealing corruption: Firm and worker level evidence from Brazil," Journal of Financial Economics, Elsevier, vol. 143(3), pages 1097-1119.
    11. Konda, Laura & Patel, Elena & Seegert, Nathan, 2022. "Tax enforcement and the intended and unintended consequences of information disclosure," Journal of Public Economics, Elsevier, vol. 212(C).
    12. Nan Yang & Nikolaos Korfiatis & Dimitris Zissis & Konstantina Spanaki, 2024. "Incorporating topic membership in review rating prediction from unstructured data: a gradient boosting approach," Annals of Operations Research, Springer, vol. 339(1), pages 631-662, August.
    13. Azadi, Majid & Yousefi, Saeed & Farzipoor Saen, Reza & Shabanpour, Hadi & Jabeen, Fauzia, 2023. "Forecasting sustainability of healthcare supply chains using deep learning and network data envelopment analysis," Journal of Business Research, Elsevier, vol. 154(C).
    14. Gavrilova, Evelina & Polakova, Aija, 2018. "Stairway to (Secrecy) Heaven: Market Attitudes towards Secrecy Shopping," Discussion Papers 2018/19, Norwegian School of Economics, Department of Business and Management Science.
    15. Sharafutdinova,Gulnaz & Lokshin,Michael M., 2020. "Hide and Protect : A Role of Global Financial Secrecy in Shaping Domestic Institutions," Policy Research Working Paper Series 9348, The World Bank.
    16. Araz Zirar, 2023. "Can artificial intelligence’s limitations drive innovative work behaviour?," Review of Managerial Science, Springer, vol. 17(6), pages 2005-2034, August.
    17. Christina M. Lewellen, 2023. "Tax haven incorporation and financial reporting transparency," Review of Accounting Studies, Springer, vol. 28(3), pages 1811-1855, September.
    18. Laila Ait Bihi Ouali, 2020. "Effects of signalling tax evasion on redistribution and voting preferences: Evidence from the Panama Papers," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-22, March.
    19. Kern, Andreas & Nosrati, Elias & Reinsberg, Bernhard & Sevinc, Dilek, 2023. "Crash for cash: Offshore financial destinations and IMF programs," European Journal of Political Economy, Elsevier, vol. 78(C).
    20. Wen Zhang & Qiang Wang & Jian Li & Zhenzhong Ma & Gokul Bhandari & Rui Peng, 2023. "What makes deceptive online reviews? A linguistic analysis perspective," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-14, December.

    More about this item

    Keywords

    Explainable AI; Text analytics; Language models; BERT; Linguistic rules; Online consumer reviews;
    All these keywords.

    JEL classification:

    • C80 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:elmark:v:32:y:2022:i:4:d:10.1007_s12525-022-00612-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.