IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v15y2023i12p397-d1297089.html
   My bibliography  Save this article

Methodological Approach for Identifying Websites with Infringing Content via Text Transformers and Dense Neural Networks

Author

Listed:
  • Aldo Hernandez-Suarez

    (Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico)

  • Gabriel Sanchez-Perez

    (Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico)

  • Linda Karina Toscano-Medina

    (Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico)

  • Hector Manuel Perez-Meana

    (Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico)

  • Jose Portillo-Portillo

    (Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico)

  • Jesus Olivares-Mercado

    (Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico)

Abstract

The rapid evolution of the Internet of Everything (IoE) has significantly enhanced global connectivity and multimedia content sharing, simultaneously escalating the unauthorized distribution of multimedia content, posing risks to intellectual property rights. In 2022 alone, about 130 billion accesses to potentially non-compliant websites were recorded, underscoring the challenges for industries reliant on copyright-protected assets. Amidst prevailing uncertainties and the need for technical and AI-integrated solutions, this study introduces two pivotal contributions. First, it establishes a novel taxonomy aimed at safeguarding and identifying IoE-based content infringements. Second, it proposes an innovative architecture combining IoE components with automated sensors to compile a dataset reflective of potential copyright breaches. This dataset is analyzed using a Bidirectional Encoder Representations from Transformers-based advanced Natural Language Processing (NLP) algorithm, further fine-tuned by a dense neural network (DNN), achieving a remarkable 98.71% accuracy in pinpointing websites that violate copyright.

Suggested Citation

  • Aldo Hernandez-Suarez & Gabriel Sanchez-Perez & Linda Karina Toscano-Medina & Hector Manuel Perez-Meana & Jose Portillo-Portillo & Jesus Olivares-Mercado, 2023. "Methodological Approach for Identifying Websites with Infringing Content via Text Transformers and Dense Neural Networks," Future Internet, MDPI, vol. 15(12), pages 1-31, December.
  • Handle: RePEc:gam:jftint:v:15:y:2023:i:12:p:397-:d:1297089
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/15/12/397/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/15/12/397/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Peukert, Christian & Claussen, Jörg & Kretschmer, Tobias, 2017. "Piracy and box office movie revenues: Evidence from Megaupload," International Journal of Industrial Organization, Elsevier, vol. 52(C), pages 188-215.
    2. Irina Atanasova, 2019. "Copyright Infringement In Digital Environment," Economics & Law, Faculty of Economics, SOUTH-WEST UNIVERSITY "NEOFIT RILSKI", BLAGOEVGRAD, vol. 1(1), pages 13-22.
    3. Hristos Karahalios, 2020. "Appraisal of a Ship’s Cybersecurity efficiency: the case of piracy," Journal of Transportation Security, Springer, vol. 13(3), pages 179-201, December.
    4. Vasja Roblek & Maja Meško & Mirjana Pejić Bach & Oshane Thorpe & Polona Šprajc, 2020. "The Interaction between Internet, Sustainable Development, and Emergence of Society 5.0," Data, MDPI, vol. 5(3), pages 1-27, September.
    5. Bradley, Wendy A. & Kolev, Julian, 2023. "How does digital piracy affect innovation? Evidence from software firms," Research Policy, Elsevier, vol. 52(3).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christian Peukert & Margaritha Windisch, 2023. "The Economics of Copyright in the Digital Age," CESifo Working Paper Series 10687, CESifo.
    2. Christophe Bellégo & Romain De Nijs, 2020. "The Unintended Consequences of Antipiracy Laws on Markets with Asymmetric Piracy: The Case of the French Movie Industry," Information Systems Research, INFORMS, vol. 31(4), pages 1064-1086, December.
    3. Marc Ivaldi & Ambre Nicolle & Frank Verboven & Jiekai Zhang, 2024. "Displacement and complementarity in the recorded music industry: evidence from France," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 48(1), pages 43-94, March.
    4. Hong Luo & Julie Holland Mortimer, 2017. "Copyright Enforcement: Evidence from Two Field Experiments," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 26(2), pages 499-528, June.
    5. Kanazawa, Kyogo & Kawaguchi, Kohei, 2022. "Displacement effects of public libraries," Journal of the Japanese and International Economies, Elsevier, vol. 66(C).
    6. Peng, Shuxia & Li, Bo & Wu, Shuang, 2023. "Presence of piracy and legal protection: Decisions in the digital goods market under different contracts," European Journal of Operational Research, Elsevier, vol. 309(2), pages 578-596.
    7. Abbas, Jaffar & Balsalobre-Lorente, Daniel & Amjid, Muhammad Asif & Al-Sulaiti, Khalid & Al-Sulaiti, Ibrahim & Aldereai, Osama, 2024. "Financial innovation and digitalization promote business growth: The interplay of green technology innovation, product market competition and firm performance," Innovation and Green Development, Elsevier, vol. 3(1).
    8. Wojciech Hardy, 2022. "Brace yourselves, pirates are coming! the effects of Game of Thrones leak on TV viewership," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 46(1), pages 27-55, March.
    9. Reis, Filipa & Godinho de Matos, Miguel & Ferreira, Pedro, 2024. "Controlling digital piracy via domain name system blocks: A natural experiment," Journal of Economic Behavior & Organization, Elsevier, vol. 218(C), pages 89-103.
    10. Jinglei Huang & Danxia Xie & Zhihao Xu, 2024. "Sequential innovation and contribution distribution: measurement from game live-streaming industry," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-10, December.
    11. Wojciech Hardy & Michal Krawczyk & Joanna Tyrowicz, 2014. "Internet piracy and book sales: A field experiment," Artefactual Field Experiments 00696, The Field Experiments Website.
    12. Peukert, Christian, 2024. "Copyright levies and cloud storage: Ex-ante policy evaluation with a field experiment," Research Policy, Elsevier, vol. 53(2).
    13. Bradley, Wendy A. & Kolev, Julian, 2023. "How does digital piracy affect innovation? Evidence from software firms," Research Policy, Elsevier, vol. 52(3).
    14. Tyrowicz, Joanna & Krawczyk, Michal & Hardy, Wojciech, 2020. "Friends or foes? A meta-analysis of the relationship between “online piracy” and the sales of cultural goods," Information Economics and Policy, Elsevier, vol. 53(C).
    15. Batikas, Michail & Claussen, Jörg & Peukert, Christian, 2017. "Follow The Money: Piracy and Online Advertising," 28th European Regional ITS Conference, Passau 2017 169448, International Telecommunications Society (ITS).
    16. Wojciech Hardy, 2018. "Pre-release leaks as one-time incentives for switching to unauthorised sources of cultural content," IBS Working Papers 03/2018, Instytut Badan Strukturalnych.
    17. Christian Peukert, 2019. "The next wave of digital technological change and the cultural industries," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 43(2), pages 189-210, June.
    18. Tobias Kretschmer & Christian Peukert, 2020. "Video Killed the Radio Star? Online Music Videos and Recorded Music Sales," Information Systems Research, INFORMS, vol. 31(3), pages 776-800, September.
    19. Shaengchart, Yarnaphat & Kraiwanit, Tanpat & Butcharoen, Smich, 2023. "Factors influencing the effects of the Starlink Satellite Project on the internet service provider market in Thailand," Technology in Society, Elsevier, vol. 74(C).
    20. Bogdan Genchev & Julie Holland Mortimer, 2016. "Empirical Evidence on Conditional Pricing Practices," NBER Working Papers 22313, National Bureau of Economic Research, Inc.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:15:y:2023:i:12:p:397-:d:1297089. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.