IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v12y2020i11p197-d445231.html
   My bibliography  Save this article

An Organized Repository of Ethereum Smart Contracts’ Source Codes and Metrics

Author

Listed:
  • Giuseppe Antonio Pierro

    (Inria Lille-Nord Europe Centre, 59650 Villeneuve d’Ascq, France)

  • Roberto Tonelli

    (Department of Mathematics and Computer Science, University of Cagliari, 09124 Cagliari, Italy)

  • Michele Marchesi

    (Department of Mathematics and Computer Science, University of Cagliari, 09124 Cagliari, Italy)

Abstract

Many empirical software engineering studies show that there is a need for repositories where source codes are acquired, filtered and classified. During the last few years, Ethereum block explorer services have emerged as a popular project to explore and search for Ethereum blockchain data such as transactions, addresses, tokens, smart contracts’ source codes, prices and other activities taking place on the Ethereum blockchain. Despite the availability of this kind of service, retrieving specific information useful to empirical software engineering studies, such as the study of smart contracts’ software metrics, might require many subtasks, such as searching for specific transactions in a block, parsing files in HTML format, and filtering the smart contracts to remove duplicated code or unused smart contracts. In this paper, we afford this problem by creating Smart Corpus , a corpus of smart contracts in an organized, reasoned and up-to-date repository where Solidity source code and other metadata about Ethereum smart contracts can easily and systematically be retrieved. We present Smart Corpus’s design and its initial implementation, and we show how the data set of smart contracts’ source codes in a variety of programming languages can be queried and processed to get useful information on smart contracts and their software metrics. Smart Corpus aims to create a smart-contract repository where smart-contract data (source code, application binary interface (ABI) and byte code) are freely and immediately available and are classified based on the main software metrics identified in the scientific literature. Smart contracts’ source codes have been validated by EtherScan, and each contract comes with its own associated software metrics as computed by the freely available software PASO. Moreover, Smart Corpus can be easily extended as the number of new smart contracts increases day by day.

Suggested Citation

  • Giuseppe Antonio Pierro & Roberto Tonelli & Michele Marchesi, 2020. "An Organized Repository of Ethereum Smart Contracts’ Source Codes and Metrics," Future Internet, MDPI, vol. 12(11), pages 1-15, November.
  • Handle: RePEc:gam:jftint:v:12:y:2020:i:11:p:197-:d:445231
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/12/11/197/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/12/11/197/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Simona Ibba & Andrea Pinna & Maria Ilaria Lunesu & Michele Marchesi & Roberto Tonelli, 2018. "Initial Coin Offerings and Agile Practices," Future Internet, MDPI, vol. 10(11), pages 1-21, October.
    2. Miguel Diogo & Bruno Cabral & Jorge Bernardino, 2019. "Consistency Models of NoSQL Databases," Future Internet, MDPI, vol. 11(2), pages 1-19, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Francesco Buccafurri & Vincenzo De Angelis & Sara Lazzaro, 2022. "A Blockchain-Based Framework to Enhance Anonymous Services with Accountability Guarantees," Future Internet, MDPI, vol. 14(8), pages 1-20, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Andrea Pinna & Maria Ilaria Lunesu & Stefano Orrù & Roberto Tonelli, 2023. "Investigation on Self-Admitted Technical Debt in Open-Source Blockchain Projects," Future Internet, MDPI, vol. 15(7), pages 1-20, June.
    2. Dénes László Fekete & Attila Kiss, 2021. "A Survey of Ledger Technology-Based Databases," Future Internet, MDPI, vol. 13(8), pages 1-22, July.
    3. Giannis-Panagiotis Botilias & Spiridoula V. Margariti & Jeries Besarat & Dimitrios Salmas & George Pachoulas & Chrysostomos Stylios & Dimitris Skalkos, 2023. "Designing and Developing a Meat Traceability System: A Case Study for the Greek Meat Industry," Sustainability, MDPI, vol. 15(16), pages 1-18, August.
    4. Bernhard Standl & Nadine Schlomske-Bodenstein, 2021. "A Pattern Mining Method for Teaching Practices," Future Internet, MDPI, vol. 13(5), pages 1-14, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:12:y:2020:i:11:p:197-:d:445231. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.