IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-04208864.html
   My bibliography  Save this paper

Fingerprinting Bitcoin entities using money flow representation learning

Author

Listed:
  • Natkamon Tovanich

    (CREST - Centre de Recherche en Économie et Statistique - ENSAI - Ecole Nationale de la Statistique et de l'Analyse de l'Information [Bruz] - X - École polytechnique - IP Paris - Institut Polytechnique de Paris - ENSAE Paris - École Nationale de la Statistique et de l'Administration Économique - CNRS - Centre National de la Recherche Scientifique, X - École polytechnique - IP Paris - Institut Polytechnique de Paris)

  • Rémy Cazabet

    (DM2L - Data Mining and Machine Learning - LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information - UL2 - Université Lumière - Lyon 2 - ECL - École Centrale de Lyon - Université de Lyon - UCBL - Université Claude Bernard Lyon 1 - Université de Lyon - INSA Lyon - Institut National des Sciences Appliquées de Lyon - Université de Lyon - INSA - Institut National des Sciences Appliquées - CNRS - Centre National de la Recherche Scientifique, LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information - UL2 - Université Lumière - Lyon 2 - ECL - École Centrale de Lyon - Université de Lyon - UCBL - Université Claude Bernard Lyon 1 - Université de Lyon - INSA Lyon - Institut National des Sciences Appliquées de Lyon - Université de Lyon - INSA - Institut National des Sciences Appliquées - CNRS - Centre National de la Recherche Scientifique, UCBL - Université Claude Bernard Lyon 1 - Université de Lyon, IXXI - Institut Rhône-Alpin des systèmes complexes - ENS de Lyon - École normale supérieure de Lyon - UL2 - Université Lumière - Lyon 2 - UJML - Université Jean Moulin - Lyon 3 - Université de Lyon - UCBL - Université Claude Bernard Lyon 1 - Université de Lyon - INSA Lyon - Institut National des Sciences Appliquées de Lyon - Université de Lyon - INSA - Institut National des Sciences Appliquées - Inria - Institut National de Recherche en Informatique et en Automatique - CNRS - Centre National de la Recherche Scientifique - UGA - Université Grenoble Alpes)

Abstract

Deanonymization is one of the major research challenges in the Bitcoin blockchain, as entities are pseudonymous and cannot be identified from the on-chain data. Various approaches exist to identify multiple addresses of the same entity, i.e., address clustering. But it is known that these approaches tend to find several clusters for the same actor. In this work, we propose to assign a fingerprint to entities based on the dynamic graph of the taint flow of money originating from them, with the idea that we could identify multiple clusters of addresses belonging to the same entity as having similar fingerprints. We experiment with different configurations to generate substructure patterns from taint flows before embedding them using representation learning models. To evaluate our method, we train classification models to identify entities from their fingerprints. Experiments show that our approach can accurately classify entities on three datasets. We compare different fingerprint strategies and show that including the temporality of transactions improves classification accuracy and that following the flow for too long impairs performance. Our work demonstrates that out-flow fingerprinting is a valid approach for recognizing multiple clusters of the same entity.

Suggested Citation

  • Natkamon Tovanich & Rémy Cazabet, 2023. "Fingerprinting Bitcoin entities using money flow representation learning," Post-Print hal-04208864, HAL.
  • Handle: RePEc:hal:journl:hal-04208864
    DOI: 10.1007/s41109-023-00591-2
    Note: View the original document on HAL open archive server: https://hal.science/hal-04208864
    as

    Download full text from publisher

    File URL: https://hal.science/hal-04208864/document
    Download Restriction: no

    File URL: https://libkey.io/10.1007/s41109-023-00591-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Keywords

    Bitcoin; Money flow; Taint analysis; Graph embedding;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-04208864. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.