IDEAS home Printed from https://ideas.repec.org/p/pra/mprapa/118435.html
   My bibliography  Save this paper

Detecting Pump-and-Dumps with Crypto-Assets: Dealing with Imbalanced Datasets and Insiders’ Anticipated Purchases

Author

Listed:
  • Fantazzini, Dean
  • Xiao, Yufeng

Abstract

Detecting pump-and-dump schemes involving cryptoassets with high-frequency data is challenging due to imbalanced datasets and the early occurrence of unusual trading volumes. To address these issues, we propose constructing synthetic balanced datasets using resampling methods and flagging a pump-and-dump from the moment of public announcement up to 60 min beforehand. We validated our proposals using data from Pumpolymp and the CryptoCurrency eXchange Trading Library to identify 351 pump signals relative to the Binance crypto exchange in 2021 and 2022. We found that the most effective approach was using the original imbalanced dataset with pump-and-dumps flagged 60 min in advance, together with a random forest model with data segmented into 30-s chunks and regressors computed with a moving window of 1 h. Our analysis revealed that a better balance between sensitivity and specificity could be achieved by simply selecting an appropriate probability threshold, such as setting the threshold close to the observed prevalence in the original dataset. Resampling methods were useful in some cases, but threshold-independent measures were not affected. Moreover, detecting pump-and-dumps in real-time involves high-dimensional data, and the use of resampling methods to build synthetic datasets can be time-consuming, making them less practical.

Suggested Citation

  • Fantazzini, Dean & Xiao, Yufeng, 2023. "Detecting Pump-and-Dumps with Crypto-Assets: Dealing with Imbalanced Datasets and Insiders’ Anticipated Purchases," MPRA Paper 118435, University Library of Munich, Germany.
  • Handle: RePEc:pra:mprapa:118435
    as

    Download full text from publisher

    File URL: https://mpra.ub.uni-muenchen.de/118435/1/Free_format_exchanges_REVISED_Repec.pdf
    File Function: original version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Freeman, Elizabeth A. & Moisen, Gretchen G., 2008. "A comparison of the performance of threshold criteria for binary classification in terms of predicted prevalence and kappa," Ecological Modelling, Elsevier, vol. 217(1), pages 48-58.
    2. Anirudh Dhawan & Tālis J Putniņš, 2023. "A New Wolf in Town? Pump-and-Dump Manipulation in Cryptocurrency Markets," Review of Finance, European Finance Association, vol. 27(3), pages 935-975.
    3. Ouyang, Liangyi & Cao, Bolong, 2020. "Selective pump-and-dump: The manipulation of their top holdings by Chinese mutual funds around quarter-ends," Emerging Markets Review, Elsevier, vol. 44(C).
    4. Rosa A. Schiavo & David J. Hand, 2000. "Ten More Years of Error Rate Research," International Statistical Review, International Statistical Institute, vol. 68(3), pages 295-310, December.
    5. King, Gary & Zeng, Langche, 2001. "Logistic Regression in Rare Events Data," Political Analysis, Cambridge University Press, vol. 9(2), pages 137-163, January.
    6. Withanawasam, R.M. & Whigham, P.A. & Crack, T.F., 2013. "Characterising trader manipulation in a limit-order driven market," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 93(C), pages 43-52.
    7. Massimo La Morgia & Alessandro Mei & Francesco Sassi & Julinda Stefa, 2020. "Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations," Papers 2005.06610, arXiv.org, revised Sep 2024.
    8. López-Ratón, Mónica & Rodríguez-Álvarez, María Xosé & Cadarso-Suárez, Carmen & Gude-Sampedro, Francisco, 2014. "OptimalCutpoints: An R Package for Selecting Optimal Cutpoints in Diagnostic Tests," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 61(i08).
    9. Gandal, Neil & Hamrick, JT & Rouhi, Farhang & Mukherjee, Arghya & Feder, Amir & Moore, Tyler & Vasek, Marie, 2018. "The Economics of Cryptocurrency Pump and Dump Schemes," CEPR Discussion Papers 13404, C.E.P.R. Discussion Papers.
    10. Jiahua Xu & Benjamin Livshits, 2018. "The Anatomy of a Cryptocurrency Pump-and-Dump Scheme," Papers 1811.10109, arXiv.org, revised Aug 2019.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sihao Hu & Zhen Zhang & Shengliang Lu & Bingsheng He & Zhao Li, 2022. "Sequence-Based Target Coin Prediction for Cryptocurrency Pump-and-Dump," Papers 2204.12929, arXiv.org, revised Apr 2023.
    2. Dun Li & Dezhi Han & Zibin Zheng & Tien-Hsiung Weng & Kuan-Ching Li & Ming Li & Shaokang Cai, 2024. "Does Short-and-Distort Scheme Really Exist? A Bitcoin Futures Audit Scheme through BIRCH & BPNN Approach," Computational Economics, Springer;Society for Computational Economics, vol. 63(4), pages 1649-1671, April.
    3. Mohammad Javad Rajaei & Qusay H. Mahmoud, 2023. "A Survey on Pump and Dump Detection in the Cryptocurrency Market Using Machine Learning," Future Internet, MDPI, vol. 15(8), pages 1-17, August.
    4. Kaihua Qin & Liyi Zhou & Yaroslav Afonin & Ludovico Lazzaretti & Arthur Gervais, 2021. "CeFi vs. DeFi -- Comparing Centralized to Decentralized Finance," Papers 2106.08157, arXiv.org, revised Jun 2021.
    5. Taro Tsuchiya, 2021. "Profitability of cryptocurrency Pump and Dump schemes," Digital Finance, Springer, vol. 3(2), pages 149-167, June.
    6. David Ardia & Keven Bluteau, 2023. "The Role of Twitter in Cryptocurrency Pump-and-Dumps," Papers 2306.02148, arXiv.org.
    7. F. Gauthier & D. Germain & B. Hétu, 2017. "Logistic models as a forecasting tool for snow avalanches in a cold maritime climate: northern Gaspésie, Québec, Canada," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 89(1), pages 201-232, October.
    8. Douglas Cumming & Lars Hornuf & Moein Karami & Denis Schweizer, 2023. "Disentangling Crowdfunding from Fraudfunding," Journal of Business Ethics, Springer, vol. 182(4), pages 1103-1128, February.
    9. Eunae Yoo & Elliot Rabinovich & Bin Gu, 2020. "The Growth of Follower Networks on Social Media Platforms for Humanitarian Operations," Production and Operations Management, Production and Operations Management Society, vol. 29(12), pages 2696-2715, December.
    10. Lo Turco, Alessia & Maggioni, Daniela, 2018. "Effects of Islamic religiosity on bilateral trust in trade: The case of Turkish exports," Journal of Comparative Economics, Elsevier, vol. 46(4), pages 947-965.
    11. Blackman, Allen & Guerrero, Santiago, 2012. "What drives voluntary eco-certification in Mexico?," Journal of Comparative Economics, Elsevier, vol. 40(2), pages 256-268.
    12. Alessandra Iannamorelli & Stefano Nobili & Antonio Scalia & Luana Zaccaria, 2024. "Asymmetric Information and Corporate Lending: Evidence from SME Bond Markets," Review of Finance, European Finance Association, vol. 28(1), pages 163-201.
    13. Václavík, Tomáš & Meentemeyer, Ross K., 2009. "Invasive species distribution modeling (iSDM): Are absence data and dispersal constraints needed to predict actual distributions?," Ecological Modelling, Elsevier, vol. 220(23), pages 3248-3258.
    14. Mehrez Ben Slama & Dhafer Saidane & Hassouna Fedhila, 2012. "How to identify targets in the M&A banking operations? Case of cross-border strategies in Europe by line of activity," Review of Quantitative Finance and Accounting, Springer, vol. 38(2), pages 209-240, February.
    15. Jeff Strnad, 2024. "Economic DAO Governance: A Contestable Control Approach," Papers 2403.16980, arXiv.org, revised Jun 2024.
    16. Lorenzo Cassi & Anne Plunket, 2014. "Proximity, network formation and inventive performance: in search of the proximity paradox," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 53(2), pages 395-422, September.
    17. Xinfu Xing & Chenglong Wu & Jinhui Li & Xueyou Li & Limin Zhang & Rongjie He, 2021. "Susceptibility assessment for rainfall-induced landslides using a revised logistic regression method," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 106(1), pages 97-117, March.
    18. Hwang, Seokyoun & Sarath, Bharat & Han, Seung-youb, 2022. "Auditor independence: The effect of auditors’ quality control efforts and corporate governance," Journal of International Accounting, Auditing and Taxation, Elsevier, vol. 47(C).
    19. Lahiri, Kajal & Yang, Liu, 2013. "Forecasting Binary Outcomes," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1025-1106, Elsevier.
    20. Eling, Martin & Jia, Ruo, 2018. "Business failure, efficiency, and volatility: Evidence from the European insurance industry," International Review of Financial Analysis, Elsevier, vol. 59(C), pages 58-76.

    More about this item

    Keywords

    pump-and-dump; crypto-assets; minority class; class imbalance; machine learning; random forests;
    All these keywords.

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C25 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Discrete Regression and Qualitative Choice Models; Discrete Regressors; Proportions; Probabilities
    • C35 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Discrete Regression and Qualitative Choice Models; Discrete Regressors; Proportions
    • C38 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Classification Methdos; Cluster Analysis; Principal Components; Factor Analysis
    • C51 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Construction and Estimation
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • C58 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Financial Econometrics
    • G17 - Financial Economics - - General Financial Markets - - - Financial Forecasting and Simulation
    • G32 - Financial Economics - - Corporate Finance and Governance - - - Financing Policy; Financial Risk and Risk Management; Capital and Ownership Structure; Value of Firms; Goodwill
    • K42 - Law and Economics - - Legal Procedure, the Legal System, and Illegal Behavior - - - Illegal Behavior and the Enforcement of Law

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pra:mprapa:118435. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Joachim Winter (email available below). General contact details of provider: https://edirc.repec.org/data/vfmunde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.