IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2103.12419.html
   My bibliography  Save this paper

Volume-Centred Range Bars: Novel Interpretable Representation of Financial Markets Designed for Machine Learning Applications

Author

Listed:
  • Artur Sokolovsky
  • Luca Arnaboldi
  • Jaume Bacardit
  • Thomas Gross

Abstract

Financial markets are a source of non-stationary multidimensional time series which has been drawing attention for decades. Each financial instrument has its specific changing-over-time properties, making its analysis a complex task. Hence, improvement of understanding and development of more informative, generalisable market representations are essential for the successful operation in financial markets, including risk assessment, diversification, trading, and order execution. In this study, we propose a volume-price-based market representation for making financial time series more suitable for machine learning pipelines. We use a statistical approach for evaluating the representation. Through the research questions, we investigate, i) whether the proposed representation allows the more efficient design of machine learning models; ii) whether the proposed representation leads to increased performance over the price levels market pattern; iii) whether the proposed representation performs better on the liquid markets, and iv) whether SHAP feature interactions are reliable to be used in the considered setting. Our analysis shows that the proposed volume-based method allows successful classification of the financial time series patterns, and also leads to better classification performance than the price levels-based method, excelling specifically on more liquid financial instruments. Finally, we propose an approach for obtaining feature interactions directly from tree-based models and compare the outcomes to those of the SHAP method. This results in the significant similarity between the two methods, hence we claim that SHAP feature interactions are reliable to be used in the setting of financial markets.

Suggested Citation

  • Artur Sokolovsky & Luca Arnaboldi & Jaume Bacardit & Thomas Gross, 2021. "Volume-Centred Range Bars: Novel Interpretable Representation of Financial Markets Designed for Machine Learning Applications," Papers 2103.12419, arXiv.org, revised May 2022.
  • Handle: RePEc:arx:papers:2103.12419
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2103.12419
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Takaya Saito & Marc Rehmsmeier, 2015. "The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-21, March.
    2. Dixon, Matthew & Klabjan, Diego & Bang, Jin Hoon, 2017. "Classification-based financial markets prediction using deep neural networks," Algorithmic Finance, IOS Press, vol. 6(3-4), pages 67-77.
    3. Kirsten Martin, 2019. "Ethical Implications and Accountability of Algorithms," Journal of Business Ethics, Springer, vol. 160(4), pages 835-850, December.
    4. Artur Sokolovsky & Luca Arnaboldi, 2020. "A Generic Methodology for the Statistically Uniform & Comparable Evaluation of Automated Trading Platform Components," Papers 2009.09993, arXiv.org, revised Jun 2022.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Niyi Ogunbiyi & Artie Basukoski & Thierry Chaussalet, 2021. "An Exploration of Ethical Decision Making with Intelligence Augmentation," Social Sciences, MDPI, vol. 10(2), pages 1-14, February.
    2. Peng Zhu & Yuante Li & Yifan Hu & Qinyuan Liu & Dawei Cheng & Yuqi Liang, 2024. "LSR-IGRU: Stock Trend Prediction Based on Long Short-Term Relationships and Improved GRU," Papers 2409.08282, arXiv.org, revised Sep 2024.
    3. Christopher J Greenwood & George J Youssef & Primrose Letcher & Jacqui A Macdonald & Lauryn J Hagg & Ann Sanson & Jenn Mcintosh & Delyse M Hutchinson & John W Toumbourou & Matthew Fuller-Tyszkiewicz &, 2020. "A comparison of penalised regression methods for informing the selection of predictive markers," PLOS ONE, Public Library of Science, vol. 15(11), pages 1-14, November.
    4. Stephanie Kelley, 2022. "Employee Perceptions of the Effective Adoption of AI Principles," Journal of Business Ethics, Springer, vol. 178(4), pages 871-893, July.
    5. Jie-Huei Wang & Cheng-Yu Liu & You-Ruei Min & Zih-Han Wu & Po-Lin Hou, 2024. "Cancer Diagnosis by Gene-Environment Interactions via Combination of SMOTE-Tomek and Overlapped Group Screening Approaches with Application to Imbalanced TCGA Clinical and Genomic Data," Mathematics, MDPI, vol. 12(14), pages 1-24, July.
    6. Le, Hong Hanh & Viviani, Jean-Laurent, 2018. "Predicting bank failure: An improvement by implementing a machine-learning approach to classical financial ratios," Research in International Business and Finance, Elsevier, vol. 44(C), pages 16-25.
    7. João Chang Junior & Fábio Binuesa & Luiz Fernando Caneo & Aida Luiza Ribeiro Turquetto & Elisandra Cristina Trevisan Calvo Arita & Aline Cristina Barbosa & Alfredo Manoel da Silva Fernandes & Evelinda, 2020. "Improving preoperative risk-of-death prediction in surgery congenital heart defects using artificial intelligence model: A pilot study," PLOS ONE, Public Library of Science, vol. 15(9), pages 1-21, September.
    8. Maude Lavanchy & Patrick Reichert & Jayanth Narayanan & Krishna Savani, 2023. "Applicants’ Fairness Perceptions of Algorithm-Driven Hiring Procedures," Journal of Business Ethics, Springer, vol. 188(1), pages 125-150, November.
    9. Mathieu Chevrier & Vincent Teixeira, 2024. "Algorithm Delegation and Responsibility: Shifting Blame to the Programmer?," GREDEG Working Papers 2024-04, Groupe de REcherche en Droit, Economie, Gestion (GREDEG CNRS), Université Côte d'Azur, France, revised Sep 2024.
    10. Arthur De Sá Ferreira & Ney Meziat-Filho & Ana Paula Antunes Ferreira, 2021. "Double threshold receiver operating characteristic plot for three-modal continuous predictors," Computational Statistics, Springer, vol. 36(3), pages 2231-2245, September.
    11. Fan, Xudong & Wang, Xiaowei & Zhang, Xijin & ASCE Xiong (Bill) Yu, P.E.F., 2022. "Machine learning based water pipe failure prediction: The effects of engineering, geology, climate and socio-economic factors," Reliability Engineering and System Safety, Elsevier, vol. 219(C).
    12. Yang Qiao & Yiping Xia & Xiang Li & Zheng Li & Yan Ge, 2023. "Higher-order Graph Attention Network for Stock Selection with Joint Analysis," Papers 2306.15526, arXiv.org.
    13. Zhang, Han, 2021. "How Using Machine Learning Classification as a Variable in Regression Leads to Attenuation Bias and What to Do About It," SocArXiv 453jk, Center for Open Science.
    14. Zineb Lanbouri & Saaid Achchab, 2019. "A new approach for Trading based on Long-Short Term memory technique [Une nouvelle approche pour le Trading basée sur la technique Long-Short Term Memory]," Post-Print hal-02396905, HAL.
    15. Şirin Özlem & Omer Faruk Tan, 2022. "Predicting cash holdings using supervised machine learning algorithms," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-19, December.
    16. Etye Steinberg, 2022. "Run for Your Life: The Ethics of Behavioral Tracking in Insurance," Journal of Business Ethics, Springer, vol. 179(3), pages 665-682, September.
    17. Bonsón, Enrique & Lavorato, Domenica & Lamboglia, Rita & Mancini, Daniela, 2021. "Artificial intelligence activities and ethical approaches in leading listed companies in the European Union," International Journal of Accounting Information Systems, Elsevier, vol. 43(C).
    18. Zihao Zhang & Stefan Zohren & Stephen Roberts, 2018. "DeepLOB: Deep Convolutional Neural Networks for Limit Order Books," Papers 1808.03668, arXiv.org, revised Jan 2020.
    19. Masabho P Milali & Samson S Kiware & Nicodem J Govella & Fredros Okumu & Naveen Bansal & Serdar Bozdag & Jacques D Charlwood & Marta F Maia & Sheila B Ogoma & Floyd E Dowell & George F Corliss & Maggy, 2020. "An autoencoder and artificial neural network-based method to estimate parity status of wild mosquitoes from near-infrared spectra," PLOS ONE, Public Library of Science, vol. 15(6), pages 1-16, June.
    20. Daniel R Jeske, 2018. "Metrics Used When Evaluating the Performance of Statistical Classifiers," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 8(1), pages 7-9, August.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2103.12419. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.