IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1912.06236.html
   My bibliography  Save this paper

Automatic Financial Feature Construction

Author

Listed:
  • Jie Fang
  • Shutao Xia
  • Jianwu Lin
  • Yong Jiang

Abstract

In automatic financial feature construction task, the state-of-the-art technic leverages reverse polish expression to represent the features, then use genetic programming (GP) to conduct its evolution process. In this paper, we propose a new framework based on neural network, alpha discovery neural network (ADNN). In this work, we made several contributions. Firstly, in this task, we make full use of neural network overwhelming advantage in feature extraction to construct highly informative features. Secondly, we use domain knowledge to design the object function, batch size, and sampling rules. Thirdly, we use pre-training to replace the GP evolution process. According to neural network universal approximation theorem, pre-training can conduct a more effective and explainable evolution process. Experiment shows that ADNN can remarkably produce more diversified and higher informative features than GP. Besides, ADNN can serve as a data augmentation algorithm. It further improves the the performance of financial features constructed by GP.

Suggested Citation

  • Jie Fang & Shutao Xia & Jianwu Lin & Yong Jiang, 2019. "Automatic Financial Feature Construction," Papers 1912.06236, arXiv.org, revised Oct 2020.
  • Handle: RePEc:arx:papers:1912.06236
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1912.06236
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zura Kakushadze, 2016. "101 Formulaic Alphas," Papers 1601.00991, arXiv.org, revised Mar 2016.
    2. Changqing Cheng & Akkarapol Sa-Ngasoongsong & Omer Beyca & Trung Le & Hui Yang & Zhenyu (James) Kong & Satish T.S. Bukkapatnam, 2015. "Time series forecasting for nonlinear and non-stationary processes: a review and comparative study," IISE Transactions, Taylor & Francis Journals, vol. 47(10), pages 1053-1071, October.
    3. Gan, Lirong & Wang, Huamao & Yang, Zhaojun, 2020. "Machine learning solutions to challenges in finance: An application to the pricing of financial products," Technological Forecasting and Social Change, Elsevier, vol. 153(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gürdal Ertek & Lakshmi Kailas, 2021. "Analyzing a Decade of Wind Turbine Accident News with Topic Modeling," Sustainability, MDPI, vol. 13(22), pages 1-34, November.
    2. Jie Fang & Shutao Xia & Jianwu Lin & Zhikang Xia & Xiang Liu & Yong Jiang, 2019. "Alpha Discovery Neural Network based on Prior Knowledge," Papers 1912.11761, arXiv.org, revised Nov 2020.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ozdemir, Ali Can & Buluş, Kurtuluş & Zor, Kasım, 2022. "Medium- to long-term nickel price forecasting using LSTM and GRU networks," Resources Policy, Elsevier, vol. 78(C).
    2. Ariana Chang & Tian‐Shyug Lee & Hsiu‐Mei Lee, 2024. "Applying sustainable development goals in financial forecasting using machine learning techniques," Corporate Social Responsibility and Environmental Management, John Wiley & Sons, vol. 31(3), pages 2277-2289, May.
    3. Jie Fang & Jianwu Lin & Shutao Xia & Yong Jiang & Zhikang Xia & Xiang Liu, 2020. "Neural Network-based Automatic Factor Construction," Papers 2008.06225, arXiv.org, revised Oct 2020.
    4. Caterina De Lucia & Pasquale Pazienza & Mark Bartlett, 2020. "Does Good ESG Lead to Better Financial Performances by Firms? Machine Learning and Logistic Regression Models of Public Enterprises in Europe," Sustainability, MDPI, vol. 12(13), pages 1-29, July.
    5. Zura Kakushadze & Willie Yu, 2017. "Dead Alphas as Risk Factors," Papers 1709.06641, arXiv.org.
    6. Chi Chen & Li Zhao & Wei Cao & Jiang Bian & Chunxiao Xing, 2020. "Trimming the Sail: A Second-order Learning Paradigm for Stock Prediction," Papers 2002.06878, arXiv.org.
    7. Vaia I. Kontopoulou & Athanasios D. Panagopoulos & Ioannis Kakkos & George K. Matsopoulos, 2023. "A Review of ARIMA vs. Machine Learning Approaches for Time Series Forecasting in Data Driven Networks," Future Internet, MDPI, vol. 15(8), pages 1-31, July.
    8. Peiwei Cao & Xubiao He, 2024. "Machine Learning Solutions for Fast Real Estate Derivatives Pricing," Computational Economics, Springer;Society for Computational Economics, vol. 64(4), pages 2003-2032, October.
    9. Guo, Jingjun & Kang, Weiyi & Wang, Yubing, 2024. "Multi-perspective option price forecasting combining parametric and non-parametric pricing models with a new dynamic ensemble framework," Technological Forecasting and Social Change, Elsevier, vol. 204(C).
    10. Iraj Daizadeh, 2021. "Leveraging latent persistency in United States patent and trademark applications to gain insight into the evolution of an innovation-driven economy," Papers 2101.02588, arXiv.org, revised May 2021.
    11. Jun Lu & Joerg Osterrieder, 2022. "Feature Selection via the Intervened Interpolative Decomposition and its Application in Diversifying Quantitative Strategies," Papers 2209.14532, arXiv.org.
    12. Xiao Yang & Weiqing Liu & Dong Zhou & Jiang Bian & Tie-Yan Liu, 2020. "Qlib: An AI-oriented Quantitative Investment Platform," Papers 2009.11189, arXiv.org.
    13. Shuo Yu & Hongyan Xue & Xiang Ao & Feiyang Pan & Jia He & Dandan Tu & Qing He, 2023. "Generating Synergistic Formulaic Alpha Collections via Reinforcement Learning," Papers 2306.12964, arXiv.org.
    14. Huo, Da & Chaudhry, Hassan Rauf, 2021. "Using machine learning for evaluating global expansion location decisions: An analysis of Chinese manufacturing sector," Technological Forecasting and Social Change, Elsevier, vol. 163(C).
    15. Quechen Yang, 2024. "Blending Ensemble for Classification with Genetic-algorithm generated Alpha factors and Sentiments (GAS)," Papers 2411.03035, arXiv.org.
    16. Yongli Li & Tianchen Wang & Baiqing Sun & Chao Liu, 2022. "Detecting the lead–lag effect in stock markets: definition, patterns, and investment strategies," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-36, December.
    17. Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
    18. Chuqiao Zong & Chaojie Wang & Molei Qin & Lei Feng & Xinrun Wang & Bo An, 2024. "MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading," Papers 2406.14537, arXiv.org.
    19. Kristof Lommers & Ouns El Harzli & Jack Kim, 2021. "Confronting Machine Learning With Financial Research," Papers 2103.00366, arXiv.org, revised Mar 2021.
    20. George Kosgei Kiptum, 2022. "Relationship between Kenya’s economic growth and inflation," SN Business & Economics, Springer, vol. 2(12), pages 1-16, December.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1912.06236. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.