IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2103.16908.html
   My bibliography  Save this paper

Dimension reduction of open-high-low-close data in candlestick chart based on pseudo-PCA

Author

Listed:
  • Wenyang Huang
  • Huiwen Wang
  • Shanshan Wang

Abstract

The (open-high-low-close) OHLC data is the most common data form in the field of finance and the investigate object of various technical analysis. With increasing features of OHLC data being collected, the issue of extracting their useful information in a comprehensible way for visualization and easy interpretation must be resolved. The inherent constraints of OHLC data also pose a challenge for this issue. This paper proposes a novel approach to characterize the features of OHLC data in a dataset and then performs dimension reduction, which integrates the feature information extraction method and principal component analysis. We refer to it as the pseudo-PCA method. Specifically, we first propose a new way to represent the OHLC data, which will free the inherent constraints and provide convenience for further analysis. Moreover, there is a one-to-one match between the original OHLC data and its feature-based representations, which means that the analysis of the feature-based data can be reversed to the original OHLC data. Next, we develop the pseudo-PCA procedure for OHLC data, which can effectively identify important information and perform dimension reduction. Finally, the effectiveness and interpretability of the proposed method are investigated through finite simulations and the spot data of China's agricultural product market.

Suggested Citation

  • Wenyang Huang & Huiwen Wang & Shanshan Wang, 2021. "Dimension reduction of open-high-low-close data in candlestick chart based on pseudo-PCA," Papers 2103.16908, arXiv.org.
  • Handle: RePEc:arx:papers:2103.16908
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2103.16908
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Paolo Giordani, 2015. "Lasso-constrained regression analysis for interval-valued data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 9(1), pages 5-19, March.
    2. Leung, Mark T. & Daouk, Hazem & Chen, An-Sing, 2000. "Forecasting stock indices: a comparison of classification and level estimation models," International Journal of Forecasting, Elsevier, vol. 16(2), pages 173-190.
    3. Brown, Philip & Chua, Angeline & Mitchell, Jason, 2002. "The influence of cultural factors on price clustering: Evidence from Asia-Pacific stock markets," Pacific-Basin Finance Journal, Elsevier, vol. 10(3), pages 307-332, June.
    4. Sun, Yuying & Han, Ai & Hong, Yongmiao & Wang, Shouyang, 2018. "Threshold autoregressive models for interval-valued time series data," Journal of Econometrics, Elsevier, vol. 206(2), pages 414-446.
    5. Andrew W. Lo & Harry Mamaysky & Jiang Wang, 2000. "Foundations of Technical Analysis: Computational Algorithms, Statistical Inference, and Empirical Implementation," Journal of Finance, American Finance Association, vol. 55(4), pages 1705-1765, August.
    6. Xiong, Tao & Li, Chongguang & Bao, Yukun, 2017. "Interval-valued time series forecasting using a novel hybrid HoltI and MSVR model," Economic Modelling, Elsevier, vol. 60(C), pages 11-23.
    7. Kazemilari, Mansooreh & Djauhari, Maman Abdurachman, 2015. "Correlation network analysis for multi-dimensional data in stocks market," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 429(C), pages 62-75.
    8. Yin-Wong Cheung, 2007. "An empirical model of daily highs and lows," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 12(1), pages 1-20.
    9. Pai, Ping-Feng & Lin, Chih-Sheng, 2005. "A hybrid ARIMA and support vector machines model in stock price forecasting," Omega, Elsevier, vol. 33(6), pages 497-505, December.
    10. De Nard, Gianluca & Engle, Robert F. & Ledoit, Olivier & Wolf, Michael, 2022. "Large dynamic covariance matrices: Enhancements based on intraday data," Journal of Banking & Finance, Elsevier, vol. 138(C).
    11. Huiwen Wang & Liying Shangguan & Rong Guan & Lynne Billard, 2015. "Principal component analysis for compositional data vectors," Computational Statistics, Springer, vol. 30(4), pages 1079-1096, December.
    12. Fiess, Norbert M & MacDonald, Ronald, 2002. "Towards the fundamentals of technical analysis: analysing the information content of High, Low and Close prices," Economic Modelling, Elsevier, vol. 19(3), pages 353-374, May.
    13. Federica Gioia & Carlo Lauro, 2006. "Principal component analysis on interval data," Computational Statistics, Springer, vol. 21(2), pages 343-363, June.
    14. Hopkins, PE, 1996. "The effect of financial statement classification of hybrid financial instruments on financial analysts' stock price judgments," Journal of Accounting Research, Wiley Blackwell, vol. 34, pages 33-50.
    15. Harris, Lawrence, 1991. "Stock Price Clustering and Discreteness," The Review of Financial Studies, Society for Financial Studies, vol. 4(3), pages 389-415.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Leandro Maciel, 2020. "Technical analysis based on high and low stock prices forecasts: evidence for Brazil using a fractionally cointegrated VAR model," Empirical Economics, Springer, vol. 58(4), pages 1513-1540, April.
    2. Huiwen Wang & Wenyang Huang & Shanshan Wang, 2021. "Forecasting open-high-low-close data contained in candlestick chart," Papers 2104.00581, arXiv.org.
    3. Sun, Yuying & Zhang, Xinyu & Wan, Alan T.K. & Wang, Shouyang, 2022. "Model averaging for interval-valued data," European Journal of Operational Research, Elsevier, vol. 301(2), pages 772-784.
    4. Rui Luo & Jinpei Liu & Piao Wang & Zhifu Tao & Huayou Chen, 2024. "A multisource data‐driven combined forecasting model based on internet search keyword screening method for interval soybean futures price," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(2), pages 366-390, March.
    5. Bill M. Cai & Charlie X. Cai & Kevin Keasey, 2007. "Influence of cultural factors on price clustering and price resistance in China's stock markets," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 47(4), pages 623-641, December.
    6. Robert Brooks & Edwyna Harris & Yovina Joymungul, 2013. "Price clustering in Australian water markets," Applied Economics, Taylor & Francis Journals, vol. 45(6), pages 677-685, February.
    7. Meng, Lei & Verousis, Thanos & ap Gwilym, Owain, 2013. "A substitution effect between price clustering and size clustering in credit default swaps," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 24(C), pages 139-152.
    8. Mazza, Paolo, 2015. "Price dynamics and market liquidity: An intraday event study on Euronext," The Quarterly Review of Economics and Finance, Elsevier, vol. 56(C), pages 139-153.
    9. Narayan, Paresh Kumar & Smyth, Russell, 2013. "Has political instability contributed to price clustering on Fiji's stock market?," Journal of Asian Economics, Elsevier, vol. 28(C), pages 125-130.
    10. Basak, Suryoday & Kar, Saibal & Saha, Snehanshu & Khaidem, Luckyson & Dey, Sudeepa Roy, 2019. "Predicting the direction of stock market prices using tree-based classifiers," The North American Journal of Economics and Finance, Elsevier, vol. 47(C), pages 552-567.
    11. Ahn, Hee-Joon & Cai, Jun & Cheung, Yan Leung, 2005. "Price clustering on the limit-order book: Evidence from the Stock Exchange of Hong Kong," Journal of Financial Markets, Elsevier, vol. 8(4), pages 421-451, November.
    12. Guo, Wei & Liu, Qingfu & Luo, Zhidan & Tse, Yiuman, 2022. "Forecasts for international financial series with VMD algorithms," Journal of Asian Economics, Elsevier, vol. 80(C).
    13. Lukas Menkhoff & Mark P. Taylor, 2007. "The Obstinate Passion of Foreign Exchange Professionals: Technical Analysis," Journal of Economic Literature, American Economic Association, vol. 45(4), pages 936-972, December.
    14. Huang, Wenyang & Wang, Huiwen & Wei, Yigang, 2023. "Identifying the determinants of European carbon allowances prices: A novel robust partial least squares method for open-high-low-close data," International Review of Financial Analysis, Elsevier, vol. 90(C).
    15. Walid Omrane & Hervé Oppens, 2006. "The performance analysis of chart patterns: Monte Carlo simulation and evidence from the euro/dollar foreign exchange market," Empirical Economics, Springer, vol. 30(4), pages 947-971, January.
    16. Kwong Wing Chau & Danika Wright & Ervi Liusman, 2018. "The cost of a lucky price," ERES eres2018_240, European Real Estate Society (ERES).
    17. Caporale, Guglielmo Maria & Gil-Alana, Luis A. & Poza, Carlos, 2020. "High and low prices and the range in the European stock markets: A long-memory approach," Research in International Business and Finance, Elsevier, vol. 52(C).
    18. Huang, Wenyang & Wang, Huiwen & Qin, Haotong & Wei, Yigang & Chevallier, Julien, 2022. "Convolutional neural network forecasting of European Union allowances futures using a novel unconstrained transformation method," Energy Economics, Elsevier, vol. 110(C).
    19. Frédy Pokou & Jules Sadefo Kamdem & François Benhmad, 2024. "Hybridization of ARIMA with Learning Models for Forecasting of Stock Market Time Series," Computational Economics, Springer;Society for Computational Economics, vol. 63(4), pages 1349-1399, April.
    20. Narayan, Paresh Kumar & Narayan, Seema & Popp, Stephan & D'Rosario, Michael, 2011. "Share price clustering in Mexico," International Review of Financial Analysis, Elsevier, vol. 20(2), pages 113-119, April.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2103.16908. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.