IDEAS home Printed from https://ideas.repec.org/a/wly/jforec/v43y2024i2p366-390.html
   My bibliography  Save this article

A multisource data‐driven combined forecasting model based on internet search keyword screening method for interval soybean futures price

Author

Listed:
  • Rui Luo
  • Jinpei Liu
  • Piao Wang
  • Zhifu Tao
  • Huayou Chen

Abstract

Accurate soybean futures price prediction is critical to related agricultural production, warehousing, and trading. Interval forecasting can avoid the loss of fluctuation information and evaluate the uncertainty of futures prices. However, most previous studies only consider the single‐type auxiliary variable, which will cause the deficiency of valued information. Moreover, the research concentrating on internet search index ignores the search habits of investors, resulting in subjectivity in keyword selection. Therefore, a novel multisource data‐driven combined forecasting model is proposed that consists of four parts: unstructured data processing, interval multi‐scale decomposition, interval combination forecasting, and model evaluation. First, sentiment analysis technology is used to convert news text into sentiment scores. The internet search keyword screening method based on latent Dirichlet allocation is then constructed to achieve the quantification of investor attention. Second, a decomposition method is applied to decompose the original interval‐valued series into finite more stationary components. Third, interval prediction results are obtained by machine learning‐based multiple predictors. Finally, the model evaluation module comprising error evaluation indicators and comparison experiments is presented to verify the effectiveness. The experimental results show that the proposed model has higher prediction accuracy, which indicates that multisource data and designed keyword screening methods can enhance the forecasting performance of the model.

Suggested Citation

  • Rui Luo & Jinpei Liu & Piao Wang & Zhifu Tao & Huayou Chen, 2024. "A multisource data‐driven combined forecasting model based on internet search keyword screening method for interval soybean futures price," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(2), pages 366-390, March.
  • Handle: RePEc:wly:jforec:v:43:y:2024:i:2:p:366-390
    DOI: 10.1002/for.3035
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/for.3035
    Download Restriction: no

    File URL: https://libkey.io/10.1002/for.3035?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Yongmei Fang & Bo Guan & Shangjuan Wu & Saeed Heravi, 2020. "Optimal forecast combination based on ensemble empirical mode decomposition for agricultural commodity futures prices," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(6), pages 877-886, September.
    2. Ahumada, H. & Cornejo, M., 2016. "Forecasting food prices: The case of corn, soybeans and wheat," International Journal of Forecasting, Elsevier, vol. 32(3), pages 838-848.
    3. He, Kaijian & Yu, Lean & Lai, Kin Keung, 2012. "Crude oil price analysis and forecasting using wavelet decomposed ensemble model," Energy, Elsevier, vol. 46(1), pages 564-574.
    4. Ding, Song & Li, Ruojin & Wu, Shu & Zhou, Weijie, 2021. "Application of a novel structure-adaptative grey model with adjustable time power item for nuclear energy consumption forecasting," Applied Energy, Elsevier, vol. 298(C).
    5. Sun, Yuying & Han, Ai & Hong, Yongmiao & Wang, Shouyang, 2018. "Threshold autoregressive models for interval-valued time series data," Journal of Econometrics, Elsevier, vol. 206(2), pages 414-446.
    6. Du, Pei & Guo, Ju’e & Sun, Shaolong & Wang, Shouyang & Wu, Jing, 2021. "Multi-step metal prices forecasting based on a data preprocessing method and an optimized extreme learning machine by marine predators algorithm," Resources Policy, Elsevier, vol. 74(C).
    7. Charfeddine, Lanouar, 2016. "Breaks or long range dependence in the energy futures volatility: Out-of-sample forecasting and VaR analysis," Economic Modelling, Elsevier, vol. 53(C), pages 354-374.
    8. Niu, Zibo & Liu, Yuanyuan & Gao, Wang & Zhang, Hongwei, 2021. "The role of coronavirus news in the volatility forecasting of crude oil futures markets: Evidence from China," Resources Policy, Elsevier, vol. 73(C).
    9. Wu, Binrong & Wang, Lin & Wang, Sirui & Zeng, Yu-Rong, 2021. "Forecasting the U.S. oil markets based on social media information during the COVID-19 pandemic," Energy, Elsevier, vol. 226(C).
    10. Xiong, Tao & Li, Chongguang & Bao, Yukun, 2017. "Interval-valued time series forecasting using a novel hybrid HoltI and MSVR model," Economic Modelling, Elsevier, vol. 60(C), pages 11-23.
    11. Li, Jianping & Li, Guowen & Liu, Mingxi & Zhu, Xiaoqian & Wei, Lu, 2022. "A novel text-based framework for forecasting agricultural futures using massive online news headlines," International Journal of Forecasting, Elsevier, vol. 38(1), pages 35-50.
    12. Li, Sufang & Zhang, Hu & Yuan, Di, 2019. "Investor attention and crude oil prices: Evidence from nonlinear Granger causality tests," Energy Economics, Elsevier, vol. 84(C).
    13. Khoshalan, Hasel Amini & Shakeri, Jamshid & Najmoddini, Iraj & Asadizadeh, Mostafa, 2021. "Forecasting copper price by application of robust artificial intelligence techniques," Resources Policy, Elsevier, vol. 73(C).
    14. Wang, Bin & Wang, Jun, 2021. "Energy futures price prediction and evaluation model with deep bidirectional gated recurrent unit neural network and RIF-based algorithm," Energy, Elsevier, vol. 216(C).
    15. Liu, Li & Wan, Jieqiu, 2012. "A study of Shanghai fuel oil futures price volatility based on high frequency data: Long-range dependence, modeling and forecasting," Economic Modelling, Elsevier, vol. 29(6), pages 2245-2253.
    16. Sun, Shaolong & Sun, Yuying & Wang, Shouyang & Wei, Yunjie, 2018. "Interval decomposition ensemble approach for crude oil price forecasting," Energy Economics, Elsevier, vol. 76(C), pages 274-287.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jiang, He & Hu, Weiqiang & Xiao, Ling & Dong, Yao, 2022. "A decomposition ensemble based deep learning approach for crude oil price forecasting," Resources Policy, Elsevier, vol. 78(C).
    2. Wuyue An & Lin Wang & Yu‐Rong Zeng, 2023. "Text‐based soybean futures price forecasting: A two‐stage deep learning approach," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(2), pages 312-330, March.
    3. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    4. Yan, Zichun & Tian, Fangzhu & Sun, Yuying & Wang, Shouyang, 2024. "A time-frequency-based interval decomposition ensemble method for forecasting gasoil prices under the trend of low-carbon development," Energy Economics, Elsevier, vol. 134(C).
    5. Piao Wang & Shahid Hussain Gurmani & Zhifu Tao & Jinpei Liu & Huayou Chen, 2024. "Interval time series forecasting: A systematic literature review," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(2), pages 249-285, March.
    6. Li, Ning & Li, Jiaojiao & Wang, Qizhou & Yan, Dairong & Wang, Liguan & Jia, Mingtao, 2024. "A novel copper price forecasting ensemble method using adversarial interpretive structural model and sparrow search algorithm," Resources Policy, Elsevier, vol. 91(C).
    7. Wen, Danyan & Liu, Li & Wang, Yudong & Zhang, Yaojie, 2022. "Forecasting crude oil market returns: Enhanced moving average technical indicators," Resources Policy, Elsevier, vol. 76(C).
    8. Wuyue An & Lin Wang & Dongfeng Zhang, 2023. "Comprehensive commodity price forecasting framework using text mining methods," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(7), pages 1865-1888, November.
    9. Zou, Yingchao & Yu, Lean & Tso, Geoffrey K.F. & He, Kaijian, 2020. "Risk forecasting in the crude oil market: A multiscale Convolutional Neural Network approach," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 541(C).
    10. Hao, Jun & Feng, Qianqian & Yuan, Jiaxin & Sun, Xiaolei & Li, Jianping, 2022. "A dynamic ensemble learning with multi-objective optimization for oil prices prediction," Resources Policy, Elsevier, vol. 79(C).
    11. Sun, Yuying & Zhang, Xinyu & Wan, Alan T.K. & Wang, Shouyang, 2022. "Model averaging for interval-valued data," European Journal of Operational Research, Elsevier, vol. 301(2), pages 772-784.
    12. Peng, Lu & Wang, Lin & Xia, De & Gao, Qinglu, 2022. "Effective energy consumption forecasting using empirical wavelet transform and long short-term memory," Energy, Elsevier, vol. 238(PB).
    13. Zhu, Mengrui & Xu, Hua & Wang, Minggang & Tian, Lixin, 2024. "Carbon price interval prediction method based on probability density recurrence network and interval multi-layer perceptron," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 636(C).
    14. Li, Sufang & Xu, Qiufan & Lv, Yixue & Yuan, Di, 2022. "Public attention, oil and gold markets during the COVID-19: Evidence from time-frequency analysis," Resources Policy, Elsevier, vol. 78(C).
    15. Wenyang Huang & Huiwen Wang & Shanshan Wang, 2021. "Dimension reduction of open-high-low-close data in candlestick chart based on pseudo-PCA," Papers 2103.16908, arXiv.org.
    16. Zhang, Tingting & Tang, Zhenpeng & Wu, Junchuan & Du, Xiaoxu & Chen, Kaijie, 2021. "Multi-step-ahead crude oil price forecasting based on two-layer decomposition technique and extreme learning machine optimized by the particle swarm optimization algorithm," Energy, Elsevier, vol. 229(C).
    17. Gao, Feng & Shao, Xueyan, 2022. "A novel interval decomposition ensemble model for interval carbon price forecasting," Energy, Elsevier, vol. 243(C).
    18. Ghosh, Indranil & Chaudhuri, Tamal Datta & Alfaro-Cortés, Esteban & Gámez, Matías & García, Noelia, 2022. "A hybrid approach to forecasting futures prices with simultaneous consideration of optimality in ensemble feature selection and advanced artificial intelligence," Technological Forecasting and Social Change, Elsevier, vol. 181(C).
    19. Lin, Ling & Jiang, Yong & Xiao, Helu & Zhou, Zhongbao, 2020. "Crude oil price forecasting based on a novel hybrid long memory GARCH-M and wavelet analysis model," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 543(C).
    20. Liu, Yuanyuan & Niu, Zibo & Suleman, Muhammad Tahir & Yin, Libo & Zhang, Hongwei, 2022. "Forecasting the volatility of crude oil futures: The role of oil investor attention and its regime switching characteristics under a high-frequency framework," Energy, Elsevier, vol. 238(PA).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:jforec:v:43:y:2024:i:2:p:366-390. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www3.interscience.wiley.com/cgi-bin/jhome/2966 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.