IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v15y2018i12p2907-d191657.html
   My bibliography  Save this article

Correlation Analysis to Identify the Effective Data in Machine Learning: Prediction of Depressive Disorder and Emotion States

Author

Listed:
  • Sunil Kumar

    (Department of Information and Communications Engineering, Hankuk University of Foreign Studies, Seoul 02450, Korea)

  • Ilyoung Chong

    (Department of Information and Communications Engineering, Hankuk University of Foreign Studies, Seoul 02450, Korea)

Abstract

Correlation analysis is an extensively used technique that identifies interesting relationships in data. These relationships help us realize the relevance of attributes with respect to the target class to be predicted. This study has exploited correlation analysis and machine learning-based approaches to identify relevant attributes in the dataset which have a significant impact on classifying a patient’s mental health status. For mental health situations, correlation analysis has been performed in Weka, which involves a dataset of depressive disorder symptoms and situations based on weather conditions, as well as emotion classification based on physiological sensor readings. Pearson’s product moment correlation and other different classification algorithms have been utilized for this analysis. The results show interesting correlations in weather attributes for bipolar patients, as well as in features extracted from physiological data for emotional states.

Suggested Citation

  • Sunil Kumar & Ilyoung Chong, 2018. "Correlation Analysis to Identify the Effective Data in Machine Learning: Prediction of Depressive Disorder and Emotion States," IJERPH, MDPI, vol. 15(12), pages 1-24, December.
  • Handle: RePEc:gam:jijerp:v:15:y:2018:i:12:p:2907-:d:191657
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/15/12/2907/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/15/12/2907/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Syllignakis, Manolis N. & Kouretas, Georgios P., 2011. "Dynamic correlation analysis of financial contagion: Evidence from the Central and Eastern European markets," International Review of Economics & Finance, Elsevier, vol. 20(4), pages 717-732, October.
    2. Sandoval, Leonidas & Franca, Italo De Paula, 2012. "Correlation of financial markets in times of crisis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(1), pages 187-208.
    3. White, Halbert, 1980. "A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity," Econometrica, Econometric Society, vol. 48(4), pages 817-838, May.
    4. Breusch, T S & Pagan, A R, 1979. "A Simple Test for Heteroscedasticity and Random Coefficient Variation," Econometrica, Econometric Society, vol. 47(5), pages 1287-1294, September.
    5. Erdem, Orhan & Ceyhan, Elvan & Varli, Yusuf, 2014. "A new correlation coefficient for bivariate time-series data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 414(C), pages 274-284.
    6. Muhammad Aslam Jarwar & Rabeeh Ayaz Abbasi & Mubashar Mushtaq & Onaiza Maqbool & Naif R. Aljohani & Ali Daud & Jalal S. Alowibdi & J.R. Cano & S. García & Ilyoung Chong, 2017. "CommuniMents: A Framework for Detecting Community Based Sentiments for Events," International Journal on Semantic Web and Information Systems (IJSWIS), IGI Global, vol. 13(2), pages 87-108, April.
    7. S. le Cessie & J. C. van Houwelingen, 1992. "Ridge Estimators in Logistic Regression," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 41(1), pages 191-201, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chimango Nyasulu & Awa Diattara & Assitan Traore & Abdoulaye Deme & Cheikh Ba, 2022. "Towards Resilient Agriculture to Hostile Climate Change in the Sahel Region: A Case Study of Machine Learning-Based Weather Prediction in Senegal," Agriculture, MDPI, vol. 12(9), pages 1-23, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marijke Verpoorten & Lode Berlage, 2004. "Genocide and land scarcity: Can Rwandan rural households manage?," CSAE Working Paper Series 2004-15, Centre for the Study of African Economies, University of Oxford.
    2. Russell, Bill & Chowdhury, Rosen Azad, 2013. "Estimating United States Phillips curves with expectations consistent with the statistical process of inflation," Journal of Macroeconomics, Elsevier, vol. 35(C), pages 24-38.
    3. Joachim Zietz, 2006. "Detecting neglected parameter heterogeneity with Chow tests," Applied Economics Letters, Taylor & Francis Journals, vol. 13(6), pages 369-374.
    4. Pedro Delicado & Juan Romo, 1998. "Constant coefficient tests for random coefficient regression," Economics Working Papers 329, Department of Economics and Business, Universitat Pompeu Fabra.
    5. Kendix, Michael & Walls, W.D., 2010. "Oil industry consolidation and refined product prices: Evidence from US wholesale gasoline terminals," Energy Policy, Elsevier, vol. 38(7), pages 3498-3507, July.
    6. Seren Firat & Esat Dasdemir, 2021. "Application of Quantity Theory of Money in Cryptocurrencies: Example of Bitcoin and the Impact of Covid-19," Istanbul Journal of Economics-Istanbul Iktisat Dergisi, Istanbul University, Faculty of Economics, vol. 71(1), pages 81-102, June.
    7. LE GALLO, Julie, 2000. "Econométrie spatiale 2 -Hétérogénéité spatiale," LATEC - Document de travail - Economie (1991-2003) 2001-01, LATEC, Laboratoire d'Analyse et des Techniques EConomiques, CNRS UMR 5118, Université de Bourgogne.
    8. David I Stern, 2014. "High-Ranked Social Science Journal Articles Can Be Identified from Early Citation Information," PLOS ONE, Public Library of Science, vol. 9(11), pages 1-11, November.
    9. Olivier Damette & Philippe Delacote, 2009. "The environmental resource curse hypothesis: the forest case," Working Papers - Cahiers du LEF 2009-04, Laboratoire d'Economie Forestiere, AgroParisTech-INRA.
    10. Zaman, Asad, 1995. "On the inconsistency of the Breusch-Pagan test," MPRA Paper 9904, University Library of Munich, Germany.
    11. Julie Le Gallo, 2004. "Hétérogénéité spatiale : principes et méthodes," Économie et Prévision, Programme National Persée, vol. 162(1), pages 151-172.
    12. Gonzalez, Elena & Stephen, Bruce & Infield, David & Melero, Julio J., 2019. "Using high-frequency SCADA data for wind turbine performance monitoring: A sensitivity study," Renewable Energy, Elsevier, vol. 131(C), pages 841-853.
    13. Li, Zhaoyuan & Yao, Jianfeng, 2019. "Testing for heteroscedasticity in high-dimensional regressions," Econometrics and Statistics, Elsevier, vol. 9(C), pages 122-139.
    14. Cem Ertur & Julie Le Gallo & Catherine Baumont, 2006. "The European Regional Convergence Process, 1980-1995: Do Spatial Regimes and Spatial Dependence Matter?," International Regional Science Review, , vol. 29(1), pages 3-34, January.
    15. Dufour, Jean-Marie & Khalaf, Lynda & Bernard, Jean-Thomas & Genest, Ian, 2004. "Simulation-based finite-sample tests for heteroskedasticity and ARCH effects," Journal of Econometrics, Elsevier, vol. 122(2), pages 317-347, October.
    16. Jacqueline Karlsson & Helena Melin & Kevin Cullinane, 2018. "The impact of potential Brexit scenarios on German car exports to the UK: an application of the gravity model," Journal of Shipping and Trade, Springer, vol. 3(1), pages 1-22, December.
    17. Miomir Jovanović & Ljiljana Kašćelan & Aleksandra Despotović & Vladimir Kašćelan, 2015. "The Impact of Agro-Economic Factors on GHG Emissions: Evidence from European Developing and Advanced Economies," Sustainability, MDPI, vol. 7(12), pages 1-21, December.
    18. Romano, Joseph P. & Wolf, Michael, 2017. "Resurrecting weighted least squares," Journal of Econometrics, Elsevier, vol. 197(1), pages 1-19.
    19. Baldauf, Markus & Santos Silva, J.M.C., 2012. "On the use of robust regression in econometrics," Economics Letters, Elsevier, vol. 114(1), pages 124-127.
    20. Moinas, Sophie & Nguyen, Minh & Valente, Giorgio, 2017. "Funding Constraints and Market Illiquidity in the European Treasury Bond Market," TSE Working Papers 17-814, Toulouse School of Economics (TSE).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:15:y:2018:i:12:p:2907-:d:191657. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.