IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v15y2018i12p2907-d191657.html
   My bibliography  Save this article

Correlation Analysis to Identify the Effective Data in Machine Learning: Prediction of Depressive Disorder and Emotion States

Author

Listed:
  • Sunil Kumar

    (Department of Information and Communications Engineering, Hankuk University of Foreign Studies, Seoul 02450, Korea)

  • Ilyoung Chong

    (Department of Information and Communications Engineering, Hankuk University of Foreign Studies, Seoul 02450, Korea)

Abstract

Correlation analysis is an extensively used technique that identifies interesting relationships in data. These relationships help us realize the relevance of attributes with respect to the target class to be predicted. This study has exploited correlation analysis and machine learning-based approaches to identify relevant attributes in the dataset which have a significant impact on classifying a patient’s mental health status. For mental health situations, correlation analysis has been performed in Weka, which involves a dataset of depressive disorder symptoms and situations based on weather conditions, as well as emotion classification based on physiological sensor readings. Pearson’s product moment correlation and other different classification algorithms have been utilized for this analysis. The results show interesting correlations in weather attributes for bipolar patients, as well as in features extracted from physiological data for emotional states.

Suggested Citation

  • Sunil Kumar & Ilyoung Chong, 2018. "Correlation Analysis to Identify the Effective Data in Machine Learning: Prediction of Depressive Disorder and Emotion States," IJERPH, MDPI, vol. 15(12), pages 1-24, December.
  • Handle: RePEc:gam:jijerp:v:15:y:2018:i:12:p:2907-:d:191657
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/15/12/2907/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/15/12/2907/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Erdem, Orhan & Ceyhan, Elvan & Varli, Yusuf, 2014. "A new correlation coefficient for bivariate time-series data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 414(C), pages 274-284.
    2. Syllignakis, Manolis N. & Kouretas, Georgios P., 2011. "Dynamic correlation analysis of financial contagion: Evidence from the Central and Eastern European markets," International Review of Economics & Finance, Elsevier, vol. 20(4), pages 717-732, October.
    3. Sandoval, Leonidas & Franca, Italo De Paula, 2012. "Correlation of financial markets in times of crisis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(1), pages 187-208.
    4. White, Halbert, 1980. "A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity," Econometrica, Econometric Society, vol. 48(4), pages 817-838, May.
    5. Breusch, T S & Pagan, A R, 1979. "A Simple Test for Heteroscedasticity and Random Coefficient Variation," Econometrica, Econometric Society, vol. 47(5), pages 1287-1294, September.
    6. Muhammad Aslam Jarwar & Rabeeh Ayaz Abbasi & Mubashar Mushtaq & Onaiza Maqbool & Naif R. Aljohani & Ali Daud & Jalal S. Alowibdi & J.R. Cano & S. García & Ilyoung Chong, 2017. "CommuniMents: A Framework for Detecting Community Based Sentiments for Events," International Journal on Semantic Web and Information Systems (IJSWIS), IGI Global, vol. 13(2), pages 87-108, April.
    7. S. le Cessie & J. C. van Houwelingen, 1992. "Ridge Estimators in Logistic Regression," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 41(1), pages 191-201, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chimango Nyasulu & Awa Diattara & Assitan Traore & Abdoulaye Deme & Cheikh Ba, 2022. "Towards Resilient Agriculture to Hostile Climate Change in the Sahel Region: A Case Study of Machine Learning-Based Weather Prediction in Senegal," Agriculture, MDPI, vol. 12(9), pages 1-23, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jean-Paul Azam & Catherine Bonjean, 1995. "La formation du prix du riz : théorie et application au cas d'Antananarivo (Madagascar) ," Revue Économique, Programme National Persée, vol. 46(4), pages 1145-1166.
    2. Stern, David I. & Gerlagh, Reyer & Burke, Paul J., 2017. "Modeling the emissions–income relationship using long-run growth rates," Environment and Development Economics, Cambridge University Press, vol. 22(6), pages 699-724, December.
    3. Tsimpanos, Apostolos & Tsimbos, Cleon & Kalogirou, Stamatis, 2018. "Assessing spatial variation and heterogeneity of fertility in Greece at local authority level," MPRA Paper 100406, University Library of Munich, Germany.
    4. Caldara, Dario & Iacoviello, Matteo & Molligo, Patrick & Prestipino, Andrea & Raffo, Andrea, 2020. "The economic effects of trade policy uncertainty," Journal of Monetary Economics, Elsevier, vol. 109(C), pages 38-59.
    5. Marijke Verpoorten & Lode Berlage, 2004. "Genocide and land scarcity: Can Rwandan rural households manage?," CSAE Working Paper Series 2004-15, Centre for the Study of African Economies, University of Oxford.
    6. Machado, Jose A. F. & Silva, J. M. C. Santos, 2000. "Glejser's test revisited," Journal of Econometrics, Elsevier, vol. 97(1), pages 189-202, July.
    7. Katarzyna Jabłońska, 2018. "Dealing With Heteroskedasticity Within The Modeling Of The Quality Of Life Of Older People," Statistics in Transition New Series, Polish Statistical Association, vol. 19(3), pages 423-452, September.
    8. Michael O'Connor Keefe & David Gallagher, 2014. "Does the effect of revealed private information on initial public offering (IPO) first trading day return differ by IPO market heat?," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 54(3), pages 921-964, September.
    9. Guilhem Bascle, 2008. "Controlling for endogeneity with instrumental variables in strategic management research," Post-Print hal-00576795, HAL.
    10. Richard H. Spady & Sami Stouli, 2018. "Simultaneous Mean-Variance Regression," Bristol Economics Discussion Papers 18/697, School of Economics, University of Bristol, UK.
    11. Russell, Bill & Chowdhury, Rosen Azad, 2013. "Estimating United States Phillips curves with expectations consistent with the statistical process of inflation," Journal of Macroeconomics, Elsevier, vol. 35(C), pages 24-38.
    12. Olivier Damette & Philippe Delacote, 2009. "The environmental resource curse hypothesis : the forest case [L'hypothèse de malédiction environnemental des ressources : le cas des forêts]," Working Papers hal-01189378, HAL.
    13. Joachim Zietz, 2006. "Detecting neglected parameter heterogeneity with Chow tests," Applied Economics Letters, Taylor & Francis Journals, vol. 13(6), pages 369-374.
    14. Christopher F Baum & Mark E. Schaffer & Steven Stillman, 2003. "Instrumental variables and GMM: Estimation and testing," Stata Journal, StataCorp LP, vol. 3(1), pages 1-31, March.
    15. Pedro Delicado & Juan Romo, 1998. "Constant coefficient tests for random coefficient regression," Economics Working Papers 329, Department of Economics and Business, Universitat Pompeu Fabra.
    16. Russell Davidson & Victoria Zinde‐Walsh, 2017. "Advances in specification testing," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 50(5), pages 1595-1631, December.
    17. Dufour, Jean-Marie & Khalaf, Lynda & Bernard, Jean-Thomas & Genest, Ian, 2004. "Simulation-based finite-sample tests for heteroskedasticity and ARCH effects," Journal of Econometrics, Elsevier, vol. 122(2), pages 317-347, October.
    18. Kendix, Michael & Walls, W.D., 2010. "Oil industry consolidation and refined product prices: Evidence from US wholesale gasoline terminals," Energy Policy, Elsevier, vol. 38(7), pages 3498-3507, July.
    19. Christopher F Baum & Arthur Lewbel, 2019. "Advice on using heteroskedasticity-based identification," Stata Journal, StataCorp LP, vol. 19(4), pages 757-767, December.
    20. Seren Firat & Esat Dasdemir, 2021. "Application of Quantity Theory of Money in Cryptocurrencies: Example of Bitcoin and the Impact of Covid-19," Istanbul Journal of Economics-Istanbul Iktisat Dergisi, Istanbul University, Faculty of Economics, vol. 71(1), pages 81-102, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:15:y:2018:i:12:p:2907-:d:191657. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.