IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v101y2010i7p1559-1573.html
   My bibliography  Save this article

Entropy based constrained inference for some HDLSS genomic models: UI tests in a Chen-Stein perspective

Author

Listed:
  • Tsai, Ming-Tien
  • Sen, Pranab Kumar

Abstract

For qualitative data models, Gini-Simpson index and Shannon entropy are commonly used for statistical analysis. In the context of high-dimensional low-sample size (HDLSS) categorical models, abundant in genomics and bioinformatics, the Gini-Simpson index, as extended to Hamming distance in a pseudo-marginal setup, facilitates drawing suitable statistical conclusions. Under Lorenz ordering it is shown that Shannon entropy and its multivariate analogues proposed here appear to be more informative than the Gini-Simpson index. The nested subset monotonicity prospect along with subgroup decomposability of some proposed measures are exploited. The usual jackknifing (or bootstrapping) methods may not work out well for HDLSS constrained models. Hence, we consider a permutation method incorporating the union-intersection (UI) principle and Chen-Stein Theorem to formulate suitable statistical hypothesis testing procedures for gene classification. Some applications are included as illustration.

Suggested Citation

  • Tsai, Ming-Tien & Sen, Pranab Kumar, 2010. "Entropy based constrained inference for some HDLSS genomic models: UI tests in a Chen-Stein perspective," Journal of Multivariate Analysis, Elsevier, vol. 101(7), pages 1559-1573, August.
  • Handle: RePEc:eee:jmvana:v:101:y:2010:i:7:p:1559-1573
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047-259X(10)00054-0
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Tzeng J-Y. & Byerley W. & Devlin B. & Roeder K. & Wasserman L., 2003. "Outlier Detection and False Discovery Rates for Whole-Genome DNA Matching," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 236-246, January.
    2. Sen, Pranab K. & Tsai, Ming-Tien & Jou, Yuh-Shan, 2007. "High-Dimension, LowSample Size Perspectives in Constrained Statistical Inference: The SARSCoV RNA Genome in Illustration," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 686-694, June.
    3. Masaaki Sibuya, 1959. "Bivariate extreme statistics, I," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 11(2), pages 195-210, June.
    4. Tsai, Ming-Tien & Sen, Pranab Kumar, 2005. "Asymptotically optimal tests for parametric functions against ordered functional alternatives," Journal of Multivariate Analysis, Elsevier, vol. 95(1), pages 37-49, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sen, Pranab K. & Kang, Moonsu, 2013. "Bivariate high-level exceedance and the Chen–Stein theorem in genomics multiple hypothesis testing perspectives," Statistics & Probability Letters, Elsevier, vol. 83(7), pages 1725-1730.
    2. Pinheiro, Aluísio & Sen, Pranab Kumar & Pinheiro, Hildete Prisco, 2009. "Decomposability of high-dimensional diversity measures: Quasi-U-statistics, martingales and nonstandard asymptotics," Journal of Multivariate Analysis, Elsevier, vol. 100(8), pages 1645-1656, September.
    3. Markus Haas, 2018. "A note on the absolute moments of the bivariate normal distribution," Economics Bulletin, AccessEcon, vol. 38(1), pages 650-656.
    4. Moore, Kyle & Zhou, Chen, 2014. "The determinants of systemic importance," LSE Research Online Documents on Economics 59289, London School of Economics and Political Science, LSE Library.
    5. Hofert, Marius & Vrins, Frédéric, 2013. "Sibuya copulas," Journal of Multivariate Analysis, Elsevier, vol. 114(C), pages 318-337.
    6. Victor Chernozhukov & Ivan Fernandez-Val & Siyi Luo, 2023. "Distribution regression with sample selection and UK wage decomposition," CeMMAP working papers 09/23, Institute for Fiscal Studies.
    7. Tiwari, Aviral Kumar & Trabelsi, Nader & Alqahtani, Faisal & Raheem, Ibrahim D., 2020. "Systemic risk spillovers between crude oil and stock index returns of G7 economies: Conditional value-at-risk and marginal expected shortfall approaches," Energy Economics, Elsevier, vol. 86(C).
    8. Tankov, Peter, 2016. "Tails of weakly dependent random vectors," Journal of Multivariate Analysis, Elsevier, vol. 145(C), pages 73-86.
    9. Monica Billio & Lorenzo Frattarolo & Dominique Guegan, 2017. "Multivariate Reflection Symmetry of Copula Functions," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01592147, HAL.
    10. Kenneth Rice & David Spiegelhalter, 2006. "A Simple Diagnostic Plot Connecting Robust Estimation, Outlier Detection, and False Discovery Rates," Journal of Applied Statistics, Taylor & Francis Journals, vol. 33(10), pages 1131-1147.
    11. Yang, Xipei & Frees, Edward W. & Zhang, Zhengjun, 2011. "A generalized beta copula with applications in modeling multivariate long-tailed data," Insurance: Mathematics and Economics, Elsevier, vol. 49(2), pages 265-284, September.
    12. Matias Heikkila & Yves Dominicy & Sirkku Pauliina Ilmonen, 2015. "Multivariate extremes based on a notion of radius," Working Papers ECARES ECARES 2015-49, ULB -- Universite Libre de Bruxelles.
    13. Zhang, Zhengjun & Zhu, Bin, 2016. "Copula structured M4 processes with application to high-frequency financial data," Journal of Econometrics, Elsevier, vol. 194(2), pages 231-241.
    14. Furman, Edward & Kuznetsov, Alexey & Su, Jianxi & Zitikis, Ričardas, 2016. "Tail dependence of the Gaussian copula revisited," Insurance: Mathematics and Economics, Elsevier, vol. 69(C), pages 97-103.
    15. Victor Chernozhukov & Ivan Fernandez-Val & Siyi Luo, 2018. "Distribution regression with sample selection, with an application to wage decompositions in the UK," CeMMAP working papers CWP68/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    16. Russell Brook T. & Hogan Paul, 2018. "Analyzing dependence matrices to investigate relationships between national football league combine event performances," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 14(4), pages 201-212, December.
    17. Moore, Kyle & Zhou, Chen, 2013. ""Too big to fail" or "Too non-traditional to fail"?: The determinants of banks' systemic importance," MPRA Paper 45589, University Library of Munich, Germany.
    18. Huang, J.S. & Dou, Xiaoling & Kuriki, Satoshi & Lin, G.D., 2013. "Dependence structure of bivariate order statistics with applications to Bayramoglu’s distributions," Journal of Multivariate Analysis, Elsevier, vol. 114(C), pages 201-208.
    19. Dalia Ghanem & D'esir'e K'edagni & Ismael Mourifi'e, 2023. "Evaluating the Impact of Regulatory Policies on Social Welfare in Difference-in-Difference Settings," Papers 2306.04494, arXiv.org, revised Jun 2023.
    20. Echaust, Krzysztof, 2021. "Asymmetric tail dependence between stock market returns and implied volatility," The Journal of Economic Asymmetries, Elsevier, vol. 23(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:101:y:2010:i:7:p:1559-1573. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.