IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v55y2011i1p366-374.html
   My bibliography  Save this article

Generalized additive models and inflated type I error rates of smoother significance tests

Author

Listed:
  • Young, Robin L.
  • Weinberg, Janice
  • Vieira, Verónica
  • Ozonoff, Al
  • Webster, Thomas F.

Abstract

Generalized additive models (GAMs) have distinct advantages over generalized linear models as they allow investigators to make inferences about associations between outcomes and predictors without placing parametric restrictions on the associations. The variable of interest is often smoothed using a locally weighted scatterplot smoothing (LOESS) and the optimal span (degree of smoothing) can be determined by minimizing the Akaike Information Criterion (AIC). A natural hypothesis when using GAMs is to test whether the smoothing term is necessary or if a simpler model would suffice. The statistic of interest is the difference in deviances between models including and excluding the smoothed term. As approximate chi-square tests of this hypothesis are known to be biased, permutation tests are a reasonable alternative. We compare the type I error rates of the chi-square test and of three permutation test methods using synthetic data generated under the null hypothesis. In each permutation method a distribution of differences in deviances is obtained from 999 permuted datasets and the null hypothesis is rejected if the observed statistic falls in the upper 5% of the distribution. One test is a conditional permutation test using the optimal span size for the observed data; this span size is held constant for all permutations. This test is shown to have an inflated type I error rate. Alternatively, the span size can be fixed a priori such that the span selection technique is not reliant on the observed data. This test is shown to be unbiased; however, the choice of span size is not clear. A third method is an unconditional permutation test where the optimal span size is selected for observed and permuted datasets. This test is unbiased though computationally intensive.

Suggested Citation

  • Young, Robin L. & Weinberg, Janice & Vieira, Verónica & Ozonoff, Al & Webster, Thomas F., 2011. "Generalized additive models and inflated type I error rates of smoother significance tests," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 366-374, January.
  • Handle: RePEc:eee:csdana:v:55:y:2011:i:1:p:366-374
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(10)00191-X
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Clifford M. Hurvich & Jeffrey S. Simonoff & Chih‐Ling Tsai, 1998. "Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(2), pages 271-293.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Pearl Anne Ante-Testard & Francois Rerolle & Anna T. Nguyen & Sania Ashraf & Sarker Masud Parvez & Abu Mohammed Naser & Tarik Benmarhnia & Mahbubur Rahman & Stephen P. Luby & Jade Benjamin-Chung & Ben, 2024. "WASH interventions and child diarrhea at the interface of climate and socioeconomic position in Bangladesh," Nature Communications, Nature, vol. 15(1), pages 1-13, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
    2. Don Harding, 2010. "Applying shape and phase restrictions in generalized dynamic categorical models of the business cycle," NCER Working Paper Series 58, National Centre for Econometric Research.
    3. Michael S. Delgado & Daniel J. Henderson & Christopher F. Parmeter, 2014. "Does Education Matter for Economic Growth?," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 76(3), pages 334-359, June.
    4. Juan Manuel Julio & Norberto Rodríguez & Héctor Manuel Zárate, 2005. "Estimating the COP Exchange Rate Volatility Smile and the Market Effect of Central Bank Interventions: A CHARN Approach," Borradores de Economia 2605, Banco de la Republica.
    5. Malloy, Elizabeth J. & Spiegelman, Donna & Eisen, Ellen A., 2009. "Comparing measures of model selection for penalized splines in Cox models," Computational Statistics & Data Analysis, Elsevier, vol. 53(7), pages 2605-2616, May.
    6. Karimu, Amin & Brännlund, Runar, 2013. "Functional form and aggregate energy demand elasticities: A nonparametric panel approach for 17 OECD countries," Energy Economics, Elsevier, vol. 36(C), pages 19-27.
    7. Liao, Jun & Zou, Guohua, 2020. "Corrected Mallows criterion for model averaging," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    8. Lu, Jun & Lin, Lu, 2018. "Feature screening for multi-response varying coefficient models with ultrahigh dimensional predictors," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 242-254.
    9. Chu, Chi-Yang & Henderson, Daniel J. & Parmeter, Christopher F., 2017. "On discrete Epanechnikov kernel functions," Computational Statistics & Data Analysis, Elsevier, vol. 116(C), pages 79-105.
    10. Salvatore Ingrassia & Simona Minotti & Giorgio Vittadini, 2012. "Local Statistical Modeling via a Cluster-Weighted Approach with Elliptical Distributions," Journal of Classification, Springer;The Classification Society, vol. 29(3), pages 363-401, October.
    11. Maria Sassi, 2010. "OLS and GWR Approaches to Agricultural Convergence in the EU-15," International Advances in Economic Research, Springer;International Atlantic Economic Society, vol. 16(1), pages 96-108, February.
    12. Nagler Thomas & Schellhase Christian & Czado Claudia, 2017. "Nonparametric estimation of simplified vine copula models: comparison of methods," Dependence Modeling, De Gruyter, vol. 5(1), pages 99-120, January.
    13. Arturo Bujanda & Thomas M. Fullerton, 2017. "Impacts of transportation infrastructure on single-family property values," Applied Economics, Taylor & Francis Journals, vol. 49(51), pages 5183-5199, November.
    14. Costas Milas & Ruthira Naraidoo, 2009. "Financial Market Conditions, Real Time, Nonlinearity and European Central Bank Monetary Policy: In-Sample and Out-of-Sample Assessment," Working Papers 200923, University of Pretoria, Department of Economics.
    15. Cai, Zongwu & Xu, Xiaoping, 2009. "Nonparametric Quantile Estimations for Dynamic Smooth Coefficient Models," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 371-383.
    16. Frölich, Markus & Huber, Martin & Wiesenfarth, Manuel, 2017. "The finite sample performance of semi- and non-parametric estimators for treatment effects and policy evaluation," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 91-102.
    17. Víctor M. Guerrero & Daniela Cortés Toto & Hortensia J. Reyes Cervantes, 2018. "Effect of autocorrelation when estimating the trend of a time series via penalized least squares with controlled smoothness," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(1), pages 109-130, March.
    18. Yanagihara, Hirokazu & Satoh, Kenichi, 2010. "An unbiased Cp criterion for multivariate ridge regression," Journal of Multivariate Analysis, Elsevier, vol. 101(5), pages 1226-1238, May.
    19. Centorrino Samuele & Feve Frederique & Florens Jean-Pierre, 2017. "Additive Nonparametric Instrumental Regressions: A Guide to Implementation," Journal of Econometric Methods, De Gruyter, vol. 6(1), pages 1-25, January.
    20. Simon Lineu Umbach, 2020. "Forecasting with supervised factor models," Empirical Economics, Springer, vol. 58(1), pages 169-190, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:55:y:2011:i:1:p:366-374. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.