IDEAS home Printed from https://ideas.repec.org/p/iim/iimawp/wp02028.html
   My bibliography  Save this paper

Inference on Categorical Survey Response: A Predictive Approach

Author

Listed:
  • Adhya Sumanta
  • Banerjee, Tathagata
  • Chattopadhyay, G.

Abstract

We consider the estimation of finite population proportions of categorical survey responses obtained by probability sampling. The customary design-based estimator does not make use of the auxiliary data available for all the population units at the estimation stage. We adopt a model-based predictive approach to incorporate this information and make the estimates more efficient. In the first part of our paper we consider a multinomial logit type model when logit function is a known parametric function of the covariates. We then use it for the prediction of non-sampled responses. This together with sampled responses is used to obtain the estimates of the proportions. The asymptotic biases and variances of these estimators are obtained. The main drawback of this approach is, being a parametric model it may suffer from model misspecification and thus, may lose it’s efficiencies over the usual design-based estimates. To overcome this drawback, in the next part of this paper we replace the multinomial logit type model by a nonparametric model using recently developed random coefficients splines models. Finally, we carry out a simulation study. It shows that the nonparametric approach may lead to an appreciable improvement over both parametric and design-based approaches when the regression function is quite different from multinomial logit.

Suggested Citation

  • Adhya Sumanta & Banerjee, Tathagata & Chattopadhyay, G., 2007. "Inference on Categorical Survey Response: A Predictive Approach," IIMA Working Papers WP2007-05-07, Indian Institute of Management Ahmedabad, Research and Publication Department.
  • Handle: RePEc:iim:iimawp:wp02028
    as

    Download full text from publisher

    File URL: https://www.iima.ac.in/sites/default/files/rnpfiles/2007-05-07_TBanerjee.pdf
    File Function: English Version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. J. G. Booth & J. P. Hobert, 1999. "Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(1), pages 265-285.
    2. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521780506, October.
    3. M. P. Wand, 2003. "Smoothing and mixed models," Computational Statistics, Springer, vol. 18(2), pages 223-249, July.
    4. Jelle J. Goeman & Saskia le Cessie, 2006. "A Goodness-of-Fit Test for Multinomial Logistic Regression," Biometrics, The International Biometric Society, vol. 62(4), pages 980-985, December.
    5. D. Firth & K. E. Bennett, 1998. "Robust models in probability sampling," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(1), pages 3-21.
    6. Qingxia Chen & Joseph G. Ibrahim, 2006. "Semiparametric Models for Missing Covariate and Response Data in Regression Models," Biometrics, The International Biometric Society, vol. 62(1), pages 177-184, March.
    7. X. Lin & D. Zhang, 1999. "Inference in generalized additive mixed modelsby using smoothing splines," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(2), pages 381-400, April.
    8. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521785167, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sumanta Adhya & Tathagata Banerjee & Gaurangadeb Chattopadhyay, 2012. "Inference on finite population categorical response: nonparametric regression-based predictive approach," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 96(1), pages 69-98, January.
    2. Nadja Klein & Michel Denuit & Stefan Lang & Thomas Kneib, 2013. "Nonlife Ratemaking and Risk Management with Bayesian Additive Models for Location, Scale and Shape," Working Papers 2013-24, Faculty of Economics and Statistics, Universität Innsbruck.
    3. Göran Kauermann & Tatyana Krivobokova & Ludwig Fahrmeir, 2009. "Some asymptotic results on generalized penalized spline smoothing," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 487-503, April.
    4. Nicole H. Augustin & Stefan Lang & Monica Musio & Klaus Von Wilpert, 2007. "A spatial model for the needle losses of pine‐trees in the forests of Baden‐Württemberg: an application of Bayesian structured additive regression," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 56(1), pages 29-50, January.
    5. Takuma Yoshida, 2016. "Asymptotics and smoothing parameter selection for penalized spline regression with various loss functions," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 70(4), pages 278-303, November.
    6. Rodríguez-Álvarez, María Xosé & Lee, Dae-Jin & Kneib, Thomas & Durbán, María & Eilers, Paul, 2013. "Fast algorithm for smoothing parameter selection in multidimensional generalized P-splines," DES - Working Papers. Statistics and Econometrics. WS ws133026, Universidad Carlos III de Madrid. Departamento de Estadística.
    7. Klein, Nadja & Denuit, Michel & Lang, Stefan & Kneib, Thomas, 2013. "Nonlife Ratemaking and Risk Management with Bayesian Additive Models for Location, Scale and Shape," LIDAM Discussion Papers ISBA 2013045, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    8. Denuit, Michel & Lang, Stefan, 2004. "Non-life rate-making with Bayesian GAMs," Insurance: Mathematics and Economics, Elsevier, vol. 35(3), pages 627-647, December.
    9. Thomas Kneib & Ludwig Fahrmeir, 2006. "Structured Additive Regression for Categorical Space–Time Data: A Mixed Model Approach," Biometrics, The International Biometric Society, vol. 62(1), pages 109-118, March.
    10. Brezger, Andreas & Lang, Stefan, 2006. "Generalized structured additive regression based on Bayesian P-splines," Computational Statistics & Data Analysis, Elsevier, vol. 50(4), pages 967-991, February.
    11. Tatiyana V. Apanasovich & David Ruppert & Joanne R. Lupton & Natasa Popovic & Nancy D. Turner & Robert S. Chapkin & Raymond J. Carroll, 2008. "Aberrant Crypt Foci and Semiparametric Modeling of Correlated Binary Data," Biometrics, The International Biometric Society, vol. 64(2), pages 490-500, June.
    12. Michael Wegener & Göran Kauermann, 2017. "Forecasting in nonlinear univariate time series using penalized splines," Statistical Papers, Springer, vol. 58(3), pages 557-576, September.
    13. Xiao Ni & Daowen Zhang & Hao Helen Zhang, 2010. "Variable Selection for Semiparametric Mixed Models in Longitudinal Studies," Biometrics, The International Biometric Society, vol. 66(1), pages 79-88, March.
    14. Strasak, Alexander M. & Umlauf, Nikolaus & Pfeiffer, Ruth M. & Lang, Stefan, 2011. "Comparing penalized splines and fractional polynomials for flexible modelling of the effects of continuous predictor variables," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1540-1551, April.
    15. Skaug, Hans J. & Fournier, David A., 2006. "Automatic approximation of the marginal likelihood in non-Gaussian hierarchical models," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 699-709, November.
    16. Tatyana Krivobokova & Göran Kauermann & Theofanis Archontakis, 2006. "Estimating the term structure of interest rates using penalized splines," Statistical Papers, Springer, vol. 47(3), pages 443-459, June.
    17. Lee, Wang-Sheng, 2014. "Is the BMI a Relic of the Past?," IZA Discussion Papers 8637, Institute of Labor Economics (IZA).
    18. Kuhlenkasper, Torben & Steinhardt, Max Friedrich, 2017. "Who leaves and when? Selective outmigration of immigrants from Germany," Economic Systems, Elsevier, vol. 41(4), pages 610-621.
    19. Basile, Roberto & Durbán, María & Mínguez, Román & María Montero, Jose & Mur, Jesús, 2014. "Modeling regional economic dynamics: Spatial dependence, spatial heterogeneity and nonlinearities," Journal of Economic Dynamics and Control, Elsevier, vol. 48(C), pages 229-245.
    20. Belitz, Christiane & Lang, Stefan, 2008. "Simultaneous selection of variables and smoothing parameters in structured additive regression models," Computational Statistics & Data Analysis, Elsevier, vol. 53(1), pages 61-81, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:iim:iimawp:wp02028. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/eciimin.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.