IDEAS home Printed from https://ideas.repec.org/a/spr/metrik/v82y2019i3d10.1007_s00184-018-0703-y.html
   My bibliography  Save this article

Kernel density estimation from complex surveys in the presence of complete auxiliary information

Author

Listed:
  • Sayed A. Mostafa

    (Indiana University
    North Carolina A&T State University)

  • Ibrahim A. Ahmad

    (Oklahoma State University)

Abstract

Auxiliary information is widely used in survey sampling to enhance the precision of estimators of finite population parameters, such as the finite population mean, percentiles, and distribution function. In the context of complex surveys, we show how auxiliary information can be used effectively in kernel estimation of the superpopulation density function of a given study variable. We propose two classes of “model-assisted” kernel density estimators that make efficient use of auxiliary information. For one class we assume that the functional relationship between the study variable Y and the auxiliary variable X is known, while for the other class the relationship is assumed unknown and is estimated using kernel smoothing techniques. Under the first class, we show that if the functional relationship can be written as a simple linear regression model with constant error variance, the mean of the proposed density estimator will be identical to the well-known regression estimator of the finite population mean. If we drop the intercept from the linear model and allow the error variance to be proportional to the auxiliary variable, the mean of the proposed density estimator matches the ratio estimator of the finite population mean. The properties of the new density estimators are studied under a combined design-model-based inference framework, which accounts for the underlying superpopulation model as well as the randomization distribution induced by the sampling design. Moreover, the asymptotic normality of each estimator is derived under both design-based and combined inference frameworks when the sampling design is simple random sampling without replacement. For the practical implementation of these estimators, we discuss how data-driven bandwidth estimators can be obtained. The finite sample properties of the proposed estimators are addressed via simulations and an example that mimics a real survey. These simulations show that the new estimators perform very well compared to standard kernel estimators which do not utilize the auxiliary information.

Suggested Citation

  • Sayed A. Mostafa & Ibrahim A. Ahmad, 2019. "Kernel density estimation from complex surveys in the presence of complete auxiliary information," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 82(3), pages 295-338, April.
  • Handle: RePEc:spr:metrik:v:82:y:2019:i:3:d:10.1007_s00184-018-0703-y
    DOI: 10.1007/s00184-018-0703-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00184-018-0703-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00184-018-0703-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hansen, Bruce E., 2008. "Uniform Convergence Rates For Kernel Estimation With Dependent Data," Econometric Theory, Cambridge University Press, vol. 24(3), pages 726-748, June.
    2. Torsten Harms & Pierre Duchesne, 2010. "On kernel nonparametric regression designed for complex survey data," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 72(1), pages 111-138, July.
    3. Scott, David W., 2004. "Multivariate Density Estimation and Visualization," Papers 2004,16, Humboldt University of Berlin, Center for Applied Statistics and Economics (CASE).
    4. Breunig, Robert, 2008. "Nonparametric density estimation for stratified samples," Statistics & Probability Letters, Elsevier, vol. 78(14), pages 2194-2200, October.
    5. Robert Breunig, 2001. "Density Estimation For Clustered Data," Econometric Reviews, Taylor & Francis Journals, vol. 20(3), pages 353-367.
    6. Yao, Qiwei & Tong, Howell, 1994. "Quantifying the influence of initial values on nonlinear prediction," LSE Research Online Documents on Economics 19426, London School of Economics and Political Science, LSE Library.
    7. Hayfield, Tristen & Racine, Jeffrey S., 2008. "Nonparametric Econometrics: The np Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 27(i05).
    8. Ingrid K. Glad & Nils Lid Hjort & Nikolai G. Ushakov, 2003. "Correction of Density Estimators that are not Densities," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 30(2), pages 415-427, June.
    9. F. J. Breidt & G. Claeskens & J. D. Opsomer, 2005. "Model-assisted estimation for complex surveys using penalised splines," Biometrika, Biometrika Trust, vol. 92(4), pages 831-846, December.
    10. Daniel Bonnéry & F. Jay Breidt & François Coquet, 2017. "Kernel estimation for a superpopulation probability density function under informative selection," METRON, Springer;Sapienza Università di Roma, vol. 75(3), pages 301-318, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sayed A. Mostafa & Ibrahim A. Ahmad, 2021. "Kernel Density Estimation Based on the Distinct Units in Sampling with Replacement," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(2), pages 507-547, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gadea Rivas, María Dolores & Gonzalo, Jesús, 2020. "Trends in distributional characteristics: Existence of global warming," Journal of Econometrics, Elsevier, vol. 214(1), pages 153-174.
    2. Juan Carlos Escanciano & Juan Carlos Pardo-Fernández & Ingrid Van Keilegom, 2017. "Semiparametric Estimation of Risk–Return Relationships," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 35(1), pages 40-52, January.
    3. Kraus, Daniel & Czado, Claudia, 2017. "D-vine copula based quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 110(C), pages 1-18.
    4. Daniel J. Henderson & Christopher F. Parmeter & R. Robert Russell, 2008. "Modes, weighted modes, and calibrated modes: evidence of clustering using modality tests," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 23(5), pages 607-638.
    5. Sayed A. Mostafa & Ibrahim A. Ahmad, 2021. "Kernel Density Estimation Based on the Distinct Units in Sampling with Replacement," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(2), pages 507-547, November.
    6. David H. Bernstein & Christopher F. Parmeter, 2017. "Returns to Scale in Electricity Generation: Revisited and Replicated," Working Papers 2017-08, University of Miami, Department of Economics.
    7. Nishiyama, Yoshihiko & Hitomi, Kohtaro & Kawasaki, Yoshinori & Jeong, Kiho, 2011. "A consistent nonparametric test for nonlinear causality—Specification in time series regression," Journal of Econometrics, Elsevier, vol. 165(1), pages 112-127.
    8. Bodory, Hugo & Huber, Martin, 2018. "The causalweight package for causal inference in R," FSES Working Papers 493, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
    9. Patrick Saart & Jiti Gao & Nam Hyun Kim, 2014. "Semiparametric methods in nonlinear time series analysis: a selective review," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 26(1), pages 141-169, March.
    10. Besstremyannaya, Galina, 2015. "Measuring the effect of health insurance companies on the quality of healthcare systems with kernel and parametric regressions," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 38(2), pages 3-20.
    11. Michael S. Delgado & Daniel J. Henderson & Christopher F. Parmeter, 2014. "Does Education Matter for Economic Growth?," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 76(3), pages 334-359, June.
    12. Hoderlein, Stefan & Su, Liangjun & White, Halbert & Yang, Thomas Tao, 2016. "Testing for monotonicity in unobservables under unconfoundedness," Journal of Econometrics, Elsevier, vol. 193(1), pages 183-202.
    13. Kim, Kun Ho & Chao, Shih-Kang & Härdle, Wolfgang Karl, 2020. "Simultaneous Inference of the Partially Linear Model with a Multivariate Unknown Function," IRTG 1792 Discussion Papers 2020-008, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    14. Ye, Yuxiang & Koch, Steven F., 2021. "Measuring energy poverty in South Africa based on household required energy consumption," Energy Economics, Elsevier, vol. 103(C).
    15. Bonilla, Jorge & Coria, Jessica & Sterner, Thomas, 2012. "Synergies and Trade-offs between Climate and Local Air Pollution: Policies in Sweden," Working Papers in Economics 529, University of Gothenburg, Department of Economics.
    16. Bhattacharya, Debopam & Dupas, Pascaline, 2012. "Inferring welfare maximizing treatment assignment under budget constraints," Journal of Econometrics, Elsevier, vol. 167(1), pages 168-196.
    17. Gao, Jiti & Kanaya, Shin & Li, Degui & Tjøstheim, Dag, 2015. "Uniform Consistency For Nonparametric Estimators In Null Recurrent Time Series," Econometric Theory, Cambridge University Press, vol. 31(5), pages 911-952, October.
    18. Masayuki Hirukawa & Mari Sakudo, 2016. "Testing Symmetry of Unknown Densities via Smoothing with the Generalized Gamma Kernels," Econometrics, MDPI, vol. 4(2), pages 1-27, June.
    19. Naveed Ahmad Khan & Andrija Mihoci & Silke Michalk & Kirill Sarachuk & Hafiz Ali Javed, 2022. "Employee Performance Measures Appraised by Training and Labor Market: Evidence from the Banking Sector of Germany," Administrative Sciences, MDPI, vol. 12(4), pages 1-13, October.
    20. Bang, Minji & Gao, Wayne Yuan & Postlewaite, Andrew & Sieg, Holger, 2023. "Using monotonicity restrictions to identify models with partially latent covariates," Journal of Econometrics, Elsevier, vol. 235(2), pages 892-921.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:metrik:v:82:y:2019:i:3:d:10.1007_s00184-018-0703-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.