IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v70y2008i4p803-823.html
   My bibliography  Save this article

Improving semiparametric estimation by using surrogate data

Author

Listed:
  • Song Xi Chen
  • Denis H. Y. Leung
  • Jing Qin

Abstract

Summary. The paper considers estimating a parameter β that defines an estimating function U(y, x, β) for an outcome variable y and its covariate x when the outcome is missing in some of the observations. We assume that, in addition to the outcome and the covariate, a surrogate outcome is available in every observation. The efficiency of existing estimators for β depends critically on correctly specifying the conditional expectation of U given the surrogate and the covariate. When the conditional expectation is not correctly specified, which is the most likely scenario in practice, the efficiency of estimation can be severely compromised even if the propensity function (of missingness) is correctly specified. We propose an estimator that is robust against the choice of the conditional expectation via an empirical likelihood. We demonstrate that the estimator proposed achieves a gain in efficiency whether the conditional score is correctly specified or not. When the conditional score is correctly specified, the estimator reaches the semiparametric variance bound within the class of estimating functions that are generated by U. The practical performance of the estimator is evaluated by using simulation and a data set that is based on the 1996 US presidential election.

Suggested Citation

  • Song Xi Chen & Denis H. Y. Leung & Jing Qin, 2008. "Improving semiparametric estimation by using surrogate data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(4), pages 803-823, September.
  • Handle: RePEc:bla:jorssb:v:70:y:2008:i:4:p:803-823
    DOI: 10.1111/j.1467-9868.2008.00662.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9868.2008.00662.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-9868.2008.00662.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Stuart G. Baker & Grant Izmirlian & Victor Kipnis, 2005. "Resolving paradoxes involving surrogate end points," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 168(4), pages 753-762, November.
    2. Myoung-Jae Lee, 2005. "Monotonicity Conditions and Inequality Imputation for Sample-Selection and Non-Response Problems," Econometric Reviews, Taylor & Francis Journals, vol. 24(2), pages 175-194.
    3. Yi‐Hau Chen & Hung Chen, 2000. "A unified approach to regression analysis under double‐sampling designs," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(3), pages 449-460.
    4. Newey, Whitney K, 1990. "Semiparametric Efficiency Bounds," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 5(2), pages 99-135, April-Jun.
    5. Chen S.X. & Leung D.H.Y. & Qin J., 2003. "Information Recovery in a Study With Surrogate Endpoints," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 1052-1062, January.
    6. C. B. Begg & D. H. Y. Leung, 2000. "On the use of surrogate end points in randomized trials," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 163(1), pages 15-28.
    7. Sanders, Mitchell S., 2001. "Uncertainty and Turnout," Political Analysis, Cambridge University Press, vol. 9(1), pages 45-57, January.
    8. Riker, William H. & Ordeshook, Peter C., 1968. "A Theory of the Calculus of Voting," American Political Science Review, Cambridge University Press, vol. 62(1), pages 25-42, March.
    9. Riker, William H. & Ordeshook, Peter C., 1968. "A Theory of the Calculus of Voting," American Political Science Review, Cambridge University Press, vol. 62(1), pages 25-42, March.
    10. Schenker, Nathaniel & Taylor, Jeremy M. G., 1996. "Partially parametric techniques for multiple imputation," Computational Statistics & Data Analysis, Elsevier, vol. 22(4), pages 425-446, August.
    11. Xiaohong Chen & Han Hong & Elie Tamer, 2005. "Measurement Error Models with Auxiliary Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 72(2), pages 343-366.
    12. John Filer & Lawrence Kenny, 1980. "Voter turnout and the benefits of voting," Public Choice, Springer, vol. 35(5), pages 575-585, January.
    13. Denis Heng‐Yan Leung, 2001. "Statistical methods for clinical studies in the presence of surrogate end points," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 164(3), pages 485-503.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Tang, Cheng Yong & Leng, Chenlei, 2012. "An empirical likelihood approach to quantile regression with auxiliary information," Statistics & Probability Letters, Elsevier, vol. 82(1), pages 29-36.
    2. Peisong Han & Linglong Kong & Jiwei Zhao & Xingcai Zhou, 2019. "A general framework for quantile estimation with incomplete data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(2), pages 305-333, April.
    3. Zhong, Ping-Shou & Chen, Sixia, 2014. "Jackknife empirical likelihood inference with regression imputation and survey data," Journal of Multivariate Analysis, Elsevier, vol. 129(C), pages 193-205.
    4. Denis Heng Yan Leung & Ken Yamada & Biao Zhang, 2015. "Enriching Surveys with Supplementary Data and its Application to Studying Wage Regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(1), pages 155-179, March.
    5. Peisong Han, 2014. "Multiply Robust Estimation in Regression Analysis With Missing Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1159-1173, September.
    6. Han, Peisong, 2012. "A note on improving the efficiency of inverse probability weighted estimator using the augmentation term," Statistics & Probability Letters, Elsevier, vol. 82(12), pages 2221-2228.
    7. Hamori, Shigeyuki & Motegi, Kaiji & Zhang, Zheng, 2019. "Calibration estimation of semiparametric copula models with data missing at random," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 85-109.
    8. Yacoubou Djima, Ismael & Kilic, Talip, 2024. "Attenuating measurement errors in agricultural productivity analysis by combining objective and self-reported survey data," Journal of Development Economics, Elsevier, vol. 168(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Denis Heng Yan Leung & Ken Yamada & Biao Zhang, 2015. "Enriching Surveys with Supplementary Data and its Application to Studying Wage Regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(1), pages 155-179, March.
    2. François Facchini & Abel François, 2005. "Territorial captivity and voter participation in national election: a theoretical and empirical analysis," Post-Print hal-00270739, HAL.
    3. Valentino Larcinese, 2007. "Does political knowledge increase turnout? Evidence from the 1997 British general election," Public Choice, Springer, vol. 131(3), pages 387-411, June.
    4. Christine Fauvelle-Aymar & Abel François, 2018. "Place of registration and place of residence: the non-linear detrimental impact of transportation cost on electoral participation," Public Choice, Springer, vol. 176(3), pages 405-440, September.
    5. Tim Powlowski & Dennis Coates, 2013. "The habit for voting, “civic duty” and travel distance," UMBC Economics Department Working Papers 13-05, UMBC Department of Economics.
    6. Valentino Larcinese, 2009. "Information Acquisition, Ideology and Turnout: Theory and Evidence From Britain," Journal of Theoretical Politics, , vol. 21(2), pages 237-276, April.
    7. Fred Thompson, 1982. "Closeness counts in horseshoes and dancing ... and elections," Public Choice, Springer, vol. 38(3), pages 305-316, January.
    8. Francesco Armillei & Enrico Cavallotti, 2021. "Concurrent elections and voting behaviour: evidence from an Italian referendum," BAFFI CAREFIN Working Papers 21164, BAFFI CAREFIN, Centre for Applied Research on International Markets Banking Finance and Regulation, Universita' Bocconi, Milano, Italy.
    9. Christine Fauvelle-Aymar & Abel François, 2015. "Mobilization, cost of voting and turnout: a natural randomized experiment with double elections," Public Choice, Springer, vol. 162(1), pages 183-199, January.
    10. Cantoni, Enrico & Gazzè, Ludovica & Schafer, Jerome, 2021. "Turnout in concurrent elections: Evidence from two quasi-experiments in Italy," European Journal of Political Economy, Elsevier, vol. 70(C).
    11. Robbett, Andrea & Matthews, Peter Hans, 2018. "Partisan bias and expressive voting," Journal of Public Economics, Elsevier, vol. 157(C), pages 107-120.
    12. León, Gianmarco, 2017. "Turnout, political preferences and information: Experimental evidence from Peru," Journal of Development Economics, Elsevier, vol. 127(C), pages 56-71.
    13. Ni, Xinwen, 2019. "Voting for Health Insurance Policy: the U.S. versus Europe," IRTG 1792 Discussion Papers 2019-012, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    14. Abraham Aldama & Mateo Vásquez-Cortés & Lauren Elyssa Young, 2019. "Fear and citizen coordination against dictatorship," Journal of Theoretical Politics, , vol. 31(1), pages 103-125, January.
    15. Lirong Xia, 2020. "How Likely Are Large Elections Tied?," Papers 2011.03791, arXiv.org, revised Jul 2021.
    16. Ming Li & Dipjyoti Majumdar, 2010. "A Psychologically Based Model of Voter Turnout," Journal of Public Economic Theory, Association for Public Economic Theory, vol. 12(5), pages 979-1002, October.
    17. Schnellenbach, Jan & Schubert, Christian, 2015. "Behavioral political economy: A survey," European Journal of Political Economy, Elsevier, vol. 40(PB), pages 395-417.
    18. Alastair Smith & Bruce Bueno de Mesquita & Tom LaGatta, 2017. "Group incentives and rational voting1," Journal of Theoretical Politics, , vol. 29(2), pages 299-326, April.
    19. Stephen Coate & Michael Conlin, 2002. "Voter Turnout: Theory and Evidence from Texas Liquor Referenda," NBER Working Papers 8720, National Bureau of Economic Research, Inc.
    20. Battaglini, Marco, 2005. "Sequential voting with abstention," Games and Economic Behavior, Elsevier, vol. 51(2), pages 445-463, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:70:y:2008:i:4:p:803-823. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.