IDEAS home Printed from https://ideas.repec.org/a/taf/jnlasa/v109y2014i505p11-23.html
   My bibliography  Save this article

A New Estimation Approach for Combining Epidemiological Data From Multiple Sources

Author

Listed:
  • Hui Huang
  • Xiaomei Ma
  • Rasmus Waagepetersen
  • Theodore R. Holford
  • Rong Wang
  • Harvey Risch
  • Lloyd Mueller
  • Yongtao Guan

Abstract

We propose a novel two-step procedure to combine epidemiological data obtained from diverse sources with the aim to quantify risk factors affecting the probability that an individual develops certain disease such as cancer. In the first step, we derive all possible unbiased estimating functions based on a group of cases and a group of controls each time. In the second step, we combine these estimating functions efficiently to make full use of the information contained in data. Our approach is computationally simple and flexible. We illustrate its efficacy through simulation and apply it to investigate pancreatic cancer risks based on data obtained from the Connecticut Tumor Registry, a population-based case--control study, and the Behavioral Risk Factor Surveillance System which is a state-based system of health surveys. Supplementary materials for this article are available online.

Suggested Citation

  • Hui Huang & Xiaomei Ma & Rasmus Waagepetersen & Theodore R. Holford & Rong Wang & Harvey Risch & Lloyd Mueller & Yongtao Guan, 2014. "A New Estimation Approach for Combining Epidemiological Data From Multiple Sources," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 11-23, March.
  • Handle: RePEc:taf:jnlasa:v:109:y:2014:i:505:p:11-23
    DOI: 10.1080/01621459.2013.870904
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/01621459.2013.870904
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/01621459.2013.870904?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hansen, Lars Peter, 1982. "Large Sample Properties of Generalized Method of Moments Estimators," Econometrica, Econometric Society, vol. 50(4), pages 1029-1054, July.
    2. Guido W. Imbens & Tony Lancaster, 1994. "Combining Micro and Macro Data in Microeconometric Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 61(4), pages 655-680.
    3. Crowder, Martin, 1986. "On Consistency and Inconsistency of Estimating Equations," Econometric Theory, Cambridge University Press, vol. 2(3), pages 305-330, December.
    4. Peter J. Diggle & Barry S. Rowlingson, 1994. "A Conditional Approach to Point Process Modelling of Elevated Risk," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 157(3), pages 433-440, May.
    5. Diggle, Peter J. & Guan, Yongtao & Hart, Anthony C. & Paize, Fauzia & Stanton, Michelle, 2010. "Estimating Individual-Level Risk in Spatial Epidemiology Using Spatially Aggregated Information on the Population at Risk," Journal of the American Statistical Association, American Statistical Association, vol. 105(492), pages 1394-1402.
    6. Jon Wakefield, 2004. "Ecological inference for 2 × 2 tables," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 167(3), pages 385-425, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Steven Berry & James Levinsohn & Ariel Pakes, 2004. "Differentiated Products Demand Systems from a Combination of Micro and Macro Data: The New Car Market," Journal of Political Economy, University of Chicago Press, vol. 112(1), pages 68-105, February.
    2. Lancaster, Tony & Imbens, Guido, 1996. "Case-control studies with contaminated controls," Journal of Econometrics, Elsevier, vol. 71(1-2), pages 145-160.
    3. Guell, Maia & Hu, Luojia, 2006. "Estimating the probability of leaving unemployment using uncompleted spells from repeated cross-section data," Journal of Econometrics, Elsevier, vol. 133(1), pages 307-341, July.
    4. Xiao, Wei, 2008. "The Competitive and Welfare Effects of New Product Introduction: The Case of Crystal Pepsi," Research Reports 149938, University of Connecticut, Food Marketing Policy Center.
    5. Lee, Seojeong, 2014. "Asymptotic refinements of a misspecification-robust bootstrap for generalized method of moments estimators," Journal of Econometrics, Elsevier, vol. 178(P3), pages 398-413.
    6. Qian, Hailong & Schmidt, Peter, 1999. "Improved instrumental variables and generalized method of moments estimators," Journal of Econometrics, Elsevier, vol. 91(1), pages 145-169, July.
    7. Bera, Anil K. & Bilias, Yannis, 2002. "The MM, ME, ML, EL, EF and GMM approaches to estimation: a synthesis," Journal of Econometrics, Elsevier, vol. 107(1-2), pages 51-86, March.
    8. Takahiro Hoshino & Ryosuke Igari, 2017. "Quasi-Bayesian Inference for Latent Variable Models with External Information: Application to generalized linear mixed models for biased data," Keio-IES Discussion Paper Series 2017-014, Institute for Economics Studies, Keio University.
    9. Nevo, Aviv, 2003. "Using Weights to Adjust for Sample Selection When Auxiliary Information Is Available," Journal of Business & Economic Statistics, American Statistical Association, vol. 21(1), pages 43-52, January.
    10. Heinz König & Michael Lechner, 1994. "Some Recent Developments in Microeconometrics - A Survey," Swiss Journal of Economics and Statistics (SJES), Swiss Society of Economics and Statistics (SSES), vol. 130(III), pages 299-331, September.
    11. van Dijk, Bram & Paap, Richard, 2008. "Explaining individual response using aggregated data," Journal of Econometrics, Elsevier, vol. 146(1), pages 1-9, September.
    12. Yuan, Xiaohui & Liu, Tianqing & Lin, Nan & Zhang, Baoxue, 2010. "Combining conditional and unconditional moment restrictions with missing responses," Journal of Multivariate Analysis, Elsevier, vol. 101(10), pages 2420-2433, November.
    13. Maia Guell & Luojia Hu, 2003. "Estimating the Probability of Leaving Unemployment Using Uncompleted Spells from Repeated Cross-section Data," Working Papers 854, Princeton University, Department of Economics, Industrial Relations Section..
    14. Amil Petrin, 2001. "Quantifying the Benefits of New Products: The Case of the Minivan," NBER Working Papers 8227, National Bureau of Economic Research, Inc.
    15. Naoki Wakamori, 2011. "Portfolio Considerations in Differentiated Product Purchases: An Application to the Japanese Automobile Market," Staff Working Papers 11-27, Bank of Canada.
    16. Paulo Albuquerque & Bart J. Bronnenberg, 2009. "Estimating Demand Heterogeneity Using Aggregated Data: An Application to the Frozen Pizza Category," Marketing Science, INFORMS, vol. 28(2), pages 356-372, 03-04.
    17. Xiaohui Chang & Rasmus Waagepetersen & Herbert Yu & Xiaomei Ma & Theodore R. Holford & Rong Wang & Yongtao Guan, 2015. "Disease risk estimation by combining case–control data with aggregated information on the population at risk," Biometrics, The International Biometric Society, vol. 71(1), pages 114-121, March.
    18. Jayawardhana, Jayani, 2013. "Direct-to-consumer advertising and consumer welfare," International Journal of Industrial Organization, Elsevier, vol. 31(2), pages 164-180.
    19. Denis Heng Yan Leung & Ken Yamada & Biao Zhang, 2015. "Enriching Surveys with Supplementary Data and its Application to Studying Wage Regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(1), pages 155-179, March.
    20. Mihai Giurcanu & Brett Presnell, 2018. "Bootstrap inference for misspecified moment condition models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(3), pages 605-630, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlasa:v:109:y:2014:i:505:p:11-23. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UASA20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.