IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2408.09187.html
   My bibliography  Save this paper

Externally Valid Selection of Experimental Sites via the k-Median Problem

Author

Listed:
  • Jos'e Luis Montiel Olea
  • Brenda Prallon
  • Chen Qiu
  • Jorg Stoye
  • Yiwei Sun

Abstract

We present a decision-theoretic justification for viewing the question of how to best choose where to experiment in order to optimize external validity as a k-median (clustering) problem, a popular problem in computer science and operations research. We present conditions under which minimizing the worst-case, welfare-based regret among all nonrandom schemes that select k sites to experiment is approximately equal - and sometimes exactly equal - to finding the k most central vectors of baseline site-level covariates. The k-median problem can be formulated as a linear integer program. Two empirical applications illustrate the theoretical and computational benefits of the suggested procedure.

Suggested Citation

  • Jos'e Luis Montiel Olea & Brenda Prallon & Chen Qiu & Jorg Stoye & Yiwei Sun, 2024. "Externally Valid Selection of Experimental Sites via the k-Median Problem," Papers 2408.09187, arXiv.org.
  • Handle: RePEc:arx:papers:2408.09187
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2408.09187
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Tetenov, Aleksey, 2012. "Statistical treatment choice based on asymmetric minimax regret criteria," Journal of Econometrics, Elsevier, vol. 166(1), pages 157-165.
    2. Jean N. Lee & Jonathan Morduch & Saravana Ravindran & Abu Shonchoy & Hassan Zaman, 2021. "Poverty and Migration in the Digital Age: Experimental Evidence on Mobile Banking in Bangladesh," American Economic Journal: Applied Economics, American Economic Association, vol. 13(1), pages 38-71, January.
    3. Eva Vivalt, 2020. "How Much Can We Generalize From Impact Evaluations?," Journal of the European Economic Association, European Economic Association, vol. 18(6), pages 3045-3089.
    4. Abadie, Alberto & Diamond, Alexis & Hainmueller, Jens, 2010. "Synthetic Control Methods for Comparative Case Studies: Estimating the Effect of California’s Tobacco Control Program," Journal of the American Statistical Association, American Statistical Association, vol. 105(490), pages 493-505.
    5. Stoye, Jörg, 2012. "Minimax regret treatment choice with covariates or with limited validity of experiments," Journal of Econometrics, Elsevier, vol. 166(1), pages 138-156.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Manski, Charles F., 2023. "Probabilistic prediction for binary treatment choice: With focus on personalized medicine," Journal of Econometrics, Elsevier, vol. 234(2), pages 647-663.
    2. Eric Mbakop & Max Tabord‐Meehan, 2021. "Model Selection for Treatment Choice: Penalized Welfare Maximization," Econometrica, Econometric Society, vol. 89(2), pages 825-848, March.
    3. Charles F. Manski & Aleksey Tetenov, 2014. "The Quantile Performance of Statistical Treatment Rules Using Hypothesis Tests to Allocate a Population to Two Treatments," CeMMAP working papers CWP44/14, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    4. Susan Athey & Stefan Wager, 2021. "Policy Learning With Observational Data," Econometrica, Econometric Society, vol. 89(1), pages 133-161, January.
    5. Keisuke Hirano & Jack R. Porter, 2016. "Panel Asymptotics and Statistical Decision Theory," The Japanese Economic Review, Japanese Economic Association, vol. 67(1), pages 33-49, March.
    6. Neil Christy & Amanda Ellen Kowalski, 2024. "Counting Defiers in Health Care with a Design-Based Likelihood for the Joint Distribution of Potential Outcomes," Papers 2412.16352, arXiv.org.
    7. Charles F. Manski & Aleksey Tetenov, 2015. "Clinical trial design enabling ε-optimal treatment rules," CeMMAP working papers 60/15, Institute for Fiscal Studies.
    8. Thomas M. Russell, 2020. "Policy Transforms and Learning Optimal Policies," Papers 2012.11046, arXiv.org.
    9. Charles F. Manski & Aleksey Tetenov, 2015. "Clinical trial design enabling epsilon-optimal treatment rules," Carlo Alberto Notebooks 430, Collegio Carlo Alberto.
    10. Haitian Xie, 2020. "Finite-Sample Average Bid Auction," Papers 2008.10217, arXiv.org, revised Feb 2022.
    11. Daido Kido, 2023. "Locally Asymptotically Minimax Statistical Treatment Rules Under Partial Identification," Papers 2311.08958, arXiv.org.
    12. Davide Viviano & Jelena Bradic, 2020. "Fair Policy Targeting," Papers 2005.12395, arXiv.org, revised Jun 2022.
    13. Michael Gechter & Keisuke Hirano & Jean Lee & Mahreen Mahmud & Orville Mondal & Jonathan Morduch & Saravana Ravindran & Abu S. Shonchoy, 2024. "Selecting Experimental Sites for External Validity," Papers 2405.13241, arXiv.org.
    14. Davide Viviano & Jess Rudder, 2020. "Policy design in experiments with unknown interference," Papers 2011.08174, arXiv.org, revised May 2024.
    15. Anders Bredahl Kock & David Preinerstorfer & Bezirgen Veliyev, 2022. "Functional Sequential Treatment Allocation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 117(539), pages 1311-1323, September.
    16. Charles F. Manski & Aleksey Tetenov, 2023. "Statistical decision theory respecting stochastic dominance," The Japanese Economic Review, Springer, vol. 74(4), pages 447-469, October.
    17. Kitagawa, Toru & Wang, Guanyi, 2023. "Who should get vaccinated? Individualized allocation of vaccines over SIR network," Journal of Econometrics, Elsevier, vol. 232(1), pages 109-131.
    18. Davide Viviano, 2019. "Policy Targeting under Network Interference," Papers 1906.10258, arXiv.org, revised Apr 2024.
    19. Neil Christy & A. E. Kowalski, 2024. "Starting Small: Prioritizing Safety over Efficacy in Randomized Experiments Using the Exact Finite Sample Likelihood," Papers 2407.18206, arXiv.org.
    20. Chunrong Ai & Yue Fang & Haitian Xie, 2024. "Data-driven Policy Learning for Continuous Treatments," Papers 2402.02535, arXiv.org, revised Nov 2024.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2408.09187. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.