IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0159496.html
   My bibliography  Save this article

Copula-Based Approach to Synthetic Population Generation

Author

Listed:
  • Byungduk Jeong
  • Wonjoon Lee
  • Deok-Soo Kim
  • Hayong Shin

Abstract

Generating synthetic baseline populations is a fundamental step of agent-based modeling and simulation, which is growing fast in a wide range of socio-economic areas including transportation planning research. Traditionally, in many commercial and non-commercial microsimulation systems, the iterative proportional fitting (IPF) procedure has been used for creating the joint distribution of individuals when combining a reference joint distribution with target marginal distributions. Although IPF is simple, computationally efficient, and rigorously founded, it is unclear whether IPF well preserves the dependence structure of the reference joint table sufficiently when fitting it to target margins. In this paper, a novel method is proposed based on the copula concept in order to provide an alternative approach to the problem that IPF resolves. The dependency characteristic measures were computed and the results from the proposed method and IPF were compared. In most test cases, the proposed method outperformed IPF in preserving the dependence structure of the reference joint distribution.

Suggested Citation

  • Byungduk Jeong & Wonjoon Lee & Deok-Soo Kim & Hayong Shin, 2016. "Copula-Based Approach to Synthetic Population Generation," PLOS ONE, Public Library of Science, vol. 11(8), pages 1-28, August.
  • Handle: RePEc:plo:pone00:0159496
    DOI: 10.1371/journal.pone.0159496
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0159496
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0159496&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0159496?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Pinjari, Abdul Rawoof & Bhat, Chandra R. & Hensher, David A., 2009. "Residential self-selection effects in an activity time-use behavior model," Transportation Research Part B: Methodological, Elsevier, vol. 43(7), pages 729-748, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jason Hawkins & Khandker Nurul Habib, 2023. "A multi-source data fusion framework for joint population, expenditure, and time use synthesis," Transportation, Springer, vol. 50(4), pages 1323-1346, August.
    2. Till Koebe & Alejandra Arias-Salazar & Timo Schmid, 2023. "Releasing survey microdata with exact cluster locations and additional privacy safeguards," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-13, December.
    3. Nejad, Mohammad Motalleb & Erdogan, Sevgi & Cirillo, Cinzia, 2021. "A statistical approach to small area synthetic population generation as a basis for carless evacuation planning," Journal of Transport Geography, Elsevier, vol. 90(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Houshmand Masoumi, 2021. "Residential Location Choice in Istanbul, Tehran, and Cairo: The Importance of Commuting to Work," Sustainability, MDPI, vol. 13(10), pages 1-18, May.
    2. Bhat, Chandra R. & Astroza, Sebastian & Sidharthan, Raghuprasad & Alam, Mohammad Jobair Bin & Khushefati, Waleed H., 2014. "A joint count-continuous model of travel behavior with selection based on a multinomial probit residential density choice model," Transportation Research Part B: Methodological, Elsevier, vol. 68(C), pages 31-51.
    3. Kaplan, Sigal & Shiftan, Yoram & Bekhor, Shlomo, 2012. "Development and estimation of a semi-compensatory model with a flexible error structure," Transportation Research Part B: Methodological, Elsevier, vol. 46(2), pages 291-304.
    4. Gershenson, Seth, 2013. "The causal effect of commute time on labor supply: Evidence from a natural experiment involving substitute teachers," Transportation Research Part A: Policy and Practice, Elsevier, vol. 54(C), pages 127-140.
    5. Lin, Tao & Wang, Donggen, 2015. "Tradeoffs between in- and out-of-residential neighborhood locations for discretionary activities and time use: do social contexts matter?," Journal of Transport Geography, Elsevier, vol. 47(C), pages 119-127.
    6. Sener, Ipek N. & Pendyala, Ram M. & Bhat, Chandra R., 2011. "Accommodating spatial correlation across choice alternatives in discrete choice models: an application to modeling residential location choice behavior," Journal of Transport Geography, Elsevier, vol. 19(2), pages 294-303.
    7. Chatman, Daniel, 2014. "Estimating the effect of land use and transportation planning on travel patterns: Three problems in controlling for residential self-selection," The Journal of Transport and Land Use, Center for Transportation Studies, University of Minnesota, vol. 7(3), pages 47-56.
    8. Sweet, Matthias N., 2014. "Do firms flee traffic congestion?," Journal of Transport Geography, Elsevier, vol. 35(C), pages 40-49.
    9. Chandra Bhat & Konstadinos Goulias & Ram Pendyala & Rajesh Paleti & Raghuprasad Sidharthan & Laura Schmitt & Hsi-Hwa Hu, 2013. "A household-level activity pattern generation model with an application for Southern California," Transportation, Springer, vol. 40(5), pages 1063-1086, September.
    10. Kumar Dey, Bibhas & Anowar, Sabreena & Eluru, Naveen, 2021. "A framework for estimating bikeshare origin destination flows using a multiple discrete continuous system," Transportation Research Part A: Policy and Practice, Elsevier, vol. 144(C), pages 119-133.
    11. Jian, Sisi & Rashidi, Taha Hossein & Dixit, Vinayak, 2017. "An analysis of carsharing vehicle choice and utilization patterns using multiple discrete-continuous extreme value (MDCEV) models," Transportation Research Part A: Policy and Practice, Elsevier, vol. 103(C), pages 362-376.
    12. Rezaei, Ali & Patterson, Zachary, 2018. "Preference stability in household location choice: Using cross-sectional data from three censuses," Research in Transportation Economics, Elsevier, vol. 67(C), pages 44-53.
    13. Limanond, Thirayoot & Jomnonkwao, Sajjakaj & Watthanaklang, Duangdao & Ratanavaraha, Vatanavongs & Siridhara, Siradol, 2011. "How vehicle ownership affect time utilization on study, leisure, social activities, and academic performance of university students? A case study of engineering freshmen in a rural university in Thail," Transport Policy, Elsevier, vol. 18(5), pages 719-726, September.
    14. Olaru, Doina & Smith, Brett & Taplin, John H.E., 2011. "Residential location and transit-oriented development in a new rail corridor," Transportation Research Part A: Policy and Practice, Elsevier, vol. 45(3), pages 219-237, March.
    15. Ho, Chinh Q. & Hensher, David A. & Ellison, Richard, 2017. "Endogenous treatment of residential location choices in transport and land use models: Introducing the MetroScan framework," Journal of Transport Geography, Elsevier, vol. 64(C), pages 120-131.
    16. Naveen Eluru & Chandra Bhat & Ram Pendyala & Karthik Konduri, 2010. "A joint flexible econometric model system of household residential location and vehicle fleet composition/usage choices," Transportation, Springer, vol. 37(4), pages 603-626, July.
    17. Ipek Sener & Chandra Bhat, 2012. "Modeling the spatial and temporal dimensions of recreational activity participation with a focus on physical activities," Transportation, Springer, vol. 39(3), pages 627-656, May.
    18. Bhat, Chandra R. & Astroza, Sebastian & Bhat, Aarti C. & Nagel, Kai, 2016. "Incorporating a multiple discrete-continuous outcome in the generalized heterogeneous data model: Application to residential self-selection effects analysis in an activity time-use behavior model," Transportation Research Part B: Methodological, Elsevier, vol. 91(C), pages 52-76.
    19. Sharma, Ishant & Mishra, Sabyasachee & Golias, Mihalis M. & Welch, Timothy F. & Cherry, Christopher R., 2020. "Equity of transit connectivity in Tennessee cities," Journal of Transport Geography, Elsevier, vol. 86(C).
    20. Pinjari, Abdul Rawoof & Bhat, Chandra, 2010. "A multiple discrete-continuous nested extreme value (MDCNEV) model: Formulation and application to non-worker activity time-use and timing behavior on weekdays," Transportation Research Part B: Methodological, Elsevier, vol. 44(4), pages 562-583, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0159496. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.