IDEAS home Printed from https://ideas.repec.org/p/cpr/ceprdp/15852.html
   My bibliography  Save this paper

A Cross-verified Database of Notable People, 3500BC-2018AD

Author

Listed:
  • Wasmer, Etienne
  • Laouenan, Morgane
  • Bhargava, Palaash
  • Eymeoud, Jean Benoit
  • Plique, Guillaume

Abstract

We add to the literature on notable individuals (famous, prominent, distinguished) in collecting first a massive amount of data from various editions of Wikipedia and Wikidata along with deduplication techniques; and then using these partially overlapping sources to cross-verify each retrieved information. This strategy results in a cross-verified database of 2.2 million individuals, including a third who are not present in the English edition of Wikipedia. An extension to 4.7 million entries is currently not recommended given the inaccuracy of the information and discrepancies between Wikidata and other sources. A non-negligible fraction of newly-added individuals were collected from non-English editions of Wikipedia. We adopt a social science approach: data collection is driven by specific social questions on gender, economic and cul- tural development and quantitative exploration of cultural trends, that we document in this paper. A sample of 100,000 individuals is available here http://medialab.github.io/bhht-datascape, together with the most recent version of this paper.

Suggested Citation

  • Wasmer, Etienne & Laouenan, Morgane & Bhargava, Palaash & Eymeoud, Jean Benoit & Plique, Guillaume, 2021. "A Cross-verified Database of Notable People, 3500BC-2018AD," CEPR Discussion Papers 15852, C.E.P.R. Discussion Papers.
  • Handle: RePEc:cpr:ceprdp:15852
    as

    Download full text from publisher

    File URL: https://cepr.org/publications/DP15852
    Download Restriction: CEPR Discussion Papers are free to download for our researchers, subscribers and members. If you fall into one of these categories but have trouble downloading our papers, please contact us at subscribers@cepr.org
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Philippe Aghion & Nick Bloom & Richard Blundell & Rachel Griffith & Peter Howitt, 2005. "Competition and Innovation: an Inverted-U Relationship," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 120(2), pages 701-728.
    2. Alberto Alesina & Johann Harnoss & Hillel Rapoport, 2016. "Birthplace diversity and economic prosperity," Journal of Economic Growth, Springer, vol. 21(2), pages 101-138, June.
    3. Oded Galor & Omer Moav, 2004. "From Physical to Human Capital Accumulation: Inequality and the Process of Development," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 71(4), pages 1001-1026.
    4. Reenu, 2008. "Role of Economic Reform in the Growth of Indian Economy," Journal of Commerce and Trade, Society for Advanced Management Studies, vol. 3(1), pages 19-22, April.
    5. Dave Donaldson & Richard Hornbeck, 2016. "Railroads and American Economic Growth: A "Market Access" Approach," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(2), pages 799-858.
    6. David de la Croix & Omar Licandro, 2015. "The longevity of famous people from Hammurabi to Einstein," Journal of Economic Growth, Springer, vol. 20(3), pages 263-303, September.
    7. Michel Serafinelli & Guido Tabellini, 2022. "Creativity over time and space," Journal of Economic Growth, Springer, vol. 27(1), pages 1-43, March.
    8. Robert B. Ekelund, Jr. & Robert F. Hebert & Robert D. Tollison, 2002. "An Economic Analysis of the Protestant Reformation," Journal of Political Economy, University of Chicago Press, vol. 110(3), pages 646-671, June.
    9. La Porta, Rafael & Lopez-de-Silanes, Florencio & Shleifer, Andrei & Vishny, Robert, 1999. "The Quality of Government," The Journal of Law, Economics, and Organization, Oxford University Press, vol. 15(1), pages 222-279, April.
    10. Kristian Behrens & Gilles Duranton & Frédéric Robert-Nicoud, 2014. "Productive Cities: Sorting, Selection, and Agglomeration," Journal of Political Economy, University of Chicago Press, vol. 122(3), pages 507-553.
    11. Alex Bell & Raj Chetty & Xavier Jaravel & Neviana Petkova & John Van Reenen, 2019. "Who Becomes an Inventor in America? The Importance of Exposure to Innovation," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 134(2), pages 647-713.
    12. Roland G. Fryer & Steven D. Levitt, 2004. "The Causes and Consequences of Distinctively Black Names," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 119(3), pages 767-805.
    13. Gojko Barjamovic & Thomas Chaney & Kerem Cosar & Ali Hortacsu, 2019. "Trade, Merchants and the Lost Cities of the Bronze Age," SciencePo Working papers hal-03261799, HAL.
    14. Gojko Barjamovic & Thomas Chaney & Kerem Coşar & Ali Hortaçsu, 2019. "Trade, Merchants, and the Lost Cities of the Bronze Age," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 134(3), pages 1455-1503.
    15. Amy Finkelstein & Matthew Gentzkow & Heidi Williams, 2021. "Place-Based Drivers of Mortality: Evidence from Migration," American Economic Review, American Economic Association, vol. 111(8), pages 2697-2735, August.
    16. Dao, Thu Hien & Docquier, Frédéric & Parsons, Chris & Peri, Giovanni, 2018. "Migration and development: Dissecting the anatomy of the mobility transition," Journal of Development Economics, Elsevier, vol. 132(C), pages 88-101.
    17. Becker, Sascha O. & Pfaff, Steven & Rubin, Jared, 2016. "Causes and consequences of the Protestant Reformation," Explorations in Economic History, Elsevier, vol. 62(C), pages 1-25.
    18. Claudia Goldin, 2014. "A Grand Gender Convergence: Its Last Chapter," American Economic Review, American Economic Association, vol. 104(4), pages 1091-1119, April.
    19. Alberto Alesina & Paola Giuliano, 2015. "Culture and Institutions," Journal of Economic Literature, American Economic Association, vol. 53(4), pages 898-944, December.
    20. Edward L. Glaeser & Rafael La Porta & Florencio Lopez-de-Silanes & Andrei Shleifer, 2004. "Do Institutions Cause Growth?," Journal of Economic Growth, Springer, vol. 9(3), pages 271-303, September.
    21. Davide Cantoni, 2015. "The Economic Effects Of The Protestant Reformation: Testing The Weber Hypothesis In The German Lands," Journal of the European Economic Association, European Economic Association, vol. 13(4), pages 561-598, August.
    22. Oded Galor, 2011. "Unified Growth Theory and Comparative Development," Rivista di Politica Economica, SIPI Spa, issue 2, pages 9-21, April-Jun.
    23. Allen,Robert C., 2009. "The British Industrial Revolution in Global Perspective," Cambridge Books, Cambridge University Press, number 9780521868273, September.
    24. C Jara-Figueroa & Amy Z Yu & César A Hidalgo, 2019. "How the medium shapes the message: Printing and the rise of the arts and sciences," PLOS ONE, Public Library of Science, vol. 14(2), pages 1-14, February.
    25. Raj Chetty & Nathaniel Hendren & Patrick Kline & Emmanuel Saez, 2014. "Where is the land of Opportunity? The Geography of Intergenerational Mobility in the United States," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 129(4), pages 1553-1623.
    26. Daron Acemoglu & Simon Johnson, 2005. "Unbundling Institutions," Journal of Political Economy, University of Chicago Press, vol. 113(5), pages 949-995, October.
    27. Marianne Bertrand, 2020. "Gender in the Twenty-First Century," AEA Papers and Proceedings, American Economic Association, vol. 110, pages 1-24, May.
    28. Beine, Michel & Docquier, Frederic & Rapoport, Hillel, 2001. "Brain drain and economic growth: theory and evidence," Journal of Development Economics, Elsevier, vol. 64(1), pages 275-289, February.
    29. Crafts, Nicholas, 2011. "Explaining the first Industrial Revolution: two views," European Review of Economic History, Cambridge University Press, vol. 15(1), pages 153-168, April.
    30. Bisin, Alberto & Verdier, Thierry, 2001. "The Economics of Cultural Transmission and the Dynamics of Preferences," Journal of Economic Theory, Elsevier, vol. 97(2), pages 298-319, April.
    31. repec:hal:spmain:info:hdl:2441/c5agcqnoa9vg8sm5q63mold8p is not listed on IDEAS
    32. Oded Galor, 2011. "Unified Growth Theory," Economics Books, Princeton University Press, edition 1, number 9477.
    33. Mark Aguiar & Erik Hurst, 2007. "Measuring Trends in Leisure: The Allocation of Time Over Five Decades," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 122(3), pages 969-1006.
    34. Nathaniel Baum-Snow, 2007. "Did Highways Cause Suburbanization?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 122(2), pages 775-805.
    35. Gojko Barjamovic & Thomas Chaney & Kerem Cosar & Ali Hortacsu, 2019. "Trade, Merchants and the Lost Cities of the Bronze Age," SciencePo Working papers Main hal-03261799, HAL.
    36. Michael Kremer, 1993. "Population Growth and Technological Change: One Million B.C. to 1990," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 108(3), pages 681-716.
    37. repec:hal:pseose:hal-01304131 is not listed on IDEAS
    38. Hunt, Jennifer & Garant, Jean-Philippe & Herman, Hannah & Munroe, David J., 2013. "Why are women underrepresented amongst patentees?," Research Policy, Elsevier, vol. 42(4), pages 831-843.
    39. Paul J. J. Welfens, 2008. "ICT – productivity and economic growth in Europe," Springer Books, in: Paul J. J. Welfens & Ellen Walther-Klaus (ed.), Digital Excellence, pages 13-39, Springer.
    40. Ran Abramitzky & Leah Boustan & Elisa Jacome & Santiago Perez, 2021. "Intergenerational Mobility of Immigrants in the United States over Two Centuries," American Economic Review, American Economic Association, vol. 111(2), pages 580-608, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bennett, Daniel L. & Faria, Hugo J. & Gwartney, James D. & Morales, Daniel R., 2017. "Economic Institutions and Comparative Economic Development: A Post-Colonial Perspective," World Development, Elsevier, vol. 96(C), pages 503-519.
    2. Johnson, Noel D. & Koyama, Mark, 2017. "Jewish communities and city growth in preindustrial Europe," Journal of Development Economics, Elsevier, vol. 127(C), pages 339-354.
    3. Anastasia Litina, 2016. "Natural land productivity, cooperation and comparative development," Journal of Economic Growth, Springer, vol. 21(4), pages 351-408, December.
    4. Combes, Pierre-Philippe & Gobillon, Laurent & Zylberberg, Yanos, 2022. "Urban economics in a historical perspective: Recovering data with machine learning," Regional Science and Urban Economics, Elsevier, vol. 94(C).
    5. Quamrul Ashraf & Oded Galor, 2011. "Cultural Diversity, Geographical Isolation, and the Origin of the Wealth of Nations," Department of Economics Working Papers 2011-15, Department of Economics, Williams College.
    6. Braunfels, Elias, 2016. "Further Unbundling Institutions," Discussion Paper Series in Economics 13/2016, Norwegian School of Economics, Department of Economics.
    7. Hanlon, W.Walker & Heblich, Stephan, 2022. "History and urban economics," Regional Science and Urban Economics, Elsevier, vol. 94(C).
    8. Fiaschi, Davide & Fioroni, Tamara, 2019. "Transition to modern growth in Great Britain: The role of technological progress, adult mortality and factor accumulation," Structural Change and Economic Dynamics, Elsevier, vol. 51(C), pages 472-490.
    9. Oyèkọ́lá, Ọláyínká, 2021. "Where do people live longer?," Research in Economics, Elsevier, vol. 75(1), pages 21-44.
    10. Oded Galor & Quamrul Ashraf, 2007. "Cultural Assimilation, Cultural Diffusion and the Origin of the Wealth of Nations," Working Papers 2007-3, Brown University, Department of Economics.
    11. Canning, David & Mabeu, Marie Christelle & Pongou, Roland, 2020. "Colonial origins and fertility: can the market overcome history?," MPRA Paper 112496, University Library of Munich, Germany.
    12. Francesco Cinnirella & Jochen Streb, 2017. "The role of human capital and innovation in economic development: evidence from post-Malthusian Prussia," Journal of Economic Growth, Springer, vol. 22(2), pages 193-227, June.
    13. Boikos, Spyridon & Bucci, Alberto & Stengos, Thanasis, 2022. "Leisure and innovation in horizontal R&D-based growth," Economic Modelling, Elsevier, vol. 107(C).
    14. Litina, Anastasia, 2012. "Unfavorable land endowment, cooperation, and reversal of fortune," MPRA Paper 39702, University Library of Munich, Germany.
    15. Cemal Eren Arbatlı & Quamrul H. Ashraf & Oded Galor & Marc Klemp, 2020. "Diversity and Conflict," Econometrica, Econometric Society, vol. 88(2), pages 727-797, March.
    16. Quamrul H. Ashraf & Francesco Cinnirella & Oded Galor & Boris Gershman & Erik Hornung, 2017. "Capital-Skill Complementarity and the Emergence of Labor Emancipation," Working Papers 2017-1, Brown University, Department of Economics.
    17. Holger Strulik, 2016. "Secularization And Long-Run Economic Growth," Economic Inquiry, Western Economic Association International, vol. 54(1), pages 177-200, January.
    18. Broadberry, Stephen & Ghosal, Sayantan & Proto, Eugenio, 2017. "Anonymity, efficiency wages and technological progress," Journal of Development Economics, Elsevier, vol. 127(C), pages 379-394.
    19. Becker, Sascha O. & Rubin, Jared & Woessmann, Ludger, 2020. "Religion in Economic History: A Survey," CEPR Discussion Papers 14894, C.E.P.R. Discussion Papers.
    20. Quamrul Ashraf & Oded Galor, 2013. "The 'Out of Africa' Hypothesis, Human Genetic Diversity, and Comparative Economic Development," American Economic Review, American Economic Association, vol. 103(1), pages 1-46, February.

    More about this item

    Keywords

    Notable individuals; Creative class; Urban economics; Economic history;
    All these keywords.

    JEL classification:

    • N01 - Economic History - - General - - - Development of the Discipline: Historiographical; Sources and Methods
    • N9 - Economic History - - Regional and Urban History
    • R00 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cpr:ceprdp:15852. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://www.cepr.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.