IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v389y2010i11p2280-2283.html
   My bibliography  Save this article

Measures of lexical distance between languages

Author

Listed:
  • Petroni, Filippo
  • Serva, Maurizio

Abstract

The idea of measuring distance between languages seems to have its roots in the work of the French explorer Dumont D’Urville (1832) [13]. He collected comparative word lists for various languages during his voyages aboard the Astrolabe from 1826 to 1829 and, in his work concerning the geographical division of the Pacific, he proposed a method for measuring the degree of relation among languages. The method used by modern glottochronology, developed by Morris Swadesh in the 1950s, measures distances from the percentage of shared cognates, which are words with a common historical origin. Recently, we proposed a new automated method which uses the normalized Levenshtein distances among words with the same meaning and averages on the words contained in a list. Recently another group of scholars, Bakker et al. (2009) [8] and Holman et al. (2008) [9], proposed a refined version of our definition including a second normalization. In this paper we compare the information content of our definition with the refined version in order to decide which of the two can be applied with greater success to resolve relationships among languages.

Suggested Citation

  • Petroni, Filippo & Serva, Maurizio, 2010. "Measures of lexical distance between languages," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(11), pages 2280-2283.
  • Handle: RePEc:eee:phsmap:v:389:y:2010:i:11:p:2280-2283
    DOI: 10.1016/j.physa.2010.02.004
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437110001081
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2010.02.004?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Russell D. Gray & Quentin D. Atkinson, 2003. "Language-tree divergence times support the Anatolian theory of Indo-European origin," Nature, Nature, vol. 426(6965), pages 435-439, November.
    2. Russell D. Gray & Fiona M. Jordan, 2000. "Language trees support the express-train sequence of Austronesian expansion," Nature, Nature, vol. 405(6790), pages 1052-1055, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Isphording, Ingo E. & Piopiunik, Marc & Rodríguez-Planas, Núria, 2016. "Speaking in numbers: The effect of reading performance on math performance among immigrants," Economics Letters, Elsevier, vol. 139(C), pages 52-56.
    2. repec:zbw:hohpro:352 is not listed on IDEAS
    3. Isphording, Ingo E. & Otten, Sebastian, 2014. "Linguistic barriers in the destination language acquisition of immigrants," Journal of Economic Behavior & Organization, Elsevier, vol. 105(C), pages 30-50.
    4. Ingo Eduard Isphording & Sebastian Otten, 2013. "The Costs of Babylon—Linguistic Distance in Applied Economics," Review of International Economics, Wiley Blackwell, vol. 21(2), pages 354-369, May.
    5. repec:zbw:rwirep:0337 is not listed on IDEAS
    6. Espitia, Diego & Larralde, Hernán, 2020. "Universal and non-universal text statistics: Clustering coefficient for language identification," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 553(C).
    7. Gamallo, Pablo & Pichel, José Ramom & Alegria, Iñaki, 2017. "From language identification to language distance," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 484(C), pages 152-162.
    8. repec:old:wpaper:352 is not listed on IDEAS
    9. Ibrahim Bousmah & Gilles Grenier & David M. Gray, 2021. "Linguistic Distance, Languages of Work and Wages of Immigrants in Montreal," Journal of Labor Research, Springer, vol. 42(1), pages 1-28, March.
    10. Erkan Gören, 2013. "Economic Effects of Domestic and Neighbouring Countries’ Cultural Diversity," Working Papers V-352-13, University of Oldenburg, Department of Economics, revised Mar 2013.
    11. Ingo Eduard Isphording & Sebastian Otten, 2013. "The Costs of Babylon—Linguistic Distance in Applied Economics," Review of International Economics, Wiley Blackwell, vol. 21(2), pages 354-369, 05.
    12. Lorraine Wong, 2023. "The effect of linguistic proximity on the labour market outcomes of the asylum population," Journal of Population Economics, Springer;European Society for Population Economics, vol. 36(2), pages 609-652, April.
    13. Mehri, Ali & Jamaati, Maryam, 2021. "Statistical metrics for languages classification: A case study of the Bible translations," Chaos, Solitons & Fractals, Elsevier, vol. 144(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthew J. Baker, 2021. "Foundations of the Age-Area Hypothesis," Palgrave Communications, Palgrave Macmillan, vol. 8(1), pages 1-17, December.
    2. Klaus Desmet & Ignacio Ortuño-Ortín & Romain Wacziarg, 2009. "The political economy of ethnolinguistic cleavages," Working Papers 2009-17, Instituto Madrileño de Estudios Avanzados (IMDEA) Ciencias Sociales.
    3. Victor Ginsburgh & Shlomo Weber, 2020. "The Economics of Language," Journal of Economic Literature, American Economic Association, vol. 58(2), pages 348-404, June.
    4. Gamallo, Pablo & Pichel, José Ramom & Alegria, Iñaki, 2017. "From language identification to language distance," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 484(C), pages 152-162.
    5. Joseph Flavian Gomes, 2020. "The health costs of ethnic distance: evidence from sub-Saharan Africa," Journal of Economic Growth, Springer, vol. 25(2), pages 195-226, June.
    6. Aparicio Fenoll, Ainoa & Kuehn, Zoë, 2016. "Education Policies and Migration across European Countries," IZA Discussion Papers 9755, Institute of Labor Economics (IZA).
    7. Ainhoa Aparicio Fenoll & Zoë Kuehn, 2017. "Compulsory Schooling Laws and Migration Across European Countries," Demography, Springer;Population Association of America (PAA), vol. 54(6), pages 2181-2200, December.
    8. Stanisz, Tomasz & Drożdż, Stanisław & Kwapień, Jarosław, 2023. "Universal versus system-specific features of punctuation usage patterns in major Western languages," Chaos, Solitons & Fractals, Elsevier, vol. 168(C).
    9. Desmet, Klaus & Ortuño-Ortín, Ignacio & Wacziarg, Romain, 2012. "The political economy of linguistic cleavages," Journal of Development Economics, Elsevier, vol. 97(2), pages 322-338.
    10. Ginsburgh, Victor & Weber, Shlomo, 2015. "Linguistic Distances and their Use in Economics," CEPR Discussion Papers 10640, C.E.P.R. Discussion Papers.
    11. Simone Pompei & Vittorio Loreto & Francesca Tria, 2011. "On the Accuracy of Language Trees," PLOS ONE, Public Library of Science, vol. 6(6), pages 1-11, June.
    12. Kandler, Anne & Laland, Kevin N., 2009. "An investigation of the relationship between innovation and cultural diversity," Theoretical Population Biology, Elsevier, vol. 76(1), pages 59-67.
    13. Stelios Michalopoulos, 2012. "The Origins of Ethnolinguistic Diversity," American Economic Review, American Economic Association, vol. 102(4), pages 1508-1539, June.
    14. Carl Müller-Crepon & Yannick Pengl & Nils-Christian Bormann, 2022. "Linking Ethnic Data from Africa (LEDA)," Journal of Peace Research, Peace Research Institute Oslo, vol. 59(3), pages 425-435, May.
    15. Stelios Michalopoulos, 2008. "The Origins of Ethnolinguistic Diversity: Theory and Evidence," Discussion Papers Series, Department of Economics, Tufts University 0725, Department of Economics, Tufts University.
    16. Nico Neureiter & Peter Ranacher & Nour Efrat-Kowalsky & Gereon A. Kaiping & Robert Weibel & Paul Widmer & Remco R. Bouckaert, 2022. "Detecting contact in language trees: a Bayesian phylogenetic model with horizontal transfer," Palgrave Communications, Palgrave Macmillan, vol. 9(1), pages 1-14, December.
    17. Aguilar, Elliot & Ghirlanda, Stefano, 2015. "Modeling the genealogy of a cultural trait," Theoretical Population Biology, Elsevier, vol. 101(C), pages 1-8.
    18. Victor Zitian Chen & John Cantwell, 2022. "An evolutionary view of institutional complexity," Journal of Evolutionary Economics, Springer, vol. 32(3), pages 1071-1090, July.
    19. Klaus Desmet & Ignacio Ortuño-Ortín & Shlomo Weber, 2017. "Peripheral diversity: transfers versus public goods," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 49(3), pages 787-823, December.
    20. Luke J Matthews & Sam Passmore & Paul M Richard & Russell D Gray & Quentin D Atkinson, 2016. "Shared Cultural History as a Predictor of Political and Economic Changes among Nation States," PLOS ONE, Public Library of Science, vol. 11(4), pages 1-18, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:389:y:2010:i:11:p:2280-2283. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.