IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v389y2010i11p2280-2283.html
   My bibliography  Save this article

Measures of lexical distance between languages

Author

Listed:
  • Petroni, Filippo
  • Serva, Maurizio

Abstract

The idea of measuring distance between languages seems to have its roots in the work of the French explorer Dumont D’Urville (1832) [13]. He collected comparative word lists for various languages during his voyages aboard the Astrolabe from 1826 to 1829 and, in his work concerning the geographical division of the Pacific, he proposed a method for measuring the degree of relation among languages. The method used by modern glottochronology, developed by Morris Swadesh in the 1950s, measures distances from the percentage of shared cognates, which are words with a common historical origin. Recently, we proposed a new automated method which uses the normalized Levenshtein distances among words with the same meaning and averages on the words contained in a list. Recently another group of scholars, Bakker et al. (2009) [8] and Holman et al. (2008) [9], proposed a refined version of our definition including a second normalization. In this paper we compare the information content of our definition with the refined version in order to decide which of the two can be applied with greater success to resolve relationships among languages.

Suggested Citation

  • Petroni, Filippo & Serva, Maurizio, 2010. "Measures of lexical distance between languages," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(11), pages 2280-2283.
  • Handle: RePEc:eee:phsmap:v:389:y:2010:i:11:p:2280-2283
    DOI: 10.1016/j.physa.2010.02.004
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437110001081
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2010.02.004?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Russell D. Gray & Fiona M. Jordan, 2000. "Language trees support the express-train sequence of Austronesian expansion," Nature, Nature, vol. 405(6790), pages 1052-1055, June.
    2. Russell D. Gray & Quentin D. Atkinson, 2003. "Language-tree divergence times support the Anatolian theory of Indo-European origin," Nature, Nature, vol. 426(6965), pages 435-439, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Isphording, Ingo E. & Piopiunik, Marc & Rodríguez-Planas, Núria, 2016. "Speaking in numbers: The effect of reading performance on math performance among immigrants," Economics Letters, Elsevier, vol. 139(C), pages 52-56.
    2. repec:zbw:hohpro:352 is not listed on IDEAS
    3. Isphording, Ingo E. & Otten, Sebastian, 2014. "Linguistic barriers in the destination language acquisition of immigrants," Journal of Economic Behavior & Organization, Elsevier, vol. 105(C), pages 30-50.
    4. Ingo Eduard Isphording & Sebastian Otten, 2013. "The Costs of Babylon—Linguistic Distance in Applied Economics," Review of International Economics, Wiley Blackwell, vol. 21(2), pages 354-369, May.
    5. repec:zbw:rwirep:0337 is not listed on IDEAS
    6. Espitia, Diego & Larralde, Hernán, 2020. "Universal and non-universal text statistics: Clustering coefficient for language identification," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 553(C).
    7. Gamallo, Pablo & Pichel, José Ramom & Alegria, Iñaki, 2017. "From language identification to language distance," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 484(C), pages 152-162.
    8. repec:old:wpaper:352 is not listed on IDEAS
    9. Ibrahim Bousmah & Gilles Grenier & David M. Gray, 2021. "Linguistic Distance, Languages of Work and Wages of Immigrants in Montreal," Journal of Labor Research, Springer, vol. 42(1), pages 1-28, March.
    10. Erkan Gören, 2013. "Economic Effects of Domestic and Neighbouring Countries’ Cultural Diversity," Working Papers V-352-13, University of Oldenburg, Department of Economics, revised Mar 2013.
    11. Ingo Eduard Isphording & Sebastian Otten, 2013. "The Costs of Babylon—Linguistic Distance in Applied Economics," Review of International Economics, Wiley Blackwell, vol. 21(2), pages 354-369, 05.
    12. Lorraine Wong, 2023. "The effect of linguistic proximity on the labour market outcomes of the asylum population," Journal of Population Economics, Springer;European Society for Population Economics, vol. 36(2), pages 609-652, April.
    13. Mehri, Ali & Jamaati, Maryam, 2021. "Statistical metrics for languages classification: A case study of the Bible translations," Chaos, Solitons & Fractals, Elsevier, vol. 144(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthew J. Baker, 2021. "Foundations of the Age-Area Hypothesis," Palgrave Communications, Palgrave Macmillan, vol. 8(1), pages 1-17, December.
    2. Klaus Desmet & Ignacio Ortuño-Ortín & Romain Wacziarg, 2009. "The political economy of ethnolinguistic cleavages," Working Papers 2009-17, Instituto Madrileño de Estudios Avanzados (IMDEA) Ciencias Sociales.
    3. Victor Ginsburgh & Shlomo Weber, 2020. "The Economics of Language," Journal of Economic Literature, American Economic Association, vol. 58(2), pages 348-404, June.
    4. Aparicio Fenoll, Ainoa & Kuehn, Zoë, 2016. "Education Policies and Migration across European Countries," IZA Discussion Papers 9755, Institute of Labor Economics (IZA).
    5. Ainhoa Aparicio Fenoll & Zoë Kuehn, 2017. "Compulsory Schooling Laws and Migration Across European Countries," Demography, Springer;Population Association of America (PAA), vol. 54(6), pages 2181-2200, December.
    6. Stanisz, Tomasz & Drożdż, Stanisław & Kwapień, Jarosław, 2023. "Universal versus system-specific features of punctuation usage patterns in major Western languages," Chaos, Solitons & Fractals, Elsevier, vol. 168(C).
    7. Stelios Michalopoulos, 2012. "The Origins of Ethnolinguistic Diversity," American Economic Review, American Economic Association, vol. 102(4), pages 1508-1539, June.
    8. Carl Müller-Crepon & Yannick Pengl & Nils-Christian Bormann, 2022. "Linking Ethnic Data from Africa (LEDA)," Journal of Peace Research, Peace Research Institute Oslo, vol. 59(3), pages 425-435, May.
    9. Nico Neureiter & Peter Ranacher & Nour Efrat-Kowalsky & Gereon A. Kaiping & Robert Weibel & Paul Widmer & Remco R. Bouckaert, 2022. "Detecting contact in language trees: a Bayesian phylogenetic model with horizontal transfer," Palgrave Communications, Palgrave Macmillan, vol. 9(1), pages 1-14, December.
    10. Aguilar, Elliot & Ghirlanda, Stefano, 2015. "Modeling the genealogy of a cultural trait," Theoretical Population Biology, Elsevier, vol. 101(C), pages 1-8.
    11. Victor Zitian Chen & John Cantwell, 2022. "An evolutionary view of institutional complexity," Journal of Evolutionary Economics, Springer, vol. 32(3), pages 1071-1090, July.
    12. Marcelo A Montemurro & Damián H Zanette, 2011. "Universal Entropy of Word Ordering Across Linguistic Families," PLOS ONE, Public Library of Science, vol. 6(5), pages 1-9, May.
    13. Taraka Rama, 2013. "Phonotactic Diversity Predicts the Time Depth of the World’s Language Families," PLOS ONE, Public Library of Science, vol. 8(5), pages 1-9, May.
    14. Arthur J. Robson, 2010. "A bioeconomic view of the Neolithic transition to agriculture," Canadian Journal of Economics, Canadian Economics Association, vol. 43(1), pages 280-300, February.
    15. Marc Allassonnière-Tang & Olof Lundgren & Maja Robbers & Sandra Cronhamn & Filip Larsson & One-Soon Her & Harald Hammarström & Gerd Carling, 2021. "Expansion by migration and diffusion by contact is a source to the global diversity of linguistic nominal categorization systems," Palgrave Communications, Palgrave Macmillan, vol. 8(1), pages 1-6, December.
    16. Joseph Flavian Gomes, 2020. "The health costs of ethnic distance: evidence from sub-Saharan Africa," Journal of Economic Growth, Springer, vol. 25(2), pages 195-226, June.
    17. Victor Ginsburgh & Shlomo Weber, 2016. "Linguistic Distances and Ethno-Linguistic Fractionalisation and Disenfranchisement Indices," Working Papers ECARES ECARES 2016-25, ULB -- Universite Libre de Bruxelles.
    18. Job Schepens & Ton Dijkstra & Franc Grootjen & Walter J B van Heuven, 2013. "Cross-Language Distributions of High Frequency and Phonetically Similar Cognates," PLOS ONE, Public Library of Science, vol. 8(5), pages 1-15, May.
    19. Paola Giuliano, 2016. "Review of Cultural Evolution: Society, Technology, Language, and Religion Edited by Peter J. Richerson and Morten H. Christiansen," Journal of Economic Literature, American Economic Association, vol. 54(2), pages 522-533, June.
    20. Seán Roberts & James Winters, 2013. "Linguistic Diversity and Traffic Accidents: Lessons from Statistical Studies of Cultural Traits," PLOS ONE, Public Library of Science, vol. 8(8), pages 1-13, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:389:y:2010:i:11:p:2280-2283. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.