IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v391y2012i4p1515-1518.html
   My bibliography  Save this article

Fitting Chinese syllable-to-character mapping spectrum by the beta rank function

Author

Listed:
  • Li, Wentian

Abstract

We define the syllable-to-character mapping spectrum in Chinese as the normalized number of characters per syllable ranked from high to low. This spectrum provides a statistical characterization of the relationship between spoken and written Chinese. We have shown that two functions, the logarithmic function and the beta rank function, fit the syllable-to-character mapping spectrum well. The beta rank function is even better than the logarithmic function judged by two measures of data-fitting performance: the sum of square errors, and Akaike information criterion. We comment on why the beta rank function is a good fitting function for many range-limited ranking data, whereas for range-open data it may be out-performed by other functions, such as a power-law function in the case of Zipf’s law.

Suggested Citation

  • Li, Wentian, 2012. "Fitting Chinese syllable-to-character mapping spectrum by the beta rank function," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(4), pages 1515-1518.
  • Handle: RePEc:eee:phsmap:v:391:y:2012:i:4:p:1515-1518
    DOI: 10.1016/j.physa.2011.08.024
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437111006522
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2011.08.024?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Mansilla, R. & Köppen, E. & Cocho, G. & Miramontes, P., 2007. "On the behavior of journal impact factor rank-order distribution," Journal of Informetrics, Elsevier, vol. 1(2), pages 155-160.
    2. Ricardo Baeza–Yates & Gonzalo Navarro, 2000. "Block addressing indices for approximate text retrieval," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 51(1), pages 69-82.
    3. Urzua, Carlos M., 2000. "A simple and efficient test for Zipf's law," Economics Letters, Elsevier, vol. 66(3), pages 257-260, March.
    4. Beltrán del Río, M. & Cocho, G. & Mansilla, R., 2011. "General model of subtraction of stochastic variables. Attractor and stability analysis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(2), pages 154-160.
    5. Naumis, G.G. & Cocho, G., 2008. "Tail universalities in rank distributions as an algebraic problem: The beta-like function," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(1), pages 84-96.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alvarez-Martínez, R. & Cocho, G. & Rodríguez, R.F. & Martínez-Mekler, G., 2014. "Birth and death master equation for the evolution of complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 402(C), pages 198-208.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Valerio Ficcadenti & Roy Cerqueti & Ciro Hosseini Varde’i, 2023. "A rank-size approach to analyse soccer competitions and teams: the case of the Italian football league “Serie A"," Annals of Operations Research, Springer, vol. 325(1), pages 85-113, June.
    2. Espitia, Diego & Larralde, Hernán, 2020. "Universal and non-universal text statistics: Clustering coefficient for language identification," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 553(C).
    3. Cerqueti, Roy & Lupi, Claudio & Pietrovito, Filomena & Pozzolo, Alberto Franco, 2022. "Rank–size distributions for banks: A cross-country analysis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 585(C).
    4. Bertoli-Barsotti, Lucio & Lando, Tommaso, 2019. "How mean rank and mean size may determine the generalised Lorenz curve: With application to citation analysis," Journal of Informetrics, Elsevier, vol. 13(1), pages 387-396.
    5. Elio Roca-Flores & Gerardo G. Naumis, 2021. "Assessing statistical hurricane risks: nonlinear regression and time-window analysis of North Atlantic annual accumulated cyclonic energy rank profile," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 108(3), pages 2455-2465, September.
    6. Campanario, Juan Miguel, 2015. "Providing impact: The distribution of JCR journals according to references they contribute to the 2-year and 5-year journal impact factors," Journal of Informetrics, Elsevier, vol. 9(2), pages 398-407.
    7. Segarra, Agustí & Teruel, Mercedes, 2012. "An appraisal of firm size distribution: Does sample size matter?," Journal of Economic Behavior & Organization, Elsevier, vol. 82(1), pages 314-328.
    8. B Ian Hutchins & Xin Yuan & James M Anderson & George M Santangelo, 2016. "Relative Citation Ratio (RCR): A New Metric That Uses Citation Rates to Measure Influence at the Article Level," PLOS Biology, Public Library of Science, vol. 14(9), pages 1-25, September.
    9. Waltman, Ludo & van Eck, Nees Jan, 2009. "Some comments on Egghe's derivation of the impact factor distribution," Journal of Informetrics, Elsevier, vol. 3(4), pages 363-366.
    10. Marcel Ausloos & Roy Cerqueti, 2016. "Studies on Regional Wealth Inequalities: the case of Italy," Papers 1602.05356, arXiv.org.
    11. Ausloos, Marcel & Cerqueti, Roy & Lupi, Claudio, 2017. "Long-range properties and data validity for hydrogeological time series: The case of the Paglia river," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 470(C), pages 39-50.
    12. Francisco Goerlich, 2013. "A simple and efficient test for the Pareto law," Empirical Economics, Springer, vol. 45(3), pages 1367-1381, December.
    13. L. Benguigui & E. Blumenfeld-Lieberthal, 2011. "The end of a paradigm: is Zipf’s law universal?," Journal of Geographical Systems, Springer, vol. 13(1), pages 87-100, March.
    14. Brzezinski, Michal, 2014. "Empirical modeling of the impact factor distribution," Journal of Informetrics, Elsevier, vol. 8(2), pages 362-368.
    15. Juan Miguel Campanario, 2011. "Empirical study of journal impact factors obtained using the classical two-year citation window versus a five-year citation window," Scientometrics, Springer;Akadémiai Kiadó, vol. 87(1), pages 189-204, April.
    16. Urzúa, Carlos M., 2011. "Testing for Zipf's law: A common pitfall," Economics Letters, Elsevier, vol. 112(3), pages 254-255, September.
    17. Sangwal, Keshra, 2013. "Citation and impact factor distributions of scientific journals published in individual countries," Journal of Informetrics, Elsevier, vol. 7(2), pages 487-504.
    18. Sarabia, José María & Prieto, Faustino, 2009. "The Pareto-positive stable distribution: A new descriptive model for city size data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 388(19), pages 4179-4191.
    19. Luckstead, Jeff & Devadoss, Stephen, 2014. "Do the world’s largest cities follow Zipf’s and Gibrat’s laws?," Economics Letters, Elsevier, vol. 125(2), pages 182-186.
    20. Shuntaro Takahashi & Kumiko Tanaka-Ishii, 2017. "Do neural nets learn statistical laws behind natural language?," PLOS ONE, Public Library of Science, vol. 12(12), pages 1-17, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:391:y:2012:i:4:p:1515-1518. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.