IDEAS home Printed from https://ideas.repec.org/a/bla/jorssc/v67y2018i5p1103-1145.html
   My bibliography  Save this article

The statistical analysis of acoustic phonetic data: exploring differences between spoken Romance languages

Author

Listed:
  • Davide Pigoli
  • Pantelis Z. Hadjipantelis
  • John S. Coleman
  • John A. D. Aston

Abstract

The historical and geographical spread from older to more modern languages has long been studied by examining textual changes and in terms of changes in phonetic transcriptions. However, it is more difficult to analyse language change from an acoustic point of view, although this is usually the dominant mode of transmission. We propose a novel analysis approach for acoustic phonetic data, where the aim will be to model the acoustic properties of spoken words statistically. We explore phonetic variation and change by using a time–frequency representation, namely the log‐spectrograms of speech recordings. We identify time and frequency covariance functions as a feature of the language; in contrast, mean spectrograms depend mostly on the particular word that has been uttered. We build models for the mean and covariances (taking into account the restrictions placed on the statistical analysis of such objects) and use these to define a phonetic transformation that models how an individual speaker would sound in a different language, allowing the exploration of phonetic differences between languages. Finally, we map back these transformations to the domain of sound recordings, enabling us to listen to the output of the statistical analysis. The approach proposed is demonstrated by using recordings of the words corresponding to the numbers from 1 to 10 as pronounced by speakers from five different Romance languages.

Suggested Citation

  • Davide Pigoli & Pantelis Z. Hadjipantelis & John S. Coleman & John A. D. Aston, 2018. "The statistical analysis of acoustic phonetic data: exploring differences between spoken Romance languages," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 67(5), pages 1103-1145, November.
  • Handle: RePEc:bla:jorssc:v:67:y:2018:i:5:p:1103-1145
    DOI: 10.1111/rssc.12258
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssc.12258
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssc.12258?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Mark Fiecas & Hernando Ombao, 2016. "Modeling the Evolution of Dynamic Brain Processes During an Associative Learning Experiment," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1440-1453, October.
    2. Davide Pigoli & John A. D. Aston & Ian L. Dryden & Piercesare Secchi, 2014. "Distances and inference for covariance operators," Biometrika, Biometrika Trust, vol. 101(2), pages 409-422.
    3. Simon N. Wood, 2003. "Thin plate regression splines," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(1), pages 95-114, February.
    4. Gerda Claeskens & Mia Hubert & Leen Slaets & Kaveh Vakili, 2014. "Multivariate Functional Halfspace Depth," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 411-423, March.
    5. Garcia, Damien, 2010. "Robust smoothing of gridded data in one and higher dimensions with missing values," Computational Statistics & Data Analysis, Elsevier, vol. 54(4), pages 1167-1178, April.
    6. Wensheng Guo, 2002. "Functional Mixed Effects Models," Biometrics, The International Biometric Society, vol. 58(1), pages 121-128, March.
    7. Yao, Fang & Muller, Hans-Georg & Wang, Jane-Ling, 2005. "Functional Data Analysis for Sparse Longitudinal Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 577-590, June.
    8. Scott A. Bruce & Martica H. Hall & Daniel J. Buysse & Robert T. Krafty, 2018. "Conditional adaptive Bayesian spectral analysis of nonstationary biomedical time series," Biometrics, The International Biometric Society, vol. 74(1), pages 260-269, March.
    9. Antonio Cuevas & Manuel Febrero & Ricardo Fraiman, 2007. "Robust estimation and classification for functional data via projection-based depth notions," Computational Statistics, Springer, vol. 22(3), pages 481-496, September.
    10. Geoff K. Nicholls & Russell D. Gray, 2008. "Dated ancestral trees from binary trait data and their application to the diversification of languages," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(3), pages 545-566, July.
    11. P. Constantinou & P. Kokoszka & M. Reimherr, 2017. "Testing separability of space-time functional processes," Biometrika, Biometrika Trust, vol. 104(2), pages 425-437.
    12. Robert T. Krafty & Martica Hall & Wensheng Guo, 2011. "Functional mixed effects spectral analysis," Biometrika, Biometrika Trust, vol. 98(3), pages 583-598.
    13. Victor Ginsburgh & Shlomo Weber, 2011. "How Many Languages Do We Need? The Economics of Linguistic Diversity," Economics Books, Princeton University Press, edition 1, number 9481.
    14. Feng, Qing & Jiang, Meilei & Hannig, Jan & Marron, J.S., 2018. "Angle-based joint and individual variation explained," Journal of Multivariate Analysis, Elsevier, vol. 166(C), pages 241-265.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Piotr Osiński & Adam Deptuła & Anna M. Deptuła, 2023. "Analysis of the Gear Pump’s Acoustic Properties Taking into Account the Classification of Induction Trees," Energies, MDPI, vol. 16(11), pages 1-28, May.
    2. Aranya Koshy & Shahin Tavakoli, 2022. "Exploring British accents: Modelling the trap–bath split with functional data analysis," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(4), pages 773-805, August.
    3. Soham Sarkar & Victor M. Panaretos, 2022. "CovNet: Covariance networks for functional data on multidimensional domains," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(5), pages 1785-1820, November.
    4. Anthony Ebert & Kerrie Mengersen & Fabrizio Ruggeri & Paul Wu, 2021. "Curve Registration of Functional Data for Approximate Bayesian Computation," Stats, MDPI, vol. 4(3), pages 1-14, September.
    5. T Masak & S Sarkar & V M Panaretos, 2023. "Separable expansions for covariance estimation via the partial inner product," Biometrika, Biometrika Trust, vol. 110(1), pages 225-247.
    6. Pedro Galeano & Daniel Peña, 2019. "Data science, big data and statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(2), pages 289-329, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nagy, Stanislav & Ferraty, Frédéric, 2019. "Data depth for measurable noisy random functions," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 95-114.
    2. Chau, Van Vinh & von Sachs, Rainer, 2016. "Functional mixed effects wavelet estimation for spectra of replicated time series," LIDAM Discussion Papers ISBA 2016013, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    3. Elías, Antonio & Jiménez, Raúl & Shang, Han Lin, 2022. "On projection methods for functional time series forecasting," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    4. Ana-Maria Staicu & Yingxing Li & Ciprian M. Crainiceanu & David Ruppert, 2014. "Likelihood Ratio Tests for Dependent Data with Applications to Longitudinal and Functional Data Analysis," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 41(4), pages 932-949, December.
    5. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2015. "Multivariate functional outlier detection," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 177-202, July.
    6. Febrero-Bande, Manuel & González-Manteiga, Wenceslao & Prallon, Brenda & Saporito, Yuri F., 2023. "Functional classification of bitcoin addresses," Computational Statistics & Data Analysis, Elsevier, vol. 181(C).
    7. Li, Pai-Ling & Chiou, Jeng-Min, 2011. "Identifying cluster number for subspace projected functional data clustering," Computational Statistics & Data Analysis, Elsevier, vol. 55(6), pages 2090-2103, June.
    8. repec:cte:wsrepe:24606 is not listed on IDEAS
    9. Li, Pai-Ling & Chiou, Jeng-Min & Shyr, Yu, 2017. "Functional data classification using covariate-adjusted subspace projection," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 21-34.
    10. Zhuo Qu & Wenlin Dai & Marc G. Genton, 2021. "Robust functional multivariate analysis of variance with environmental applications," Environmetrics, John Wiley & Sons, Ltd., vol. 32(1), February.
    11. Zhang, Shibin, 2020. "Nonparametric Bayesian inference for the spectral density based on irregularly spaced data," Computational Statistics & Data Analysis, Elsevier, vol. 151(C).
    12. Reiss Philip T. & Huang Lei & Mennes Maarten, 2010. "Fast Function-on-Scalar Regression with Penalized Basis Expansions," The International Journal of Biostatistics, De Gruyter, vol. 6(1), pages 1-30, August.
    13. Huaihou Chen & Yuanjia Wang, 2011. "A Penalized Spline Approach to Functional Mixed Effects Model Analysis," Biometrics, The International Biometric Society, vol. 67(3), pages 861-870, September.
    14. Chen, Ziqi & Hu, Jianhua & Zhu, Hongtu, 2020. "Surface functional models," Journal of Multivariate Analysis, Elsevier, vol. 180(C).
    15. Boente, Graciela & Rodriguez, Daniela & Sued, Mariela, 2019. "The spatial sign covariance operator: Asymptotic results and applications," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 115-128.
    16. Minhee Kim & Todd Allen & Kaibo Liu, 2023. "Covariate Dependent Sparse Functional Data Analysis," INFORMS Joural on Data Science, INFORMS, vol. 2(1), pages 81-98, April.
    17. Serfling, Robert & Wijesuriya, Uditha, 2017. "Depth-based nonparametric description of functional data, with emphasis on use of spatial depth," Computational Statistics & Data Analysis, Elsevier, vol. 105(C), pages 24-45.
    18. Antonio Elías & Raúl Jiménez & Han Lin Shang, 2023. "Depth-based reconstruction method for incomplete functional data," Computational Statistics, Springer, vol. 38(3), pages 1507-1535, September.
    19. Jeff Goldsmith & Vadim Zipunnikov & Jennifer Schrack, 2015. "Generalized multilevel function-on-scalar regression and principal component analysis," Biometrics, The International Biometric Society, vol. 71(2), pages 344-353, June.
    20. Yakun Wang & Zeda Li & Scott A. Bruce, 2023. "Adaptive Bayesian sum of trees model for covariate‐dependent spectral analysis," Biometrics, The International Biometric Society, vol. 79(3), pages 1826-1839, September.
    21. Carlo Sguera & Sara López-Pintado, 2021. "A notion of depth for sparse functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(3), pages 630-649, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssc:v:67:y:2018:i:5:p:1103-1145. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.