IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v599y2022ics037843712200334x.html
   My bibliography  Save this article

On similarity

Author

Listed:
  • Costa, Luciano da F.

Abstract

The objective quantification of similarity between two mathematical or physical structures, from scalars to graphs, constitutes a central issue in the physical sciences and technology. In the present work, we develop a principled and systematic approach that adopts the Kronecker delta function of two scalar real values as the prototypical reference for fully strict similarity quantification. We then consider other approaches, namely the cosine similarity, correlation, Sørensen–Dice, and Jaccard indices, and show that they provide successively more strict similarity quantifications. Multiset-based generalizations of these indices to take into account real values are then adopted in order to progressively extend the indices to multisets, vectors, and functions in real spaces. Several important results are reported, including the multiset formulation of similarity indices, as well as the formal derivation of the Jaccard index from the Kronecker delta function. When generalized to real functions, the described similarity indices become respective functionals, which can then be employed to obtain operations analogous to convolution and correlation. Complete application examples involving the recognition of patterns through template matching between two 1D functions as well as the identification of multiples instances of objects in 2D scalar fields (images) in presence of noise are also reported which well-illustrate the potential of the proposed concepts and methods. The characterization of the eigenmodes of successive convolutions are also addressed, with interesting results substantiating the enhanced potential of the coincidence index for accurate and stable similarity quantification.

Suggested Citation

  • Costa, Luciano da F., 2022. "On similarity," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 599(C).
  • Handle: RePEc:eee:phsmap:v:599:y:2022:i:c:s037843712200334x
    DOI: 10.1016/j.physa.2022.127456
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S037843712200334X
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2022.127456?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Stefania Albanesi & Domonkos F. Vamossy, 2019. "Predicting Consumer Default: A Deep Learning Approach," Papers 1908.11498, arXiv.org, revised Oct 2019.
    2. András Schubert & András Telcs, 2014. "A note on the Jaccardized Czekanowski similarity index," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(2), pages 1397-1399, February.
    3. Kim, Yunmi & Kim, Tae-Hwan & Ergün, Tolga, 2015. "The instability of the Pearson correlation coefficient in the presence of coincidental outliers," Finance Research Letters, Elsevier, vol. 13(C), pages 243-257.
    4. Michael Brusco & J Dennis Cradit & Douglas Steinley, 2021. "A comparison of 71 binary similarity coefficients: The effect of base rates," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-19, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Vitaly Meursault & Daniel Moulton & Larry Santucci & Nathan Schor, 2022. "One Threshold Doesn’t Fit All: Tailoring Machine Learning Predictions of Consumer Default for Lower-Income Areas," Working Papers 22-39, Federal Reserve Bank of Philadelphia.
    2. Domonkos F. Vamossy, 2024. "Social Media Emotions and Market Behavior," Papers 2404.03792, arXiv.org.
    3. Van Loo, Ellen J. & Caputo, Vincenzina & Lusk, Jayson L., 2020. "Consumer preferences for farm-raised meat, lab-grown meat, and plant-based meat alternatives: Does information or brand matter?," Food Policy, Elsevier, vol. 95(C).
    4. Andrés Alonso Robisco & José Manuel Carbó Martínez, 2022. "Measuring the model risk-adjusted performance of machine learning algorithms in credit default prediction," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-35, December.
    5. Domonkos F. Vamossy, 2023. "Social Media Emotions and IPO Returns," Papers 2306.12602, arXiv.org, revised Apr 2024.
    6. Stefania Albanesi & Domonkos F. Vamossy, 2024. "Credit Scores: Performance and Equity," NBER Working Papers 32917, National Bureau of Economic Research, Inc.
    7. Oskar Kowalewski & Pawel Pisany & Emil Slazak, 2021. "What determines cross-country differences in fintech and bigtech credit markets?," Working Papers 2021-ACF-02, IESEG School of Management.
    8. Nigmonov, Asror & Shams, Syed & Alam, Khorshed, 2022. "Macroeconomic determinants of loan defaults: Evidence from the U.S. peer-to-peer lending market," Research in International Business and Finance, Elsevier, vol. 59(C).
    9. David D. J. Antia, 2023. "Desalination of Saline Irrigation Water Using Hydrophobic, Metal–Polymer Hydrogels," Sustainability, MDPI, vol. 15(9), pages 1-32, April.
    10. Nicola Branzoli & Ilaria Supino, 2020. "FinTech credit: a critical review of empirical research," Questioni di Economia e Finanza (Occasional Papers) 549, Bank of Italy, Economic Research and International Relations Area.
    11. Bastien Lextrait, 2021. "Scaling up SME's credit scoring scope with LightGBM," EconomiX Working Papers 2021-25, University of Paris Nanterre, EconomiX.
    12. Li, Huan & Wu, Weixing, 2024. "Loan default predictability with explainable machine learning," Finance Research Letters, Elsevier, vol. 60(C).
    13. Contreras-Aso, Gonzalo & Criado, Regino & Vera de Salas, Guillermo & Yang, Jinling, 2023. "Detecting communities in higher-order networks by using their derivative graphs," Chaos, Solitons & Fractals, Elsevier, vol. 177(C).
    14. Criado-Alonso, Ángeles & Aleja, David & Romance, Miguel & Criado, Regino, 2022. "Derivative of a hypergraph as a tool for linguistic pattern analysis," Chaos, Solitons & Fractals, Elsevier, vol. 163(C).
    15. Hoang, Daniel & Wiegratz, Kevin, 2022. "Machine learning methods in finance: Recent applications and prospects," Working Paper Series in Economics 158, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
    16. Nicola Branzoli & Edoardo Rainone & Ilaria Supino, 2023. "The role of banks' technology adoption in credit markets during the pandemic," Temi di discussione (Economic working papers) 1406, Bank of Italy, Economic Research and International Relations Area.
    17. Saira Maqbool & Rabia Bahoo & Muhammad Shakir, 2019. "Exploring Determiners of Achievements of Degree Students (Pakistan) in Academic English," Global Regional Review, Humanity Only, vol. 4(3), pages 64-72, September.
    18. Rumen Iliev & Will Bennis, 2023. "The Convergence of Positivity: Are Happy People All Alike?," Journal of Happiness Studies, Springer, vol. 24(5), pages 1643-1662, June.
    19. LaVoice, Jessica & Vamossy, Domonkos F., 2024. "Racial disparities in debt collection," Journal of Banking & Finance, Elsevier, vol. 164(C).
    20. Michael Karpe, 2020. "An overall view of key problems in algorithmic trading and recent progress," Papers 2006.05515, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:599:y:2022:i:c:s037843712200334x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.