IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v599y2022ics037843712200334x.html
   My bibliography  Save this article

On similarity

Author

Listed:
  • Costa, Luciano da F.

Abstract

The objective quantification of similarity between two mathematical or physical structures, from scalars to graphs, constitutes a central issue in the physical sciences and technology. In the present work, we develop a principled and systematic approach that adopts the Kronecker delta function of two scalar real values as the prototypical reference for fully strict similarity quantification. We then consider other approaches, namely the cosine similarity, correlation, Sørensen–Dice, and Jaccard indices, and show that they provide successively more strict similarity quantifications. Multiset-based generalizations of these indices to take into account real values are then adopted in order to progressively extend the indices to multisets, vectors, and functions in real spaces. Several important results are reported, including the multiset formulation of similarity indices, as well as the formal derivation of the Jaccard index from the Kronecker delta function. When generalized to real functions, the described similarity indices become respective functionals, which can then be employed to obtain operations analogous to convolution and correlation. Complete application examples involving the recognition of patterns through template matching between two 1D functions as well as the identification of multiples instances of objects in 2D scalar fields (images) in presence of noise are also reported which well-illustrate the potential of the proposed concepts and methods. The characterization of the eigenmodes of successive convolutions are also addressed, with interesting results substantiating the enhanced potential of the coincidence index for accurate and stable similarity quantification.

Suggested Citation

  • Costa, Luciano da F., 2022. "On similarity," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 599(C).
  • Handle: RePEc:eee:phsmap:v:599:y:2022:i:c:s037843712200334x
    DOI: 10.1016/j.physa.2022.127456
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S037843712200334X
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2022.127456?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Stefania Albanesi & Domonkos F. Vamossy, 2019. "Predicting Consumer Default: A Deep Learning Approach," NBER Working Papers 26165, National Bureau of Economic Research, Inc.
    2. Michael Brusco & J Dennis Cradit & Douglas Steinley, 2021. "A comparison of 71 binary similarity coefficients: The effect of base rates," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-19, April.
    3. András Schubert & András Telcs, 2014. "A note on the Jaccardized Czekanowski similarity index," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(2), pages 1397-1399, February.
    4. Kim, Yunmi & Kim, Tae-Hwan & Ergün, Tolga, 2015. "The instability of the Pearson correlation coefficient in the presence of coincidental outliers," Finance Research Letters, Elsevier, vol. 13(C), pages 243-257.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. LaVoice, Jessica & Vamossy, Domonkos F., 2024. "Racial disparities in debt collection," Journal of Banking & Finance, Elsevier, vol. 164(C).
    2. Michael Karpe, 2020. "An overall view of key problems in algorithmic trading and recent progress," Papers 2006.05515, arXiv.org.
    3. Vitaly Meursault & Daniel Moulton & Larry Santucci & Nathan Schor, 2022. "One Threshold Doesn’t Fit All: Tailoring Machine Learning Predictions of Consumer Default for Lower-Income Areas," Working Papers 22-39, Federal Reserve Bank of Philadelphia.
    4. Adebayo Oshingbesan & Eniola Ajiboye & Peruth Kamashazi & Timothy Mbaka, 2022. "Model-Free Reinforcement Learning for Asset Allocation," Papers 2209.10458, arXiv.org.
    5. Fraisse, Henri & Laporte, Matthias, 2022. "Return on investment on artificial intelligence: The case of bank capital requirement," Journal of Banking & Finance, Elsevier, vol. 138(C).
    6. Giacomo De Giorgi & Costanza Naguib, 2022. "Life after Default: Credit Hardship and its Effects," Diskussionsschriften dp2206, Universitaet Bern, Departement Volkswirtschaft.
    7. Domonkos F. Vamossy, 2024. "Social Media Emotions and Market Behavior," Papers 2404.03792, arXiv.org.
    8. Rodrigues, João L. & Bolognesi, Hugo M. & Melo, Joel D. & Heymann, Fabian & Soares, F.J., 2019. "Spatiotemporal model for estimating electric vehicles adopters," Energy, Elsevier, vol. 183(C), pages 788-802.
    9. Jae Min Lee, 2021. "Understanding volume and correlations of automated walk count: Predictors for necessary, optional, and social activities in Dilworth Park," Environment and Planning B, , vol. 48(2), pages 331-347, February.
    10. González-Sánchez, Mariano, 2021. "Is there a relationship between the time scaling property of asset returns and the outliers? Evidence from international financial markets," Finance Research Letters, Elsevier, vol. 38(C).
    11. Van Loo, Ellen J. & Caputo, Vincenzina & Lusk, Jayson L., 2020. "Consumer preferences for farm-raised meat, lab-grown meat, and plant-based meat alternatives: Does information or brand matter?," Food Policy, Elsevier, vol. 95(C).
    12. Andrés Alonso Robisco & José Manuel Carbó Martínez, 2022. "Measuring the model risk-adjusted performance of machine learning algorithms in credit default prediction," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-35, December.
    13. Giacomo De Giorgi & Matthew Harding & Gabriel Vasconcelos, 2021. "Predicting Mortality from Credit Reports," Papers 2111.03662, arXiv.org.
    14. De Giorgi, Giacomo & Naguib, Costanza, 2024. "Life after (soft) default," European Economic Review, Elsevier, vol. 167(C).
    15. Andrés Alonso & José Manuel Carbó, 2021. "Understanding the performance of machine learning models to predict credit default: a novel approach for supervisory evaluation," Working Papers 2105, Banco de España.
    16. Domonkos F. Vamossy, 2023. "Social Media Emotions and IPO Returns," Papers 2306.12602, arXiv.org, revised Apr 2024.
    17. Stefania Albanesi & Domonkos F. Vamossy, 2024. "Credit Scores: Performance and Equity," NBER Working Papers 32917, National Bureau of Economic Research, Inc.
    18. Alessandro Bitetto & Stefano Filomeni & Michele Modina, 2021. "Understanding corporate default using Random Forest: The role of accounting and market information," DEM Working Papers Series 205, University of Pavia, Department of Economics and Management.
    19. Eccles, Peter & Grout, Paul & Siciliani, Paolo & Zalewska, Anna, 2021. "The impact of machine learning and big data on credit markets," Bank of England working papers 930, Bank of England.
    20. Alonso-Robisco, Andrés & Carbó, José Manuel, 2022. "Can machine learning models save capital for banks? Evidence from a Spanish credit portfolio," International Review of Financial Analysis, Elsevier, vol. 84(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:599:y:2022:i:c:s037843712200334x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.