IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v646y2024ics0378437124004187.html
   My bibliography  Save this article

Scaling in Deep and Shallow Learning Architectures

Author

Listed:
  • Koresh, Ella
  • Halevi, Tal
  • Meir, Yuval
  • Dilmoney, Dolev
  • Dror, Tamar
  • Gross, Ronit
  • Tevet, Ofek
  • Hodassman, Shiri
  • Kanter, Ido

Abstract

The realization of classification tasks using deep learning is a primary goal of artificial intelligence; however, its possible universal behavior remains unexplored. Herein, we demonstrate a scaling behavior for the test error, ϵ, as a function of the number of classified labels, K. For trained utmost deep architectures on CIFAR-100 ϵ(K)∝Kρ with ρ∼1, and in case of reduced deep architectures, ρ continuously decreases until a crossover to ϵ(K)∝log(K) is observed for shallow architectures. A similar crossover is observed for shallow architectures, where the number of filters in the convolutional layers is proportionally increased. This unified the scaling behavior of deep and shallow architectures, which yields a reduced latency method. The dependence of Δϵ/ΔK on the trained architecture is expected to be crucial in learning scenarios involving dynamic number of labels.

Suggested Citation

  • Koresh, Ella & Halevi, Tal & Meir, Yuval & Dilmoney, Dolev & Dror, Tamar & Gross, Ronit & Tevet, Ofek & Hodassman, Shiri & Kanter, Ido, 2024. "Scaling in Deep and Shallow Learning Architectures," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 646(C).
  • Handle: RePEc:eee:phsmap:v:646:y:2024:i:c:s0378437124004187
    DOI: 10.1016/j.physa.2024.129909
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437124004187
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2024.129909?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Levy, Moshe & Solomon, Sorin, 1997. "New evidence for the power-law distribution of wealth," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 242(1), pages 90-94.
    2. Blank, Aharon & Solomon, Sorin, 2000. "Power laws in cities population, financial markets and internet sites (scaling in systems with a variable number of components)," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 287(1), pages 279-288.
    3. Tevet, Ofek & Gross, Ronit D. & Hodassman, Shiri & Rogachevsky, Tal & Tzach, Yarden & Meir, Yuval & Kanter, Ido, 2024. "Efficient shallow learning mechanism as an alternative to deep learning," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 635(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Solomon, Sorin & Richmond, Peter, 2001. "Power laws of wealth, market order volumes and market returns," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 299(1), pages 188-197.
    2. Jan Schulz & Mishael Milaković, 2023. "How Wealthy are the Rich?," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 69(1), pages 100-123, March.
    3. Segarra, Agustí & Teruel, Mercedes, 2012. "An appraisal of firm size distribution: Does sample size matter?," Journal of Economic Behavior & Organization, Elsevier, vol. 82(1), pages 314-328.
    4. Kwame Boamah‐Addo & Tomasz J. Kozubowski & Anna K. Panorska, 2023. "A discrete truncated Zipf distribution," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 77(2), pages 156-187, May.
    5. E. Samanidou & E. Zschischang & D. Stauffer & T. Lux, 2001. "Microscopic Models of Financial Markets," Papers cond-mat/0110354, arXiv.org.
    6. Rama Cont & Jean-Philippe Bouchaud, 1997. "Herd behavior and aggregate fluctuations in financial markets," Science & Finance (CFM) working paper archive 500028, Science & Finance, Capital Fund Management.
    7. Marco Raberto & Silvano Cincotti & Sergio Focardi & Michele Marchesi, 2003. "Traders' Long-Run Wealth in an Artificial Financial Market," Computational Economics, Springer;Society for Computational Economics, vol. 22(2), pages 255-272, October.
    8. Zhou, Bin & Yan, Xiao-Yong & Xu, Xiao-Ke & Xu, Xiao-Ting & Wang, Nianxin, 2018. "Evolutionary of online social networks driven by pareto wealth distribution and bidirectional preferential attachment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 507(C), pages 427-434.
    9. Wu, Yahao & Wang, Xiao-Tian & Wu, Min, 2009. "Fractional-moment CAPM with loss aversion," Chaos, Solitons & Fractals, Elsevier, vol. 42(3), pages 1406-1414.
    10. Castaldi, Carolina & Milakovic, Mishael, 2007. "Turnover activity in wealth portfolios," Journal of Economic Behavior & Organization, Elsevier, vol. 63(3), pages 537-552, July.
    11. Bucsa, G. & Jovanovic, F. & Schinckus, C., 2011. "A unified model for price return distributions used in econophysics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(20), pages 3435-3443.
    12. Cornelia Metzig & Mirta Gordon, 2012. "Heterogeneous Enterprises in a Macroeconomic Agent-Based Model," Papers 1211.5575, arXiv.org.
    13. Andrea Bonaccorsi & Maurizio Martinelli & Cristina Rossi & Irma Serrecchia, 2002. "Measuring and modelling Internet diffusion using second level domains: the case of Italy," LEM Papers Series 2002/17, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
    14. Philip Vermeulen, 2018. "How Fat is the Top Tail of the Wealth Distribution?," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 64(2), pages 357-387, June.
    15. Malevergne, Y. & Saichev, A. & Sornette, D., 2013. "Zipf's law and maximum sustainable growth," Journal of Economic Dynamics and Control, Elsevier, vol. 37(6), pages 1195-1212.
    16. Becker, Bo & Cronqvist, Henrik & Fahlenbrach, Rüdiger, 2011. "Estimating the Effects of Large Shareholders Using a Geographic Instrument," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 46(4), pages 907-942, August.
    17. Ren, F. & Zhang, Y.C., 2008. "Trading model with pair pattern strategies," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(22), pages 5523-5534.
    18. Misha Perepelitsa, 2018. "A model of adaptive, market behavior generating positive returns, volatility and system risk," Papers 1809.09601, arXiv.org.
    19. Arun Advani & George Bangham & Jack Leslie, 2021. "The UK's wealth distribution and characteristics of high‐wealth households," Fiscal Studies, John Wiley & Sons, vol. 42(3-4), pages 397-430, September.
    20. Pierpaolo Andriani & Bill McKelvey, 2009. "Perspective ---From Gaussian to Paretian Thinking: Causes and Implications of Power Laws in Organizations," Organization Science, INFORMS, vol. 20(6), pages 1053-1071, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:646:y:2024:i:c:s0378437124004187. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.