IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v8y2023i10p146-d1248449.html
   My bibliography  Save this article

Synthetic Data Generation for Data Envelopment Analysis

Author

Listed:
  • Andrey V. Lychev

    (College of Information Technologies and Computer Sciences, National University of Science and Technology “MISIS”, 4 Leninsky Ave., Bldg. 1, 119049 Moscow, Russia)

Abstract

The paper is devoted to the problem of generating artificial datasets for data envelopment analysis (DEA), which can be used for testing DEA models and methods. In particular, the papers that applied DEA to big data often used synthetic data generation to obtain large-scale datasets because real datasets of large size, available in the public domain, are extremely rare. This paper proposes the algorithm which takes as input some real dataset and complements it by artificial efficient and inefficient units. The generation process extends the efficient part of the frontier by inserting artificial efficient units, keeping the original efficient frontier unchanged. For this purpose, the algorithm uses the assurance region method and consistently relaxes weight restrictions during the iterations. This approach produces synthetic datasets that are closer to real ones, compared to other algorithms that generate data from scratch. The proposed algorithm is applied to a pair of small real-life datasets. As a result, the datasets were expanded to 50K units. Computational experiments show that artificially generated DMUs preserve isotonicity and do not increase the collinearity of the original data as a whole.

Suggested Citation

  • Andrey V. Lychev, 2023. "Synthetic Data Generation for Data Envelopment Analysis," Data, MDPI, vol. 8(10), pages 1-25, September.
  • Handle: RePEc:gam:jdataj:v:8:y:2023:i:10:p:146-:d:1248449
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/8/10/146/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/8/10/146/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Sueyoshi, Toshiyuki & Sekitani, Kazuyuki, 2007. "Measurement of returns to scale using a non-radial DEA model: A range-adjusted measure approach," European Journal of Operational Research, Elsevier, vol. 176(3), pages 1918-1946, February.
    2. Krivonozhko, Vladimir E. & Førsund, Finn R. & Lychev, Andrey V., 2012. "A note on imposing strong complementary slackness conditions in DEA," European Journal of Operational Research, Elsevier, vol. 220(3), pages 716-721.
    3. Sueyoshi, Toshiyuki & Sekitani, Kazuyuki, 2007. "The measurement of returns to scale under a simultaneous occurrence of multiple solutions in a reference set and a supporting hyperplane," European Journal of Operational Research, Elsevier, vol. 181(2), pages 549-570, September.
    4. Sueyoshi, Toshiyuki & Sekitani, Kazuyuki, 2009. "An occurrence of multiple projections in DEA-based measurement of technical efficiency: Theoretical comparison among DEA models from desirable properties," European Journal of Operational Research, Elsevier, vol. 196(2), pages 764-794, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Krivonozhko, Vladimir E. & Førsund, Finn R. & Lychev, Andrey V., 2012. "A note on imposing strong complementary slackness conditions in DEA," European Journal of Operational Research, Elsevier, vol. 220(3), pages 716-721.
    2. Sueyoshi, Toshiyuki & Goto, Mika, 2011. "Methodological comparison between two unified (operational and environmental) efficiency measurements for environmental assessment," European Journal of Operational Research, Elsevier, vol. 210(3), pages 684-693, May.
    3. Sueyoshi, Toshiyuki & Goto, Mika, 2012. "DEA radial and non-radial models for unified efficiency under natural and managerial disposability: Theoretical extension by strong complementary slackness conditions," Energy Economics, Elsevier, vol. 34(3), pages 700-713.
    4. Sueyoshi, Toshiyuki & Goto, Mika, 2010. "Measurement of a linkage among environmental, operational, and financial performance in Japanese manufacturing firms: A use of Data Envelopment Analysis with strong complementary slackness condition," European Journal of Operational Research, Elsevier, vol. 207(3), pages 1742-1753, December.
    5. Sueyoshi, Toshiyuki & Goto, Mika, 2012. "Efficiency-based rank assessment for electric power industry: A combined use of Data Envelopment Analysis (DEA) and DEA-Discriminant Analysis (DA)," Energy Economics, Elsevier, vol. 34(3), pages 634-644.
    6. Sueyoshi, Toshiyuki & Goto, Mika, 2012. "Returns to scale and damages to scale under natural and managerial disposability: Strategy, efficiency and competitiveness of petroleum firms," Energy Economics, Elsevier, vol. 34(3), pages 645-662.
    7. Krivonozhko, Vladimir E. & Førsund, Finn R. & Lychev, Andrey V., 2014. "Measurement of returns to scale using non-radial DEA models," European Journal of Operational Research, Elsevier, vol. 232(3), pages 664-670.
    8. Sueyoshi, Toshiyuki & Goto, Mika, 2012. "Returns to Scale, Damages to Scale, Marginal Rate of Transformation and Rate of Substitution in DEA Environmental Assessment," Energy Economics, Elsevier, vol. 34(4), pages 905-917.
    9. Sueyoshi, Toshiyuki & Goto, Mika, 2012. "Returns to scale and damages to scale on U.S. fossil fuel power plants: Radial and non-radial approaches for DEA environmental assessment," Energy Economics, Elsevier, vol. 34(6), pages 2240-2259.
    10. Mehdiloozad, Mahmood & Mirdehghan, S. Morteza & Sahoo, Biresh K. & Roshdi, Israfil, 2015. "On the identification of the global reference set in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 245(3), pages 779-788.
    11. Sueyoshi, Toshiyuki & Goto, Mika, 2011. "Measurement of Returns to Scale and Damages to Scale for DEA-based operational and environmental assessment: How to manage desirable (good) and undesirable (bad) outputs?," European Journal of Operational Research, Elsevier, vol. 211(1), pages 76-89, May.
    12. Sueyoshi, Toshiyuki & Goto, Mika, 2012. "Returns to Scale and Damages to Scale with Strong Complementary Slackness Conditions in DEA Assessment: Japanese Corporate Effort on Environment Protection," Energy Economics, Elsevier, vol. 34(5), pages 1422-1434.
    13. Sueyoshi, Toshiyuki & Goto, Mika, 2011. "Operational synergy in the US electric utility industry under an influence of deregulation policy: A linkage to financial performance and corporate value," Energy Policy, Elsevier, vol. 39(2), pages 699-713, February.
    14. Sueyoshi, Toshiyuki & Goto, Mika, 2015. "Environmental assessment on coal-fired power plants in U.S. north-east region by DEA non-radial measurement," Energy Economics, Elsevier, vol. 50(C), pages 125-139.
    15. Aparicio, Juan & Mahlberg, Bernhard & Pastor, Jesus T. & Sahoo, Biresh K., 2014. "Decomposing technical inefficiency using the principle of least action," European Journal of Operational Research, Elsevier, vol. 239(3), pages 776-785.
    16. Sueyoshi, Toshiyuki & Goto, Mika & Ueno, Takahiro, 2010. "Performance analysis of US coal-fired power plants by measuring three DEA efficiencies," Energy Policy, Elsevier, vol. 38(4), pages 1675-1688, April.
    17. Sueyoshi, Toshiyuki & Shang, Jennifer & Chiang, Wen-Chyuan, 2009. "A decision support framework for internal audit prioritization in a rental car company: A combined use between DEA and AHP," European Journal of Operational Research, Elsevier, vol. 199(1), pages 219-231, November.
    18. Sahoo, Biresh K. & Tone, Kaoru, 2013. "Non-parametric measurement of economies of scale and scope in non-competitive environment with price uncertainty," Omega, Elsevier, vol. 41(1), pages 97-111.
    19. Sueyoshi, Toshiyuki & Sekitani, Kazuyuki, 2009. "DEA congestion and returns to scale under an occurrence of multiple optimal projections," European Journal of Operational Research, Elsevier, vol. 194(2), pages 592-607, April.
    20. Halická, Margaréta & Trnovská, Mária, 2021. "A unified approach to non-radial graph models in data envelopment analysis: common features, geometry, and duality," European Journal of Operational Research, Elsevier, vol. 289(2), pages 611-627.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:8:y:2023:i:10:p:146-:d:1248449. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.