IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i22p4338-d977472.html
   My bibliography  Save this article

A Differential Privacy Budget Allocation Algorithm Based on Out-of-Bag Estimation in Random Forest

Author

Listed:
  • Xin Li

    (School of Cyberspace Security, Xi’an University of Posts and Telecommunications, Xi’an 710121, China)

  • Baodong Qin

    (School of Cyberspace Security, Xi’an University of Posts and Telecommunications, Xi’an 710121, China)

  • Yiyuan Luo

    (School of Computer Science and Engineering, Huizhou University, Huizhou 516007, China)

  • Dong Zheng

    (School of Cyberspace Security, Xi’an University of Posts and Telecommunications, Xi’an 710121, China
    School of Computer Science, Qinghai Normal University, Xining 810008, China)

Abstract

The issue of how to improve the usability of data publishing under differential privacy has become one of the top questions in the field of machine learning privacy protection, and the key to solving this problem is to allocate a reasonable privacy protection budget. To solve this problem, we design a privacy budget allocation algorithm based on out-of-bag estimation in random forest. The algorithm firstly calculates the decision tree weights and feature weights by the out-of-bag data under differential privacy protection. Secondly, statistical methods are introduced to classify features into best feature set, pruned feature set, and removable feature set. Then, pruning is performed using the pruned feature set to avoid decision trees over-fitting when constructing an ϵ -differential privacy random forest. Finally, the privacy budget is allocated proportionally based on the decision tree weights and feature weights in the random forest. We conducted experimental comparisons with real data sets from Adult and Mushroom to demonstrate that this algorithm not only protects data security and privacy, but also improves model classification accuracy and data availability.

Suggested Citation

  • Xin Li & Baodong Qin & Yiyuan Luo & Dong Zheng, 2022. "A Differential Privacy Budget Allocation Algorithm Based on Out-of-Bag Estimation in Random Forest," Mathematics, MDPI, vol. 10(22), pages 1-15, November.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:22:p:4338-:d:977472
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/22/4338/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/22/4338/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:22:p:4338-:d:977472. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.