IDEAS home Printed from https://ideas.repec.org/a/rsr/journl/v65y2017i4p67-76.html
   My bibliography  Save this article

Statistical Disclosure Control for Tabular Data in R

Author

Listed:
  • Kazuhiro Minami

    (The Institute of Statistical Mathematics / National Statistics Center, Japan)

  • Yutaka Abe

    (National Statistics Center, Japan)

Abstract

To perform statistical disclosure control (SDC) on tabular data is a challenging task because we need to ensure that every suppressed cell of a table has a sufficient width of a confidentiality interval under the presence of linear relations among cell variables. However, we find that the existing SDC tool (i.e., t-ARGUS) does not effectively support an output checking process of the on-site use program in Japan. We therefore develop a new SDC tool in R, which produces safe tabular data with auxiliary information that is necessary for an output checker to verify its safety. In this paper, we describe the major features of our SDC tool and discuss possible extensions in the future. Our SDC tool performs primary suppressions on a frequency table and a magnitude table with the minimum frequency rule and an occupancy rule (e.g., (n,k)-rule), respectively. We implement the optimal secondary suppression mechanism based on the technique of Benders decomposition.

Suggested Citation

  • Kazuhiro Minami & Yutaka Abe, 2017. "Statistical Disclosure Control for Tabular Data in R," Romanian Statistical Review, Romanian Statistical Review, vol. 65(4), pages 67-76, December.
  • Handle: RePEc:rsr:journl:v:65:y:2017:i:4:p:67-76
    as

    Download full text from publisher

    File URL: http://www.revistadestatistica.ro/wp-content/uploads/2017/11/RRS-4_2017_A05.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Castro, Jordi, 2012. "Recent advances in optimization techniques for statistical tabular data protection," European Journal of Operational Research, Elsevier, vol. 216(2), pages 257-269.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Daniel Baena & Jordi Castro & Antonio Frangioni, 2020. "Stabilized Benders Methods for Large-Scale Combinatorial Optimization, with Application to Data Privacy," Management Science, INFORMS, vol. 66(7), pages 3051-3068, July.
    2. Sage, Andrew J. & Wright, Stephen E., 2016. "Obtaining cell counts for contingency tables from rounded conditional frequencies," European Journal of Operational Research, Elsevier, vol. 250(1), pages 91-100.

    More about this item

    Keywords

    Statistical Disclosure Control; Output checking; Cell suppressions; Linear programming;
    All these keywords.

    JEL classification:

    • Z00 - Other Special Topics - - General - - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:rsr:journl:v:65:y:2017:i:4:p:67-76. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Adrian Visoiu (email available below). General contact details of provider: https://edirc.repec.org/data/stagvro.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.