IDEAS home Printed from https://ideas.repec.org/p/osf/socarx/ustxg_v1.html
   My bibliography  Save this paper

Fairer machine learning in the real world: Mitigating discrimination without collecting sensitive data

Author

Listed:
  • Veale, Michael

    (University College London)

  • Binns, Reuben

Abstract

Cite as: Veale, Michael and Binns, Reuben (2017) Fairer machine learning in the real world: Mitigating discrimination without collecting sensitive data. Big Data & Society 4(2). doi:10.1177/2053951717743530 Decisions based on algorithmic, machine learning models can be unfair, reproducing biases in historical data used to train them. While computational techniques are emerging to address aspects of these concerns through communities such as discrimination-aware data mining (DADM) and fair, accountable and transparent machine learning (FATML), their practical implementation faces real-world challenges. For legal, institutional or commercial reasons, organisations might not hold the data on sensitive attributes such as gender, ethnicity, sexuality or disability needed to diagnose and mitigate emergent indirect discrimination-by-proxy, such as redlining. Such organisations might also lack the knowledge and capacity to identify and manage fairness issues that are emergent properties of complex sociotechnical systems. This paper presents and discusses three potential approaches to deal with such knowledge and information deficits in the context of fairer machine learning. Trusted third parties could selectively store data necessary for performing discrimination discovery and incorporating fairness constraints into model-building in a privacy-preserving manner. Collaborative online platforms would allow diverse organisations to record, share and access contextual and experiential knowledge to promote fairness in machine learning systems. Finally, unsupervised learning and pedagogically interpretable algorithms might allow fairness hypotheses to be built for further selective testing and exploration. Real-world fairness challenges in machine learning are not abstract, constrained optimisation problems, but are institutionally and contextually grounded. Computational fairness tools are useful, but must be researched and developed in and with the messy contexts that will shape their deployment, rather than just for imagined situations. Not doing so risks real, near-term algorithmic harm.

Suggested Citation

  • Veale, Michael & Binns, Reuben, 2017. "Fairer machine learning in the real world: Mitigating discrimination without collecting sensitive data," SocArXiv ustxg_v1, Center for Open Science.
  • Handle: RePEc:osf:socarx:ustxg_v1
    DOI: 10.31219/osf.io/ustxg_v1
    as

    Download full text from publisher

    File URL: https://osf.io/download/59f3559c9ad5a1026d107902/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/ustxg_v1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. repec:elg:eebook:14251 is not listed on IDEAS
    2. Munawer Sultan Khwaja & Rajul Awasthi & Jan Loeprick, 2011. "Risk-Based Tax Audits : Approaches and Country Experiences," World Bank Publications - Books, The World Bank Group, number 2314.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dina Pomeranz & José Vila-Belda, 2019. "Taking State-Capacity Research to the Field: Insights from Collaborations with Tax Authorities," Annual Review of Economics, Annual Reviews, vol. 11(1), pages 755-781, August.
    2. Dai, Zhixin & Hogarth, Robin M. & Villeval, Marie Claire, 2015. "Ambiguity on audits and cooperation in a public goods game," European Economic Review, Elsevier, vol. 74(C), pages 146-162.
    3. Henselmann, Klaus & Haller, Stefanie, 2017. "Potentielle Risikofaktoren für die Erhöhung der Betriebsprüfungswahrscheinlichkeit - Eine analytische und empirische Untersuchung auf Basis der E-Bilanz-Taxonomie 6.0 -," Working Papers in Accounting Valuation Auditing 2017-1, Friedrich-Alexander University Erlangen-Nuremberg, Chair of Accounting and Auditing.
    4. Eberhartinger, Eva & Safaei, Reyhaneh & Sureth, Caren & Wu, Yuchen, 2021. "Are risk-based tax audit stretegies rewarded? An analysis of corporate tax avoidance," arqus Discussion Papers in Quantitative Tax Research 267, arqus - Arbeitskreis Quantitative Steuerlehre.
    5. Semjén, András, 2017. "Az adózói magatartás különféle magyarázatai [Various explanations for tax compliance]," Közgazdasági Szemle (Economic Review - monthly of the Hungarian Academy of Sciences), Közgazdasági Szemle Alapítvány (Economic Review Foundation), vol. 0(2), pages 140-184.
    6. Saudin Terzić, 2017. "Model for determining subjective and objective factors of tax evasion," Notitia - journal for economic, business and social issues, Notitia Ltd., vol. 1(3), pages 49-62, December.
    7. Jaime Vázquez-Caro & Richard M. Bird, 2011. "Benchmarking Tax Administrations in Developing Countries: A Systemic Approach," International Center for Public Policy Working Paper Series, at AYSPS, GSU paper1104, International Center for Public Policy, Andrew Young School of Policy Studies, Georgia State University.
    8. Veale, Michael & Binns, Reuben, 2017. "Fairer machine learning in the real world: Mitigating discrimination without collecting sensitive data," SocArXiv ustxg, Center for Open Science.
    9. Serdar ÇİÇEK & Hüseyin Güçlü ÇİÇEK & Elif Ayşe ŞAHİN-İPEK, 2019. "Behavioral Approach to Tax Compliance Process: Taxpayer Behaviors and Typologies," Sosyoekonomi Journal, Sosyoekonomi Society.
    10. Okunogbe,Oyebola Motunrayo & Santoro,Fabrizio, 2021. "The Promise and Limitations of Information Technology for Tax Mobilization," Policy Research Working Paper Series 9848, The World Bank.
    11. Era Dabla-Norris & Florian Misch & Duncan Cleary & Munawer Khwaja, 2020. "The quality of tax administration and firm performance: evidence from developing countries," International Tax and Public Finance, Springer;International Institute of Public Finance, vol. 27(3), pages 514-551, June.
    12. von Haldenwang, Christian, 2020. "Digitalising the fiscal contract: An interdisciplinary framework for empirical inquiry," IDOS Discussion Papers 20/2020, German Institute of Development and Sustainability (IDOS).
    13. Petros Dellaportas & Evangelos Ioannidis & Christos Kotsogiannis, 2021. "Sample size determination for risk‐based tax auditing," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(2), pages 479-493, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:socarx:ustxg_v1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://arabixiv.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.