Themis ML
themis-ml
is a Python library built on top of pandas
and sklearn
that implements fairness-aware machine learning algorithms.
Fairness-aware Machine Learning
themis-ml
defines discrimination as the preference (bias) for or against a set of social groups that result in the unfair treatment of its members with respect to some outcome.
It defines fairness as the inverse of discrimination, and in the context of a machine learning algorithm, this is measured by the degree to which the algorithm's predictions favor one social group over another in relation to an outcome that holds socioeconomic, political, or legal importance, e.g. the denial/approval of a loan application.
A "fair" algorithm depends on how we define fairness. For example if we define fairness as statistical parity, a fair algorithm is one in which the proportion of approved loans among minorities is equal to the proportion of approved loans among white people.
Features
Here are a few of the discrimination discovery and fairness-aware techniques that this library implements.
Measuring Discrimination
- Mean difference
- Normalized mean difference
- Consistency
- Situation Test Score
Mitigating Discrimination
Preprocessing
- Relabelling (Massaging)
- Reweighting
- Sampling
Model Estimation
- Additive Counterfactually Fair Estimator
- Prejudice Remover Regularized Estimator
Postprocessing
- Reject Option Classification
- Discrimination-aware Ensemble Classification
Datasets
themis-ml
also provides utility functions for loading freely available datasets from a variety of sources.
- German Credit (source)
- Census Income (source)
- Taiwan Credit Default (source)
- Australian Credit Approval (source)
- Adult Census (source)
- Communities and Crime (source)
- Disabled Residents Expenditure (source)
Installation
The source code is currently hosted on GitHub: https://github.com/cosmicBboy/themis-ml. You can install the latest released version with conda
or pip
.
# conda
conda install -c cosmicbboy themis-ml
If you install with pip, you'll need to install scikit-learn, numpy, and pandas with either pip or conda. Version requirements:
- numpy (>= 1.9.0)
- scikit-learn (>= 0.19.1)
- pandas (>= 0.22.0)
# pip
pip install themis-ml
Documentation
Official documentation for this package can be found here
References
You can find a complete set of references for the discrimination discovery and fairness-aware methods implemented in themis-ml
in this paper.