Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

Last update: Nov 30, 2022

Related tags

Deep Learning C-Fairness-RecSys

Overview

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

This is the repository for the paper Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigation, developed by Giacomo Medda, PhD student at University of Cagliari, with the support of Gianni Fenu, Full Professor at University of Cagliari, Mirko Marras, Non-tenure Track Assistant Professor at University of Cagliari, and Ludovico Boratto, Tenure Track Assistant Professor at University of Cagliari.

The goal of the paper was to find a common understanding and practical benchmarks on how and when each procedure of consumer fairness in recommender systems can be used in comparison to the others.

Repository Organization

reproducibility_study

This is the directory that contains the source code of each reproduced paper identified by the author names of the respective paper.
- Ashokan and Haas: Fairness metrics and bias mitigation strategies for rating predictions
- Burke et al: Balanced Neighborhoods for Multi-sided Fairness in Recommendation
- Ekstrand et al: All The Cool Kids, How Do They Fit In. Popularity and Demographic Biases in Recommender Evaluation and Effectiveness
- Frisch et al: Co-clustering for fair recommendation
- Kamishima et al: Recommendation Independence
- Li et al: User-oriented Fairness in Recommendation
- Rastegarpanah et al: Fighting Fire with Fire. Using Antidote Data to Improve Polarization and Fairness of Recommender Systems
- Wu et al: Learning Fair Representations for Recommendation. A Graph-based Perspective
Preprocessing

Contains the scripts to preprocess the raw datasets and to generate the input data for each reproduced paper.
Evaluation

Contains the scripts to load the predictions of each reproduced paper, compute the metrics and generate plots and tables in latex and markdown forms.
Other Folders

The other folders not already mentioned are part of the codebase that supports the scripts contained in Preprocessing and Evaluation. These directories and their contents are described by README_codebase, since the structure and code inside these folders is only used to support the reproducibility study and it is independent from the specific implementation of each paper.

Reproducibility Pipeline

Code Integration.

The preprocessing of the raw datasets is performed by the scripts.
- preprocess_ml1m: script to preprocess MovieLens 1M
- preprocess_lastfm1K: script to preprocess Last.FM 1K
The commands to preprocess each dataset are present at the top of the related dataset script, but the procedure is better described inside the REPRODUCE.md. The preprocessed datasets will be saved in data/preprocessed_datasets.

Once the MovieLens 1M and the Last.FM 1K dataset have been processed, we can pass to the generation of the input data for each reproduced paper:
- generate_input_data: script to generate the input data of each reproduced paper
The commands to generate the input data for each preprocessed dataset and sensitive attribute are present at the top of the script, but the procedure is better described inside the REPRODUCE.md). The generated input data will be saved in Preprocessing/input_data.
Mitigation Execution

Each paper (folder) listed in the subsection reproducibility_study of Repository Organization contains a REPRODUCE.md file that describes everything to setup, prepare and run each reproduced paper. In particular, instructions to install the dependencies are provided, as well as the specific subfolders to fill with the input data generated in the previous step, in order to properly run the experiments of the selected paper. The procedure for each source code is better described in the already mentioned REPRODUCE.md file.
Relevance Estimation and Metrics Computation

The REPRODUCE.md file contained in each "paper" folder describes also where the predictions can be found at the end of the mitigation procedure and guide the developer on following the instructions of the REPRODUCE.md of Evaluation that contains:
- metrics_reproduced: script that loads all the predictions of relevance scores and computes the metrics in form of plots and latex tables This is the script that must be configured the most, since the paths of the specific predictions of each paper and model could be copied and pasted inside the script if the filenames do not correspond to what we expect and prepare. The REPRODUCE.MD already mentioned better described these steps and specifying which are the commands to execute to get the desired results.

Installation

Considering the codebase and the different versions of libraries used by each paper, multiple Python versions are mandatory to execute properly this code.

The codebase (that is the code not inside reproducibility_study, Preprocessing, Evaluation) needs a Python 3.8 installation and all the necessary dependencies can be installed with the requirements.txt file in the root of the repository with the following command in Windows:

pip install -r requirements.txt

or in Linux:

pip3 install -r requirements.txt

The installation of each reproducible paper is thoroughly described in the REPRODUCE.md that you can find in each paper folder, but every folder contains a requirements.txt file that you can use to install the dependencies in the same way. We recommend to use virtual environments at least for each reproduced paper, since some require specific versions of Python (2, 3, 3.7) and a virtual environment for each paper will maintain a good order in the code organization. Virtual environments can be created in different ways depending on the Python version and on the system. The Python Documentation describes the creation of virtual environments for Python >= 3.5, while the virtualenv Website can be used for Python 2.

Results

Top-N Recommendation Gender

Top-N Recommendation Age

Rating Prediction Gender

Rating Prediction Age

You might also like...

High level network definitions with pre-trained weights in TensorFlow

TensorNets High level network definitions with pre-trained weights in TensorFlow (tested with 2.1.0 = TF = 1.4.0). Guiding principles Applicability.

1k Dec 13, 2022

Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering recommendation algorithms in the world of scientiﬁc Python packages (numpy, scipy, matplotlib).

Crab - A Recommendation Engine library for Python Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering r

1.2k Dec 21, 2022

This is a repository for a No-Code object detection inference API using the OpenVINO. It's supported on both Windows and Linux Operating systems.

OpenVINO Inference API This is a repository for an object detection inference API using the OpenVINO. It's supported on both Windows and Linux Operati

68 Nov 24, 2022

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

Related tags

Overview

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

Repository Organization

reproducibility_study

Preprocessing

Evaluation

Other Folders

Reproducibility Pipeline

Code Integration.

Mitigation Execution

Relevance Estimation and Metrics Computation

Installation

Results

Top-N Recommendation Gender

Top-N Recommendation Age

Rating Prediction Gender

Rating Prediction Age

You might also like...

High level network definitions with pre-trained weights in TensorFlow

Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering recommendation algorithms in the world of scientiﬁc Python packages (numpy, scipy, matplotlib).

A python library for implementing a recommender system

StackRec: Efficient Training of Very Deep Sequential Recommender Models by Iterative Stacking

A TikTok-like recommender system for GitHub repositories based on Gorse

Fashion Recommender System With Python

LoL Runes Recommender With Python

A Real-World Benchmark for Reinforcement Learning based Recommender System

This is a repository for a No-Code object detection inference API using the OpenVINO. It's supported on both Windows and Linux Operating systems.

Owner

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.

A Comparative Framework for Multimodal Recommender Systems

Open-sourcing the Slates Dataset for recommender systems research

An efficient PyTorch implementation of the evaluation metrics in recommender systems.

Code for reproducing our analysis in the paper titled: Image Cropping on Twitter: Fairness Metrics, their Limitations, and the Importance of Representation, Design, and Agency

Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness

ICLR 2021, Fair Mixup: Fairness via Interpolation

This repo is the code release of EMNLP 2021 conference paper "Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories".