Federated Learning Simulator (FLSim)
Federated Learning Simulator (FLSim) is a flexible, standalone library written in PyTorch that simulates FL settings with a minimal, easy-to-use API. FLSim is domain-agnostic and accommodates many use cases such as computer vision and natural text. Currently FLSim supports cross-device FL, where millions of clients' devices (e.g. phones) traing a model collaboratively together.
FLSim is scalable and fast. It supports differential privacy (DP), secure aggregation (secAgg), and variety of compression techniques.
In FL, a model is trained collaboratively by multiple clients that each have their own local data, and a central server moderates training, e.g. by aggregating model updates from multiple clients.
In FLSim, developers only need to define a dataset, model, and metrics reporter. All other aspects of FL training are handled internally by the FLSim core library.
FLSim
Library Structure
FLSim core components follow the same semantic as FedAvg. The server comprises three main features: selector, aggregator, and optimizer at a high level. The selector selects clients for training, and the aggregate aggregates client updates until a round is complete. Then, the optimizer optimizes the server model based on the aggregated gradients. The server communicates with the clients via the channel. The channel then compresses the message between the server and the clients. Locally, the client composes of a dataset and a local optimizer. This local optimizer can be SGD, FedProx, or a custom Pytorch optimizer.
Installation
The latest release of FLSim can be installed via pip
:
pip install flsim
You can also install directly from the source for the latest features (along with its quirks and potentially ocassional bugs):
git clone https://github.com/facebookresearch/FLSim.git
cd FLSim
pip install -e .
Getting started
To implement a central training loop in the FL setting using FLSim, a developer simply performs the following steps:
- Build their own data pipeline to assign individual rows of training data to client devices (to simulate data is distributed across client devices)
- Create a corresponding
nn/Module
model and wrap it in an FL model. - Define a custom metrics reporter that computes and collects metrics of interest (e.g., accuracy) throughout training.
- Set the desired hyperparameters in a config.
Usage Example
Tutorials
- Image classification with CIFAR-10
- Sentiment classification with LEAF's Sent140
- Compression for communication efficiency
- Adding a custom communication channel
To see the details, please refer to the tutorials that we have prepared.
Examples
We have prepared the runnable exampels for 2 of the tutorials above:
Contributing
See the CONTRIBUTING for how to contribute to this library.
License
This code is released under Apache 2.0, as found in the LICENSE file.