Byzantine-robust decentralized learning via self-centered clipping

In this paper, we study the challenging task of Byzantine-robust decentralized training on arbitrary communication graphs. Unlike federated learning where workers communicate through a server, workers in the decentralized environment can only talk to their neighbors, making it harder to reach consensus. We identify a novel dissensus attack in which few malicious nodes can take advantage of information bottlenecks in the topology to poison the collaboration. To address these issues, we propose a Self-Centered Clipping (SSClip) algorithm for Byzantine-robust consensus and optimization, which is the first to provably converge to a $O(\delta_{\max}\zeta^2/\gamma^2)$ neighborhood of the stationary point for non-convex objectives under standard assumptions. Finally, we demonstrate the encouraging empirical performance of SSClip under a large number of attacks.

Structure of code
Reproduction
License
Reference

Code organization

The structure of the repository is as follows:

codes/
- Source code.
outputs/
- Store the output of the launcher scripts.
consensus.ipynb: Study the error of aggregators to the average consensus under dissensus attack.
- This notebook generates Fig. 3 in the main text and Fig. 8 in the appendix.
dumbbell.py: Study how topology + heterogeneity influence on the aggregators.
dumbbell_improvement.py: Study how to help aggregators to address topology + heterogeneity influence.
dumbbell.ipynb: Plot the results of dumbbell.py and dumbbell_improvement.py.
- Generate Fig. 4 in the main text.
optimization_delta.py: Fix p, zeta^2 and varying delta of dissensus attack for SCClip aggregator.
- Generate Fig. 5 in the main text.
honest_majority.py: Study the influence of honest majority in the text.
- Generate Fig. 6 in the main text.

Reproduction

To reproduce the results in the paper, do the following steps

Add codes/ to environment variable PYTHONPATH
Install the dependencies: pip install -r requirements.txt
Run bash run.sh and select option 2 to 9 to generate the code.
The output will be saved to the corresponding folders under outputs

Note that if the GPU memory is small (e.g. less than 16 GB), then running the previous commands may raise insufficient exception. In this case, one can decrease the level parallelism in the script by changing the order of loops and reduce the number of parallel processes.

License

This repo is covered under The MIT License.

Reference

TODO

Byzantine-robust decentralized learning via self-centered clipping

Related tags

Overview

Byzantine-robust decentralized learning via self-centered clipping

Table of contents

Code organization

Reproduction

License

Reference

Owner

EPFL Machine Learning and Optimization Laboratory

DecoupledNet is semantic segmentation system which using heterogeneous annotations

Educational 2D SLAM implementation based on ICP and Pose Graph

Minimal PyTorch implementation of Generative Latent Optimization from the paper "Optimizing the Latent Space of Generative Networks"

Dataset and Code for ICCV 2021 paper "Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme"

A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision for Visual Scene Graph Generation''

[NeurIPS 2021] "Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks" by Yonggan Fu, Qixuan Yu, Yang Zhang, Shang Wu, Xu Ouyang, David Cox, Yingyan Lin

Source code for Task-Aware Variational Adversarial Active Learning

AI Toolkit for Healthcare Imaging

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

PyTorch implementation of the WarpedGANSpace: Finding non-linear RBF paths in GAN latent space (ICCV 2021)

Keywords : Streamlit, BertTokenizer, BertForMaskedLM, Pytorch

ONNX-PackNet-SfM: Python scripts for performing monocular depth estimation using the PackNet-SfM model in ONNX

SCNet: Learning Semantic Correspondence

QuadTree Attention for Vision Transformers (ICLR2022)

💃 VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena

Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286

A paper using optimal transport to solve the graph matching problem.

Deep universal probabilistic programming with Python and PyTorch

YOLOX-CondInst - Implement CondInst which is a instances segmentation method on YOLOX

Open source code for Paper "A Co-Interactive Transformer for Joint Slot Filling and Intent Detection"