Simulation-based performance analysis of server-less Blockchain-enabled Federated Learning

Last update: Sep 27, 2022

Overview

Blockchain-enabled Server-less Federated Learning

Repository containing the files used to reproduce the results of the publication "Blockchain-enabled Server-less Federated Learning".

''BibTeX'' citation:

@article{wilhelmi2021blockchain,
  title={Blockchain-enabled Server-less Federated Learning},
  author={Wilhelmi, Francesc, Giupponi, Lorenza and Dini, Paolo},
  journal={arXiv preprint arXiv:2112.07938
},
  year={2021}
}

Authors
Publication's abstract
Repository description
Usage
Performance Evaluation
References
Contribute

Authors

Abstract

Motivated by the heterogeneous nature of devices participating in large-scale Federated Learning (FL) optimization, we focus on an asynchronous server-less FL solution empowered by Blockchain (BC) technology. In contrast to mostly adopted FL approaches, which assume synchronous operation, we advocate an asynchronous method whereby model aggregation is done as clients submit their local updates. The asynchronous setting fits well with the federated optimization idea in practical large-scale settings with heterogeneous clients. Thus, it potentially leads to higher efficiency in terms of communication overhead and idle periods. To evaluate the learning completion delay of BC-enabled FL, we provide an analytical model based on batch service queue theory. Furthermore, we provide simulation results to assess the performance of both synchronous and asynchronous mechanisms. Important aspects involved in the BC-enabled FL optimization, such as the network size, link capacity, or user requirements, are put together and analyzed. As our results show, the synchronous setting leads to higher prediction accuracy than the asynchronous case. Nevertheless, asynchronous federated optimization provides much lower latency in many cases, thus becoming an appealing FL solution when dealing with large data sets, tough timing constraints (e.g., near-real-time applications), or highly varying training data.

Repository description

This repository contains the resources used to generate the results included in the paper entitled "Blockchain-enabled Server-less Federated Learning". The files included in this repository are:

LaTeX files: contains the files used to generate the manuscript.
Code & Results: scripts and code used to generate the results included in the paper.

Queue code: scripts used to execute the Blockchain queuing delay simulations through the batch-service queue simulator.
TensorFlow code: python scripts used to execute the FL mechanisms through TensorFlowFederated.
Matlab code: matlab scripts used to process the results and plot the figures included in the manuscript.
Outputs: files containing the outputs from the different resources (queue simulator, TFF).
Figures: figures included in the manuscript and others with preliminary results.

Usage

Part 1: Batch service queue analysis

To generate the results related to the analysis of the queueing delay in the Blockchain, we used our batch-service queue simulator (commit: f846b66). Please, refer to that repository's documentation for installation/execution guidelines. As for the corresponding theoretical background, more details can be found in [1].

The obtained results from this part can be found at "Matlab code/output_queue_simulator". To reproduce them, execute the scripts from the "Batch service queue" folder in the batch-service queue simulator.

Part 2: FLchain analysis

Tensorflow Federated (TFF) has been used to evaluate the proposed s-FLchain and a-FLchain mechanisms in the manuscript. To get started with TF (and TFF), we strongly recommend using the tutorials in https://www.tensorflow.org/federated/tutorials/tutorials_overview.

Once the TFF environment has been setup, our results can be reproduced by using the scripts in "TensorFlow code":

centalized_baseline.py: centralized ML model for getting baseline results (upper/lower bounds).
sFLchain_vs_aFLchain.py: script generating the output for the comparison of the synchronous and the asynchronous models.

The output results from this part can be found at "Matlab code/output_tensorflow".

Part 3: End-to-end analysis framework

Finally, to gather all the resources together, we have used the end-to-end latency framework contained in this repository ("Matlab code/simulation_scripts"). Those files contain the communication and computation models used to calculate the total latency experienced by each considered Blockchain-enabled FL mechanism. Moreover, to get the end-to-end latency and accuracy results, the abovementioned scripts gather and process the outputs obtained from both batch-service queue simulator and TFF.

Content:

0_preliminary_results: evaluation of several FL parameters via TFF (out of the scope of this publication).
1_blockchain_analysis: evaluation of the Blockchain queuing delay (refer to Part 1: Batch service queue analysis).
2_flchain: evaluation of the FL accuracy (refer to Part 2: FLchain analysis) and end-to-end latency analysis. Includes models to compute communication and computation-related delays.

Performance Evaluation

Simulation parameters

The simulation parameters used in the publication are as follows:

	Parameter	Value
	Number of miners	19
	Transaction size	5 kbits
BC	Block header size	20 kbits
	Max. waiting time	1000 seconds
	Queue length	1000 packets
---------	---------------------------------------	----------------------
	Min/max distance Client-BS	0/4.15 meters
	Bandwidth.	180 kHz
	Min/max distance Client-BS	2 GHz
	Min/max distance Client-BS	0 dBi
Comm.	Loss at the reference distance (P_L0)	5 dB
	Path-loss exponent (α)	4.4
	Shadowing factor (σ)	9.5
	Obstacles factor (γ)	30
	Ground noise	-95 dBm
	Capacity P2P links	5 Mbps
---------	---------------------------------------	----------------------
	Learning algorithm	Neural Network
	Number of hidden layers	2
	Activation function	ReLU
	Optimizer	SGD
	Loss function	Cat. cross-entropy
ML	Learning rate (local/global)	0.01/1
	Epochs number	5
	Batch size	20
	CPU cycles to process a data point	10^-5
	Clients' clock speed	1 GHz

Simulation Results

In what follows, we present the results presented in the manuscript. First, we refer to the Blockchain queuing delay analysis, where we assess the sensitivity of the Blockchain on various parameters, including the block size, the mining rate, the traffic intensity, or the miners' communication capacity.

Next, we provide a broader vision of the Blockchain transaction confirmation latency by including other delays different than the queuing delay, such as transaction upload, block generation, or block propagation.

Finally, we present the results obtained for the evaluation of s-FLchain and a-FLchain in terms of learning accuracy and learning completion time:

References

[1] Wilhelmi, F., & Giupponi, L. (2021). Discrete-Time Analysis of Wireless Blockchain Networks. arXiv preprint arXiv:2104.05586.

Contribute

If you want to contribute, please contact to [email protected].

Simulation-based performance analysis of server-less Blockchain-enabled Federated Learning

Related tags

Overview

Blockchain-enabled Server-less Federated Learning

Table of Contents

Authors

Abstract

Repository description

Usage

Part 1: Batch service queue analysis

Part 2: FLchain analysis

Part 3: End-to-end analysis framework

Performance Evaluation

Simulation parameters

Simulation Results

References

Contribute

Owner

Francesc Wilhelmi

N-RPG - Novel role playing game da turfu

なりすまし検出(anti-spoof-mn3)のWebカメラ向けデモ

Source Code for AAAI 2022 paper "Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching"

Code for paper "Context-self contrastive pretraining for crop type semantic segmentation"

TACTO: A Fast, Flexible and Open-source Simulator for High-Resolution Vision-based Tactile Sensors

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation

Unofficial PyTorch Implementation of Multi-Singer

An unofficial personal implementation of UM-Adapt, specifically to tackle joint estimation of panoptic segmentation and depth prediction for autonomous driving datasets.

Improving Transferability of Representations via Augmentation-Aware Self-Supervision

MoCap-Solver: A Neural Solver for Optical Motion Capture Data

PyTorch Live is an easy to use library of tools for creating on-device ML demos on Android and iOS.

Tensorflow implementation of "Learning Deconvolution Network for Semantic Segmentation"

Dynamic Bottleneck for Robust Self-Supervised Exploration

Minimal fastai code needed for working with pytorch

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis

A computational block to solve entity alignment over textual attributes in a knowledge graph creation pipeline.

Code for "On Memorization in Probabilistic Deep Generative Models"

FairEdit: Preserving Fairness in Graph Neural Networks through Greedy Graph Editing

CBREN: Convolutional Neural Networks for Constant Bit Rate Video Quality Enhancement

A general-purpose, flexible, and easy-to-use simulator alongside an OpenAI Gym trading environment for MetaTrader 5 trading platform (Approved by OpenAI Gym)