Description

Code of paper 'Byzantine-Resilient Decentralized TD Learning with Linear Function Approximation'. (Arxiv preprint: https://arxiv.org/abs/2009.11146)

'robust_td.py' is the main code. 'plotter.py' and 'subplotter.py' are for plotting. Other codes are utilities.

Instruction

Check environment requirements: python(>=3.6), numpy, matplotlib, scipy, gym(0.10.5), tensorflow(>=2.0.0)
Install the multiagent env package in this file. cd into root of this file and run: pip install -e multiagent-particle-envs-master
An example to run the code in console: python robust_td.py --network renyi --attack 2 --lam 0. --lr 0.1 --diminish --plot

Running

Different Topologies

Figure: MSBE and MCE versus step k under sign flipping attacks in a complete network.

python robust_td.py --network complete --attack 2 --lam 0. --lr 0.1 --diminish --mc 10
python robust_td.py --network complete --attack 2 --lam 0.3 --lr 0.1 --diminish --mc 10
python robust_td.py --network complete --attack 2 --lam 0.6 --lr 0.1 --diminish --mc 10
python robust_td.py --network complete --attack 2 --lam 0.9 --lr 0.05 --diminish --mc 10
python subplotter.py --network complete --attack 2 --lams 0 0.3 0.6 0.9 --lnk

Figure: MSBE and MCE versus step k under sign flipping in a Erdos-Renyi network.

python robust_td.py --network renyi --attack 2 --lam 0. --lr 0.1 --diminish --mc 10
python robust_td.py --network renyi --attack 2 --lam 0.3 --lr 0.1 --diminish --mc 10
python robust_td.py --network renyi --attack 2 --lam 0.6 --lr 0.1 --diminish --mc 10
python robust_td.py --network renyi --attack 2 --lam 0.9 --lr 0.05 --diminish --mc 10
python subplotter.py --network renyi --attack 2 --lams 0 0.3 0.6 0.9 --lnk

Figure: MSBE and MCE versus step k under sign flipping in a H2B1 network.

python robust_td.py --network h2b1 --attack 2 --lam 0. --lr 0.1 --diminish --mc 10
python robust_td.py --network h2b1 --attack 2 --lam 0.3 --lr 0.1 --diminish --mc 10
python robust_td.py --network h2b1 --attack 2 --lam 0.6 --lr 0.1 --diminish --mc 10
python robust_td.py --network h2b1 --attack 2 --lam 0.9 --lr 0.05 --diminish --mc 10
python subplotter.py --network h2b1 --attack 2 --lams 0 0.3 0.6 0.9 --lnk

Figure: MSBE and MCE versus step k under sign flipping in a H3B1 network.

python robust_td.py --network h3b1 --attack 2 --lam 0. --lr 0.1 --diminish --mc 10
python robust_td.py --network h3b1 --attack 2 --lam 0.3 --lr 0.1 --diminish --mc 10
python robust_td.py --network h3b1 --attack 2 --lam 0.6 --lr 0.1 --diminish --mc 10
python robust_td.py --network h3b1 --attack 2 --lam 0.9 --lr 0.05 --diminish --mc 10
python subplotter.py --network h3b1 --attack 2 --lams 0 0.3 0.6 0.9 --lnk

Figure: MSBE and MCE versus step k under sign flipping in a H4B1 network.

python robust_td.py --network h4b1 --attack 2 --lam 0. --lr 0.1 --diminish --mc 10
python robust_td.py --network h4b1 --attack 2 --lam 0.3 --lr 0.1 --diminish --mc 10
python robust_td.py --network h4b1 --attack 2 --lam 0.6 --lr 0.1 --diminish --mc 10
python robust_td.py --network h4b1 --attack 2 --lam 0.9 --lr 0.05 --diminish --mc 10
python subplotter.py --network h4b1 --attack 2 --lams 0 0.3 0.6 0.9 --lnk

Different Byzantine attacks

Figure: MSBE and MCE versus step k under same value in a Erdos-Renyi network.

python robust_td.py --network renyi --attack 0 --lam 0. --lr 0.1 --diminish --mc 10
python robust_td.py --network renyi --attack 0 --lam 0.3 --lr 0.1 --diminish --mc 10
python robust_td.py --network renyi --attack 0 --lam 0.6 --lr 0.1 --diminish --mc 10
python robust_td.py --network renyi --attack 0 --lam 0.9 --lr 0.05 --diminish --mc 10
python subplotter.py --network renyi --attack 0 --lams 0 0.3 0.6 0.9 --lnk

Figure: MSBE and MCE versus step k under Gaussian noise in a Erdos-Renyi network.

python robust_td.py --network renyi --attack 1 --lam 0. --lr 0.1 --diminish --mc 10
python robust_td.py --network renyi --attack 1 --lam 0.3 --lr 0.1 --diminish --mc 10
python robust_td.py --network renyi --attack 1 --lam 0.6 --lr 0.1 --diminish --mc 10
python robust_td.py --network renyi --attack 1 --lam 0.9 --lr 0.05 --diminish --mc 10
python subplotter.py --network renyi --attack 1 --lams 0 0.3 0.6 0.9 --lnk

Different reward variation

Figure: Asymptotic MSBE under sign flipping in a H3B1 network.

python robust_td.py --network renyi --attack 2 --lam 0. --lr 0.05 --diminish --vars 0 0.5 1.0 1.5
python robust_td.py --network renyi --attack 2 --lam 0.3 --lr 0.05 --diminish --vars 0 0.5 1.0 1.5
python robust_td.py --network renyi --attack 2 --lam 0.6 --lr 0.05 --diminish --vars 0 0.5 1.0 1.5
python robust_td.py --network renyi --attack 2 --lam 0.9 --lr 0.05 --diminish --vars 0 0.5 1.0 1.5
python plotter_var.py --network renyi --attack 2 --lams 0 0.3 0.6 0.9 --vars 0.0 0.5 1.0 1.5

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
figure		figure
multiagent-particle-envs-master		multiagent-particle-envs-master
record		record
thesis-figure		thesis-figure
RenyiGraph.py		RenyiGraph.py
h2b1.txt		h2b1.txt
h3b1.txt		h3b1.txt
h4b1.txt		h4b1.txt
make_env.py		make_env.py
plotter.py		plotter.py
plotter_var.py		plotter_var.py
readme.md		readme.md
robust_td.py		robust_td.py
subplotter.py		subplotter.py
thesis-readme.md		thesis-readme.md
thesis_plotter_var.py		thesis_plotter_var.py
thesis_subplotter-slides.py		thesis_subplotter-slides.py
thesis_subplotter.py		thesis_subplotter.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Instruction

Running

Different Topologies

Different Byzantine attacks

Different reward variation

About

Releases

Packages

Contributors 2

Languages

Zhaoxian-Wu/Byrd-TD

Folders and files

Latest commit

History

Repository files navigation

Description

Instruction

Running

Different Topologies

Different Byzantine attacks

Different reward variation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages