SDE-HARL

This repo is the official implementation for SDE-HARL: Scalable Distributed Policy Execution for Heterogeneous-Agent Reinforcement Learning (AAAI 2026). This repository allows researchers and practitioners to easily reproduce our results on seven challenging benchmarks or apply SDE-HARL algorithms to their intended applications.

Edge-Server Architecture

SDE-HARL Framework

Installation

Install HARL

conda create -n sde-harl python=3.8
conda activate sde-harl
# Install pytorch>=1.9.0 (CUDA>=11.0) manually
git clone https://github.com/Restuccia-Group/SDE_HARL.git
cd SDE-HARL
pip install -e .

Install Environments

We implement SDE-HARL on SMAC, SMACv2, and, Google Research Football. However, you can install MAMuJoCo, MPE, Bi-DexterousHands, and Light Aircraft Game to your research purposes.

Install Google Research Football

Please follow the official instructions to install Google Research Football.

Install SMAC

Please follow the official instructions to install SMAC. We use StarCraft II version 4.10 on Linux.

Install SMACv2

Please follow the official instructions to install SMACv2.

How to Run

Training

To train an algorithm on a provided environment, users can modify yaml configuration files of the corresponding algorithm and environment under harl/configs/algos_cfgs and harl/configs/envs_cfgs as they wish, go to examples folder, and then start training with a one-liner python train.py --algo <ALGO> --env <ENV> --exp_name <EXPERIMENT NAME> or python train.py --load_config <CONFIG FILE PATH> --exp_name <EXPERIMENT NAME>, where the latter is mostly used when reproducing an experiment. We provide the tuned configurations for algorithms in each environments under tuned_configs folder. Users can reproduce our results by using python train.py --load_config <TUNED CONFIG PATH> --exp_name <EXPERIMENT NAME> and change <TUNED CONFIG PATH> to the absolute path of the tuned config file on their machine.

During training, users receive continuous logging feedback in the terminal.

After training, users can check the log file, tensorboard output, experiment configuration, and saved models under the generated results folder. Moreover, users can also render the trained models by setting use_render: True, model_dir: <path to trained models> in algorithm configuration file (for football users also need to set render: True in the environment configuration file), and use the same training command as above again. For SMAC and SMACv2, rendering comes in the form of video replay automatically saved to the StarCraftII/Replays folder (more details can be found here).

To enable batch running, we allow users to modify yaml configs in the command line. For each training command, users specify the special parameters in the commands with the same names as in the config files. For example, if you want to run HAPPO on SMAC tasks under three random seeds. You can customize the configs and replace train.sh with the following commands:

python train.py --algo happo --env smac --exp_name test --seed $seed

Note that our code is modified to run HARL algorithms, not for homogeneous. Currently, HAA2C, HAPPO, and HASAC are working well, and we are updating the rest.

Real-world Prototype

To evaluate the resource consumption, we implement a real-world prototype with some Raspberry Pi/Jetson Nano serve as low-power devices, and a super workstation as the server.

Acknowledgements

This repo is based on official implementation of Heterogeneous-Agent Reinforcement Learning at HARL, serving as core HARL aglorithms as well as baselines. Secondly, we use Compression Library at CompressAI. Finally, we inspire our IAD and RAD approaches relies on source code CDS.

Thanks to the original authors for their work!

Citation

Please cite this work if you find it useful.

Our work:

For any work related to the variable bitrate models, please cite

@article{JMLR:v25:23-0488,
  author  = {Yifan Zhong and Jakub Grudzien Kuba and Xidong Feng and Siyi Hu and Jiaming Ji and Yaodong Yang},
  title   = {Heterogeneous-Agent Reinforcement Learning},
  journal = {Journal of Machine Learning Research},
  year    = {2024},
  volume  = {25},
  number  = {32},
  pages   = {1--67},
  url     = {http://jmlr.org/papers/v25/23-0488.html}
}

@inproceedings{
liu2024maximum,
title={Maximum Entropy Heterogeneous-Agent Reinforcement Learning},
author={Jiarong Liu and Yifan Zhong and Siyi Hu and Haobo Fu and QIANG FU and Xiaojun Chang and Yaodong Yang},
booktitle={The Twelfth International Conference on Learning Representations},
year={2024},
url={https://openreview.net/forum?id=tmqOhBC4a5}
}

@article{begaint2020compressai,
	title={CompressAI: a PyTorch library and evaluation platform for end-to-end compression research},
	author={B{\'e}gaint, Jean and Racap{\'e}, Fabien and Feltman, Simon and Pushparaja, Akshay},
	year={2020},
	journal={arXiv preprint arXiv:2011.03029},
}

@article{chenghao2021celebrating,
  title={Celebrating diversity in shared multi-agent reinforcement learning},
  author={Li, Chenghao, and Wang, Tonghan and Wu, Chengjie and Zhao, Qianchuan and Yang, Jun and Zhang, Chongjie},
  journal={Advances in Neural Information Processing Systems},
  volume={34},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
assets		assets
examples		examples
harl		harl
tuned_configs		tuned_configs
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
README.md		README.md
Supplementary.pdf		Supplementary.pdf
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SDE-HARL

Edge-Server Architecture

SDE-HARL Framework

Installation

Install HARL

Install Environments

How to Run

Training

Real-world Prototype

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

Languages

Restuccia-Group/SDE_HARL

Folders and files

Latest commit

History

Repository files navigation

SDE-HARL

Edge-Server Architecture

SDE-HARL Framework

Installation

Install HARL

Install Environments

How to Run

Training

Real-world Prototype

Acknowledgements

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages