Skip to content

Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy

License

Notifications You must be signed in to change notification settings

brownirl/lambda_discrepancy

Repository files navigation

λ-Discrepancy

Code for the paper Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy.

For set up, make a virtual environment with Python version >=3.11, and

pip install -r requirements.txt

To run the analytical experiments, the entry point is batch_run_analytical.py.

The RNN experiments have the entry point batch_run_ppo.py.

Instructions to run hyperparameter sweeps are in scripts/experiments.md.

We also include the data used to generate our results in results/analytical_results.csv and results/rnn_results.csv.zip, which you can unzip for the CSV.

About

Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published