Files
candle/candle-examples/examples/reinforcement-learning
Laurent Mazare ad73e93da2 Detach the tensors on batch-norm eval. (#1702)
* Detach the tensors on batch-norm eval.

* Fix pyo3 bindings.

* Black tweak.

* Formatting.

* Also update the pyo3-onnx formatting.

* Apply black.
2024-02-13 14:26:32 +01:00
..
2023-12-17 05:20:05 -06:00

candle-reinforcement-learning

Reinforcement Learning examples for candle.

This has been tested with gymnasium version 0.29.1. You can install the Python package with:

pip install "gymnasium[accept-rom-license]"

In order to run the examples, use the following commands. Note the additional --package flag to ensure that there is no conflict with the candle-pyo3 crate.

For the Policy Gradient example:

cargo run --example reinforcement-learning --features=pyo3 --package candle-examples -- pg

For the Deep Deterministic Policy Gradient example:

cargo run --example reinforcement-learning --features=pyo3 --package candle-examples -- ddpg