candle

huggingface/candle

Fork 0

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

Commit Graph

Author	SHA1	Message	Date
s-casci	51e577a682	Add Policy Gradient to Reinforcement Learning examples (#1500 ) * added policy_gradient, modified main, ddpg and README * fixed typo in README * removed unnecessary imports * small refactor * Use clap for picking up the subcommand to run. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2023-12-30 09:01:29 +01:00
Laurent Mazare	29c7f2565d	Add some reinforcement learning example. (#1090 ) * Add some reinforcement learning example. * Python initialization. * Get the example to run. * Vectorized gym envs for the atari wrappers. * Get some simulation loop to run.	2023-10-14 16:46:43 +01:00

Author

SHA1

Message

Date

s-casci

51e577a682

Add Policy Gradient to Reinforcement Learning examples (#1500 )

* added policy_gradient, modified main, ddpg and README

* fixed typo in README

* removed unnecessary imports

* small refactor

* Use clap for picking up the subcommand to run.

---------

Co-authored-by: Laurent <laurent.mazare@gmail.com>

2023-12-30 09:01:29 +01:00

Laurent Mazare

29c7f2565d

Add some reinforcement learning example. (#1090 )

* Add some reinforcement learning example.

* Python initialization.

* Get the example to run.

* Vectorized gym envs for the atari wrappers.

* Get some simulation loop to run.

2023-10-14 16:46:43 +01:00

2 Commits