Cudnn support (#445)

* Add a cudnn feature to be used for conv2d. * Allocate the proper workspace. * Only create a single cudnn handle per cuda device. * Proper cudnn usage. * Bugfix.
2025-06-22 04:22:50 +00:00 · 2023-08-14 21:30:41 +01:00
parent c84883ecf2
commit 90374097dc
7 changed files with 195 additions and 12 deletions
--- a/README.md
+++ b/README.md
@ -28,7 +28,7 @@ Check out our [examples](./candle-examples/examples/):
 - [StarCoder](./candle-examples/examples/bigcode/): LLM specialized to code
  generation.
 - [Stable Diffusion](./candle-examples/examples/stable-diffusion/): text to
-  image generative model, only cpu support at the moment and on the slow side.
+  image generative model, yet to be optimized.

 Run them using the following commands:
 ```