Cudnn support (#445)

* Add a cudnn feature to be used for conv2d.

* Allocate the proper workspace.

* Only create a single cudnn handle per cuda device.

* Proper cudnn usage.

* Bugfix.
This commit is contained in:
Laurent Mazare
2023-08-14 21:30:41 +01:00
committed by GitHub
parent c84883ecf2
commit 90374097dc
7 changed files with 195 additions and 12 deletions

View File

@ -28,7 +28,7 @@ Check out our [examples](./candle-examples/examples/):
- [StarCoder](./candle-examples/examples/bigcode/): LLM specialized to code
generation.
- [Stable Diffusion](./candle-examples/examples/stable-diffusion/): text to
image generative model, only cpu support at the moment and on the slow side.
image generative model, yet to be optimized.
Run them using the following commands:
```