mirror of https://github.com/huggingface/candle.git synced 2025-06-16 02:38:10 +00:00

Files

Laurent Mazare a9101700b6 Add a kv-cache to the quantized llama example. (#466 )

* Add a kv-cache to the quantized llama example.

* Also print the prompt.

* Bugfix in q6k dequantizing.

* Another bugfix.

2023-08-16 14:28:42 +01:00

Cudnn support (#445 )

2023-08-14 21:30:41 +01:00

2023-08-16 14:28:42 +01:00

2023-08-16 12:41:07 +01:00

Cargo.toml

2023-08-15 10:48:57 +01:00

LICENSE

Refactor the hierarchy.

2023-06-27 11:57:27 +02:00

README.md

Refactor the hierarchy.

2023-06-27 11:57:27 +02:00

candle

Minimalist ML framework for Rust