candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-18 19:47:12 +00:00

Files

Laurent Mazare a9101700b6 Add a kv-cache to the quantized llama example. (#466 )

* Add a kv-cache to the quantized llama example.

* Also print the prompt.

* Bugfix in q6k dequantizing.

* Another bugfix.

2023-08-16 14:28:42 +01:00

main.rs

2023-08-16 14:28:42 +01:00