candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 18:48:51 +00:00

Files

Laurent Mazare d73ca3d28e Line up the llama.cpp implementation with the candle one. (#518 )

* Separate the prompt stats from the post-prompt ones in the quantized example.

* Slightly nicer output printing.

* Line up with the llama.cpp implementation.

2023-08-19 20:12:07 +01:00

main.rs

Line up the llama.cpp implementation with the candle one. (#518 )

2023-08-19 20:12:07 +01:00