Quantized support for stable-lm2. (#1654)

* Quantized support for stable-lm2. * Quantized support for v2-zephyr.
2025-06-15 18:28:24 +00:00 · 2024-02-04 11:57:05 +01:00
parent 58cc896e69
commit 50be8a98ba
3 changed files with 36 additions and 10 deletions
--- a/candle-examples/examples/stable-lm/README.md
+++ b/candle-examples/examples/stable-lm/README.md
@ -10,7 +10,9 @@ order to be able to use it.

 Other available models are Stable-Code-3B, StableLM-2 and Zephyr variants.

-StableLM-2 uses a Tiktoken based GPT-3.5/GPT-4 tokenizer not supported by Candle, so to run it you can download a somewhat compatible [tokenizer.json](https://huggingface.co/Xenova/gpt-4/resolve/main/tokenizer.json?download=true)
+StableLM-2 uses a Tiktoken based GPT-3.5/GPT-4 tokenizer not supported by
+Candle, so to run it you can download a somewhat compatible
+[tokenizer.json](https://huggingface.co/Xenova/gpt-4/resolve/main/tokenizer.json?download=true)
 and pass it via the --tokenizer-file argument.

 ## Running some example