mirror of
https://github.com/huggingface/candle.git
synced 2025-06-20 12:06:35 +00:00
add quantized qwen2 (#2329)
* add quantized version of qwen2 and corresponding example for qwen2-instruct * fix quantized qwen2 clippy error
This commit is contained in:
11
candle-examples/examples/quantized-qwen2-instruct/README.md
Normal file
11
candle-examples/examples/quantized-qwen2-instruct/README.md
Normal file
@ -0,0 +1,11 @@
|
||||
# candle-quantized-qwen2-instruct
|
||||
|
||||
[Qwen2]((https://qwenlm.github.io/blog/qwen2/)) is an upgraded version of Qwen1.5, released by Alibaba Cloud.
|
||||
|
||||
## Running the example
|
||||
|
||||
```bash
|
||||
cargo run --example quantized-qwen2-instruct --release -- --prompt "Write a function to count prime numbers up to N."
|
||||
```
|
||||
|
||||
0.5b, 1.5b, 7b and 72b models are available via `--model` argument.
|
Reference in New Issue
Block a user