Files
candle/candle-examples/examples/quantized-qwen2-instruct
MaCAT 9ce4fe6194 Fix docs quantized qwen3 (#2955)
* fixed docs quantized-qwen3 README

* fixed docs quantized-qwen2-instruct README
2025-05-15 07:58:03 +02:00
..
2025-05-15 07:58:03 +02:00

candle-quantized-qwen2-instruct

Qwen2 is an upgraded version of Qwen1.5, released by Alibaba Cloud.

Running the example

cargo run --example quantized-qwen2-instruct --release -- --prompt "Write a function to count prime numbers up to N."

0.5b, 1.5b, 7b and 72b models are available via --which argument.

 cargo run --release --example quantized-qwen2-instruct --   --which 0.5b   --prompt "Write a function to count prime numbers up to N."