mirror of
https://github.com/huggingface/candle.git
synced 2025-06-16 10:38:54 +00:00
candle-quantized-qwen2-instruct
Qwen2 is an upgraded version of Qwen1.5, released by Alibaba Cloud.
Running the example
cargo run --example quantized-qwen2-instruct --release -- --prompt "Write a function to count prime numbers up to N."
0.5b, 1.5b, 7b and 72b models are available via --which
argument.
cargo run --release --example quantized-qwen2-instruct -- --which 0.5b --prompt "Write a function to count prime numbers up to N."