mirror of
https://github.com/huggingface/candle.git
synced 2025-06-16 02:38:10 +00:00
549 B
549 B
candle-quantized-qwen3
Qwen3 is an upgraded version of Qwen2.5, released by Alibaba Cloud.
Running the example
cargo run --example quantized-qwen3 --release -- --prompt "Write a function to count prime numbers up to N."
0.6b is used by default, 1.7b, 4b, 8b, 14b, and 32b models are available via --which
argument.
cargo run --example quantized-qwen3 --release -- --which 4b --prompt "A train is travelling at 120mph, how far does it travel in 3 minutes 30 seconds?"