Fix docs quantized qwen3 (#2955)

* fixed docs quantized-qwen3 README

* fixed docs quantized-qwen2-instruct README
This commit is contained in:
MaCAT
2025-05-15 14:58:03 +09:00
committed by GitHub
parent 450a49ed1a
commit 9ce4fe6194
2 changed files with 12 additions and 2 deletions

View File

@ -8,4 +8,8 @@
cargo run --example quantized-qwen2-instruct --release -- --prompt "Write a function to count prime numbers up to N."
```
0.5b, 1.5b, 7b and 72b models are available via `--model` argument.
0.5b, 1.5b, 7b and 72b models are available via `--which` argument.
```bash
cargo run --release --example quantized-qwen2-instruct -- --which 0.5b --prompt "Write a function to count prime numbers up to N."
```

View File

@ -8,4 +8,10 @@
cargo run --example quantized-qwen3 --release -- --prompt "Write a function to count prime numbers up to N."
```
0.6b is used by default, 1.7b, 4b, 8b, 14b, and 32b models are available via `--model` argument.
0.6b is used by default, 1.7b, 4b, 8b, 14b, and 32b models are available via `--which` argument.
```bash
cargo run --example quantized-qwen3 --release -- --which 4b --prompt "A train is travelling at 120mph, how far does it travel in 3 minutes 30 seconds?"
```