mirror of
https://github.com/huggingface/candle.git
synced 2025-06-20 12:06:35 +00:00
Quantized version of flux. (#2500)
* Quantized version of flux. * More generic sampling. * Hook the quantized model. * Use the newly minted gguf file. * Fix for the quantized model. * Default to avoid the faster cuda kernels.
This commit is contained in:
@ -13,7 +13,7 @@ descriptions,
|
||||
|
||||
```bash
|
||||
cargo run --features cuda --example flux -r -- \
|
||||
--height 1024 --width 1024
|
||||
--height 1024 --width 1024 \
|
||||
--prompt "a rusty robot walking on a beach holding a small torch, the robot has the word "rust" written on it, high quality, 4k"
|
||||
```
|
||||
|
||||
|
Reference in New Issue
Block a user