mirror of
https://github.com/huggingface/candle.git
synced 2025-06-20 20:09:50 +00:00
Add the flux model for image generation. (#2390)
* Add the flux autoencoder. * Add the encoder down-blocks. * Upsampling in the decoder. * Sketch the flow matching model. * More flux model. * Add some of the positional embeddings. * Add the rope embeddings. * Add the sampling functions. * Add the flux example. * Fix the T5 bits. * Proper T5 tokenizer. * Clip encoder path fix. * Get the clip embeddings. * No configurable weights in layer norm. * More weights related fixes. * Yet another shape fix. * DType fix. * Fix a couple more shape issues. * DType fixes. * Fix the latent dims. * Fix more shape issues. * Autoencoder fixes. * Get some generations out. * Bugfix. * T5 padding. * Clippy fix. * Add the decode only mode. * Fix. * More fixes. * Finally get some generations to work. * Add readme.
This commit is contained in:
19
candle-examples/examples/flux/README.md
Normal file
19
candle-examples/examples/flux/README.md
Normal file
@ -0,0 +1,19 @@
|
||||
# candle-flux: image generation with latent rectified flow transformers
|
||||
|
||||

|
||||
|
||||
Flux is a 12B rectified flow transformer capable of generating images from text
|
||||
descriptions,
|
||||
[huggingface](https://huggingface.co/black-forest-labs/FLUX.1-schnell),
|
||||
[github](https://github.com/black-forest-labs/flux),
|
||||
[blog post](https://blackforestlabs.ai/announcing-black-forest-labs/).
|
||||
|
||||
|
||||
## Running the model
|
||||
|
||||
```bash
|
||||
cargo run --features cuda --example flux -r -- \
|
||||
--height 1024 --width 1024
|
||||
--prompt "a rusty robot walking on a beach holding a small torch, the robot has the word "rust" written on it, high quality, 4k"
|
||||
```
|
||||
|
Reference in New Issue
Block a user