Blip attention mask + readme (#1146)

* Add the attention mask to the blip model.

* Add a readme.
This commit is contained in:
Laurent Mazare
2023-10-21 22:44:13 +01:00
committed by GitHub
parent 2531b13bf8
commit 3115fe42e4
2 changed files with 68 additions and 13 deletions

View File

@ -0,0 +1,19 @@
# candle-blip
The
[blip-image-captioning](https://huggingface.co/Salesforce/blip-image-captioning-base)
model can generate captions for an input image.
## Running on an example
```bash
cargo run --example blip --release -- --image candle-examples/examples/yolo-v8/assets/bike.jpg
```
```
Running on CPU, to run on GPU, build this example with `--features cuda`
loaded image Tensor[dims 3, 384, 384; f32]
model built
several cyclists are riding down a road with cars behind them%
```
![Leading group, Giro d'Italia 2021](../yolo-v8/assets/bike.jpg)