mirror of https://github.com/huggingface/candle.git synced 2025-06-16 18:48:51 +00:00

Files

Santiago Medina ace282e5c2 Add flag to run Moondream in f16 precision (#2015 )

* moondream implementation

* add moondream example

* change config default activation

* Add assets and integrate phi mixformer with example

* Make use of kv cache and fix seq_len bug; Clean up example code

* Add README link to example

* Remove pos_embed scaling; Remove assets; Add to README; Expand VisionConfig

* Delete image

* Use apply instead of forward

* Use latest release special token; Fix token/s accuracy; Use GeluPytorchTanh in VisionConfig v2

* Add flag to use f16

* Avoid breaking the quantized version on cuda.

---------

Co-authored-by: laurent <laurent.mazare@gmail.com>

2024-04-05 07:03:33 +02:00

examples

Add flag to run Moondream in f16 precision (#2015 )

2024-04-05 07:03:33 +02:00

src

Normalize loudness of the generated audio (#1795 )

2024-03-03 14:00:42 +01:00

build.rs

Pin the revision used for phi-v2 + make it the default. (#1572 )

2024-01-12 09:19:30 +01:00

Cargo.toml

Update for pyo3 0.21. (#1985 )

2024-04-01 17:07:02 +02:00

README.md

…

README.md

candle-examples