Files
candle/candle-examples/examples
Santiago Medina ace282e5c2 Add flag to run Moondream in f16 precision (#2015)
* moondream implementation

* add moondream example

* change config default activation

* Add assets and integrate phi mixformer with example

* Make use of kv cache and fix seq_len bug; Clean up example code

* Add README link to example

* Remove pos_embed scaling; Remove assets; Add to README; Expand VisionConfig

* Delete image

* Use apply instead of forward

* Use latest release special token; Fix token/s accuracy; Use GeluPytorchTanh in VisionConfig v2

* Add flag to use f16

* Avoid breaking the quantized version on cuda.

---------

Co-authored-by: laurent <laurent.mazare@gmail.com>
2024-04-05 07:03:33 +02:00
..
2024-01-17 10:27:58 +01:00
2024-02-09 17:36:50 +01:00
2023-11-24 15:09:14 +00:00
2024-03-13 21:41:36 +01:00
2024-02-22 10:22:03 +01:00
2024-01-17 10:27:58 +01:00
2024-03-28 23:24:46 +01:00
2024-01-17 10:27:58 +01:00
2024-03-09 11:21:48 +01:00
2024-02-10 16:14:50 +01:00
2024-03-21 12:54:09 +01:00