mirror of
https://github.com/huggingface/candle.git
synced 2025-06-14 01:48:08 +00:00

* Add a quantized version of recurrent-gemma. * Share the rglru part. * Get the quantized gemma model to work.
candle-recurrent-gemma
This model card corresponds to the 2B base version of the RecurrentGemma model huggingface model card.
cargo run --features cuda -r --example recurrent-gemma -- \
--prompt "Write me a poem about Machine Learning."