mirror of
https://github.com/huggingface/candle.git
synced 2025-06-15 02:16:37 +00:00

* Add a quantized version of recurrent-gemma. * Share the rglru part. * Get the quantized gemma model to work.
candle-recurrent-gemma
This model card corresponds to the 2B base version of the RecurrentGemma model huggingface model card.
cargo run --features cuda -r --example recurrent-gemma -- \
--prompt "Write me a poem about Machine Learning."