Files
Laurent Mazare 50e49ecc5f Add a quantized version of recurrent-gemma. (#2054)
* Add a quantized version of recurrent-gemma.

* Share the rglru part.

* Get the quantized gemma model to work.
2024-04-13 20:07:01 +02:00
..

candle-recurrent-gemma

This model card corresponds to the 2B base version of the RecurrentGemma model huggingface model card.

cargo run --features cuda -r --example recurrent-gemma -- \
    --prompt "Write me a poem about Machine Learning."