mirror of https://github.com/huggingface/candle.git synced 2025-06-15 02:16:37 +00:00

Files

Laurent Mazare 50e49ecc5f Add a quantized version of recurrent-gemma. (#2054 )

* Add a quantized version of recurrent-gemma.

* Share the rglru part.

* Get the quantized gemma model to work.

2024-04-13 20:07:01 +02:00

main.rs

2024-04-13 20:07:01 +02:00

README.md

2024-04-13 00:05:21 +02:00

candle-recurrent-gemma

This model card corresponds to the 2B base version of the RecurrentGemma model huggingface model card.

cargo run --features cuda -r --example recurrent-gemma -- \
    --prompt "Write me a poem about Machine Learning."