mirror of
https://github.com/huggingface/candle.git
synced 2025-06-21 04:10:46 +00:00

* Load t5 decoder * Run enc, dec, and lm head, but no cross attn * Cross-attention over key_value_states * New arg for decoder input ids * Add mask, don't forward position biases through decoder * Update t5 examples * Clippy + rustfmt