* Support for attention bias in gemma + refactor things a bit. * Fix the cuda tests.
Minimalist ML framework for Rust