Improved mamba model optimized for inference (#1694)

* Sketch the mamba model for inference.

* Complete the forward pass.

* Add the mamba example.

* Optimize the selective-scan part.

* Fix a couple shape mismatches and get inference to work.

* Tweak the readmes.

* More readme tweaks.
This commit is contained in:
Laurent Mazare
2024-02-11 17:04:57 +01:00
committed by GitHub
parent 74497e6bf7
commit 1e26d539d9
6 changed files with 533 additions and 2 deletions

View File

@ -2,6 +2,9 @@
This is based on [mamba-minimal](https://github.com/johnma2006/mamba-minimal).
Compared to the mamba example, this version can handle training but is much
slower.
## Running the example
```bash