Files
candle/candle-examples/examples/mamba
Laurent Mazare b81ecf712d Support alternative dtypes for mamba (#2036)
* Allow different dtypes in mamba.

* Add a dtype flag.
2024-04-10 18:10:01 +02:00
..

candle-mamba: Mamba implementation

Candle implementation of Mamba [1] inference only. Mamba is an alternative to the transformer architecture. It leverages State Space Models (SSMs) with the goal of being computationally efficient on long sequences. The implementation is based on mamba.rs.

Compared to the mamba-minimal example, this version is far more efficient but would only work for inference.

Running the example

$ cargo run --example mamba-minimal --release -- --prompt "Mamba is the"