mirror of
https://github.com/huggingface/candle.git
synced 2025-06-15 02:16:37 +00:00

* Add the mimi audio-tokenizer. * Formatting tweaks. * Add a full example. * Use the transformers names. * More renamings. * Get encoding and decoding to work. * Clippy fixes.
21 lines
781 B
Markdown
21 lines
781 B
Markdown
# candle-mimi
|
|
|
|
[Mimi](https://huggingface.co/kyutai/mimi) is a state of the art audio
|
|
compression model using an encoder/decoder architecture with residual vector
|
|
quantization. The candle implementation supports streaming meaning that it's
|
|
possible to encode or decode a stream of audio tokens on the flight to provide
|
|
low latency interaction with an audio model.
|
|
|
|
## Running one example
|
|
|
|
Generating some audio tokens from an audio files.
|
|
```bash
|
|
wget https://github.com/metavoiceio/metavoice-src/raw/main/assets/bria.mp3
|
|
cargo run --example mimi --features mimi --release -- audio-to-code bria.mp3 bria.safetensors
|
|
```
|
|
|
|
And decoding the audio tokens back into a sound file.
|
|
```bash
|
|
cargo run --example mimi --features mimi --release -- code-to-audio bria.safetensors bria.wav
|
|
```
|