mirror of
https://github.com/huggingface/candle.git
synced 2025-06-19 19:58:35 +00:00
Add the mimi audio-tokenizer. (#2488)
* Add the mimi audio-tokenizer. * Formatting tweaks. * Add a full example. * Use the transformers names. * More renamings. * Get encoding and decoding to work. * Clippy fixes.
This commit is contained in:
20
candle-examples/examples/mimi/README.md
Normal file
20
candle-examples/examples/mimi/README.md
Normal file
@ -0,0 +1,20 @@
|
||||
# candle-mimi
|
||||
|
||||
[Mimi](https://huggingface.co/kyutai/mimi) is a state of the art audio
|
||||
compression model using an encoder/decoder architecture with residual vector
|
||||
quantization. The candle implementation supports streaming meaning that it's
|
||||
possible to encode or decode a stream of audio tokens on the flight to provide
|
||||
low latency interaction with an audio model.
|
||||
|
||||
## Running one example
|
||||
|
||||
Generating some audio tokens from an audio files.
|
||||
```bash
|
||||
wget https://github.com/metavoiceio/metavoice-src/raw/main/assets/bria.mp3
|
||||
cargo run --example mimi --features mimi --release -- audio-to-code bria.mp3 bria.safetensors
|
||||
```
|
||||
|
||||
And decoding the audio tokens back into a sound file.
|
||||
```bash
|
||||
cargo run --example mimi --features mimi --release -- code-to-audio bria.safetensors bria.wav
|
||||
```
|
Reference in New Issue
Block a user