mirror of
https://github.com/huggingface/candle.git
synced 2025-06-19 03:54:56 +00:00

* Return the metadata in the gguf pyo3 bindings. * Read the metadata in the quantized llama example. * Get inference to work on gguf files.
From the candle-pyo3
directory, enable a virtual env where you will want the
candle package to be installed then run.
maturin develop
python test.py