mirror of
https://github.com/huggingface/candle.git
synced 2025-06-15 10:26:33 +00:00

Codellama requires bf16 for now (error to convert from bf16 to f16). Multiprocess demo not functional for it because flash-attn only supports f16 for now.