|
678f64dd27
|
Fix token generation in bilingual models (non-English outputs) (#1668)
Co-authored-by: Guoqing Bao <guoqing.bao@enflame-tech.com>
|
2024-02-06 12:03:53 +01:00 |
|
|
37c539f2b7
|
Helper function to load sharded safetensors files (#1481)
* Fix the quantized mistral example.
* Add a helper function to load sharded safetensors weights.
* Use the sharded loader.
|
2023-12-25 21:49:21 +01:00 |
|
|
7c3cfd1086
|
Use the llama weight names for the Yi example. (#1381)
|
2023-11-27 20:42:52 +00:00 |
|
|
a007f8fdb4
|
Add the Yi-6b and Yi-34b models. (#1320)
* Add the Yi-6b model.
* Add the 34b model.
* Add the yi example.
* Fix the weight file names.
|
2023-11-11 12:00:48 +01:00 |
|