|
3ad4770eb6
|
Use cat for faster MQA computation. (#2043)
* Use cat for faster MQA computation.
* Move the function to utils + use it in mistral.
* Use the shared repeat-kv in a few more models.
* Fix.
|
2024-04-12 09:15:10 +02:00 |
|
|
90fc82211f
|
Use a common with_tracing::RmsNorm in a few models. (#1871)
* Add RmsNorm with tracing.
* Use with_tracing::RmsNorm in some models.
|
2024-03-18 21:40:06 +01:00 |
|
|
7c3cfd1086
|
Use the llama weight names for the Yi example. (#1381)
|
2023-11-27 20:42:52 +00:00 |
|
|
a007f8fdb4
|
Add the Yi-6b and Yi-34b models. (#1320)
* Add the Yi-6b model.
* Add the 34b model.
* Add the yi example.
* Fix the weight file names.
|
2023-11-11 12:00:48 +01:00 |
|