|
c0bdd9c7a6
|
Use the fast RmsNorm in the quantized model. (#1904)
|
2024-03-21 18:49:35 +01:00 |
|
|
ff03fd3fb3
|
Expose some helper functions to create quantized models. (#1837)
|
2024-03-12 11:30:24 +01:00 |
|
|
dd00482ea3
|
Quantized version of the metavoice model. (#1824)
* Quantized version of the metavoice model.
* Integrate the quantized version of metavoice.
|
2024-03-09 11:06:04 +01:00 |
|
|
a11af79e23
|
Add a quantized blip model. (#1155)
* Add a quantized blip model.
* Integrate the quantized blip model to the actual example.
|
2023-10-22 20:33:25 +01:00 |
|
|
902d0b9166
|
More model cloning. (#1126)
* More model cloning.
* More cloning on quantized models.
|
2023-10-18 21:55:46 +01:00 |
|
|
86e7d539d2
|
Add the quantized mpt model. (#1123)
* Add the quantized mpt model.
* Support the quantized model for replit-code.
|
2023-10-18 16:29:38 +01:00 |
|
|
392fe02fba
|
Move the common quantized-nn code to a shared module. (#1063)
|
2023-10-09 06:22:22 +01:00 |
|