Logo
Explore Help
Register Sign In
huggingface/candle
1
0
Fork 0
You've already forked candle
mirror of https://github.com/huggingface/candle.git synced 2025-06-20 20:09:50 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
8ad822a9831ea3d0970783daa2ca20811b2f8de1
candle/candle-transformers/src
History
Laurent Mazare 8ad822a983 Add a function to clear the KV cache in falcon. (#2066)
* Add a function to clear the KV cache in falcon.

* Clippy.
2024-04-15 09:29:25 +02:00
..
generation
Include topk sampling in the quantized example. (#2005)
2024-04-04 09:27:54 +02:00
models
Add a function to clear the KV cache in falcon. (#2066)
2024-04-15 09:29:25 +02:00
pipelines
Sketch the candle-transformers crate. (#147)
2023-07-12 13:49:31 +01:00
lib.rs
Move the common quantized-nn code to a shared module. (#1063)
2023-10-09 06:22:22 +01:00
object_detection.rs
Move more models to candle-transformers (#796)
2023-09-10 10:20:18 +01:00
quantized_nn.rs
Use the fast RmsNorm in the quantized model. (#1904)
2024-03-21 18:49:35 +01:00
quantized_var_builder.rs
Add a quantized version of recurrent-gemma. (#2054)
2024-04-13 20:07:01 +02:00
utils.rs
Use cat for faster MQA computation. (#2043)
2024-04-12 09:15:10 +02:00
Powered by Gitea Version: 1.24.2 Page: 546ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API