This website requires JavaScript.
Explore
Help
Register
Sign In
huggingface
/
candle
Watch
1
Star
0
Fork
0
You've already forked candle
mirror of
https://github.com/huggingface/candle.git
synced
2025-06-21 12:20:46 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
d54e02d73de3391b34d4511aa7add32f9cffd4f0
candle
/
candle-transformers
/
src
History
Laurent Mazare
d54e02d73d
Avoid a contiguous call in the quantized phi 3 model. (
#2209
)
...
* Simplify the KvCache api. * Avoid a contiguous call in the quantized phi3 model.
2024-05-23 21:24:55 +02:00
..
generation
Include topk sampling in the quantized example. (
#2005
)
2024-04-04 09:27:54 +02:00
models
Avoid a contiguous call in the quantized phi 3 model. (
#2209
)
2024-05-23 21:24:55 +02:00
pipelines
Sketch the candle-transformers crate. (
#147
)
2023-07-12 13:49:31 +01:00
lib.rs
Move the common quantized-nn code to a shared module. (
#1063
)
2023-10-09 06:22:22 +01:00
object_detection.rs
Move more models to candle-transformers (
#796
)
2023-09-10 10:20:18 +01:00
quantized_nn.rs
Use the fast RmsNorm in the quantized model. (
#1904
)
2024-03-21 18:49:35 +01:00
quantized_var_builder.rs
Add a quantized version of recurrent-gemma. (
#2054
)
2024-04-13 20:07:01 +02:00
utils.rs
Use cat for faster MQA computation. (
#2043
)
2024-04-12 09:15:10 +02:00