c930ab7e1a
upgrade half library to fix rand ( #2806 )
...
fix lints
2025-03-14 09:01:54 +01:00
62ced44ea9
Add a Context trait similar to anyhow::Context. ( #2676 )
...
* Add a Context trait similar to anyhow::Context.
* Switch two unwrap to context.
2024-12-22 09:18:13 +01:00
3159f91b90
20241118 docs ( #2629 )
...
* module docs
* varbuilder gguf docs
* add a link to gguf files
* small additonal mod doc titles
* safetensor docs
* more core docs
* more module docs in canlde_core
* 2 more link fixes
2024-11-19 04:07:07 +01:00
f48c07e242
Include topk sampling in the quantized example. ( #2005 )
...
* Include topk sampling in the quantized example.
* Also sample with top-k on the mistral side.
2024-04-04 09:27:54 +02:00
5e70821dd0
Allow for arbitrary temperature modifications.
2024-03-23 15:47:39 +01:00
a62a97340c
Add topk sampling. ( #1923 )
2024-03-23 15:26:09 +01:00
0a647875ec
Use softmax-last-dim in the quantized example. ( #848 )
2023-09-14 17:29:24 +01:00
805bf9ffa7
Implement top_p / nucleus sampling ( #819 )
...
* Implement top_p / nucleus sampling
* Update changelog
* rustfmt
* Add tests
* Fix clippy warning
* Fix another clippy error
2023-09-12 18:10:16 +02:00
912561614f
Better handling of zero temperatures. ( #532 )
2023-08-21 07:51:46 +01:00
3eb2bc6d07
Softmax numerical stability. ( #267 )
...
* Softmax numerical stability.
* Fix the flash-attn test.
2023-07-28 13:13:01 +01:00
ba35d895e7
Sketch the candle-transformers crate. ( #147 )
...
* Sketch the candle-transformers crate.
* Format the empty files.
2023-07-12 13:49:31 +01:00