Commit Graph

17 Commits

Author SHA1 Message Date
c3a0761e62 Add some tracing to the whisper example. (#375) 2023-08-09 19:58:36 +01:00
3eb2bc6d07 Softmax numerical stability. (#267)
* Softmax numerical stability.

* Fix the flash-attn test.
2023-07-28 13:13:01 +01:00
43c7223292 Rename the .r functions to .dims so as to be a bit more explicit. (#220) 2023-07-22 10:39:27 +01:00
eae646d322 Use arange in the examples. (#146) 2023-07-12 12:12:34 +01:00
b46c28a2ac VarBuilder path creation (#131)
* Use a struct for the safetensor+routing.

* Group the path and the var-builder together.

* Fix for the empty path case.
2023-07-10 22:37:34 +01:00
1aa7fbbc33 Move the var-builder in a central place. (#130) 2023-07-10 20:49:50 +01:00
89a5b602a6 Move the conv1d layer to candle_nn. (#117) 2023-07-10 11:02:06 +01:00
b06e1a7e54 [nn] Move the Embedding and Activation parts. (#116)
* Share the Embedding and Activation parts.

* Tweak some activations.
2023-07-10 10:24:52 +01:00
9ce0f1c010 Sketch the candle-nn crate. (#115)
* Sketch the candle-nn crate.

* Tweak the cuda dependencies.

* More cuda tweaks.
2023-07-10 08:50:09 +01:00
cd230d26fe Whisper tweaks (#85)
* Isolate the decoding bits of the whisper example.

* Decode -> Decoder.

* Add the suppress tokens filter.

* More suppress tokens.
2023-07-06 09:13:20 +01:00
d3418f1cff Add the original whisper names as comment. 2023-07-06 07:57:03 +01:00
19ab5ea411 Merge pull request #78 from LaurentMazare/whisper_update
Adapting whisper for Hub use.
2023-07-06 07:21:58 +01:00
2c3d871b2e Add a simpler way to specify the dim index for some ops. 2023-07-05 20:22:43 +01:00
e85573a4bd Adapting whisper for Hub use. 2023-07-05 14:35:27 +00:00
bae6d07b7e Fix the position embeddings size. 2023-07-05 13:43:34 +01:00
9694e35db0 Clean the decode loop of the whisper example. 2023-07-05 08:37:26 +01:00
8cf803d1a3 Split the model in a separate file. 2023-07-05 05:46:53 +01:00