|
94aa234dfd
|
Add the kv-cache to the whisper wasm version. (#689)
* Add the kv-cache to the whisper wasm version.
* Improve the handling of special tokens.
|
2023-08-31 09:37:44 +01:00 |
|
|
1d0bb48fae
|
Improve Whisper WASM UI example (#669)
* wip add module and js worker example
* params
* clean up, send error
* final UI with whisper webworker
* add simple instructions
|
2023-08-30 20:35:41 +02:00 |
|
|
72ebb12bca
|
Remove some dead-code annotations. (#629)
* Remove some dead-code annotations.
* More dead code removal.
* One more.
* CI fix.
|
2023-08-27 18:52:33 +01:00 |
|
|
c84883ecf2
|
Add a cuda kernel for upsampling. (#441)
* Add a cuda kernel for upsampling.
* Update for the latest tokenizers version.
|
2023-08-14 13:12:17 +01:00 |
|
|
3eb2bc6d07
|
Softmax numerical stability. (#267)
* Softmax numerical stability.
* Fix the flash-attn test.
|
2023-07-28 13:13:01 +01:00 |
|
|
1735e4831e
|
TP sharding v2
|
2023-07-27 09:58:14 +02:00 |
|
|
5a26cba733
|
Re-organize the wasm examples (#231)
* Move the whisper example.
* More renaming.
* Add llama2 as a new wasm example.
* Live generation.
* More of the llama wasm example.
* Formatting.
|
2023-07-24 12:36:02 +01:00 |
|