2d3fcad267
Simplify usage of the pool functions. ( #662 )
...
* Simplify usage of the pool functions.
* Small tweak.
* Attempt at using apply to simplify the convnet definition.
2023-08-29 19:12:16 +01:00
b31d41e26a
Add a convnet training example. ( #661 )
...
* Add a convnet example.
* Dataset fix.
* Randomize batches.
2023-08-29 18:23:01 +01:00
71221559d3
Fix the dilated convolutions. ( #659 )
2023-08-29 16:37:42 +01:00
a044907ffc
Dilated convolutions ( #657 )
...
* Add the dilation parameter.
* Restore the basic optimizer example.
* Dilation support in cudnn.
* Use the dilation parameter in the cpu backend.
* More dilation support.
* No support for dilation in transposed convolutions.
* Add dilation to a test.
* Remove a print.
* Helper function.
2023-08-29 16:12:11 +01:00
ee8bb1bde1
Add avx
implemenetations of q2k
, q3k
and q5k
vec-dot functions ( #654 )
...
* `q2k` avx implementation
* `q3k` avx implementation
* `q5k` avx implementation
* `avx` make masks constant
* clippy stuff
2023-08-29 13:35:56 +01:00
3d2d3c7edb
Merge pull request #658 from huggingface/upgrade_hf_hub2
...
Upgrading hf-hub (for windows support, removing symlink requirement).
2023-08-29 14:32:15 +02:00
1aca6fa291
Upgrading hf-hub.
2023-08-29 14:18:54 +02:00
4ed202447e
Upgrading hf-hub.
2023-08-29 14:14:26 +02:00
1d6bff53fc
Changelog update. ( #656 )
2023-08-29 12:55:56 +01:00
14b4d456e8
Merge pull request #439 from huggingface/training_hub_dataset
...
[Book] Add small error management + start training (with generic dataset inclusion).
2023-08-29 13:10:05 +02:00
2d5b7a735d
Update the book with new layout of datasets.
2023-08-29 12:51:59 +02:00
62ef494dc1
Use multiple transformer layer in the same cross-attn blocks. ( #653 )
...
* Use multiple transformer layer in the same cross-attn blocks.
* Make the context contiguous if required.
2023-08-29 11:13:43 +01:00
d0a330448d
Backprop support for pooling ops. ( #652 )
...
* Backprop support for pooling ops.
* max-pool gradient.
2023-08-29 10:17:59 +01:00
4b8d57ba15
AVX version of the q4k vecdot. ( #651 )
2023-08-29 09:41:17 +01:00
d5a525f7a7
Fix clippy + save_image.
2023-08-29 10:19:44 +02:00
33c23c19b6
Preliminary support for SDXL. ( #647 )
...
* Preliminary support for SDXL.
* More SDXL support.
* More SDXL.
* Use the proper clip config.
* Querying for existing tensors.
* More robust test.
2023-08-29 09:00:04 +01:00
49326fb925
Update .gitignore ( #649 )
2023-08-29 08:41:33 +01:00
fd3131a4ce
Fix the debug implementation. ( #648 )
2023-08-28 22:51:39 +01:00
037b41c9dc
Cuda conv transpose ( #645 )
...
* Cuda kernel for conv-transpose.
* Fix the cuda kernel.
* Fix the tests.
2023-08-28 20:58:49 +01:00
72fae3140c
Optimize the conv2d transpose cpu kernel. ( #644 )
...
* Optimize the conv2d transpose cpu kernel.
* Use multiple cores.
2023-08-28 20:06:31 +01:00
ca26198b95
Fix the cpu kernel for conv-transpose. ( #643 )
2023-08-28 16:45:12 +01:00
b292047882
Backprop for conv2d. ( #638 )
...
* Start adding backprop for conv2d.
* Backprop for conv2d.
* Bugfix + start adding a conv2d test.
* Conv2d backprop testing.
* More conv fixes.
2023-08-28 16:08:55 +01:00
09c5bd1881
Rebased
2023-08-28 15:47:03 +02:00
fe6c88713d
Fix waiting upgrade for SSL ?
2023-08-28 15:15:27 +02:00
6f3f9285e6
Remove image dep.
2023-08-28 15:15:27 +02:00
baca3cf69d
Fix deps.
2023-08-28 15:15:27 +02:00
d726484a6d
Re-enable local dir for mnist.
2023-08-28 15:15:27 +02:00
dd06d93d0b
Cleanup:
...
- Moved around book from `examples` to `candle-book` proper (overlapping
the book and the lib structures)
2023-08-28 15:15:26 +02:00
c109c93db7
Update candle-book/src/SUMMARY.md
2023-08-28 15:15:02 +02:00
d7a273be51
Training:
...
- Removed a lot of surface (SerializedFileReader ownership is really
painful).
- Moved example + vision to hf.co version.
- Removed feature gate.
2023-08-28 15:15:01 +02:00
dd02f589c0
Better training+hub
2023-08-28 15:14:43 +02:00
7602323667
[Book] Add small error management + start training (with generic dataset
...
inclusion).
2023-08-28 15:14:17 +02:00
9137c63175
Update README.md ( #640 )
2023-08-28 11:34:54 +01:00
3cca89cc70
Add conv-transpose. ( #635 )
...
* Add conv-transpose.
* Return zeros for now.
* Naive CPU implementation.
* Add a conv-transpose test + fix the cpu implementation.
* Add a second test.
2023-08-28 10:10:12 +01:00
26e1b40992
Repeat-penalty in the falcon example. ( #634 )
2023-08-28 08:13:40 +01:00
1da71a5da1
Neon optimized version of the q4k vecdot product. ( #632 )
2023-08-27 21:30:47 +01:00
24dda44c27
Add wasm support for yolo-v8 pose detection. ( #630 )
...
* Add wasm support for yolo-v8 pose detection.
* Better bbox handling.
* Add the pose model in the wasm example lib.
2023-08-27 19:49:24 +01:00
72ebb12bca
Remove some dead-code annotations. ( #629 )
...
* Remove some dead-code annotations.
* More dead code removal.
* One more.
* CI fix.
2023-08-27 18:52:33 +01:00
a3f97c143d
Bump the crate version + update CHANGELOG. ( #628 )
2023-08-27 18:17:11 +01:00
4c338b0cd9
VarBuilder cleanup ( #627 )
...
* VarBuilder cleanup.
* Implement the basic varbuilders.
* Add the sharded code.
* Proper support for tensor sharding.
2023-08-27 18:03:26 +01:00
be471d50ab
Llama quantization. ( #625 )
2023-08-27 14:08:15 +01:00
7151f2cf63
Add the quantize command. ( #624 )
...
* Add the quantize command.
* Bugfix for writing gguf files.
* And add a comment.
2023-08-27 11:35:19 +01:00
6e485f2deb
Add some optional repeat penalty. ( #623 )
...
* Add some optional repeat penalty.
* Add the missing files.
2023-08-27 10:48:45 +01:00
5320aa6b7d
Move the test-utils bits to a shared place. ( #619 )
2023-08-27 09:42:22 +01:00
a8b39dd7b7
Fix for q5_1 quantization. ( #617 )
...
* Fix for q5_1 quantization.
* Fix some typos.
2023-08-27 08:31:18 +01:00
fa0d75b18d
Quantization tests + fix some issues. ( #616 )
2023-08-27 08:17:38 +01:00
28658054ff
More missing quantized bits. ( #615 )
...
* Q4_1 support.
* Add Q5_1 quantization.
* Tweak.
2023-08-27 07:52:26 +01:00
ab36a7f3e3
Fix for when f16c is not available. ( #614 )
2023-08-27 07:19:52 +01:00
f704e39761
Missing quants ops ( #611 )
...
* Another transmute tweak.
* Changelog tweak.
* Add some missing quantized ops.
2023-08-26 20:09:04 +01:00
fdf15f0e05
Another transmute tweak. ( #610 )
...
* Another transmute tweak.
* Changelog tweak.
2023-08-26 13:00:24 +01:00