c2007ac88f
W fixes. ( #862 )
2023-09-15 15:11:11 +01:00
30be5b6660
Replication pad ( #861 )
...
* Add the embed mapper convolutions.
* Add the replication pad layer.
* Use the replication-pad op.
* Tweak a todo.
2023-09-15 14:06:21 +01:00
107d3d9530
Add the embed mapper convolutions. ( #860 )
2023-09-15 11:38:38 +02:00
2746f2c4be
DiffNeXt/unet ( #859 )
...
* DiffNeXt/unet
* Start adding the vae.
* VAE residual block.
* VAE forward pass.
* Add pixel shuffling.
* Actually use pixel shuffling.
2023-09-15 10:14:02 +01:00
81a36b8713
Add link error info ( #851 )
...
* add link error info
* grammar fix
2023-09-15 07:25:10 +01:00
0633c85514
Add leaky-relu in the activation enum. ( #858 )
2023-09-15 07:05:38 +01:00
39157346cb
Add SAM UI Demo ( #854 )
...
* fix tensor flattening
* send image data back
* sam ui worker example
* SAM example
* resize container
* no need for this
2023-09-15 06:31:58 +01:00
5cefbba757
minor UI fixes ( #856 )
...
* fixes
* remove listener
* remove event listener
2023-09-15 06:30:50 +01:00
130fe5a087
Add the upblocks. ( #853 )
2023-09-14 22:24:56 +01:00
91ec546feb
More DiffNeXt. ( #847 )
...
* More DiffNeXt.
* Down blocks.
2023-09-14 22:16:31 +02:00
0a647875ec
Use softmax-last-dim in the quantized example. ( #848 )
2023-09-14 17:29:24 +01:00
a0c6d5548c
Add the attention block. ( #846 )
...
* Add the attention block.
* Add more to clipnext.
2023-09-14 15:40:09 +01:00
286f01db14
Start adding the Wuerstchen diffusion pipeline ( #843 )
...
* Wuerstchen common bits.
* Add the prior layer.
* Start adding diffnext.
2023-09-14 10:56:07 +01:00
d6447ad635
Tensor based indexing. ( #842 )
2023-09-14 07:47:07 +01:00
49d3f7f708
Add support to flan-t5 ( #840 )
2023-09-13 19:27:20 +02:00
9a465e1b26
Add 1d upsampling. ( #839 )
...
* Add 1d upsampling.
* Add the interpolate functions.
2023-09-13 16:50:39 +01:00
31ab2ddaeb
Remove the padding. ( #838 )
2023-09-13 13:00:59 +01:00
b11a2a7b9d
Move the constant to avoid some unused warning. ( #837 )
2023-09-13 11:56:53 +01:00
1c09164021
Add CANDLE_NVCC_CCBIN
support for candle-kernels
, and eliminate warning. ( #836 )
2023-09-13 11:39:22 +01:00
3e94324012
Add some sentence similarity part to the t5 example. ( #835 )
...
* Add some sentence similarity part to the t5 example.
* Clippy fix.
2023-09-13 10:44:02 +01:00
e6f040d6e3
Readme gallery ( #834 )
...
* More readme tweaks.
* Update README.md
2023-09-13 09:05:47 +01:00
cbd36157ac
Add a gif to the quantized readme. ( #833 )
...
* Add a gif to the quantized readme.
* gif update.
2023-09-13 08:43:52 +01:00
18d3c803a8
Scalar support in minimum/maximum. ( #832 )
...
* Scalar support in minimum/maximum.
* Add a clamp method to tensors.
2023-09-13 08:24:58 +01:00
e4553fb355
T5 tweaks ( #831 )
...
* Use default values rather than options.
* Avoid exposing the device field.
* More tweaks.
2023-09-13 07:37:04 +01:00
d801e1d564
Clippy fix. ( #830 )
2023-09-13 07:16:20 +01:00
9daa6dbe87
Extract T5 module and add main function to use it ( #829 )
...
* Extract t5 out of musicgen
* Add main for t5 module
2023-09-13 07:14:05 +01:00
e82fcf1c59
Add more example readmes. ( #828 )
...
* Add more readmes.
* Add a readme for dinov2.
* Add some skeleton files for a couple more examples.
* More whisper details.
2023-09-12 17:21:24 +01:00
805bf9ffa7
Implement top_p / nucleus sampling ( #819 )
...
* Implement top_p / nucleus sampling
* Update changelog
* rustfmt
* Add tests
* Fix clippy warning
* Fix another clippy error
2023-09-12 18:10:16 +02:00
42da17694a
Segment Anything readme ( #827 )
...
* Add a readme for the segment-anything model.
* Add the original image.
* Clean-up the segment anything cli example.
* Also print the mask id in the outputs.
2023-09-12 14:35:55 +01:00
25aacda28e
Add useful libraries section ( #825 )
...
* Add useful libraries section
* Add link
2023-09-12 11:06:21 +01:00
7a62aad24a
Add a readme for yolo-v8. ( #824 )
2023-09-12 11:01:06 +01:00
bb23b90b1d
Add a small readme for the quantized example. ( #823 )
2023-09-12 10:17:31 +01:00
2257f4d475
Bump the crate version + update the changelog. ( #822 )
2023-09-12 06:39:24 +01:00
871efc0307
Bugfix for the conv2d cpu kernel. ( #820 )
2023-09-11 23:11:27 +01:00
c5a058b169
Use the module trait in stable-diffusion. ( #817 )
2023-09-11 20:40:07 +01:00
59e63d690c
Add weight, bias, and hidden_size methods ( #816 )
...
* Add weight, bias methods to Conv(1|2)
* Add hidden_size method to Embedding
* Expose hidden_size
2023-09-11 16:01:11 +01:00
dbd4561416
im2col version of the conv1d kernel. ( #815 )
...
* im2col version of the cuda conv1d kernel.
* im2col version of the conv1d cpu kernel.
2023-09-11 14:40:09 +01:00
5c35fbbb13
Stable-Diffusion readme ( #814 )
...
* Stable Diffusion readme.
* Fix the image path.
* Move the assets.
* Resize the sample image.
* Lower resolution.
2023-09-11 13:06:29 +01:00
70f38c2069
Proper error on unsupported dtypes when using gemm. ( #813 )
2023-09-11 12:10:51 +01:00
d7b9fec849
Move the stable-diffusion modeling code so that it's easier to re-use. ( #812 )
2023-09-11 11:45:57 +01:00
84ee870efd
Use softmax-last-dim in whisper. ( #810 )
2023-09-11 11:05:05 +01:00
df712ecf64
Handle the case where the kernel is not contiguous in the cuda backend. ( #809 )
2023-09-11 09:48:31 +01:00
6fb665004c
Enable im2col on the cpu side. ( #805 )
...
* Enable im2col on the cpu side.
* Hook im2col on the cpu backend.
* Use the kernel offset.
* Avoid an unnecessary copy.
* Handle non-contiguous kernels.
* Add a const to select the conv2d kernel.
2023-09-11 09:28:13 +01:00
1cd74129d4
Add Im2Col support on the gpu side. ( #808 )
...
* Add Im2Col support on the gpu side.
* Actually enable.
2023-09-11 08:52:33 +01:00
98d1242b8f
im2col based conv2d ( #802 )
...
* im2col implementation for conv2d.
* Fix for the im2col implementation to match the current conv2d.
* Small optimization.
* Add a cuda kernel.
* Handle arbitrary layouts.
* Im2Col cuda code.
2023-09-10 21:02:42 +01:00
18d6db2180
more doc fixes ( #804 )
2023-09-10 20:36:29 +01:00
4f18180fc7
Bugfix so that im2col produce the same results as conv2d. ( #801 )
2023-09-10 16:59:46 +01:00
559944146f
Add an im2col based benchmark. ( #800 )
...
* Add an im2col based benchmark.
* Reshape the final result.
2023-09-10 16:56:28 +01:00
3dd5804299
Fix typo in readme. ( #799 )
2023-09-10 13:49:47 +01:00
90e077e409
Return the low res mask in the wasm segment-anything module. ( #798 )
...
* Return the low res mask.
* Add some validations.
2023-09-10 13:03:02 +01:00