|
3aac1047fe
|
Sketch the conv1d op.
|
2023-07-04 10:52:34 +01:00 |
|
|
e6b01d0c18
|
Add the conv1d layer (but not the op).
|
2023-07-04 10:01:05 +01:00 |
|
|
d71b31144d
|
Add a weight extraction script.
|
2023-07-04 09:29:19 +01:00 |
|
|
c09aa4b0f4
|
Add more to the forward pass and fix the cross-attention.
|
2023-07-04 09:14:28 +01:00 |
|
|
998cfda9c4
|
Sinusoid embeddings.
|
2023-07-04 09:03:25 +01:00 |
|
|
6728a85676
|
Add more to the whisper inference.
|
2023-07-04 08:32:03 +01:00 |
|
|
512dd9e4d6
|
Flesh out the whisper example.
|
2023-07-04 08:01:06 +01:00 |
|
|
0ca2af6940
|
Add the residual attention block.
|
2023-07-04 07:43:36 +01:00 |
|
|
b1d42231fb
|
Start sketching the whisper model.
|
2023-07-04 07:23:02 +01:00 |
|
|
a8b38ff821
|
Merge pull request #65 from LaurentMazare/bert-batch
Add a batch dimension on the bert example.
|
2023-07-04 06:21:38 +01:00 |
|
|
a57b314780
|
Add a batch dimension on the bert example.
|
2023-07-04 06:10:52 +01:00 |
|
|
8e4d298c90
|
Merge pull request #64 from LaurentMazare/matmul-batch
Better handling of the batch dimension in matmul.
|
2023-07-03 22:56:13 +01:00 |
|
|
86d691c74c
|
Better handling of the batch dimension in matmul.
|
2023-07-03 22:51:40 +01:00 |
|
|
ee3f7c0269
|
Merge pull request #63 from LaurentMazare/minor-tweaks
Minor tweaks.
|
2023-07-03 18:41:38 +01:00 |
|
|
b6d179cc1c
|
Allow for batch dimensions in the embedding layer.
|
2023-07-03 18:37:40 +01:00 |
|
|
9784d1ed9f
|
Minor tweaks.
|
2023-07-03 18:31:55 +01:00 |
|
|
3905de4169
|
Merge pull request #61 from LaurentMazare/bert
Sketch some Bert model.
|
2023-07-03 17:14:46 +01:00 |
|
|
5524ca29cc
|
Remove the fixed length hack.
|
2023-07-03 17:13:23 +01:00 |
|
|
1ea6690557
|
Bugfix for transpose.
|
2023-07-03 17:06:23 +01:00 |
|
|
a7f03a7bb6
|
Fix the layer norm to properly handle bias.
|
2023-07-03 16:45:03 +01:00 |
|
|
f379b8feae
|
Get some embeddings out.
|
2023-07-03 16:11:16 +01:00 |
|
|
54850e7525
|
Get the tensors to be loaded properly.
|
2023-07-03 15:53:31 +01:00 |
|
|
ad52b0377c
|
Add the varbuilder + check shapes.
|
2023-07-03 15:32:20 +01:00 |
|
|
895805be92
|
Merge remote-tracking branch 'origin/main' into bert
|
2023-07-03 14:45:49 +01:00 |
|
|
fd682a94f8
|
Merge pull request #62 from LaurentMazare/safetensors_integration
Adding saving capabilities.
|
2023-07-03 15:40:00 +02:00 |
|
|
f74bddca31
|
Model creation.
|
2023-07-03 14:09:46 +01:00 |
|
|
12ac9e1460
|
Complete (?) the forward pass.
|
2023-07-03 13:33:32 +01:00 |
|
|
d796945ad8
|
Add more to the forward pass.
|
2023-07-03 13:04:41 +01:00 |
|
|
0b3cc215f1
|
Address comments.
|
2023-07-03 13:52:27 +02:00 |
|
|
5bc66c68fa
|
Adding saving capabilities.
|
2023-07-03 13:39:24 +02:00 |
|
|
2309c5fac5
|
Boilerplate code for Bert.
|
2023-07-03 12:17:06 +01:00 |
|
|
d87ff64de1
|
Merge pull request #60 from LaurentMazare/candle-examples
Move llama in a cargo-examples directory.
|
2023-07-03 11:38:07 +01:00 |
|
|
cb03364718
|
Fix the CI.
|
2023-07-03 11:34:02 +01:00 |
|
|
8215eacceb
|
Formatting.
|
2023-07-03 11:32:50 +01:00 |
|
|
fdb1acd2ff
|
Move llama in a cargo-examples directory.
|
2023-07-03 11:30:58 +01:00 |
|
|
d0d530dfdc
|
Merge pull request #59 from LaurentMazare/safety
Adding a bit more docs around safety.
|
2023-07-03 12:00:35 +02:00 |
|
|
81cec86e75
|
Adding a bit more docs around safety.
|
2023-07-03 11:55:54 +02:00 |
|
|
48089005f6
|
Merge pull request #58 from LaurentMazare/use-patched-gemm
Use the patched gemm for the time being.
|
2023-07-03 10:36:09 +01:00 |
|
|
639270b796
|
Use the patched gemm for the time being.
|
2023-07-03 10:29:15 +01:00 |
|
|
ec4871b8a4
|
Merge pull request #57 from LaurentMazare/safetensor-module2
Move more safetensors bits to the shared module.
|
2023-07-03 10:19:57 +01:00 |
|
|
899c76de75
|
Handle more types in safetensors.
|
2023-07-03 10:09:46 +01:00 |
|
|
783b7054ee
|
Move more safetensors bits to the shared module.
|
2023-07-03 09:34:08 +01:00 |
|
|
b036faf6a0
|
Merge pull request #56 from LaurentMazare/safetensor-module
Dedicated safetensor module
|
2023-07-03 08:47:25 +01:00 |
|
|
fe2c07e368
|
Add the ST error.
|
2023-07-03 08:44:00 +01:00 |
|
|
cf2789fb81
|
Move some safetensors bits in the candle-core crate.
|
2023-07-03 08:37:46 +01:00 |
|
|
9e419641fb
|
Merge pull request #55 from LaurentMazare/pyo3-device
Cuda support for the pyo3 bindings
|
2023-07-02 21:04:58 +01:00 |
|
|
5b0ee2e0ba
|
Get cuda to work on pyo3.
|
2023-07-02 21:04:11 +01:00 |
|
|
fbfe74caab
|
Preliminary pyo3 support for device.
|
2023-07-02 20:42:55 +01:00 |
|
|
eb6f7d30b6
|
Merge pull request #54 from LaurentMazare/more-pyo3-2
Add dtype support in the pyo3 bindings.
|
2023-07-02 20:21:37 +01:00 |
|
|
bdb257ceab
|
Add the tensor function.
|
2023-07-02 20:15:50 +01:00 |
|