963c75cb89
Adding offline mode.
2023-07-05 07:19:57 +00:00
43a007cba4
Upgrading bert example to work with bert-base-uncased
.
...
- Always take weights from the hub
- Optional `model_id` + `revision` to use safetensors version
potentially
- Optional loading for `bert-base-uncased` (`weight` vs `gamma`).
- Take the config from the hub.
2023-07-04 14:12:14 +00:00
a8b38ff821
Merge pull request #65 from LaurentMazare/bert-batch
...
Add a batch dimension on the bert example.
2023-07-04 06:21:38 +01:00
a57b314780
Add a batch dimension on the bert example.
2023-07-04 06:10:52 +01:00
8e4d298c90
Merge pull request #64 from LaurentMazare/matmul-batch
...
Better handling of the batch dimension in matmul.
2023-07-03 22:56:13 +01:00
86d691c74c
Better handling of the batch dimension in matmul.
2023-07-03 22:51:40 +01:00
ee3f7c0269
Merge pull request #63 from LaurentMazare/minor-tweaks
...
Minor tweaks.
2023-07-03 18:41:38 +01:00
b6d179cc1c
Allow for batch dimensions in the embedding layer.
2023-07-03 18:37:40 +01:00
9784d1ed9f
Minor tweaks.
2023-07-03 18:31:55 +01:00
3905de4169
Merge pull request #61 from LaurentMazare/bert
...
Sketch some Bert model.
2023-07-03 17:14:46 +01:00
5524ca29cc
Remove the fixed length hack.
2023-07-03 17:13:23 +01:00
1ea6690557
Bugfix for transpose.
2023-07-03 17:06:23 +01:00
a7f03a7bb6
Fix the layer norm to properly handle bias.
2023-07-03 16:45:03 +01:00
f379b8feae
Get some embeddings out.
2023-07-03 16:11:16 +01:00
54850e7525
Get the tensors to be loaded properly.
2023-07-03 15:53:31 +01:00
ad52b0377c
Add the varbuilder + check shapes.
2023-07-03 15:32:20 +01:00
895805be92
Merge remote-tracking branch 'origin/main' into bert
2023-07-03 14:45:49 +01:00
fd682a94f8
Merge pull request #62 from LaurentMazare/safetensors_integration
...
Adding saving capabilities.
2023-07-03 15:40:00 +02:00
f74bddca31
Model creation.
2023-07-03 14:09:46 +01:00
12ac9e1460
Complete (?) the forward pass.
2023-07-03 13:33:32 +01:00
d796945ad8
Add more to the forward pass.
2023-07-03 13:04:41 +01:00
0b3cc215f1
Address comments.
2023-07-03 13:52:27 +02:00
5bc66c68fa
Adding saving capabilities.
2023-07-03 13:39:24 +02:00
2309c5fac5
Boilerplate code for Bert.
2023-07-03 12:17:06 +01:00
d87ff64de1
Merge pull request #60 from LaurentMazare/candle-examples
...
Move llama in a cargo-examples directory.
2023-07-03 11:38:07 +01:00
cb03364718
Fix the CI.
2023-07-03 11:34:02 +01:00
8215eacceb
Formatting.
2023-07-03 11:32:50 +01:00
fdb1acd2ff
Move llama in a cargo-examples directory.
2023-07-03 11:30:58 +01:00
d0d530dfdc
Merge pull request #59 from LaurentMazare/safety
...
Adding a bit more docs around safety.
2023-07-03 12:00:35 +02:00
81cec86e75
Adding a bit more docs around safety.
2023-07-03 11:55:54 +02:00
48089005f6
Merge pull request #58 from LaurentMazare/use-patched-gemm
...
Use the patched gemm for the time being.
2023-07-03 10:36:09 +01:00
639270b796
Use the patched gemm for the time being.
2023-07-03 10:29:15 +01:00
ec4871b8a4
Merge pull request #57 from LaurentMazare/safetensor-module2
...
Move more safetensors bits to the shared module.
2023-07-03 10:19:57 +01:00
899c76de75
Handle more types in safetensors.
2023-07-03 10:09:46 +01:00
783b7054ee
Move more safetensors bits to the shared module.
2023-07-03 09:34:08 +01:00
b036faf6a0
Merge pull request #56 from LaurentMazare/safetensor-module
...
Dedicated safetensor module
2023-07-03 08:47:25 +01:00
fe2c07e368
Add the ST error.
2023-07-03 08:44:00 +01:00
cf2789fb81
Move some safetensors bits in the candle-core crate.
2023-07-03 08:37:46 +01:00
9e419641fb
Merge pull request #55 from LaurentMazare/pyo3-device
...
Cuda support for the pyo3 bindings
2023-07-02 21:04:58 +01:00
5b0ee2e0ba
Get cuda to work on pyo3.
2023-07-02 21:04:11 +01:00
fbfe74caab
Preliminary pyo3 support for device.
2023-07-02 20:42:55 +01:00
eb6f7d30b6
Merge pull request #54 from LaurentMazare/more-pyo3-2
...
Add dtype support in the pyo3 bindings.
2023-07-02 20:21:37 +01:00
bdb257ceab
Add the tensor function.
2023-07-02 20:15:50 +01:00
78871ffe38
Add dtype support.
2023-07-02 20:12:26 +01:00
65e069384c
Merge pull request #53 from LaurentMazare/more-pyo3
...
Add more pyo3 wrapping
2023-07-02 07:50:49 +01:00
d38897461b
Add to the example.
2023-07-02 07:37:17 +01:00
5b8c6764b0
Add matmul/where_cond.
2023-07-02 07:34:14 +01:00
9a9858bbe0
Expose a couple more ops.
2023-07-02 07:30:00 +01:00
dfe197f791
Handle more input types to create tensors.
2023-07-02 07:19:46 +01:00
4a28dcf828
Rename the method.
2023-07-02 07:08:11 +01:00