Make the Python Wrapper more Hackable and simplify Quantization (#1010)

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

* Some first `Module` implementations

* Add `state_dict` and `load_state_dict` functionality

* Move modules around and create `candle.nn.Linear`

* Add `nn.Embedding` and `nn.LayerNorm`

* Add BERT implementation

* Batch q-matmul

* Automatically dequantize `QTensors` if a `Tensor` is expected

* Add Module `.to()`, `.cuda()`, `cpu()` and `.type()` functionality

* Unittests for `Module`, `Tensor` and `candle.utils`

* Add `pytorch` like slicing to `Tensor`

* Cleanup and BERT fixes

* `black` formatting + unit-test for `nn.Linear`

* Refactor slicing implementation

This commit is contained in:

Lukas Kreussel

2023-10-06 20:01:07 +02:00

committed by

GitHub

parent b0442eff8a

commit 904bbdae65

25 changed files with 2426 additions and 182 deletions

1

candle-pyo3/.gitignore vendored

View File

 @ -1,3 +1,4 @@
 tests/_workdir
 # Byte-compiled / optimized / DLL files
 __pycache__/
 *.py[cod]

Make the Python Wrapper more Hackable and simplify Quantization (#1010)

1 candle-pyo3/.gitignore vendored Unescape Escape View File

1

candle-pyo3/.gitignore vendored

View File