candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

Author	SHA1	Message	Date
Laurent Mazare	0d9bb4eb18	Add the blip example. (#1144 ) * Add the blip example. * Tweak the example. * Implement the cross-attn logic. * Fix some shape mismatches. * Get some logits out. * Get some caption to be generated.	2023-10-21 20:05:02 +01:00
Laurent Mazare	e8f760ee44	Add get_on_dim. (#1142 )	2023-10-21 15:01:38 +01:00
Laurent Mazare	94e3373883	Blip forward pass (#1141 ) * More forward methods for the blip model. * Blipping continues.	2023-10-21 10:19:23 +01:00
Laurent Mazare	34d9e91748	Add the blip image captioning model (#1140 ) * Blip text model. * Blip vision bits. * Blippity. * More blip.	2023-10-20 22:09:11 +01:00
Lukas Kreussel	cfb423ab76	PyO3: Add CI (#1135 ) * Add PyO3 ci * Update python.yml * Format `bert.py`	2023-10-20 19:05:14 +01:00
Laurent Mazare	7366aeac21	Make func cloneable. (#1137 )	2023-10-20 16:28:50 +01:00
Laurent Mazare	99cf13e8e2	Add the sequential layer. (#1136 )	2023-10-20 16:08:50 +01:00
Lukas Kreussel	b43ab6cd1d	PyO3: Add `None` and `Tensor` indexing to `candle.Tensor` (#1098 ) * Add proper `None` and `tensor` indexing * Allow indexing via lists + allow tensor/list indexing outside of first dimension	2023-10-20 09:59:00 +01:00
Laurent Mazare	31ca4897bb	Readme updates. (#1134 )	2023-10-20 09:08:39 +01:00
Laurent Mazare	55351ef57d	Add some vision transformers models (#1132 ) * Start adding vision-transformers. * Add self-attn. * More vision transformers. * vit-vit. * Add the actual vit model. * Add the example code for the vision transformers.	2023-10-19 22:24:18 +01:00
Lukas Kreussel	6684b7127a	PyO3: Add pytorch like `.to()` operator to `candle.Tensor` (#1100 ) * add `.to()` operator * Only allow each value to be provided once via `args` or `kwargs`	2023-10-19 21:46:21 +01:00
Laurent Mazare	93c25e8844	Expose the larger resnets (50/101/152) in the example. (#1131 )	2023-10-19 13:48:28 +01:00
Laurent Mazare	cd53c472df	Support ResNet 50/101/152. (#1130 )	2023-10-19 10:48:31 +01:00
Laurent Mazare	6f76383f38	Add a readme for the resnet example. (#1129 )	2023-10-19 09:58:50 +01:00
Laurent Mazare	8e773cc0c6	Experiment with resnet (#1128 ) * Add some preliminary support for resnet. * Add an actual resnet example.	2023-10-19 09:25:03 +01:00
Laurent Mazare	87eb1658e1	Add pad_with_same. (#1127 ) * More model cloning. * More cloning on quantized models. * Add pad-with-same. * Add some tests.	2023-10-18 23:13:37 +01:00
Laurent Mazare	902d0b9166	More model cloning. (#1126 ) * More model cloning. * More cloning on quantized models.	2023-10-18 21:55:46 +01:00
Laurent Mazare	185b54a33b	Make some model cloneable. (#1125 )	2023-10-18 19:30:47 +01:00
Laurent Mazare	620c94d12e	Add support for Zephyr-7b in the quantized model. (#1124 )	2023-10-18 17:31:26 +01:00
Laurent Mazare	86e7d539d2	Add the quantized mpt model. (#1123 ) * Add the quantized mpt model. * Support the quantized model for replit-code.	2023-10-18 16:29:38 +01:00
Laurent Mazare	cb034506cd	Remove the unused pragma in mpt. (#1122 )	2023-10-18 15:47:50 +01:00
Laurent Mazare	63c204c79e	Add a mention to the replit-code model in the readme. (#1121 )	2023-10-18 11:27:23 +01:00
Laurent Mazare	767a6578f1	MPT alibi fixes. (#1120 ) * MPT alibi fixes. * Some more fixes. * Finally get the model to return some sensible outputs. * Add a readme.	2023-10-18 10:58:05 +01:00
Laurent Mazare	662c186fd5	Better error message when overflowing in narrow. (#1119 )	2023-10-18 08:40:14 +01:00
Laurent Mazare	2cd745a97c	MPT fixes. (#1117 ) * MPT fixes. * Another couple fixes. * Another shape fix.	2023-10-17 21:53:31 +01:00
Laurent Mazare	a72b50e2c0	Build alibi bias. (#1115 ) * Build alibi bias. * Apply the alibi attention bias. * Add the replit-code example.	2023-10-17 20:41:37 +01:00
Laurent Mazare	872c3f14b0	Add the MPT model. (#1114 ) * Add the MPT model. * Add ffn and block. * Forward pass for the mpt block. * Repeat-kv.	2023-10-17 16:06:48 +01:00
Lukas Kreussel	f9e93f5b69	Extend `stub.py` to accept external typehinting (#1102 )	2023-10-17 11:07:26 +01:00
Lukas Kreussel	b355ab4e2e	Always broadcast magic methods (#1101 )	2023-10-17 10:57:12 +01:00
Laurent Mazare	2fe24ac5b1	Rework the cuda casting bits. (#1112 )	2023-10-17 09:44:51 +01:00
Laurent Mazare	00948eb656	Formatting tweak. (#1111 )	2023-10-16 21:02:53 +01:00
Laurent Mazare	af67672207	Add support for Puffin-Phi-v2. (#1110 ) * Add support for Puffin-Phi-v2. * Tweak the file name. * Support the config for puffin-phi-v2. * Update the readme.	2023-10-16 20:54:21 +01:00
Laurent Mazare	6c588c4792	Refactor the pth tensor exctraction. (#1109 )	2023-10-16 18:16:34 +01:00
OlivierDehaene	122da87580	feat: add pth varbuilder (#1108 )	2023-10-16 16:20:36 +01:00
OlivierDehaene	75629981bc	feat: parse Cuda compute cap from env (#1066 ) * feat: add support for multiple compute caps * Revert to one compute cap * fmt * fix	2023-10-16 15:37:38 +01:00
Laurent Mazare	0106b0b04c	Read all the tensors in a PyTorch pth file. (#1106 )	2023-10-16 13:50:07 +01:00
Laurent Mazare	588ad4835a	Fix the verbose prompt for phi. (#1097 )	2023-10-15 10:53:25 +01:00
Laurent Mazare	b73c35cc57	Improve the reshape error messages. (#1096 ) * Improve the reshape error messages. * Add the verbose-prompt flag to the phi example.	2023-10-15 10:43:10 +01:00
Laurent Mazare	8f310cc666	Avoid trying to backprop through non-differentiable layers. (#1094 )	2023-10-14 22:03:41 +01:00
Laurent Mazare	8921d5027c	Add support for phi-1.0 (#1093 ) * Add support for phi-1.0 * Update the readme.	2023-10-14 20:15:43 +01:00
Laurent Mazare	29c7f2565d	Add some reinforcement learning example. (#1090 ) * Add some reinforcement learning example. * Python initialization. * Get the example to run. * Vectorized gym envs for the atari wrappers. * Get some simulation loop to run.	2023-10-14 16:46:43 +01:00
Laurent Mazare	9309cfc47d	Create a new curand instead of reseeding. (#1089 )	2023-10-14 10:03:59 +01:00
Laurent Mazare	a193bf5f60	Another gemm update. (#1088 )	2023-10-14 09:36:52 +01:00
Laurent Mazare	2c110ac7d9	Add the pooling operators to the pyo3 layer. (#1086 )	2023-10-13 20:18:10 +01:00
Laurent Mazare	75989fc3b7	Use an attention mask in the e5 padding case. (#1085 )	2023-10-13 18:53:40 +01:00
Laurent Mazare	07af87a1d8	Typos. (#1084 )	2023-10-13 16:21:20 +01:00
Laurent Mazare	eefad2b95f	Update to gemm 0.16.1 (#1083 )	2023-10-13 06:40:20 +01:00
Laurent Mazare	5e6df4a3f7	Update to gemm-0.16. (#1082 ) * Update to gemm-0.16. * Enable wasm-simd128.	2023-10-12 21:56:59 +01:00
Laurent Mazare	7473c4ceca	Fix the npy read function and add some testing. (#1080 )	2023-10-12 15:25:05 +02:00
Laurent Mazare	c096f02411	Add a matvec cpu benchmark. (#1076 )	2023-10-12 09:29:18 +01:00

1 2 3 4 5 ...

1401 Commits