candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-18 03:28:50 +00:00

Author	SHA1	Message	Date
Laurent Mazare	c7f92f985e	Further randn tweaks: use the appropriate rng rather than the f64 one, some cleanup. (#383 )	2023-08-10 05:48:19 +01:00
Lei	3bbc08a8df	Fix randn cpu (#382 ) * Change distributions Standard generates in [0, 1), Normal is correct. * Add test Not sure if this is the best place to put the test * Remove unnecessary use	2023-08-10 05:33:44 +01:00
LeeeSe	a5c5a893aa	add max_pool2d (#371 ) Co-authored-by: 赵理山 <ls@zhaolishandeMacBook-Air.local>	2023-08-09 18:05:26 +01:00
Laurent Mazare	cd225bd3b1	More testing for avg-pool2d. (#366 ) * More testing for avg-pool2d. * Another fix. * Add a max-pool test with non-divisible kernel sizes.	2023-08-09 16:12:23 +01:00
Laurent Mazare	b80348d22f	Bugfix for avg-pool + add some test. (#365 )	2023-08-09 15:44:16 +01:00
Laurent Mazare	dbc6f281c9	Conv1d test with padding. (#356 )	2023-08-09 05:45:38 +01:00
Laurent Mazare	608b2358c6	Add some conv1d test + bugfix using padding. (#349 )	2023-08-08 20:50:20 +01:00
Laurent Mazare	1e6dbeac01	Add some conv2d tests. (#347 ) * Add some conv2d tests. * Add a simpler conv2d test. * More conv2d testing + bugfix. * Add a todo.	2023-08-08 19:02:42 +01:00
Laurent Mazare	51e51da896	Rename the candle crate to candle-core (#301 ) * Rename to candle-core. * More candle-core renaming.	2023-08-02 08:20:22 +01:00
Laurent Mazare	4b3bd79fbd	Remove the embedding ops in favor of index-select. (#299 ) * Remove the embedding ops in favor of index-select. * Also remove the cuda kernels.	2023-08-02 05:42:11 +01:00
Laurent Mazare	cc76c63202	Use index-select for the embeddings as it supports backprop. (#298 )	2023-08-01 20:44:43 +01:00
Laurent Mazare	c950a5c6b1	Cuda support for the mnist training. (#277 ) * Cuda support for the mnist training. * min/max fix + testing. * Add the argmin/argmax tests. * More cuda support for argmin/argmax. * Cuda kernels for argmin and argmax.	2023-07-29 19:48:04 +01:00
Laurent Mazare	3eb2bc6d07	Softmax numerical stability. (#267 ) * Softmax numerical stability. * Fix the flash-attn test.	2023-07-28 13:13:01 +01:00
Laurent Mazare	f291065f6c	Do not panic on empty ranges. (#257 )	2023-07-27 09:28:47 +01:00
Laurent Mazare	944d70bd9a	Add a test for scatter add. (#238 ) * Add a test for scatter add (segfaults on gpus for now). * Bugfix for the scatter add cuda kernel.	2023-07-25 09:12:14 +01:00
Laurent Mazare	18cc73954a	Add some testing for index-add (#237 ) * Add some testing for index-add. * Fix the cpu implementation for index-add.	2023-07-25 08:38:33 +01:00
Laurent Mazare	581b104f97	Indexing cuda (#235 ) * Allow using uint8_t for indexing. * Revert the default cuda feature. * Add a cuda-kernel for index-select. * Add a test for gather.	2023-07-24 20:22:47 +01:00
Laurent Mazare	b50f932e7c	Add some cmp tests. (#233 ) * Add some cmp tests. * Add the cuda kernels for comparison operations.	2023-07-24 16:53:45 +01:00
Laurent Mazare	23827c49cd	Cleanup some todos. (#226 ) * Cleanup some todos. * Fix more todo. * Optimize for the contiguous case. * Add the IntDType trait. * Handle the intdtype trait for more ops. * Remove a todo. * Remove a todo.	2023-07-23 16:00:00 +01:00
Laurent Mazare	43c7223292	Rename the .r functions to .dims so as to be a bit more explicit. (#220 )	2023-07-22 10:39:27 +01:00
Laurent Mazare	5cc843550d	Add binary and ternary custom ops. (#217 )	2023-07-21 17:29:50 +01:00
Laurent Mazare	4a100875bf	Use a macro to handle the dtype pattern matching. (#215 )	2023-07-21 16:03:51 +01:00
Laurent Mazare	a6bcdfb269	Custom ops with a single argument (#214 ) * Add the CustomOp1 trait. * Add an example of custom op. * Polish the custom op example. * Add some backward pass test for custom ops.	2023-07-21 15:18:05 +01:00
Laurent Mazare	b02229ce92	Add some epsilon tolerance to grad tests so that they work on cuda / mkl. (#213 )	2023-07-21 12:45:14 +01:00
Laurent Mazare	c60831aad4	Add more gradient tests + bugfixes. (#211 ) * Add more gradient tests + bugfixes. * More tests and fixes. * More tests.	2023-07-21 06:52:39 +01:00
Laurent Mazare	4845d5cc64	More realistic training setup. (#210 ) * More realistic training setup. * Compute the model accuracy. * Very inefficient backprop for index select. * More backprop. * Fix some backprop issues. * Backprop fix. * Another broadcasting backprop fix. * Better backprop for reducing ops. * Training again. * Add some gradient tests. * Get the training to work.	2023-07-20 18:25:41 +01:00
Laurent Mazare	fa08fb3126	Add the index-select op. (#209 ) * Add the index-select op. * Cpu implementation of index-select. * Add the cpu implementation for index-select.	2023-07-20 14:01:03 +01:00
Laurent Mazare	76dcc7a381	Test the broadcasting binary ops. (#196 )	2023-07-19 06:18:36 +01:00
Laurent Mazare	18ea92d83b	Iteration over strided blocks (#175 ) * Introduce the strided blocks. * Use the strided blocks to fasten the copy. * Add more testing.	2023-07-15 21:30:35 +01:00
Laurent Mazare	a2f72edc0d	Simplify the parameters used by sum and sum_keepdim. (#165 )	2023-07-14 08:22:08 +01:00
Laurent Mazare	2bfa791336	Use the same default as pytorch for sum. (#164 )	2023-07-13 21:32:32 +01:00
Laurent Mazare	23e105cd94	Add the gradient for reduce-sum. (#162 ) * Add the gradient for reduce-sum. * And add the gradient for the broadcast ops. * Add some backprop tests. * Add some linear regression example.	2023-07-13 20:14:10 +01:00
Laurent Mazare	5ee3c95582	Move the variable creation to the variable module. (#159 ) * Move the variable creation to the variable module. * Make it possible to set a variable. * Add some basic gradient descent test. * Get the gradient descent test to work.	2023-07-13 16:55:40 +01:00
Laurent Mazare	50b0946a2d	Tensor mutability (#154 ) * Working towards tensor mutability. * Use a ref-cell to provide tensor mutability.	2023-07-13 11:04:40 +01:00
Laurent Mazare	8aab787384	Test the index op + bugfix. (#148 )	2023-07-12 15:42:36 +01:00
Laurent Mazare	e2807c78a4	Enable the doctests to run with mkl (though they are broken for now). (#126 )	2023-07-10 16:27:46 +01:00
Laurent Mazare	5c3864f9f7	Add more sum tests. (#110 ) * Add some tests for the sum. * More sum testing.	2023-07-08 13:15:36 +01:00
Laurent Mazare	f1e29cd405	Allow using mkl in tests. (#90 )	2023-07-06 13:25:05 +01:00
laurent	2741b39ad3	Use broadcasted scalars for const tensors.	2023-06-29 11:56:40 +01:00
laurent	caafef6cc1	Get the cpu tests to run.	2023-06-28 14:32:02 +01:00
laurent	0417d9cec8	Add more cuda testing again.	2023-06-28 08:33:43 +01:00
laurent	395c84e80a	Also run the backprop tests on cuda.	2023-06-28 08:15:03 +01:00
laurent	b0f5f2d22d	Add some display tests + bugfixes.	2023-06-27 21:37:28 +01:00
laurent	dbe3e4e7c0	Add some test utils module.	2023-06-27 16:20:28 +01:00
laurent	07a682c2ff	Run the tensor tests for the cuda backend too.	2023-06-27 15:37:01 +01:00
Nicolas Patry	d7f729fb8f	Refactor the hierarchy.	2023-06-27 11:57:27 +02:00

46 Commits