candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

Author	SHA1	Message	Date
Nicolas Patry	14b4d456e8	Merge pull request #439 from huggingface/training_hub_dataset [Book] Add small error management + start training (with generic dataset inclusion).	2023-08-29 13:10:05 +02:00
Nicolas Patry	2d5b7a735d	Update the book with new layout of datasets.	2023-08-29 12:51:59 +02:00
Laurent Mazare	62ef494dc1	Use multiple transformer layer in the same cross-attn blocks. (#653 ) * Use multiple transformer layer in the same cross-attn blocks. * Make the context contiguous if required.	2023-08-29 11:13:43 +01:00
Laurent Mazare	d0a330448d	Backprop support for pooling ops. (#652 ) * Backprop support for pooling ops. * max-pool gradient.	2023-08-29 10:17:59 +01:00
Laurent Mazare	4b8d57ba15	AVX version of the q4k vecdot. (#651 )	2023-08-29 09:41:17 +01:00
Nicolas Patry	d5a525f7a7	Fix clippy + save_image.	2023-08-29 10:19:44 +02:00
Laurent Mazare	33c23c19b6	Preliminary support for SDXL. (#647 ) * Preliminary support for SDXL. * More SDXL support. * More SDXL. * Use the proper clip config. * Querying for existing tensors. * More robust test.	2023-08-29 09:00:04 +01:00
Lei	49326fb925	Update .gitignore (#649 )	2023-08-29 08:41:33 +01:00
Laurent Mazare	fd3131a4ce	Fix the debug implementation. (#648 )	2023-08-28 22:51:39 +01:00
Laurent Mazare	037b41c9dc	Cuda conv transpose (#645 ) * Cuda kernel for conv-transpose. * Fix the cuda kernel. * Fix the tests.	2023-08-28 20:58:49 +01:00
Laurent Mazare	72fae3140c	Optimize the conv2d transpose cpu kernel. (#644 ) * Optimize the conv2d transpose cpu kernel. * Use multiple cores.	2023-08-28 20:06:31 +01:00
Laurent Mazare	ca26198b95	Fix the cpu kernel for conv-transpose. (#643 )	2023-08-28 16:45:12 +01:00
Laurent Mazare	b292047882	Backprop for conv2d. (#638 ) * Start adding backprop for conv2d. * Backprop for conv2d. * Bugfix + start adding a conv2d test. * Conv2d backprop testing. * More conv fixes.	2023-08-28 16:08:55 +01:00
Nicolas Patry	09c5bd1881	Rebased	2023-08-28 15:47:03 +02:00
Nicolas Patry	fe6c88713d	Fix waiting upgrade for SSL ?	2023-08-28 15:15:27 +02:00
Nicolas Patry	6f3f9285e6	Remove image dep.	2023-08-28 15:15:27 +02:00
Nicolas Patry	baca3cf69d	Fix deps.	2023-08-28 15:15:27 +02:00
Nicolas Patry	d726484a6d	Re-enable local dir for mnist.	2023-08-28 15:15:27 +02:00
Nicolas Patry	dd06d93d0b	Cleanup: - Moved around book from `examples` to `candle-book` proper (overlapping the book and the lib structures)	2023-08-28 15:15:26 +02:00
Nicolas Patry	c109c93db7	Update candle-book/src/SUMMARY.md	2023-08-28 15:15:02 +02:00
Nicolas Patry	d7a273be51	Training: - Removed a lot of surface (SerializedFileReader ownership is really painful). - Moved example + vision to hf.co version. - Removed feature gate.	2023-08-28 15:15:01 +02:00
Nicolas Patry	dd02f589c0	Better training+hub	2023-08-28 15:14:43 +02:00
Nicolas Patry	7602323667	[Book] Add small error management + start training (with generic dataset inclusion).	2023-08-28 15:14:17 +02:00
Laurent Mazare	9137c63175	Update README.md (#640 )	2023-08-28 11:34:54 +01:00
Laurent Mazare	3cca89cc70	Add conv-transpose. (#635 ) * Add conv-transpose. * Return zeros for now. * Naive CPU implementation. * Add a conv-transpose test + fix the cpu implementation. * Add a second test.	2023-08-28 10:10:12 +01:00
Laurent Mazare	26e1b40992	Repeat-penalty in the falcon example. (#634 )	2023-08-28 08:13:40 +01:00
Laurent Mazare	1da71a5da1	Neon optimized version of the q4k vecdot product. (#632 )	2023-08-27 21:30:47 +01:00
Laurent Mazare	24dda44c27	Add wasm support for yolo-v8 pose detection. (#630 ) * Add wasm support for yolo-v8 pose detection. * Better bbox handling. * Add the pose model in the wasm example lib.	2023-08-27 19:49:24 +01:00
Laurent Mazare	72ebb12bca	Remove some dead-code annotations. (#629 ) * Remove some dead-code annotations. * More dead code removal. * One more. * CI fix.	2023-08-27 18:52:33 +01:00
Laurent Mazare	a3f97c143d	Bump the crate version + update CHANGELOG. (#628 )	2023-08-27 18:17:11 +01:00
Laurent Mazare	4c338b0cd9	VarBuilder cleanup (#627 ) * VarBuilder cleanup. * Implement the basic varbuilders. * Add the sharded code. * Proper support for tensor sharding.	2023-08-27 18:03:26 +01:00
Laurent Mazare	be471d50ab	Llama quantization. (#625 )	2023-08-27 14:08:15 +01:00
Laurent Mazare	7151f2cf63	Add the quantize command. (#624 ) * Add the quantize command. * Bugfix for writing gguf files. * And add a comment.	2023-08-27 11:35:19 +01:00
Laurent Mazare	6e485f2deb	Add some optional repeat penalty. (#623 ) * Add some optional repeat penalty. * Add the missing files.	2023-08-27 10:48:45 +01:00
Laurent Mazare	5320aa6b7d	Move the test-utils bits to a shared place. (#619 )	2023-08-27 09:42:22 +01:00
Laurent Mazare	a8b39dd7b7	Fix for q5_1 quantization. (#617 ) * Fix for q5_1 quantization. * Fix some typos.	2023-08-27 08:31:18 +01:00
Laurent Mazare	fa0d75b18d	Quantization tests + fix some issues. (#616 )	2023-08-27 08:17:38 +01:00
Laurent Mazare	28658054ff	More missing quantized bits. (#615 ) * Q4_1 support. * Add Q5_1 quantization. * Tweak.	2023-08-27 07:52:26 +01:00
Laurent Mazare	ab36a7f3e3	Fix for when f16c is not available. (#614 )	2023-08-27 07:19:52 +01:00
Laurent Mazare	f704e39761	Missing quants ops (#611 ) * Another transmute tweak. * Changelog tweak. * Add some missing quantized ops.	2023-08-26 20:09:04 +01:00
Laurent Mazare	fdf15f0e05	Another transmute tweak. (#610 ) * Another transmute tweak. * Changelog tweak.	2023-08-26 13:00:24 +01:00
Laurent Mazare	06b37ea7ad	Avoid using tmp values. (#609 )	2023-08-26 12:28:28 +01:00
Lukas Kreussel	c72eb3d75b	Add reference implementation for `q4k` and `q5k` (#586 ) * add `q2k` vec-dot * `q3k` vec-dot + quantization bugfix * `q4k` vec-dot * `q5k` vec-dot * Validate against GGML unit test results. * Remove some more `transmutes`	2023-08-26 12:07:54 +01:00
Radamés Ajna	864227edbf	[WIP] Improve Yolo WASM UI example (#591 ) * return detections with classes names * ignore .DS_Store * example how to load wasm module * add param to set model size * add param for model size * accept iou and confidence threshold on run * conf and iou thresholds * clamp only * remove images from branch * a couple of renamings, add readme with instructions * final design * minor font + border update	2023-08-26 11:40:41 +01:00
Nicolas Patry	b23b347b35	Merge pull request #601 from huggingface/repair_bf16_f16_cast Repairing cast bf16/f16	2023-08-26 12:34:41 +02:00
Patrick von Platen	71518caeee	Align tensor device print more with PyTorch (#590 ) * Improve tensor print * Use CudaDevice only if enabled with cuda feature * run rust fmt * up * improve * rustfmt	2023-08-26 11:20:22 +01:00
Laurent Mazare	6559eae72c	Avoid some transmutes. (#607 )	2023-08-25 18:21:37 +01:00
Laurent Mazare	46eb225ba5	Add some missing entries to the changelog. (#606 )	2023-08-25 18:01:38 +01:00
Nicolas Patry	aa67e5107d	Merge pull request #600 from huggingface/codellama_gpu_support Adding support for codellama in examples.	2023-08-25 18:25:26 +02:00
Nicolas Patry	c105550405	s/panic/bail/	2023-08-25 18:05:07 +02:00

... 5 6 7 8 9 ...

1333 Commits