candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-15 18:28:24 +00:00

Author	SHA1	Message	Date
Laurent Mazare	19042962d5	Whisper fix (#711 ) * Remove unnecessary file. * Whisper fix.	2023-09-01 20:04:07 +01:00
Laurent Mazare	7529531056	Add the optimizer trait. (#702 )	2023-09-01 12:55:39 +01:00
Laurent Mazare	7cef35c84d	Tweak some quantized args (#692 ) * Print the args + change the default temp/repeat penalty. * Minor formatting tweak.	2023-08-31 17:25:21 +01:00
Laurent Mazare	7509c98970	Interactive mode for the quantized model. (#690 )	2023-08-31 10:52:42 +01:00
Laurent Mazare	9874d843f1	Fix the accelerate build (#678 ) * Cosmetic changes. * Fix the accelerate build for tanh.	2023-08-30 18:31:14 +02:00
Laurent Mazare	7d753d3acd	Mnist training dropout (#677 ) * Use dropout in the mnist training. * Fix.	2023-08-30 16:41:01 +01:00
Laurent Mazare	618f4e4c78	Add some documentation. (#673 ) * Add some documentation. * Bump the crate version.	2023-08-30 11:54:00 +01:00
Laurent Mazare	a1a5ab8b0a	Neon optimized vecdot (#666 ) * Q5k vecdot. * Add the q3k vecdot. * Q2k vecdot. * Move the quantized model to its own file.	2023-08-29 22:28:46 +01:00
Laurent Mazare	2d3fcad267	Simplify usage of the pool functions. (#662 ) * Simplify usage of the pool functions. * Small tweak. * Attempt at using apply to simplify the convnet definition.	2023-08-29 19:12:16 +01:00
Laurent Mazare	b31d41e26a	Add a convnet training example. (#661 ) * Add a convnet example. * Dataset fix. * Randomize batches.	2023-08-29 18:23:01 +01:00
Laurent Mazare	a044907ffc	Dilated convolutions (#657 ) * Add the dilation parameter. * Restore the basic optimizer example. * Dilation support in cudnn. * Use the dilation parameter in the cpu backend. * More dilation support. * No support for dilation in transposed convolutions. * Add dilation to a test. * Remove a print. * Helper function.	2023-08-29 16:12:11 +01:00
Nicolas Patry	1aca6fa291	Upgrading hf-hub.	2023-08-29 14:18:54 +02:00
Nicolas Patry	14b4d456e8	Merge pull request #439 from huggingface/training_hub_dataset [Book] Add small error management + start training (with generic dataset inclusion).	2023-08-29 13:10:05 +02:00
Laurent Mazare	62ef494dc1	Use multiple transformer layer in the same cross-attn blocks. (#653 ) * Use multiple transformer layer in the same cross-attn blocks. * Make the context contiguous if required.	2023-08-29 11:13:43 +01:00
Laurent Mazare	33c23c19b6	Preliminary support for SDXL. (#647 ) * Preliminary support for SDXL. * More SDXL support. * More SDXL. * Use the proper clip config. * Querying for existing tensors. * More robust test.	2023-08-29 09:00:04 +01:00
Nicolas Patry	d726484a6d	Re-enable local dir for mnist.	2023-08-28 15:15:27 +02:00
Nicolas Patry	d7a273be51	Training: - Removed a lot of surface (SerializedFileReader ownership is really painful). - Moved example + vision to hf.co version. - Removed feature gate.	2023-08-28 15:15:01 +02:00
Laurent Mazare	26e1b40992	Repeat-penalty in the falcon example. (#634 )	2023-08-28 08:13:40 +01:00
Laurent Mazare	72ebb12bca	Remove some dead-code annotations. (#629 ) * Remove some dead-code annotations. * More dead code removal. * One more. * CI fix.	2023-08-27 18:52:33 +01:00
Laurent Mazare	4c338b0cd9	VarBuilder cleanup (#627 ) * VarBuilder cleanup. * Implement the basic varbuilders. * Add the sharded code. * Proper support for tensor sharding.	2023-08-27 18:03:26 +01:00
Laurent Mazare	6e485f2deb	Add some optional repeat penalty. (#623 ) * Add some optional repeat penalty. * Add the missing files.	2023-08-27 10:48:45 +01:00
Nicolas Patry	aa67e5107d	Merge pull request #600 from huggingface/codellama_gpu_support Adding support for codellama in examples.	2023-08-25 18:25:26 +02:00
Nicolas Patry	c105550405	s/panic/bail/	2023-08-25 18:05:07 +02:00
Laurent Mazare	ca6c050b04	Cleanup the pose reporting code. (#605 )	2023-08-25 16:49:21 +01:00
Laurent Mazare	0afbc435df	Add some configurable legend for yolo detection. (#603 ) * Add some configurable legend for yolo detection. * Clippyness.	2023-08-25 13:50:31 +01:00
Laurent Mazare	97909e5068	Move the yolo model bits in a separate file. (#602 ) * Move the yolo model bits in a separate file. * Improve the drawing. * Bugfix.	2023-08-25 12:47:55 +01:00
Laurent Mazare	8bc5fffa45	More support for pose estimation in yolo-v8. (#599 ) * More support for pose estimation in yolo-v8. * Support both object detection and pose-estimation in the yolo-v8 example.	2023-08-25 11:21:11 +01:00
Nicolas Patry	4826a4212e	Adding support for codellama in examples. Codellama requires bf16 for now (error to convert from bf16 to f16). Multiprocess demo not functional for it because flash-attn only supports f16 for now.	2023-08-25 09:56:11 +00:00
Laurent Mazare	c093b03d51	Generic implementation of vecdot for q80. (#596 ) * Generic implementation of vecdot for q80. * Add support for code-llama 7b. * Support more code-llama.	2023-08-25 09:04:05 +01:00
Laurent Mazare	189442a0fa	Add the pose estimation head for yolo. (#589 ) * Add the pose estimation head for yolo. * Properly handle the added position dimensions. * Integrate the pose estimation head in the forward pass. * Renaming. * Fix for pose estimation.	2023-08-24 22:12:34 +01:00
Laurent Mazare	79916c2edb	Use the hub weights for efficientnet. (#573 )	2023-08-23 18:20:21 +01:00
Laurent Mazare	431051cc32	Add Efficientnet (#572 ) * EfficientNet. * Complete the efficientnet implementation. * Improve group handling. * Get the efficientnet to work.	2023-08-23 18:02:58 +01:00
Laurent Mazare	eedd85ffa7	Move the imagenet specific bits to a separate file. (#571 )	2023-08-23 16:42:09 +01:00
Laurent Mazare	329f661d9b	Trace softmax (#568 ) * Trace the softmax op. * Inline the sum. * Add min/max vec operations.	2023-08-23 15:25:50 +01:00
Laurent Mazare	aba1e90797	Add some group parameter to convolutions. (#566 ) * Add some group parameter to convolutions. * Avoid some unnecessary groups checks. * Move the tensor convolution bits. * Properh handling of groups. * Bump the crate version. * And add a changelog.	2023-08-23 12:58:55 +01:00
Laurent Mazare	4ee1cf038a	Get the rms epsilon from GGUF. (#565 )	2023-08-23 11:40:20 +01:00
Laurent Mazare	0f4ff8a739	Fix the quantized example. (#564 )	2023-08-23 11:09:55 +01:00
cksac	89a00b56cc	add chat models in quantized example (#551 ) * add chat models in quantized example * cargo fmt	2023-08-23 11:05:33 +01:00
Laurent Mazare	508d34daf2	GGUF support in the quantized model. (#559 ) * GGUF support in the quantized model. * Get the GGUF support to work on llama.	2023-08-23 09:20:57 +01:00
Laurent Mazare	f9ecc84477	GQA support in the quantized model. (#555 ) * GQA support in the quantized model. * Fix the reshaping. * Fix the main llama model. * Infer the proper gqa from the model kind.	2023-08-22 19:41:10 +01:00
Laurent Mazare	cc22d4db20	Put the transcribe token before the language one. (#553 )	2023-08-22 16:46:34 +01:00
Laurent Mazare	9bc811a247	Improve the aspect ratio handling on yolo-v8. (#549 ) * Fix the aspect ratio handling in yolo-v8. * Typo.	2023-08-22 14:55:33 +01:00
Laurent Mazare	bb69d89e28	Move the yolo shared bits to a common place. (#548 ) * Move the yolo shared bits to a common place. * Share more code. * Configurable thresholds.	2023-08-22 13:03:07 +01:00
Laurent Mazare	20ce3e9f39	Sketch the yolo wasm example. (#546 ) * Sketch the yolo wasm example. * Web ui. * Get the web ui to work. * UI tweaks. * More UI tweaks. * Use the natural width/height. * Add a link to the hf space in the readme.	2023-08-22 11:56:43 +01:00
Laurent Mazare	44420d8ae1	Add some llama-v2 variants. (#545 )	2023-08-22 08:35:15 +01:00
Laurent Mazare	f16bb97401	Use the yolo-v8 weights from the hub. (#544 ) * Use the weights from the hub. * Add to the readme.	2023-08-21 22:07:36 +01:00
Laurent Mazare	3507e14c0c	Yolo v8 fixes (#542 ) * Fixes for the yolo-v8 layout. * Bugfixes. * Another silly bugfix. * Remove the hf-hub dependency. * Remove the transformers dependency.	2023-08-21 21:05:40 +01:00
Laurent Mazare	de50e66af1	Add yolo v8 as an example (#541 ) * Sketching yolo-v8. * Get the model to load. * yolo-v8 forward pass. * Complete(?) the forward pass. * Fix some shape issues. * Add the missing padding. * Process the predictions.	2023-08-21 18:40:09 +01:00
Laurent Mazare	cc2d6cf2e0	Improve the timestamps support in whisper (#539 ) * Timestamp support for whisper. * Properly display the timestamps. * Bugfix for the timestamp units.	2023-08-21 12:26:59 +01:00
Laurent Mazare	e3b71851e6	Retrieve the yolo-v3 weights from the hub. (#537 )	2023-08-21 10:55:09 +01:00

1 2 3 4 5 ...

296 Commits