candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

Author	SHA1	Message	Date
Patrick von Platen	1f58bdbb1d	Apply suggestions from code review	2023-08-23 13:33:45 +02:00
Patrick von Platen	c98d3cfd8b	Update candle-book/src/guide/installation.md	2023-08-23 13:31:54 +02:00
Patrick von Platen	c5e43ad0ab	Apply suggestions from code review	2023-08-23 13:27:29 +02:00
Patrick von Platen	2c280007e8	Apply suggestions from code review	2023-08-23 13:26:21 +02:00
Patrick von Platen	649202024c	fix code snippets	2023-08-23 09:05:07 +00:00
Patrick von Platen	283f6c048d	fix code snippets	2023-08-23 09:04:36 +00:00
Patrick von Platen	c8211fc474	fix code snippets	2023-08-23 09:04:08 +00:00
Patrick von Platen	7732bf6238	correct	2023-08-23 08:54:48 +00:00
Patrick von Platen	7c0ca80d3a	move installation to book	2023-08-23 08:52:53 +00:00
Patrick von Platen	b558d08b85	improve	2023-08-23 08:42:47 +00:00
Patrick von Platen	34cb9f924f	improve	2023-08-23 08:40:23 +00:00
Patrick von Platen	d4968295a0	improve	2023-08-23 08:37:08 +00:00
Patrick von Platen	65e146c72d	Add installation section	2023-08-23 08:32:59 +00:00
Laurent Mazare	f9ecc84477	GQA support in the quantized model. (#555 ) * GQA support in the quantized model. * Fix the reshaping. * Fix the main llama model. * Infer the proper gqa from the model kind.	2023-08-22 19:41:10 +01:00
Laurent Mazare	07067b01dc	Avoid some mutable variables (take 2). (#554 ) * Avoid some mutable variables (take 2). * Fix.	2023-08-22 18:51:20 +01:00
Laurent Mazare	cc22d4db20	Put the transcribe token before the language one. (#553 )	2023-08-22 16:46:34 +01:00
Laurent Mazare	ec665acad7	Revert "Avoid some mut in quantized functions. (#550 )" (#552 ) This reverts commit `cf27b9b636`.	2023-08-22 15:57:46 +01:00
Laurent Mazare	cf27b9b636	Avoid some mut in quantized functions. (#550 ) * Avoid a couple more 'let mut'. * Tweaks.	2023-08-22 15:44:26 +01:00
Lukas Kreussel	352383cbc3	Add quantization support for `q2k`, `q3k`, `q4k` and `q5k` (#524 ) * first q2 implementation * First Q4K and Q5K implementations * fix `q2k` and `q5k` * Some first cleanups * run `clippy` on tests * finally implement `q3k` * deactivate `q3k` test on macos * also disable the test on linux * Fix floating bits in `q3k` dequantization * Refactoring pass + reorder quants in file * `fmt` * Re-add `src` asserts and redefine `dst`	2023-08-22 15:04:55 +01:00
Laurent Mazare	9bc811a247	Improve the aspect ratio handling on yolo-v8. (#549 ) * Fix the aspect ratio handling in yolo-v8. * Typo.	2023-08-22 14:55:33 +01:00
Laurent Mazare	bb69d89e28	Move the yolo shared bits to a common place. (#548 ) * Move the yolo shared bits to a common place. * Share more code. * Configurable thresholds.	2023-08-22 13:03:07 +01:00
Laurent Mazare	20ce3e9f39	Sketch the yolo wasm example. (#546 ) * Sketch the yolo wasm example. * Web ui. * Get the web ui to work. * UI tweaks. * More UI tweaks. * Use the natural width/height. * Add a link to the hf space in the readme.	2023-08-22 11:56:43 +01:00
Laurent Mazare	44420d8ae1	Add some llama-v2 variants. (#545 )	2023-08-22 08:35:15 +01:00
Laurent Mazare	f16bb97401	Use the yolo-v8 weights from the hub. (#544 ) * Use the weights from the hub. * Add to the readme.	2023-08-21 22:07:36 +01:00
Laurent Mazare	3507e14c0c	Yolo v8 fixes (#542 ) * Fixes for the yolo-v8 layout. * Bugfixes. * Another silly bugfix. * Remove the hf-hub dependency. * Remove the transformers dependency.	2023-08-21 21:05:40 +01:00
Laurent Mazare	de50e66af1	Add yolo v8 as an example (#541 ) * Sketching yolo-v8. * Get the model to load. * yolo-v8 forward pass. * Complete(?) the forward pass. * Fix some shape issues. * Add the missing padding. * Process the predictions.	2023-08-21 18:40:09 +01:00
Laurent Mazare	cc2d6cf2e0	Improve the timestamps support in whisper (#539 ) * Timestamp support for whisper. * Properly display the timestamps. * Bugfix for the timestamp units.	2023-08-21 12:26:59 +01:00
Laurent Mazare	e3b71851e6	Retrieve the yolo-v3 weights from the hub. (#537 )	2023-08-21 10:55:09 +01:00
Laurent Mazare	4300864ce9	Add some optional repeat penalty. (#535 )	2023-08-21 09:59:13 +01:00
Laurent Mazare	d70cffdab6	Fix the minimum/maximum gradient computations. (#534 )	2023-08-21 08:28:41 +01:00
Laurent Mazare	912561614f	Better handling of zero temperatures. (#532 )	2023-08-21 07:51:46 +01:00
Laurent Mazare	8c232d706b	Small tweaks to the pickle handling to be able to use libtorch files. (#530 ) * Small tweaks to the pickle handling to be able to use libtorch files. * Move the pytorch specific bits in a different function.	2023-08-20 23:25:34 +01:00
Laurent Mazare	11c7e7bd67	Some fixes for yolo-v3. (#529 ) * Some fixes for yolo-v3. * Use the running stats for inference in the batch-norm layer. * Get some proper predictions for yolo. * Avoid the quadratic insertion.	2023-08-20 23:19:15 +01:00
Laurent Mazare	a1812f934f	Add a yolo-v3 example. (#528 ) * Add a couple functions required for yolo. * Add the yolo-v3 example. * Add minimum and maximum. * Use the newly introduced maximum. * Cuda support for min/max + add some testing. * Allow for more tests to work with accelerate. * Fix a typo.	2023-08-20 18:19:37 +01:00
Laurent Mazare	e3d2786ffb	Add a couple functions required for yolo. (#527 )	2023-08-20 17:02:05 +01:00
Laurent Mazare	372f8912c5	Minor readme tweaks. (#526 )	2023-08-20 14:33:21 +01:00
Laurent Mazare	d2622a8160	Move the VarMap to a separate file (#525 ) * Move the var-map struct in a separate file. * Fix some typos.	2023-08-20 14:25:07 +01:00
Laurent Mazare	2fcb386f17	Add a broadcast variant to matmul. (#523 ) * Add a broadcast variant to matmul. * Get the test to pass.	2023-08-20 13:20:42 +01:00
Laurent Mazare	a8f61e66cc	Bump the crates version to 0.1.2. (#522 )	2023-08-20 08:07:07 +01:00
Laurent Mazare	aa207f2dd9	Print some per-step timings in stable-diffusion. (#520 ) * Skeleton files for neon support of quantization. * SIMD version for q4 vecdot. * Also simdify the q6k multiplication. * Add some timings to stable-diffusion.	2023-08-20 05:45:12 +01:00
Laurent Mazare	82410995a2	Neon support for quantization. (#519 ) * Skeleton files for neon support of quantization. * SIMD version for q4 vecdot. * Also simdify the q6k multiplication.	2023-08-19 22:07:29 +01:00
Laurent Mazare	d73ca3d28e	Line up the llama.cpp implementation with the candle one. (#518 ) * Separate the prompt stats from the post-prompt ones in the quantized example. * Slightly nicer output printing. * Line up with the llama.cpp implementation.	2023-08-19 20:12:07 +01:00
Laurent Mazare	551409092e	Small tweaks to tensor-tools. (#517 )	2023-08-19 16:50:26 +01:00
Laurent Mazare	6431140250	Retrieve tensor data from PyTorch files. (#516 )	2023-08-19 15:57:18 +01:00
Laurent Mazare	607ffb9f1e	Retrieve more information from PyTorch checkpoints. (#515 ) * Retrieve more information from PyTorch checkpoints. * Add enough support to load dino-v2 backbone weights.	2023-08-19 15:05:34 +01:00
Laurent Mazare	f861a9df6e	Add ggml support to tensor-tools (#512 ) * Pickle work-in-progress. * More unpickling. * More pickling. * Proper handling of setitems. * Clippy. * Again more pickling. * Restore the example. * Add enough pickle support to get the list of tensors. * Read the data from zip files. * Retrieve the tensor shape. * Extract the size and dtype. * More storage types. * Improve the destructuring. * Also support ggml files.	2023-08-19 11:45:22 +01:00
Laurent Mazare	ad33715c61	Preliminary support for importing PyTorch weights. (#511 ) * Pickle work-in-progress. * More unpickling. * More pickling. * Proper handling of setitems. * Clippy. * Again more pickling. * Restore the example. * Add enough pickle support to get the list of tensors. * Read the data from zip files. * Retrieve the tensor shape. * Extract the size and dtype. * More storage types. * Improve the destructuring.	2023-08-19 11:26:32 +01:00
Laurent Mazare	90ff04e77e	Add the tensor-tools binary. (#510 )	2023-08-19 09:06:44 +01:00
Laurent Mazare	42e1cc8062	Add a batch normalization layer (#508 ) * Add BatchNormalization. * More batch-norm. * Add some validation of the inputs. * More validation.	2023-08-18 20:05:56 +01:00
Laurent Mazare	b64e782c2d	Use the hub to retrieve dinov2 model weights. (#507 )	2023-08-18 18:27:31 +01:00

1 2 3 4 5 ...

943 Commits