candle/examples at f57e3164ae4ab94c68f2496b0800947f3c5f5f9f - candle - Gitea: Git with a cup of tea

huggingface/candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-19 03:54:56 +00:00

Files

History

Nicolas Patry 7161002a34 Finished scaffolding, lots of TODOs

- Most kernels just copy themselfs to get the shapes correct
- Matmul works only in 1 case and simply empty allocates otherwise
- Logits and randomized to make the demo finish itself.

Performance is quite bad (30ms/token), but lot's of prints and allocs and some actual sending to metal.

Couln't get it super high by removing the obvious blockers (println + the actual running matmuls).

Allocations takes between 1us and 100us and seems very stable, Maybe metal doesn't really have a smart allocator and we'll need to own it.

2023-11-02 15:32:28 +01:00

..

Handle LongStorage in pytorch checkpoints. (#1152 )

2023-10-22 18:34:36 +01:00

Adapt more examples to the updated safetensor api. (#947 )

2023-09-23 21:26:03 +01:00

Add a KV cache to marian decoding. (#1226 )

2023-10-31 08:47:44 +00:00

Convmixer example (#1074 )

2023-10-11 19:51:10 +01:00

Remove some dead-code annotations. (#629 )

2023-08-27 18:52:33 +01:00

Adapt more examples to the updated safetensor api. (#947 )

2023-09-23 21:26:03 +01:00

Adapt more examples to the updated safetensor api. (#947 )

2023-09-23 21:26:03 +01:00

Adapt more examples to the updated safetensor api. (#947 )

2023-09-23 21:26:03 +01:00

Use the hub model file when possible. (#1190 )

2023-10-26 20:00:50 +01:00

Adapt more examples to the updated safetensor api. (#947 )

2023-09-23 21:26:03 +01:00

Infer the config for llama2-c. (#1208 )

2023-10-28 19:00:39 +01:00

llama_multiprocess

Self-contained safetensors for the multiprocess llama example. (#950 )

2023-09-24 06:54:49 +01:00

Add a KV cache to marian decoding. (#1226 )

2023-10-31 08:47:44 +00:00

Quantized version of mistral. (#1009 )

2023-09-30 18:25:47 +01:00

Allow for different behavior between training and eval (#1213 )

2023-10-29 07:53:09 +01:00

Use candle_nn::LSTM in encodec. (#1051 )

2023-10-07 19:43:06 +01:00

Add support for the phi-hermes finetuned model. (#1192 )

2023-10-27 05:57:08 +01:00

Finished scaffolding, lots of TODOs

2023-11-02 15:32:28 +01:00

Do not use the kv-cache on external key-value states. (#1054 )

2023-10-07 22:37:19 +01:00

reinforcement-learning

Add DDPG and fix Gym wrapper (#1207 )

2023-10-28 19:53:34 +01:00

Add the quantized mpt model. (#1123 )

2023-10-18 16:29:38 +01:00

Expose the larger resnets (50/101/152) in the example. (#1131 )

2023-10-19 13:48:28 +01:00

segment-anything

Add negative prompts to segment-anything. (#1000 )

2023-09-30 06:17:42 +01:00

stable-diffusion

Mention the flash-attention restriction in the readme. (#1158 )

2023-10-23 10:26:56 +01:00

Quantized version of StableLM. (#1058 )

2023-10-08 15:42:38 +01:00

Add some reinforcement learning example. (#1090 )

2023-10-14 16:46:43 +01:00

Allow for different behavior between training and eval (#1213 )

2023-10-29 07:53:09 +01:00

Readme updates. (#1134 )

2023-10-20 09:08:39 +01:00

Simd128 optimized q8k vecdot. (#1026 )

2023-10-03 15:29:48 +01:00

Remove some unusued bits. (#1067 )

2023-10-09 19:49:57 +01:00

Make func cloneable. (#1137 )

2023-10-20 16:28:50 +01:00

Add fuse-conv-bn method for Conv2d (#1196 )

2023-10-27 15:56:50 +01:00