candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-17 02:58:50 +00:00

Author	SHA1	Message	Date
laurent	ee3d290f8b	Cuda support for dtype conversions.	2023-06-27 09:15:46 +01:00
laurent	59a59f41a6	Add the cuda mode to llama.	2023-06-26 10:06:44 +01:00
laurent	d867155ef2	Load the weights for llama.	2023-06-26 07:23:59 +01:00
laurent	7a3101f15f	Llama bugfix.	2023-06-26 07:07:56 +01:00
laurent	97424289d1	Fix the llama causal mask inversion.	2023-06-25 21:16:54 +01:00
laurent	117f014b55	Add where_cond and properly apply the causal mask.	2023-06-25 21:08:03 +01:00
laurent	25bcad290e	Fix the causal mask computation.	2023-06-25 20:19:30 +01:00
laurent	8e404eb125	Get a some first inference to work on llama.	2023-06-25 18:26:15 +01:00
laurent	87c5aab005	More llama fixes.	2023-06-25 18:08:41 +01:00
laurent	60a5598c8b	Fix some shape errors.	2023-06-25 17:56:59 +01:00
laurent	817e4b5005	Rework the embeddings so that it works on non-contiguous weights + factor out some code.	2023-06-25 17:37:47 +01:00
laurent	334524e2c4	Take as input slices of tensors as well as slices of &Tensors.	2023-06-25 17:07:09 +01:00
laurent	90c140ff4b	Start sketching the llama example.	2023-06-25 13:51:20 +01:00