Logo
Explore Help
Register Sign In
huggingface/candle
1
0
Fork 0
You've already forked candle
mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
62ced44ea94da7062430ed6c21ff17b36f41737d
candle/candle-kernels/src
History
Laurent Mazare 1a0f9ccf16 Import the ggml_cuda_dp4a function. (#2628)
2024-11-19 03:41:34 +01:00
..
affine.cu
Cuda backend optimization (#1886)
2024-03-20 18:32:55 +01:00
binary_op_macros.cuh
Cuda backend optimization (#1886)
2024-03-20 18:32:55 +01:00
binary.cu
Add support for i64 (#563)
2023-08-23 10:42:19 +01:00
cast.cu
Add cast_bf16_x/cast_x_bf16 when CUDA_ARCH<800 but CUDA_VERSION >= 11000 (#1919)
2024-03-23 13:44:10 +01:00
compatibility.cuh
Compat windows.
2023-08-10 17:46:47 +02:00
conv.cu
More efficient cuda implementation for ConvTranspose1d. (#2211)
2024-05-24 11:05:43 +02:00
cuda_utils.cuh
Relax the contiguous check for cuda kernels. (#2000)
2024-04-03 09:02:38 +02:00
fill.cu
Optimize the cat operation on contiguous tensors (#1855)
2024-03-17 10:49:13 +01:00
indexing.cu
Support scatter/index_add with i64 indices for f16 (#1915)
2024-03-22 11:51:41 +01:00
lib.rs
Add argsort. (#2132)
2024-04-27 20:17:35 +02:00
quantized.cu
Import the ggml_cuda_dp4a function. (#2628)
2024-11-19 03:41:34 +01:00
reduce.cu
Improved launch config for layer-norm/rms-norm. (#2591)
2024-11-04 10:42:18 +01:00
sort.cu
Add argsort. (#2132)
2024-04-27 20:17:35 +02:00
ternary.cu
Add support for i64 (#563)
2023-08-23 10:42:19 +01:00
unary.cu
Fix sigmoid gradient calculation and move sigmoid into a specialized op (#2114)
2024-04-29 11:04:43 +02:00
Powered by Gitea Version: 1.24.0 Page: 120ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API