Files
candle/candle-examples/examples
Kyle Birnbaum 1fdfb58de5 Updating Add qwen3 (PR 2903) to use HF weights (#2930)
* add Qwen3.rs

* fixed compile error

* attempting to gett pr 2903 working with qwen weights

* different qwen variants working

* added moe model

* clippy

* added additional eos token

* translated Korean comments to English as well as I can

* removed specialized Qwen3RmsNorm and replaced with generic Candle RmsNorm

* replaced custom repeat_kv implementation with candle's repeat_kv implementation

* replace linear with linear_b in attention initalization

* replaced custom custom kv_cache implementation with candle kv_cache

* style

* replaced explicit broadcast add with normal add in decoder layer

* removed keeping the Rotary embedding layer in the model struct

* used tie_word_embeddings bool from config instead of relying on existence of weights for lm head in CasualLM

* removed duplicate code from qwen3_moe

* removed sliding window from qwen3 attention

* removed MoE code

* removed unused option

* Fixed Typo

Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>

* fixed tie word embeddings to use the correct embedding weights instead of the opposite

---------

Co-authored-by: Max <naturale@hufs.ac.kr>
Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>
2025-05-02 06:05:53 +02:00
..
2024-07-23 23:10:57 +02:00
2024-08-01 08:26:19 +02:00
2024-07-23 23:10:57 +02:00
2025-04-03 09:18:29 +02:00
2024-09-28 23:48:00 +02:00
2024-10-01 11:48:39 +02:00
2025-04-11 21:43:35 +02:00
2025-01-29 08:59:28 +01:00
2025-02-19 10:51:01 +01:00
2024-07-23 23:10:57 +02:00
2025-04-03 09:18:29 +02:00
2024-08-28 11:20:09 +02:00
2024-12-31 09:21:41 +01:00
2025-04-03 09:18:29 +02:00
2025-04-30 19:38:44 +02:00
2024-08-01 11:59:22 +02:00
2024-08-01 14:19:41 +02:00
2025-04-03 09:18:29 +02:00
2025-04-15 21:40:18 +02:00
2024-08-16 18:57:14 +02:00
2024-07-23 23:10:57 +02:00
2025-04-03 09:18:29 +02:00
2024-08-04 19:52:40 +02:00
2025-01-13 08:39:27 +01:00
2024-12-03 10:56:01 +01:00
2024-04-26 11:02:51 +02:00
2025-04-13 12:02:17 +02:00
2024-09-29 19:56:56 +02:00
2025-03-01 10:07:29 +01:00
2024-09-30 21:23:54 +02:00
2025-04-03 09:18:29 +02:00
2024-03-09 11:21:48 +01:00
2025-04-03 09:18:29 +02:00
2025-04-07 08:23:47 +02:00
2025-04-03 09:18:29 +02:00
2024-07-23 23:10:57 +02:00
2025-04-03 09:18:29 +02:00
2025-04-03 09:18:29 +02:00
2025-04-03 09:18:29 +02:00