Updating `Add qwen3` (PR 2903) to use HF weights (#2930)

mirror of https://github.com/huggingface/candle.git synced 2025-06-19 11:56:45 +00:00

* add Qwen3.rs

* fixed compile error

* attempting to gett pr 2903 working with qwen weights

* different qwen variants working

* added moe model

* clippy

* added additional eos token

* translated Korean comments to English as well as I can

* removed specialized Qwen3RmsNorm and replaced with generic Candle RmsNorm

* replaced custom repeat_kv implementation with candle's repeat_kv implementation

* replace linear with linear_b in attention initalization

* replaced custom custom kv_cache implementation with candle kv_cache

* style

* replaced explicit broadcast add with normal add in decoder layer

* removed keeping the Rotary embedding layer in the model struct

* used tie_word_embeddings bool from config instead of relying on existence of weights for lm head in CasualLM

* removed duplicate code from qwen3_moe

* removed sliding window from qwen3 attention

* removed MoE code

* removed unused option

* Fixed Typo

Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>

* fixed tie word embeddings to use the correct embedding weights instead of the opposite

---------

Co-authored-by: Max <naturale@hufs.ac.kr>
Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>

This commit is contained in:

Kyle Birnbaum

2025-05-01 21:05:53 -07:00

committed by

GitHub

parent cd96fa80da

commit 1fdfb58de5

3 changed files with 421 additions and 3 deletions

									
										1

candle-transformers/src/models/mod.rs
									
												View File
												
				@ -97,6 +97,7 @@ pub mod quantized_stable_lm;

				pub mod quantized_t5;

				pub mod qwen2;

				pub mod qwen2_moe;

				pub mod qwen3;

				pub mod recurrent_gemma;

				pub mod repvgg;

				pub mod resnet;

Updating Add qwen3 (PR 2903) to use HF weights (#2930)

1 candle-transformers/src/models/mod.rs Unescape Escape View File

Updating `Add qwen3` (PR 2903) to use HF weights (#2930)

1

candle-transformers/src/models/mod.rs

View File