Files
Kyle Birnbaum 0224a749f0 Add Qwen3 MoE (#2934)
* qwen-moe rebase

* lint

* fixed rebase error

* swapped normal MoE model with CausalMoE Model in example, and swapped the tie word embeddings if statement

* updated readme
2025-05-31 15:33:28 +02:00

1.8 KiB
Raw Permalink Blame History

candle-qwen: large language model series from Alibaba Cloud

Qwen 1.5 is a series of large language models that provide strong performances on English and Chinese.

Running the example

$ cargo run --example qwen --release  -- --prompt "Hello there "

Various model sizes are available via the --model argument, including the MoE variant.

$ cargo run --example qwen --release  -- --model moe-a2.7b --prompt 'def print_prime(n: int): '
def print_prime(n: int):  # n is the number of primes to be printed
    for i in range(2, n + 1):
        if all(i % j != 0 for j in range(2, i)):
            print(i)

The qwen3 MoE variant is also an option.

$ cargo run --example qwen --features metal --release  -- --prompt "Write a poem about butterflies. <think></think>." --model "3-moe-a3b"
> In morning's hush, where daisies sleep,  
> A fleeting dance through sunlit deep—  
> They flutter soft on gossamer thread,  
> The messengers of springs own head.
> 
> With painted sails and delicate grace,  
> They drift from bloom to blossom's face.  
> Each wing a tale in hues unseen,  
> Of ancient dreams and secrets between.
> 
> No sound they make, yet still they speak—  
> Of time that flies, of life so brief.  
> A fleeting kiss on summers breath,  
> A whisper lost before death.
> 
> Yet in their flight, the soul takes wing,  
> And for a moment, all is spring.  
> For though they fade, they never die—  
> Their beauty lives where hearts can fly.
> 161 tokens generated (3.00 token/s)