mirror of
https://github.com/huggingface/candle.git
synced 2025-06-14 09:57:10 +00:00

* qwen-moe rebase * lint * fixed rebase error * swapped normal MoE model with CausalMoE Model in example, and swapped the tie word embeddings if statement * updated readme
1.8 KiB
1.8 KiB
candle-qwen: large language model series from Alibaba Cloud
Qwen 1.5 is a series of large language models that provide strong performances on English and Chinese.
- Blog post introducing Qwen1.5.
- Model card on the HuggingFace Hub.
- Blog post for the mixture-of-experts (MoE) variant.
Running the example
$ cargo run --example qwen --release -- --prompt "Hello there "
Various model sizes are available via the --model
argument, including the MoE
variant.
$ cargo run --example qwen --release -- --model moe-a2.7b --prompt 'def print_prime(n: int): '
def print_prime(n: int): # n is the number of primes to be printed
for i in range(2, n + 1):
if all(i % j != 0 for j in range(2, i)):
print(i)
The qwen3 MoE variant is also an option.
$ cargo run --example qwen --features metal --release -- --prompt "Write a poem about butterflies. <think></think>." --model "3-moe-a3b"
> In morning's hush, where daisies sleep,
> A fleeting dance through sunlit deep—
> They flutter soft on gossamer thread,
> The messengers of spring’s own head.
>
> With painted sails and delicate grace,
> They drift from bloom to blossom's face.
> Each wing a tale in hues unseen,
> Of ancient dreams and secrets between.
>
> No sound they make, yet still they speak—
> Of time that flies, of life so brief.
> A fleeting kiss on summer’s breath,
> A whisper lost before death.
>
> Yet in their flight, the soul takes wing,
> And for a moment, all is spring.
> For though they fade, they never die—
> Their beauty lives where hearts can fly.
> 161 tokens generated (3.00 token/s)