Qwen3 quantized implementation (#2939)

* fixed quantized_phi3 implementation

* quantized_qwen3 implementation

* Update quantized_phi3.rs

* Update quantized_phi3.rs

* add quantized_qwen3 example

* Clippy fixes.

* Cleanup.

---------

Co-authored-by: Laurent <laurent.mazare@gmail.com>
This commit is contained in:
Lucien Thomas
2025-05-08 08:06:10 -05:00
committed by GitHub
parent 637473cb5e
commit 3d05f5cf3d
5 changed files with 755 additions and 1 deletions

View File

@ -90,6 +90,7 @@ pub mod quantized_mpt;
pub mod quantized_phi;
pub mod quantized_phi3;
pub mod quantized_qwen2;
pub mod quantized_qwen3;
pub mod quantized_recurrent_gemma;
pub mod quantized_rwkv_v5;
pub mod quantized_rwkv_v6;