implemented quantized-gemma3 (#2902)

* implemented quantized-gemma, inference not working

* Fixed a few modeling bugs: outputing the correct tokens for a few iterations then garbage

* lint

* clippy

* quantized-gemma3 example working

* added readme

* clippy
This commit is contained in:
Kyle Birnbaum
2025-04-18 22:46:41 -07:00
committed by GitHub
parent 21055b5697
commit b2904a830b
4 changed files with 781 additions and 0 deletions

View File

@ -79,6 +79,7 @@ pub mod phi3;
pub mod pixtral;
pub mod quantized_blip;
pub mod quantized_blip_text;
pub mod quantized_gemma3;
pub mod quantized_llama;
pub mod quantized_llama2_c;
pub mod quantized_metavoice;