Implement DeepSeek V2 (#2744)

* Add deepseek v2

* Fix

* Remove unused

* Add kv cache

* Remove from cargo.toml

* Fix dtype selection logic

* Fix unnecessary u32->f32->gather->u32

* Remove fromstr impl

* Use local scopes for some clarity

* Typo

* Repeat k_pe

* Chain calls to remove mut

* Actually, remove all muts

* Update readme
This commit is contained in:
Eric Buehler
2025-02-19 04:51:01 -05:00
committed by GitHub
parent fd7f7242a1
commit e6cc76fc37
4 changed files with 1367 additions and 0 deletions

File diff suppressed because it is too large Load Diff

View File

@ -29,6 +29,7 @@ pub mod convmixer;
pub mod convnext;
pub mod dac;
pub mod debertav2;
pub mod deepseek2;
pub mod depth_anything_v2;
pub mod dinov2;
pub mod dinov2reg4;