Files
Kyle Birnbaum 0224a749f0 Add Qwen3 MoE (#2934)
* qwen-moe rebase

* lint

* fixed rebase error

* swapped normal MoE model with CausalMoE Model in example, and swapped the tie word embeddings if statement

* updated readme
2025-05-31 15:33:28 +02:00
..
2024-07-23 23:10:57 +02:00
2024-08-01 08:26:19 +02:00
2024-07-23 23:10:57 +02:00
2025-04-03 09:18:29 +02:00
2024-09-28 23:48:00 +02:00
2024-10-01 11:48:39 +02:00
2025-04-11 21:43:35 +02:00
2025-05-15 21:50:27 +02:00
2025-02-19 10:51:01 +01:00
2025-05-15 21:50:27 +02:00
2024-07-23 23:10:57 +02:00
2025-04-03 09:18:29 +02:00
2024-08-28 11:20:09 +02:00
2024-12-31 09:21:41 +01:00
2025-04-03 09:18:29 +02:00
2025-04-30 19:38:44 +02:00
2024-08-01 11:59:22 +02:00
2024-08-01 14:19:41 +02:00
2025-04-03 09:18:29 +02:00
2025-04-15 21:40:18 +02:00
2024-08-16 18:57:14 +02:00
2024-07-23 23:10:57 +02:00
2025-04-03 09:18:29 +02:00
2024-08-04 19:52:40 +02:00
2025-01-13 08:39:27 +01:00
2024-12-03 10:56:01 +01:00
2025-05-14 19:18:02 +02:00
2025-05-10 07:05:03 +02:00
2025-04-13 12:02:17 +02:00
2024-09-29 19:56:56 +02:00
2025-05-21 10:18:33 +02:00
2024-09-30 21:23:54 +02:00
2025-05-31 15:33:28 +02:00
2025-04-03 09:18:29 +02:00
2025-04-03 09:18:29 +02:00
2025-04-07 08:23:47 +02:00
2025-04-03 09:18:29 +02:00
2024-07-23 23:10:57 +02:00
2025-04-03 09:18:29 +02:00
2025-04-03 09:18:29 +02:00
2025-04-03 09:18:29 +02:00