Files
FastDeploy/tests/operators
sunxin 2533836dbb [Optimization] Accelerate Qwen3 QK RMSNorm via Fused Triton Kernel (#5880)
* qk rmsnorm fused

* inplace

* glm

* fix

* add qknorm layer

* fix

* update

* fix qwen3 vl

* update rl baseline

* fix qwen3 vl moe

* test

* fix qwen vl moe rl

* fix
2026-01-12 05:10:21 -08:00
..
2025-08-28 14:42:24 +08:00
2025-08-20 08:57:17 +08:00
2025-08-20 08:57:17 +08:00
2025-08-20 08:57:17 +08:00
2025-12-22 13:39:41 +08:00
2025-08-28 14:42:24 +08:00