Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
e580cf0fef6fc87ee0ee24dee3ce5ae3ebbcba68
FastDeploy/fastdeploy/model_executor/layers
T
History
周周周 e580cf0fef [OP] latent moe deepgemm support (#7537)
* commit

* commit

* commit

* commit

* commit

* commit

* commit

* commit
2026-04-22 13:47:06 +08:00
..
attention
[Feature][KVCache] Implement Cache Manager V1 with GPU + CPU Cache Support (1/n) (#7097)
2026-04-21 14:39:00 +08:00
backends
[Iiluvatar] fix ci error and update readme (#7453)
2026-04-17 20:42:56 +08:00
batch_invariant_ops
[Cleanup] Replace torch proxy alias with public compat API (#7348)
2026-04-13 11:43:26 +08:00
moe
[OP] latent moe deepgemm support (#7537)
2026-04-22 13:47:06 +08:00
pool
…
quantization
[Feature] Support MOE Cutlass backend for latent MOE (#7428)
2026-04-16 22:11:49 +08:00
sample
[XPU] Unify Spec and non-spec branch.(#6947) (#7180)
2026-04-16 14:58:38 +08:00
__init__.py
…
activation.py
…
embeddings.py
…
flashinfer_comm_fusion.py
[Optimization] enable trtllm_all_reduce fusion kernel in glm model (#6660)
2026-04-16 14:10:19 +08:00
linear.py
[Optimization] enable trtllm_all_reduce fusion kernel in glm model (#6660)
2026-04-16 14:10:19 +08:00
lm_head.py
…
mtp_linear.py
…
normalization.py
[Optimization] enable trtllm_all_reduce fusion kernel in glm model (#6660)
2026-04-16 14:10:19 +08:00
pooler.py
…
rotary_embedding.py
[BugFix] fix mm rope (#7274)
2026-04-14 11:36:08 +08:00
utils.py
[Feature] Support NVFP4 Flashinfer-cutedsl MoE on SM100 (#6963)
2026-03-30 11:37:04 +08:00
Powered by Gitea Version: 1.26.0 Page: 2917ms Template: 103ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API