Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-05-08 16:32:41 +08:00
Code Issues Actions 7 Packages Projects Releases Wiki Activity
Files
f1cee7fd5e4d1f2036dbba2a2a76c2fb33ed21a6
FastDeploy/fastdeploy/model_executor/layers
T
History
xiaoxiaohehe001 7ffa88bb01 [BugFix] fix mask_attn (#6214)
* [BugFix] fix mask attn

* [BugFix] fix mask attn
2026-01-26 07:46:51 -08:00
..
attention
[BugFix] fix mask_attn (#6214)
2026-01-26 07:46:51 -08:00
backends
[XPU] change XPU EP interface from xDeepEP to paddle (#5706)
2026-01-21 18:23:45 +08:00
batch_invariant_ops
…
moe
Improve deep_ep import handling with logging (#6207)
2026-01-24 22:41:42 -08:00
pool
…
quantization
[Loader] support dummy load weight (#6169)
2026-01-26 13:58:53 +08:00
sample
…
__init__.py
…
activation.py
[Feature] Unify fp8 block_wise quant ops (#5991)
2026-01-15 05:50:37 -08:00
embeddings.py
…
linear.py
Support MXFP4 for GPT-OSS (#5435)
2026-01-22 14:21:01 +08:00
lm_head.py
…
mtp_linear.py
…
normalization.py
Support MXFP4 for GPT-OSS (#5435)
2026-01-22 14:21:01 +08:00
pooler.py
…
rotary_embedding.py
…
utils.py
add scale_wrapper for per_block_cast_to_fp8 (#6183)
2026-01-23 00:37:20 -08:00
Powered by Gitea Version: 1.26.0 Page: 9165ms Template: 3609ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API