This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 17:11:21 +08:00
Code
Issues
Actions
23
Packages
Projects
Releases
Wiki
Activity
Files
e580cf0fef6fc87ee0ee24dee3ce5ae3ebbcba68
FastDeploy
/
fastdeploy
/
model_executor
/
ops
/
triton_ops
T
History
Echo-Nie
8819a039c9
[Others] Fix typo (
#7280
)
...
* typo * typo * typo * typo
2026-04-14 17:28:22 +08:00
..
__init__.py
[Models][OP][Optimization] Support DeepSeek-v3.2 model, integrate DSA & Indexer architecture with FlashMLA/DeepGEMM (
#6689
)
2026-03-10 15:05:14 +08:00
pre_token_quant_fp8_kernel.py
[Models][OP][Optimization] Support DeepSeek-v3.2 model, integrate DSA & Indexer architecture with FlashMLA/DeepGEMM (
#6689
)
2026-03-10 15:05:14 +08:00
qk_rmsnorm_fused_kernel.py
[Optimization] Accelerate Qwen3 QK RMSNorm via Fused Triton Kernel (
#5880
)
2026-01-12 05:10:21 -08:00
repetition_early_stop_kernel.py
[Optimization] Use a separate driver when using Triton with Paddle (
#6897
)
2026-03-24 10:56:00 +08:00
triton_utils_v2.py
[Others] Fix typo (
#7280
)
2026-04-14 17:28:22 +08:00
triton_utils.py
[Others] Fix typo (
#7280
)
2026-04-14 17:28:22 +08:00
wint2_fused_moe_kernel.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00