Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
46e14f88f9db3c9d85eae0e8526f619cc22d1e55
FastDeploy/fastdeploy/model_executor/graph_optimization
T
History
GoldPancake 26674bbbb6 [Cherry-Pick][RL] Add clear_graph_opt_backend for glm4_mtp (#7378) (#7379)
* add clear_grpah func

* fix spell
2026-04-15 19:45:09 +08:00
..
__init__.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
cudagraph_piecewise_backend.py
[Cherry-Pick][FDConfig] Auto-scale CUDA Graph Capture & CLI Quantization Params + CUDAGraph Validation (#7215,#7281) (#7301)
2026-04-10 16:10:31 +08:00
decorator.py
[Cherry-Pick][RL] Add clear_graph_opt_backend for glm4_mtp (#7378) (#7379)
2026-04-15 19:45:09 +08:00
dynamic_dims_marker.py
[SOT] Mark dynamic dims by type annotations (#2771)
2025-07-22 00:23:52 -07:00
graph_optimization_backend.py
[Graph Optimization] Support CUDAGraph for P/PD mixed Batch using SOT subgraph spliting mode (#6196)
2026-01-29 16:29:54 +08:00
utils.py
[Iluvatar] add vl into ci and support v1 loader (#4774)
2025-11-11 10:50:17 +08:00
Powered by Gitea Version: 1.26.0 Page: 1855ms Template: 147ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API