This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-05-08 16:32:41 +08:00
Code
Issues
Actions
7
Packages
Projects
Releases
Wiki
Activity
Files
aa35ce449d7461ebe05732be957c409c0d1d85d8
FastDeploy
/
fastdeploy
/
cache_manager
T
History
Juncai
0925d44f18
[PD Disaggregation] support different tp_size for prefill and decode (
#5296
)
...
* up * up * up * fix
2025-12-01 17:50:20 +08:00
..
transfer_factory
[PD Disaggregation] support different tp_size for prefill and decode (
#5296
)
2025-12-01 17:50:20 +08:00
__init__.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
cache_data.py
[Feature] mm support prefix cache (
#4134
)
2025-10-27 17:39:51 +08:00
cache_messager.py
[PD Disaggregation] support different tp_size for prefill and decode (
#5296
)
2025-12-01 17:50:20 +08:00
cache_metrics.py
[Feature] mm support prefix cache (
#4134
)
2025-10-27 17:39:51 +08:00
cache_transfer_manager.py
[Feature] dyc8 support prefixcache (
#5125
)
2025-11-21 19:46:26 +08:00
multimodal_cache_manager.py
[Feature] mm support prefix cache (
#4134
)
2025-10-27 17:39:51 +08:00
ops.py
dummy import fd (
#5192
)
2025-11-24 20:23:07 +08:00
prefix_cache_manager.py
[Metrics] Update time_to_first_token to include tokenization & queue time, and remove redundant metrics (
#4993
)
2025-11-26 14:42:17 +08:00