mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-04-23 00:17:25 +08:00
[Feature] Support KV Cache Storage (#5571)
* Support Mooncake Store * up * up * add op * fix conflict * fix error * up for comments * avoid thread lock * up * fix unittest * fix unittest * remove debug info * consider tp_size > 1 * add default rdma_nics * add utils * up * fix error --------- Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
This commit is contained in:
@@ -41,6 +41,8 @@ class Args:
|
||||
create_cache_tensor = False
|
||||
cache_dtype = "bfloat16"
|
||||
default_dtype = "bfloat16"
|
||||
kvcache_storage_backend = None
|
||||
write_policy = "write_through"
|
||||
|
||||
|
||||
# ==========================
|
||||
|
||||
Reference in New Issue
Block a user