mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-04-23 00:17:25 +08:00
[XPU] Support W4A8C8-TP4-300B Model (#4068)
* support w4a8 * delete ep block attn * delete moe_topk_select * update note * update * delte useless info * update * add some note * fix some format * update scale info * add ans baseline --------- Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
This commit is contained in:
@@ -73,10 +73,10 @@ Refer to the example code `offline_disaggregated_demo.py` in the `fastdeploy/dem
|
||||
|
||||
#### Prerequisite: Redis
|
||||
|
||||
> **⚠️ NOTE**
|
||||
> **Redis requirement: version 6.2.0 or higher**
|
||||
> **⚠️ NOTE**
|
||||
> **Redis requirement: version 6.2.0 or higher**
|
||||
> Versions below this may not support the required commands.
|
||||
>
|
||||
>
|
||||
* Installation via `conda`
|
||||
|
||||
```bash
|
||||
|
||||
Reference in New Issue
Block a user