Commit Graph

20 Commits

Author SHA1 Message Date
yzwu 3b9d6c60d3 [Iiluvatar] fix ci error and update readme (#7453) 2026-04-17 20:42:56 +08:00
yzwu 8789329457 [Iluvatar] Support wi4a16 group_gemm (#7078) 2026-03-30 19:03:51 +08:00
yzwu 901b38c936 [Iluvatar] Optimize decode group_gemm and Support cuda graph for ernie (#6803) 2026-03-12 19:21:17 +08:00
yzwu 6674131b0b [Iluvatar] Support CudaGraph and optimize flash_attn_unpadded and fused_neox_rope_embedding (#6553) 2026-03-02 14:07:17 +08:00
yzwu 7b6cc11952 [Iluvatar] Fix FD launch error when specifing CUDA_VISBLE_DEVICE (#5735) 2025-12-26 14:01:27 +08:00
yzwu ac013803f3 [Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode (#5555) 2025-12-18 02:14:25 -08:00
JYChen a1218076dc remove load default_v1 since already been as default (#4980) 2025-11-12 16:49:48 +08:00
yzwu 08b96baa4a [Iluvatar][Doc] Add ERNIE-4.5-VL-28B-A3B-Thinking doc (#4955) 2025-11-11 19:15:19 +08:00
yzwu 3707af7a4f [Iluvatar] add vl into ci and support v1 loader (#4774) 2025-11-11 10:50:17 +08:00
yzwu 4b661512ca [Iluvatar GPU] Adapt VL model (#4313) 2025-10-17 16:13:38 +08:00
yangjianfengo1 ba5c2b7e37 [Docx] add language (en/cn) switch links (#4470)
* add install docs

* 修改文档

* 修改文档
2025-10-17 15:47:41 +08:00
yzwu 504461b6b5 [Iluvatar GPU] Optimize attention performance and fix moe load ckpt error (#3651) 2025-09-22 21:13:59 +08:00
co63oc 17a27170bc fix typos (#4093) 2025-09-15 18:33:30 +08:00
yongqiangma 5703d7aa0f update installation readme (#3429) 2025-08-15 19:09:41 +08:00
yzwu ce9180241e [Iluvatar GPU] Modify the names of some variables (#3273) 2025-08-13 11:38:02 +08:00
yzwu fbdd6b0663 [Iluvatar GPU] Optimze attention and moe performance (#3234) 2025-08-08 10:51:24 +08:00
Zero Rains 25698d56d1 polish code with new pre-commit rule (#2923) 2025-07-19 23:19:27 +08:00
liddk1121 865e856a94 update iluvatar gpu fastdeploy whl (#2675) 2025-07-02 14:47:21 +08:00
mayongqiang 0d39e23ab9 fix format 2025-06-30 11:39:59 +08:00
Jiang-Jia-Jun 92c2cfa2e7 Sync v2.0 version of code to github repo 2025-06-29 23:29:37 +00:00