wangyifei
|
b57c960837
|
cuda13.0, implement changes to CCCL (#6751)
|
2026-03-10 16:47:02 +08:00 |
|
gongweibao
|
30f9f33f34
|
[Feature][BugFix][OP] Enhance Deterministic Inference Mode with Kernel-level Fixes and Batch-invariant BMM (#6610)
* add fa deter
* add ut
* add long sentence
* fix basic
* fix bugs
* fix adn
* fix first
* fix single
* fix single
* fix single test
* refine
* add more test
* refine comments
* add comments of bmm
* fix ci
* remove probe
* add
* remove not need
* refine tests
* fix comments and refine code
* refine code
* refine test
* refine test
* mv 4cards tests
* fix tests
* add
* fix comments
* fix cover
* fix cover
---------
Co-authored-by: gongweibao <gognweibao@baidu.com>
|
2026-03-09 10:27:53 +08:00 |
|
gongweibao
|
ddb06ff83f
|
init (#6642)
Co-authored-by: gongweibao <gognweibao@baidu.com>
|
2026-03-04 21:55:31 +08:00 |
|
jiangjiajun
|
684703fd72
|
[LLM] First commit the llm deployment code
|
2025-06-09 19:20:15 +08:00 |
|