Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-05-10 09:31:48 +08:00
Code Issues Actions 6 Packages Projects Releases Wiki Activity
Files
b88537a4567441edb4df3fbf0edcc812ad571b4c
FastDeploy/test
T
History
xjkmfa 71018fb62e 【CI case】include total_tokens in the last packet of completion interface stream output (#3279)
* Add ci case for min token and max token

* 【CI case】include total_tokens in the last packet of completion interface stream output

---------

Co-authored-by: xujing43 <xujing43@baidu.com>
2025-08-11 10:59:47 +08:00
..
ce
【CI case】include total_tokens in the last packet of completion interface stream output (#3279)
2025-08-11 10:59:47 +08:00
ci_use
[Iluvatar GPU] Optimze attention and moe performance (#3234)
2025-08-08 10:51:24 +08:00
entrypoints/openai
[Bugfix] Fix uninitialized decoded_token and add corresponding unit test. (#3195)
2025-08-04 19:23:58 +08:00
graph_optimization
[Executor]Update graph test case and delete test_attention (#3257)
2025-08-07 14:05:15 +08:00
layers
[Executor]Update graph test case and delete test_attention (#3257)
2025-08-07 14:05:15 +08:00
operators
[New Feature] Support W4Afp8 MoE GroupGemm (#3171)
2025-08-06 10:34:05 +08:00
plugins
[plugin] Custom model_runner/model support (#3186)
2025-08-04 18:52:39 -07:00
utils
[fix] multi source download (#3259)
2025-08-07 19:30:39 +08:00
Powered by Gitea Version: 1.26.0 Page: 510ms Template: 11ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API