mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-04-23 00:17:25 +08:00
6b891da02b
* enable trtllm_all_reduce fusion kernel in glm model * fix conflict * format update * fix a bug * modify test * modify test * support empty tensor and modify test * fix test_linear config issues * modify test name * add edge test case * modify format * fix conflict * modify default max token num in trtllm_allreduce_fusion * add max token num branch for trtllm_allreduce_fusion * fix format * fix rmsnorm config issue * modify 2025 to 2026 * using compat grard * Lazily import flashinfer.comm and fix test config issue * fix test issues * add flashinfer cache dir clean machine * fix some issues
52 lines
922 B
Plaintext
52 lines
922 B
Plaintext
setuptools
|
|
pre-commit
|
|
yapf
|
|
flake8
|
|
ruamel.yaml
|
|
zmq
|
|
aiozmq
|
|
openai>=1.93.0
|
|
tqdm
|
|
pynvml
|
|
uvicorn>=0.38.0
|
|
fastapi
|
|
paddleformers>=1.1.1
|
|
redis
|
|
etcd3
|
|
httpx
|
|
fast_dataindex
|
|
cupy-cuda12x
|
|
pybind11[global]
|
|
tabulate
|
|
gradio
|
|
xlwt
|
|
visualdl
|
|
setuptools-scm>=8
|
|
prometheus-client
|
|
decord
|
|
moviepy
|
|
triton
|
|
crcmod
|
|
msgpack
|
|
gunicorn
|
|
modelscope
|
|
safetensors>=0.7.0
|
|
opentelemetry-api>=1.24.0
|
|
opentelemetry-sdk>=1.24.0
|
|
opentelemetry-instrumentation-redis
|
|
opentelemetry-instrumentation-mysql
|
|
opentelemetry-distro
|
|
opentelemetry-exporter-otlp
|
|
opentelemetry-instrumentation-fastapi
|
|
opentelemetry-instrumentation-logging>=0.57b0
|
|
partial_json_parser
|
|
msgspec
|
|
einops
|
|
setproctitle
|
|
aistudio_sdk
|
|
p2pstore
|
|
py-cpuinfo
|
|
flashinfer-python-paddle @ https://xly-devops.bj.bcebos.com/flashinfer/flashinfer_python_paddle-0.4.1.2-py3-none-any.whl
|
|
flash_mask @ https://xly-devops.bj.bcebos.com/flashmask/flash_mask-4.0.0%2Bg4c84f74-py3-none-any.whl
|
|
transformers>=4.55.1,<5.0.0
|