Files
FastDeploy/fastdeploy
yeliang2258 2a68a23baf Add paddle quantize model support for ORT, TRT and MKLDNN deploy backend (#257)
* add quantize model support for trt and paddle

* fix bugs

* fix

* update paddle2onnx version

* update version

* add quantize test

Co-authored-by: Jason <jiangjiajun@baidu.com>
2022-10-09 20:00:05 +08:00
..
2022-09-20 10:58:07 +08:00
2022-09-22 15:18:50 +08:00
2022-10-07 20:41:04 +08:00

FastDeploy C++ API Summary

Runtime

FastDeploy Runtime can be used as an inference engine with the same code, we can deploy Paddle/ONNX model on different device by different backends.
Currently, FastDeploy supported backends listed as below,

Backend Hardware Support Model Format Platform
Paddle Inference CPU/Nvidia GPU Paddle Windows(x64)/Linux(x64)
ONNX Runtime CPU/Nvidia GPU Paddle/ONNX Windows(x64)/Linux(x64/aarch64)/Mac(x86/arm64)
TensorRT Nvidia GPU Paddle/ONNX Windows(x64)/Linux(x64)/Jetson
OpenVINO CPU Paddle/ONNX Windows(x64)/Linux(x64)/Mac(x86)

Example code

Vision Models

Task Model API Example
object detection PaddleDetection/PPYOLOE fastdeploy::vision::detection::PPYOLOE C++/Python