TensorRT is Nvidia's deep learning SDK that enables applications to perform up to 40x faster than CPU-only platforms during inference. With CUDA's parallel programming model, TensorRT allows you to ...