nebuly.com
↩
My benchmarks on PyTorch 2.0 inference performances vs TensorRT and ONNX Runtime
2023-03-18 08:53:41 (Reddit: Learn Machine Learning)
Source:
Reddit: Learn Machine Learning
Review of the paper "Full Stack Optimization of Transformer Inference: a Survey"
2023-03-16 03:24:51 (Reddit: AI)
Source:
Reddit: AI
↩