Making Deep Learning Faster: Quantization Benchmarks with TensorRT

Introduction Modern deep learning models are typically trained using FP32 (32-bit floating-point precision) to ensure numerical stability and maximum accuracy. However, when deploying these models...

Jan 31, 2026 Deep Learning, Inference, Optimization

Coffee and Cake

This is my first post. Nothing big. Just coffee, a piece of cake, and a quiet moment to start writing. I’ve been thinking about starting this blog for a while. Not to teach, not to impress — just...

Jan 29, 2026 Coffee and Cake

Making Deep Learning Faster: Quantization Benchmarks with TensorRT

Coffee and Cake

Trending Tags