Nvidia’s Tesla P4 And P40 GPUs Boost Deep Learning Inference Performance With INT8 …

Nvidia Tesl P40Nvidia continues to double down on deep learning GPUs with the release of two new “inference” GPUs, the Tesla P4 and the Tesla P40. The pair are the 16nm FinFET direct successors to Tesla M4 and M40, with much improved performance and support for 8-bit (INT8) operations. Deep learning consists of two steps: training and inference. For training, it can take billions of TeraFLOPS to achieve an expected result over a matter of days (while using GPUs). For inference, which is the running of the trained models against new data, it can take billions of FLOPS, and it can be done in real-time. The two steps in the deep learning process require different levels of performance, but also different features. This is why Nvidia is now releasing the…


Link to Full Article: Nvidia’s Tesla P4 And P40 GPUs Boost Deep Learning Inference Performance With INT8 …