Nvidia Aims Two New GPUs at Deep Learning Inferencing

Already entrenched in the deep learning community for neural net training, Nvidia wants to secure its place as the go-to chipmaker for inferencing production workloads. At the GPU Technology Conference (GTC) in Beijing today (Tuesday), Nvidia CEO Jen-Hsun Huang unveiled the latest additions to the Tesla line, Pascal-based P4 and P40 GPU accelerators, as well as new software all aimed at improving performance for inferencing workloads, which undergird applications like voice-activated assistance, email spam filters, and movie and product recommendation engines. Employing the same form factor as the Maxwell-based M4 and M40 GPUs, the new Pascal-based cards were designed to accelerate inferencing workloads. Most significantly, the GPUs feature specialized inference instructions based on 8-bit (INT8) operations. Using the VGG image recognition model as a benchmark, Nvidia reports that the P40 achieved a 45x faster response…


Link to Full Article: Nvidia Aims Two New GPUs at Deep Learning Inferencing