Nvidia Aims Two New GPUs at Deep Learning Inferencing

Already entrenched in the deep learning community for neural net training, Nvidia wants to secure its place as the go-to chipmaker for inferencing production workloads. At the GPU Technology Conference (GTC) in Beijing today (Tuesday), Nvidia CEO Jen-Hsun Huang unveiled the latest additions to the Tesla line, Pascal-based P4 and P40 GPU accelerators, as well as new software all aimed at improving performance for inferencing workloads, which undergird applications like voice-activated assistance, email spam filters, and movie and product recommendation engines. Employing the same form factor as the Maxwell-based M4 and M40 GPUs, the new Pascal-based cards were designed to accelerate inferencing workloads. Most significantly, the GPUs feature specialized inference instructions based on 8-bit (INT8) operations. Using the VGG image recognition model as a benchmark, Nvidia reports that the P40 achieved a 45x faster response…


Link to Full Article: Nvidia Aims Two New GPUs at Deep Learning Inferencing

Pin It on Pinterest

Share This

Join Our Newsletter

Sign up to our mailing list to receive the latest news and updates about homeAI.info and the Informed.AI Network of AI related websites which includes Events.AI, Neurons.AI, Awards.AI, and Vocation.AI

You have Successfully Subscribed!