Inspur Releases Powerful Scale-Up AI Super-Server AGX-5 Accelerated by NVIDIA Tensor Core GPUs

By: Inspur Group

On November 12th at the SC18 International Conference for High Performance Computing, Networking, Storage and Analysis in Dallas, USA, Inspur released its AI super-server AGX-5. Capable of computing deep learning workloads at up to 2 petaflops per second within a single server, AGX-5 is one of the most powerful computers of its kind in the world. 

Based on NVIDIA’s latest HGX-2 platform, the Inspur AGX-5 is designed to facilitate AI/deep learning and high performance computing. A single server houses 16 NVIDIA Tesla V100 32GB GPUs, providing 10,240 Tensor Cores. AGX-5 uses the industry’s most advanced GPU fabric, NVIDIA’s NVSwitch interconnect., providing near linear AI performance acceleration. AGX-5 is also equipped with two 28-core CPUs that provide excellent performance for general-purpose computing and 6 TB of persistent memory for high-speed data access. 

Currently, more and more industries are working to integrate AI technology into their existing businesses, rapidly expanding the range of application scenarios. As this technology enters a rapid period of growth, the speed and quality of AI innovation is becoming key to ensuring the competitiveness of many enterprises. To support this kind of innovation, companies will turn to computing platforms like the AGX-5, which are designed to accelerate these types of workloads. 

“The capability of AI computing has become one of the key production factors that determine a company’s competitive edge and speed of innovation,” said Peter Peng, Vice President of Inspur Group. “NVIDIA is the world’s leading company in AI accelerated computing and a long-time partner of Inspur. As the latest product to emerge from this partnership, AGX-5 offers massively improved computing performance, chip-to-chip communication, and data throughput. It will no doubt provide a major boost of computing power to help drive unprecedented acceleration of AI applications in commercial and research environments.” 

“The value of fusing AI and HPC computing into a unified architecture based on NVIDIA Tensor Core GPUs is now recognized by more and more customers,” said Marc Hamilton, VP of Solutions Architecture and Engineering at NVIDIA. “Inspur has a deep understanding of customer demands and the ability to quickly, efficiently and innovatively develop scale-up computing systems based on the latest NVIDIA GPUs. The newly launched AGX-5 will help AI and HPC users worldwide break through their most complex computational bottlenecks and save significant cost, space, and energy in the data center.”

According to IDC’s 2017 China AI Infrastructure Market Survey Report, with 57% market share, Inspur ranks first in the AI server market. As the world’s leading provider of AI computing, Inspur is fully engaged in the development of AI infrastructure in four areas, including the computing platform, the management & performance suite, optimized deep learning frameworks, and application acceleration. Together they deliver end-to-end, agile, cost-efficient, and optimized AI solutions. As a result of its commitment to offering these elements to its global customers through innovative design, Inspur has become a critical business partner of many leading companies around the world.