Intel Launches Sapphire Rapids 4th Gen Xeon Scalable
Processors
January 11, 2023

Intel marked one of the most important product launches in company history
with the unveiling of 4th Gen Intel Xeon Scalable processors (code-named
Sapphire Rapids), the Intel® Xeon® CPU Max Series (code-named Sapphire
Rapids HBM) and the Intel® Data Center GPU Max Series (code-named Ponte
Vecchio), delivering for its customers a leap in data center performance,
efficiency, security and new capabilities for AI, the cloud, the network and
edge, and the world’s most powerful supercomputers.
Working alongside its customers and partners with 4th Gen Xeon, Intel is
delivering differentiated solutions and systems at scale to tackle their
biggest computing challenges. Intel’s unique approach to providing
purpose-built, workload-first acceleration and highly optimized software
tuned for specific workloads enables the company to deliver the right
performance at the right power for optimal overall total cost of ownership.
Additionally, as Intel’s most sustainable data center processors, 4th Gen
Xeon processors deliver customers a range of features for managing power and
performance, making the optimal use of CPU resources to help achieve their
sustainability goals.
“The launch of 4th Gen Xeon Scalable processors and the Max Series product
family is a pivotal moment in fueling Intel’s turnaround, reigniting our
path to leadership in the data center and growing our footprint in new
arenas,” said Sandra Rivera, Intel executive vice president and general
manager of the Data Center and AI Group. “Intel’s 4th Gen Xeon and the Max
Series product family deliver what customers truly want – leadership
performance and reliability within a secure environment for their real-world
requirements – driving faster time to value and powering their pace of
innovation.”
Unlike any other data center processor on the market and already in the
hands of customers today, the 4th Gen Xeon family greatly expands on Intel’s
purpose-built, workload-first strategy and approach.
Leading Performance and Sustainability Benefits with the Most Built-In
Acceleration
Today, there are over 100 million Xeons installed in the market – from on-prem
servers running IT services, including new as-a-service business models, to
networking equipment managing Internet traffic, to wireless base station
computing at the edge, to cloud services.
Building on decades of data center, network and intelligent edge innovation
and leadership, new 4th Gen Xeon processors deliver leading performance with
the most built-in accelerators of any CPU in the world to tackle customers’
most important computing challenges across AI, analytics, networking,
security, storage and HPC.
When comparing with prior generations, 4th Gen Intel Xeon customers can
expect a 2.9x1 average performance per watt efficiency improvement for
targeted workloads when utilizing built-in accelerators, up to 70-watt2
power savings per CPU in optimized power mode with minimal performance loss,
and a 52% to 66% lower TCO3.
Sustainability
The expansiveness of built-in accelerators included in 4th Gen Xeon means
Intel delivers platform-level power savings, lessening the need for
additional discrete acceleration and helping our customers achieve their
sustainability goals. Additionally, the new Optimized Power Mode can deliver
up to 20% socket power savings with a less than 5% performance impact for
selected workloads11. New innovations in air and liquid cooling reduce total
data center energy consumption further; and for the manufacturing of 4th Gen
Xeon, it’s been built with 90% or more renewable electricity at Intel sites
with state-of-the-art water reclamation facilities.
Artificial Intelligence
In AI, and compared to previous generation, 4th Gen Xeon processors achieve
up to 10x5,6 higher PyTorch real-time inference and training performance
with built-in Intel® Advanced Matrix Extension (Intel® AMX) accelerators.
Intel’s 4th Gen Xeon unlocks new levels of performance for inference and
training across a wide breadth of AI workloads. The Xeon CPU Max Series
expands on these capabilities for natural language processing, with
customers seeing up to a 20x12 speed-up on large language models. With the
delivery of Intel’s AI software suite, developers can use their AI tool of
choice, while increasing productivity and speeding time to AI development.
The suite is portable from the workstation, enabling it to scale out in the
cloud and all the way out to the edge. And it has been validated with over
400 machine learning and deep learning AI models across the most common AI
uses cases in every business segment.
Networking
4th Gen Xeon offers a family of processors specifically optimized for
high-performance, low-latency network and edge workloads. These processors
are a critical part of the foundation driving a more software-defined future
for industries ranging from telecommunications and retail to manufacturing
and smart cities. For 5G core workloads, built-in accelerators help increase
throughput and decrease latency, while advances in power management enhance
both the responsiveness and the efficiency of the platform. And, when
compared to previous generations, 4th Gen Xeon delivers up to twice the
virtualized radio access network (vRAN) capacity without increasing power
consumption. This enables communications service providers to double the
performance-per-watt to meet their critical performance, scaling and energy
efficiency needs.
High Performance Computing
4th Gen Xeon and the Intel Max Series product family bring a scalable,
balanced architecture that integrates CPU and GPU with oneAPI’s open
software ecosystem for demanding computing workloads in HPC and AI, solving
the world’s most challenging problems.
The Xeon CPU Max Series is the first and only x86-based processor with high
bandwidth memory, accelerating many HPC workloads without the need for code
changes. The Intel Data Center GPU Max Series is Intel’s highest-density
processor and will be available in several form factors that address
different customer needs.
The Xeon CPU Max Series offers 64 gigabytes of high bandwidth memory (HBM2e)
on the package, significantly increasing data throughput for HPC and AI
workloads. Compared with top-end 3rd Gen Intel® Xeon® Scalable processors,
the Xeon CPU Max Series provides up to 3.7 times10 more performance on a
range of real-world applications like energy and earth systems modeling.
Further, the Data Center GPU Max Series packs over 100 billion transistors
into a 47-tile package, bringing new levels of throughput to challenging
workloads like physics, financial services and life sciences. When paired
with the Xeon CPU Max Series, the combined platform achieves up to 12.8
times13 greater performance than the prior generation when running the
LAMMPS molecular dynamics simulator.
Most Feature-Rich and Secure Xeon Platform Yet
Signifying
the biggest platform transformation Intel has delivered, not only is 4th Gen
Xeon a marvel of acceleration, but it is also an achievement in
manufacturing, combining up to four Intel 7-built tiles on a single package,
connected using Intel EMIB (embedded multi-die interconnect bridge)
packaging technology and delivering new features including increased memory
bandwidth with DDR5, increased I/O bandwidth with PCIe5.0 and Compute
Express Link (CXL) 1.1 interconnect.
At the foundation of it all is security. With 4th Gen Xeon, Intel is
delivering the most comprehensive confidential computing portfolio of any
data center silicon provider in the industry, enhancing data security,
regulatory compliance and data sovereignty. Intel remains the only silicon
provider to offer application isolation for data center computing with
Intel® Software Guard Extensions (Intel® SGX), which provides today’s
smallest attack surface for confidential computing in private, public and
cloud-to-edge environments. Additionally, Intel’s new virtual-machine (VM)
isolation technology, Intel® Trust Domain Extensions (Intel® TDX), is ideal
for porting existing applications into a confidential environment and will
debut with Microsoft Azure, Alibaba Cloud, Google Cloud and IBM Cloud.
Finally, the modular architecture of 4th Gen Xeon allows Intel to offer a
wide range of processors across nearly 50 targeted SKUs for customer use
cases or applications, from mainstream general-purpose SKUs to purpose-built
SKUs for cloud, database and analytics, networking, storage, and
single-socket edge use cases. The 4th Gen Xeon processor family is On
Demand-capable and varies in core count, frequency, mix of accelerators,
power envelope and memory throughput as is appropriate for target use cases
and form factors addressing customers’ real-world requirements. |