Name: AMD Data Center and AI Technology Premiere Live Blog (Starts at 10am PT/17:00 UTC)
Item: AMD Data Center and AI Technology Premiere Live Blog (Starts at 10am PT/17:00 UTC)

Original Link: https://www.anandtech.com/show/18916/amd-data-center-and-ai-technology-premiere-live-blog-starts-at-10am-pt1700-utc

AMD Data Center and AI Technology Premiere Live Blog (Starts at 10am PT/17:00 UTC)

VIEW ARTICLE

by Ryan Smith & Gavin Bonshor on June 13, 2023 12:00 PM EST

0 Comments

12:55PM EDT - AMD this morning is hosting their first data center and server-focused event in quite some time. Dubbed the "AMD Data Center and AI Technology Premiere," we're expecting a sizable number of announcements from AMD around their server and AI product portfolio

12:55PM EDT - Highlighting this should be fresh news on AMD's forthcoming MI300 accelerator, the company's first server APU, that combines Zen 4 CPU cores and CDNA 3 GPU cores on to a single chip. Aimed at the very high end of the market, when it ships later this year MI300 will be AMD's flagship accelerator, and their most significant offering yet for the exploding AI market

12:55PM EDT - We're also expecting some additional details on AMD's remaining server CPU products for the year. This includes the density-focused Bergamo CPU, which will offer up to 128 CPU cores based on AMD's Zen 4c architecture. Genoa-X, which is AMD's V-cache equipped version of the EPYC 9004 series, offering up to 1.1GB of L3 cache per chip. And Siena, a 64 core low-cost EPYC chip

12:55PM EDT - And not to be left out, AMD also has their FPGA and networking divisions, courtesy of recent acquisition like Xilinx and Pensando. Those teams have also been hard at work at their own products, which are due for announcements as well

12:56PM EDT - This is AMD's first live event focused on the data center market in a while. The low frequency of these events means that AMD's most recent slate of announcements were during the tail-end of the pandemic, before live events had resumed

12:57PM EDT - So while not AMD's first live event overall, it's certainly their most important data center event in quite some time

12:58PM EDT - The live stream link, for anyone who would like to watch along: https://www.youtube.com/watch?v=l3pe_qx95E0

12:59PM EDT - I am here in person in cloudy San Francisco, where AMD is holding their event. Backing me up, as always, is Gavin Bonshor, who is in decidedly warmer England

12:59PM EDT - AMD has asked everyone to silence their devices; the show is about to begin

01:00PM EDT - One thing to note that, for as important as this show is for AMD's customers, there's also a distinct element of pleasing AMD's shareholders

01:00PM EDT - Which, to be sure, as a company AMD is always looking to do that. But with the explosion in demand for AI products, there's a lot of pressure on AMD to make sure they're going to capture a piece of that pie

01:00PM EDT - And we're starting!

01:01PM EDT - Here's AMD CEO, Dr. Lisa Su

01:01PM EDT - Lisa is greeting the audience

01:01PM EDT - As well as welcoming remote viewers

01:01PM EDT - "We have a lot of products and exciting news to share with you today"

01:01PM EDT - So with no further ado, we're getting started

01:01PM EDT - "AMD technology is truly everywhere"

01:02PM EDT - EPYC processors, Instinct accelerators, and AMD's product ecosystem

01:02PM EDT - "Today we lead the industry" with their EPYC processors

01:02PM EDT - Lisa is going to show how AMD brings their current products together, and how they'll be expanding their portfolio

01:03PM EDT - Every major cloud provider has EPYC instances

01:03PM EDT - EPYC adoption is also growing in the Enterprise market

01:04PM EDT - AMD is still ramping their 4th generation "Genoa" EPYC processors

01:04PM EDT - Genoa technically launched last November

01:04PM EDT - AMD thinks Genoa is still by far the highest performance and most efficient processor in the industry

01:04PM EDT - Performance comparisons between EPYC and Intel's 4th gen Xeon platform (Sapphire Rapids)

01:05PM EDT - "We want leadership performance. But we must have best-in-class energy efficiency"

01:06PM EDT - The vast majority of AI workloads today are still being run on CPUs

01:06PM EDT - So AMD sees themselves as having a big stake - and big advantage - in that market

01:06PM EDT - Expect no shortage of guests today. Starting with AWS VP Dave Brown

01:07PM EDT - AMD and AWS are continuing to collaborate

01:08PM EDT - AWS has introed over 100 different AMD-based instances at this point

01:09PM EDT - AWS has a broad range of customers, who have benefitted from the cost savings of using AMD instances

01:09PM EDT - Brown is talking about the various things AWS's customers have been up to - and how much money they've saved

01:10PM EDT - What's next for the AMD/AWS partnership?

01:11PM EDT - AWS is building new EC2 instances using EPYC 9004 processors and AWS's Nitro system

01:11PM EDT - Announcing M7a instances for general purpose computing

01:12PM EDT - Up to 50% more perf than M6a instances

01:12PM EDT - Preview available today

01:13PM EDT - General availability in Q3

01:13PM EDT - AMD is using AWS today for their data analytics workloads

01:13PM EDT - AMD will be expanding their use of AWS to use the service for more technical workloads like EDA

01:14PM EDT - And that's AWS

01:14PM EDT - "We're really pleased with the response we're getting on Genoa"

01:14PM EDT - Oracle is also announcing new Genoa instances that will be available in July

01:14PM EDT - "Genoa is ramping nicely"

01:15PM EDT - More customres coming online in the coming weeks

01:15PM EDT - Now talking about the breadth of AMD's data center product stack

01:16PM EDT - Cloud computing clients have different needs than AMD's standard EPYC customers

01:16PM EDT - Which brings us to Bergamo

01:16PM EDT - AMD's density-optimized CPU design for higher core counts

01:16PM EDT - 128 cores per socket "for leadership performance and energy efficiency in the cloud"

01:16PM EDT - Lisa loves her chips

01:16PM EDT - "Our chips"

01:17PM EDT - 8 CCDs, each with 16 Zen 4c cores

01:17PM EDT - Same IOD as Genoa

01:17PM EDT - 82 billion transistors

01:17PM EDT - Zen 4c core is 2.48mm2 on TSMC 5nm, versus 3.84mm2 for Zen 4

01:18PM EDT - AMD starts from the same RTL as Zen 4, and then optimize the physical implementation for reduced area

01:18PM EDT - 35% smaller core, and substantially better perf-per-watt

01:18PM EDT - The only real difference between Genoa and Bergamo is the CCDs

01:18PM EDT - 8x12 versus 16x8

01:19PM EDT - Genoa and Bergamo use the same SP5 socket, and can be swapped

01:19PM EDT - Now for performance comparison benchmarks versus Intel's 4th gen Xeon

01:19PM EDT - Up to 2.0x energy efficiency

01:20PM EDT - Bergamo is shipping in volume now to AMD's hyperscale customers

01:20PM EDT - And now for another guest: Meta VP Infrastructure, Alexis Bjorlin

01:21PM EDT - Meta and AMD have been collabing on EPYC server design since 2019

01:22PM EDT - Meta is a big supporter and provider for the Open Compute Project (OCP)

01:22PM EDT - So Meta's server designs are in significant use in the world

01:23PM EDT - AMD has proven to be able to meet their commitments to Meta

01:24PM EDT - Some of the insights from Meta have helped to shape Bergamo

01:24PM EDT - Meta will be deploying Bergamo for their next-gen high density server platform

01:24PM EDT - With substantial TCO improvements over Milan

01:25PM EDT - AMD is looking forward to the coming years with their Meta partnership. And that's Meta.

01:25PM EDT - And that's Bergamo as well

01:26PM EDT - Now on to technical computing workloads

01:26PM EDT - Dan McNamara is now taking the stage. SVP and GM of AMD's server business unit

01:26PM EDT - Delivering two new products today

01:27PM EDT - He's starting with a look at how AMD has optimized its designed for the "technical computing" market

01:27PM EDT - AMD's second generation V-cache technology

01:27PM EDT - Over 1GB of L3 cache on a 96 core EPYC CPU

01:27PM EDT - 4th gen EPYC, Genoa-X

01:28PM EDT - Like Bergamo, Genoa-X is available now

01:28PM EDT - 4 new SKUs, from 16 cores to 96 cores

01:28PM EDT - Genoa-X is aimed at technical computing. Workloads that can benefit from substantially larger L3 cache sizes

01:29PM EDT - Now for some performance slides with some EDA workloads

01:29PM EDT - Versus Intel, of course

01:29PM EDT - Platforms featuring Genoa-X will be available next quarter

01:30PM EDT - Another guest: Microsot's GM for Azure (apologies, didn't get the name)

01:31PM EDT - Talking about the history of Azure's HB series instances

01:31PM EDT - Performance comparisons using ANSYS Fluent 2021 R1

01:32PM EDT - 4x performance growth in 4 years

01:32PM EDT - Announcing general availability of HBv4

01:33PM EDT - 1.2TB/sec of memory bandwidth on hBv4

01:34PM EDT - Azure is also offering the HX series for even higher performance (and lower latency)

01:35PM EDT - And now talking a bit about Azure's customer adoption, and what they've been doing with their instances

01:36PM EDT - Azure is going to be 100% renewable energy by 2025

01:36PM EDT - Which is helpful for their customers who are wanting to get to net-zero carbon emissions

01:38PM EDT - Meanwhile, ST Micro has been able to reduce their simulation time by 30%

01:38PM EDT - And that's Azure

01:38PM EDT - VMs with Genoa-X now available

01:39PM EDT - Final piece of the Zen 4 portfolio: Siena

01:39PM EDT - AMD's low-cost EPYC processor for telco and other markets

01:39PM EDT - More on that in the second half of the year

01:39PM EDT - Now on to Forrest Norrad, EVP and GM of AMD's data center solutions business group

01:40PM EDT - Who is bringing on another guest: Jeff Maurona, Managing Director and COO of Citadel Securities

01:40PM EDT - "World's most profitable hedge fund"

01:40PM EDT - As well as the world's largest market-making firm

01:41PM EDT - Citadel is basically doing real-time predictive analytics

01:43PM EDT - Citadel transistioned to AMD in late 2020

01:43PM EDT - EPYC's memory bandwidth in particular has unlocked a lot of performance for Citadel

01:44PM EDT - Citadel finds Xilinx FPGAs to be absolutely essential as well

01:45PM EDT - And that's Citadel

01:45PM EDT - Citadel is using over a million CPU cores in the cloud

01:46PM EDT - Now focusing on AMD's network portfolio. One of their recent expansions via the Pensando acquisition

01:46PM EDT - Networking is an increasingly important part of the data center market - and thus AMD's own offerings

01:47PM EDT - Forrest is talking about the challenges of offering a hybrid cloud environment

01:48PM EDT - Focusing in part on the CPU overhead involved in offering those services while maintaining the necessary isolation

01:48PM EDT - Now talking about the P4 DPU architecture

01:48PM EDT - A purpose-built architecture to provide important services at line rate

01:48PM EDT - While being fully programmable

01:49PM EDT - DPUs offload a good chunk of the CPU overhead

01:49PM EDT - Reducing the need for a set of external appliances

01:50PM EDT - And as part of Pensando SmartNICs, offer multiple new use cases

01:50PM EDT - Available on major public clouds

01:51PM EDT - Deployed into an existing infrastructure, or designed into a new one

01:52PM EDT - AMD is working with HP Aruba to develop a smart switch. An industry-standard switch enhanced with P4 DPUs

01:53PM EDT - And that's how AMD is helping customers evolve their data center environments and make them more efficient

01:53PM EDT - And now back to Lisa Su for a look at AI

01:54PM EDT - (An aurora background? That has to be intentional...)

01:54PM EDT - 3 key areas for AI: broad portfolio of CPUs and GPUs, open and proven software platform, and a deep ecosystem of partners

01:55PM EDT - AMD is uniquely positioned with a broad collection of AI platforms across everything from client to server

01:56PM EDT - Lisa is talking about some of the customers using AMD hardware today for AI tasks, not the least of which being NASA

01:56PM EDT - Meanwhile AMD is expecting more than 70 laptop designs to launch through later this year featuring Ryzen AI

01:57PM EDT - $150B+ opportunity in the data center AI acceleration market

01:57PM EDT - "We are very, very early in the lifecycle of the AI market"

01:57PM EDT - 50% compound annual growth rate, from $30B today

01:58PM EDT - Talking about some of AMD's supercomputer wins, including, of course, Frontier, the industry's first exascale supercomputer

01:59PM EDT - And now rolling a video on the Lumi supercomputer (#3 on the current Top500 list)

02:00PM EDT - More accurate and better models as a result of Lumi

02:00PM EDT - Generative AI requires hardware as well as good software

02:00PM EDT - Now on stage: AMD's President, Victor Peng, to talk about the software side of matters

02:01PM EDT - Peng also heads up AMD's newly formed AI group

02:01PM EDT - AMD software development: open, proven, ready

02:02PM EDT - Talking about some of AMD's accomplishments to date

02:02PM EDT - Recapping Ryzen 7040 series and Ryzen AI

02:03PM EDT - As well as sampling new Vitis AI products

02:03PM EDT - This requires a "leadership software stack"

02:04PM EDT - Recappoing the ROCm stack

02:04PM EDT - A significant portion of which is open source

02:04PM EDT - ROCm in its fifth generation, with a comprehensive suite of AI optimizations

02:04PM EDT - FP8 data type support, optimized kernels, etc

02:05PM EDT - Another guest on stage: Soumith Chintala, the founder of PyTorch and VP at Meta

02:06PM EDT - Recapping PyTorch and what it's used for. One of the most popular AI frameworks on the market

02:06PM EDT - Recently released PyTorch 2.0

02:08PM EDT - How does AMD's collab benefit the developer community?

02:09PM EDT - Removed a lot of the work required/friction in moving platforms

02:10PM EDT - And that's PyTorch

02:10PM EDT - PyTorch 2.0 offers day-0 support for ROCm 5

02:10PM EDT - Now on to talking about AI models

02:11PM EDT - Another guest on stage, Clement Delangue, CEO of Hugging Face

02:11PM EDT - Sharing his thoughts on why open source matters in AI

02:12PM EDT - Giving companies the tools to build AI themselves, rather than just relying on provided tools

02:13PM EDT - AMD and Hugging Face recently formalized their partnership, which is being announced today

02:14PM EDT - Hugging Face is the most used open platform for AI

02:14PM EDT - Over 5000 new models added to their service just last week

02:15PM EDT - And they will be optimizing these models for AMD's platforms

02:16PM EDT - "Democratize AI"

02:17PM EDT - AMD, of course, shares in this vision, which is why they're working with Hugging Face

02:17PM EDT - The rate of innovation for AI is unprecidented

02:18PM EDT - And that's AMD's software stack

02:18PM EDT - Now back to Lisa Su for hardware

02:18PM EDT - "We've made a tremendous amount of progress over the past year with ROCm"

02:18PM EDT - Turning to AI hardware

02:19PM EDT - Generative AI and LLMs have changed the landscape

02:19PM EDT - For both training and inference

02:19PM EDT - At the center of this are GPUs

02:19PM EDT - Turning to the Instinct GPU roadmap

02:19PM EDT - AMD's CDNA 3 GPU architecture

02:19PM EDT - New compute engine, latest data formats, 5/6nm processes

02:20PM EDT - Recapping the MI300, now known as the MI300A

02:20PM EDT - 24 Zen 4 CPU cores, 128GB HBM3 memory

02:20PM EDT - All in a single package with unified memory across the CPU and GPU

02:20PM EDT - MI300A is sampling now

02:20PM EDT - MI300A is slated for use in the El Capitan supercomputer in LLNL

02:21PM EDT - 13 chiplets in MI300A

02:21PM EDT - MI300 is now a family of products

02:21PM EDT - AMD is replacing the Zen CPU chiplets to create a GPU-only version: MI300X

02:22PM EDT - 129GB HBM3, 5.2 TB/second of memory bandwidth, 896GB/ec memory bandwith, 153B transistors

02:22PM EDT - "Leadership generative AI accelerator"

02:22PM EDT - It looks very similar to MI300A. Removed 3 CPU chiplets, added 2 GPU chiplets

02:22PM EDT - 12 5nm and 6nm chiplets in total

02:23PM EDT - So AMD has done an XPU 3+ years before Intel

02:23PM EDT - Comparing MI300X to NVIDIA's H100 accelerator in terms of HBM3 density and bandwidth

02:24PM EDT - AMD supports 8 HBM3 stacks, versus 6 on H100, which gives them a capacity and bandwidtbh advantage

02:24PM EDT - Doing a live demo with the Falcon-40B model running on one MI300X

02:26PM EDT - More memory and more memory bandwidth allows larger models, and also running LLMs on fewer GPUs overall

02:26PM EDT - Which AMD believes will offer a TCO advantage

02:27PM EDT - Also announcing the AMD Instinct Platform

02:27PM EDT - 8-way MI300X configuration

02:27PM EDT - Using an OCP platform (so OAM?)

02:27PM EDT - 1.5TB total HBM3 memory

02:28PM EDT - Making it easier to deploy MI300 into their existing server and AI infrastructure

02:28PM EDT - MI300A began sampling earlier this quarter. MI300X and the 8-way platform sampling in Q3 of this year

02:29PM EDT - Expecting both products to ramp into production later this year

02:29PM EDT - Now wrapping things up

02:29PM EDT - Recapping all of AMD's announcements, and the scope of AMD's overall data center product lineup

02:30PM EDT - And that's a wrap on AMD's data center event

02:30PM EDT - Thanks for joining us. Now off to find out more about MI300X

AMD Data Center and AI Technology Premiere Live Blog (Starts at 10am PT/17:00 UTC)

Log in

Don't have an account? Sign up now