Skip to content
Main Navigation Puget Systems Logo
  • Solutions
    • Content Creation
      • Photo Editing
        • Recommended Systems For:
        • Adobe Lightroom Classic
        • Adobe Photoshop
        • Stable Diffusion
      • Video Editing & Motion Graphics
        • Recommended Systems For:
        • Adobe After Effects
        • Adobe Premiere Pro
        • DaVinci Resolve
        • Foundry Nuke
      • 3D Design & Animation
        • Recommended Systems For:
        • Autodesk 3ds Max
        • Autodesk Maya
        • Blender
        • Cinema 4D
        • Houdini
        • ZBrush
      • Real-Time Engines
        • Recommended Systems For:
        • Game Development
        • Unity
        • Unreal Engine
        • Virtual Production
      • Rendering
        • Recommended Systems For:
        • Keyshot
        • OctaneRender
        • Redshift
        • V-Ray
      • Digital Audio
        • Recommended Systems For:
        • Ableton Live
        • FL Studio
        • Pro Tools
    • Engineering
      • Architecture & CAD
        • Recommended Systems For:
        • Autodesk AutoCAD
        • Autodesk Inventor
        • Autodesk Revit
        • SOLIDWORKS
      • Visualization
        • Recommended Systems For:
        • Enscape
        • Lumion
        • Twinmotion
      • Photogrammetry & GIS
        • Recommended Systems For:
        • ArcGIS Pro
        • Agisoft Metashape
        • Pix4D
        • RealityCapture
    • AI & HPC
      • Recommended Systems For:
      • Data Science
      • Generative AI
      • Large Language Models
      • Machine Learning / AI Dev
      • Scientific Computing
    • More
      • Recommended Systems For:
      • Compact Size
      • Live Streaming
      • NVIDIA RTX Studio
      • Quiet Operation
      • Virtual Reality
    • Business & Enterprise
      We can empower your company
    • Government & Education
      Services tailored for your organization
  • Products
    • Puget Mobile
      Powerful laptop workstations
      • Puget Mobile 16″
    • Puget Workstations
      High-performance desktop PCs
      • AMD Ryzen
        • Ryzen 9000:
        • Small Form Factor
        • Mini Tower
        • Mid Tower
        • Full Tower
      • AMD Threadripper
        • Threadripper 7000:
        • Mid Tower
        • Full Tower
        • Threadripper PRO 7000WX:
        • Full Tower
      • AMD EPYC
        • EPYC 9004/9005:
        • Full Tower
      • Intel Core Ultra
        • Core Ultra Series 2:
        • Small Form Factor
        • Mini Tower
        • Mid Tower
        • Full Tower
      • Intel Xeon
        • Xeon W-2500:
        • Mid Tower
        • Xeon W-3500:
        • Full Tower
    • Custom Computers
    • Puget Rackstations
      Workstations in rackmount chassis
      • AMD Rackstations
        • Ryzen 9000:
        • R132-4U
        • R550-6U 5-Node
        • Threadripper 7000:
        • T121-4U
        • T120-5U
        • Threadripper PRO 7000WX:
        • T141-4U
        • T140-5U (Dual 5090s)
        • EPYC 9004:
        • E140-4U (Quad 4090s)
      • Intel Rackstations
        • Core Ultra Series 2:
        • C132-4U
        • Xeon W-3500:
        • X131-4U
        • X141-5U
    • Custom Rackmount Workstations
    • Puget Servers
      Enterprise-class rackmount servers
      • Rackmount Servers
        • AMD EPYC:
        • E200-1U
        • E120-2U
        • E140-2U
        • E280-4U
        • E281-4U
        • Intel Xeon:
        • X200-1U
        • X240-2U
    • Comino Grando GPU Servers
    • Custom Servers
    • Puget Storage
      Solutions from desktop to datacenter
      • Network-Attached Storage
        • Synology NAS Units:
        • 4-bay DiskStation
        • 8-bay DiskStation
        • 12-bay DiskStation
        • 4-bay RackStation
        • 12-bay FlashStation
      • Software-Defined Storage
        • Datacenter Storage:
        • 12-Bay 2U
        • 24-Bay 2U
        • 36-Bay 4U
    • Recommended Third Party Peripherals
      Curated list of accessories for your workstation
    • Puget Gear
      Quality apparel with Puget Systems branding
  • Publications
    • Articles
    • Blog Posts
    • Case Studies
    • HPC Blog
    • Podcasts
    • Press
    • PugetBench
  • Support
    • Contact Support
    • Support Articles
    • Warranty Details
    • Onsite Services
    • Unboxing
  • About Us
    • About Us
    • Contact Us
    • Our Customers
    • Enterprise
    • Gov & Edu
    • Press Kit
    • Testimonials
    • Careers
  • Talk to an Expert
  • My Account
  1. Home
  2. /
  3. Hardware Articles
  4. /
  5. Stable Diffusion Performance – NVIDIA RTX vs Radeon PRO

Stable Diffusion Performance – NVIDIA RTX vs Radeon PRO

Posted on July 31, 2023 (November 15, 2023) by Evan Lagergren
Always look at the date when you read an article. Some of the content in this article is most likely out of date, as it was written on July 31, 2023. For newer information, see our more recent articles.

Table of Contents

  • Introduction
  • Test Setup
  • Automatic 1111
  • SHARK
  • PugetBench for Stable Diffusion
  • Is NVIDIA RTX or Radeon PRO faster for Stable Diffusion?

Introduction

Stable Diffusion is a deep learning model that is increasingly used in the content creation space for its ability to generate and manipulate images using text prompts. Stable Diffusion is unique among creative workflows in that, while it is being used professionally, it lacks commercially-developed software and is instead implemented in various open-source applications. Additionally, in contrast to other similar text-to-image models, Stable Diffusion is often run locally on your system rather than being accessible with a cloud service.

Stable Diffusion can run on a midrange graphics card with at least 8 GB of VRAM but benefits significantly from powerful, modern cards with lots of VRAM. We have published our own benchmark testing methodology for Stable Diffusion, and in this article, we will be looking at the performance of a large variety of Professional GPUs from AMD and NVIDIA that were released over the last five years. If you are interested in the performance of Consumer GPUs, we have also published an article covering over a dozen of those.

We want to point out that Tom’s Hardware also published their results back in January for an even wider variety of consumer GPUs. However, we could not fully replicate their results, so the numbers we are showing are slightly different. We do not believe that this is due to any issue with testing methodology from each party, but rather that Stable Diffusion is a constantly evolving set of tools. How it works today is very different than how it did even six months ago.

NVIDIA RTX A6000 and AMD Radeon PRO W7900 on a blue background with a
Image
Open Full Resolution

Below are the specifications for the cards tested:

GPUMSRPVRAMCUDA/Stream ProcessorsSingle-Precision PerformancePowerLaunch Date
RTX 6000 Ada Generation$6,80048 GB18,17691.1 TFLOPS300 WDec. 2022
RTX A6000$4,65048 GB10,75238.7 TFLOPS300 WOct. 2020
Radeon PRO W7900$4,00048 GB6,14461.32 TFLOPS295 WApril 2023
RTX A5000$2,50024 GB8,19227.8 TFLOPS230 WApril 2021
Radeon PRO W7800$2,50032 GB4,48044.78 TFLOPS260 WApril 2023
Radeon PRO W6800$2,25032 GB3,84017.83 TFLOPS250 WJune 2021
Radeon PRO VII$1,90016 GB3,84013.06 TFLOPS250 WMay 2020
Radeon PRO W9100$1,60016 GB4,09612.29 TFLOPS230 WJuly 2017

Test Setup

Test Platform

CPU: AMD Threadripper PRO 5975WX 32-Core
CPU Cooler: Noctua NH-U14S TR4-SP3 (AMD TR4)
Motherboard: ASUS Pro WX WRX80E-SAGE SE WIFI
RAM: 8x Micron DDR4-3200 16GB ECC Reg. (128GB total)
GPUs:
AMD Radeon PRO W7900 48GB
AMD Radeon PRO W7800 32GB
AMD Radeon PRO W6800 32GB
AMD Radeon Pro VII 16GB
AMD Radeon Pro WX 9100 16GB

NVIDIA RTX 6000 Ada 48GB
NVIDIA RTX A6000 48GB
NVIDIA RTX A5000 24GB
PSU: Super Flower LEADEX Platinum 1600W
Storage: Samsung 980 Pro 2TB
OS: Windows 11 Pro 64-bit (22621)

Benchmark Software

Automatic 1111
Version: 1.5.1, xformers: 0.0.17
Checkpoint: v1-5-pruned-emaonly
Automatic 1111 (lshqqytiger AMD fork)
Version: 1.3.1
Checkpoint: v1-5-pruned-emaonly
SHARK
Version: 20230701_796
Checkpoint: stabilityai/stable-diffusion-2-1-base
PugetBench for Stable Diffusion 0.3.0 alpha

To test the performance in Stable Diffusion, we used one of our fastest platforms in the AMD Threadripper PRO 5975WX, although CPU should have minimal impact on results. Following our test methodology, we used three implementations of Stable Diffusion: Automatic 1111, SHARK, and our custom in-development benchmark and the prompts given in our methodology article.

It is important to note that our primary goal is to test the latest public releases of the most popular Stable Diffusion implementations. While many would consider these to be cutting-edge already, there are even newer things that could be implemented, such as updated CUDA and PyTorch versions. However, these are not always stable and are not yet integrated into the public releases of Automatic 1111 and SHARK. We want to focus on what end-users would most likely use for real-world, professional applications today, rather than testing right at the bleeding edge. However, We will note that our “PugetBench” version of Stable Diffusion uses the latest versions of CUDA and PyTorch (among others) at the time of this article.

Tower Computer Icon in Puget Systems Colors

Looking for a Content Creation Workstation?

We build computers tailor-made for your workflow. 

Configure a System!
Talking Head Icon in Puget Systems Colors

Don’t know where to start?
We can help!

Get in touch with our technical consultants today.

Talk to an Expert

Automatic 1111

Stable Diffusion Automatic 1111 w/ xformers Geometric Mean Iterations per Second - Higher is Better. RTX 6000 Ada Generation: 21.11 RTX A6000: 19.09 RTX A5000: 15.3 Radeon Pro VII: 4.68 Radeon PRO W7900: 4.14 Radeon PRO W7800: 3.29 Radeon PRO W6800: 2.25 Radeon Pro WX 9100: 0
Image
Open Full Resolution

Starting off looking at the Automatic 1111 implementation with xFormers enabled, we see that the NVIDIA cards dramatically outperform the AMD cards, with the slowest NVIDIA card tested–the RTX A5000–having over three times the iterations per second as the fastest AMD card–the Radeon Pro VII. Otherwise, NVIDIA shows expected scaling, with the most expensive RTX 6000 Ada having the highest results, followed by the RTX A6000. Given the price gap between the 6000 Ada and the A6000, it may not be worth the relatively small performance bump.

Automatic 1111 isn’t typically the preferred implementation for AMD GPUs, however, and we see mixed results overall, with the year-old Radeon Pro VII outperforming the three-month-old W7900 by a small amount. The rest of the PRO cards fall in line with expectations, although the Pro WX 9100 did not complete the benchmark with this implementation. It is possible further tweaking could fix the compatibility issue, but we wanted to stick as close to the “stock” setup for each implementation as possible.

SHARK

Stable Diffusion Automatic 1111 w/ xformers Geometric Mean Iterations per Second - Higher is Better. Radeon PRO W7900: 18.53 RTX 6000 Ada Generation: 14.76 RTX A6000: 10.56 RTX A5000: 9.15 Radeon PRO W6800: 6.9 Radeon PRO W7800: 5.64 Radeon Pro WX 9100: 3.83 Radeon Pro VII: 0
Image
Open Full Resolution

Moving on to SHARK, we see why AMD users tend to favor this implementation. The Radeon PRO W7900 has the highest results of all the cards we tested, followed by the NVIDIA suite in its expected order. Interestingly, the Radeon PRO W6800 had results nearly 20% higher than its new replacement, the W7800–it seems likely this is a bug, but we are unsure if it is with the implementation or our testing. The Radeon Pro VII did not complete testing with the SHARK implementation, likely due to it lacking some newer features that are required by this package.

With AMD topping the board here, it is worth noting that AMD’s W7900 performance with SHARK gives comparable results to NVIDIA’s RTX A6000 in Automatic 1111, making it a viable alternative when using the correct implementation. This shows that using the proper implementation is extremely important as it can have anywhere from a 30% decrease in iterations per second to a massive 400% increase depending on the type of GPU you have.

PugetBench for Stable Diffusion

Stable Diffusion PugetBench for Stable Diffustion 0.3.0 alpha Geometric Mean Iterations per Second - Higher is Better. RTX 6000 Ada Generation: 22.97 RTX A6000: 20.06 RTX A5000: 16.28
Image
Open Full Resolution

In addition to the two most common packages for Stable Diffusion, we also have our own implementation that is still early in development. We are focusing it on benchmarking, which allows us to remove a lot of the bloat that is necessary when making a robust tool like Automatic 1111 or SHARK. Currently, it only supports NVIDIA cards, although we plan to add AMD support.

This implementation performs almost identically to the Automatic 1111 implementation with xFormers, although it eschews that package in favor of an updated PyTorch library. This means that when Automatic 1111 and SHARK update to the latest version of PyTorch, we likely won’t see a massive performance shift, although it will make the codebase much cleaner.

Tower Computer Icon in Puget Systems Colors

Looking for a Content Creation Workstation?

We build computers tailor-made for your workflow. 

Configure a System!
Talking Head Icon in Puget Systems Colors

Don’t know where to start?
We can help!

Get in touch with our technical consultants today.

Talk to an Expert

Is NVIDIA RTX or Radeon PRO faster for Stable Diffusion?

Although this is our first look at Stable Diffusion performance, what is most striking is the disparity in performance between various implementations of Stable Diffusion: up to four times the iterations per second for some GPUs. NVIDIA offered the highest performance on Automatic 1111, while AMD had the best results on SHARK, and the highest-end GPU on their respective implementations had relatively similar performance.

Assuming you are not tied to using either Automatic 1111, SHARK, or a different implementation, both AMD and NVIDIA offer similar performance at the top-end, with the AMD Radeon PRO W7900 48GB and NVIDIA RTX A6000 48GB both giving about 19 it/s. This currently gives AMD a slight price/performance advantage, although we will note that developers tend to give priority to NVIDIA GPUs, so that may change in the future. AMD is doing a lot of work in this vertical to improve GPU support, but that hasn’t quite caught up to NVIDIA.

If you have a bit more of a budget, however, NVIDIA is clearly the way to go. The NVIDIA RTX 6000 Ada Generation 48GB is the fastest GPU in this workflow that we tested. Conversely, if you are on more of a “budget”, NVIDIA may have the most compelling offering. The NVIDIA RTX A5000 24GB may have less VRAM than the AMD Radeon PRO W7800 32GB, but it should be around three times faster. If you are training models, the extra VRAM may still make AMD the right route, but if you are generating images, you likely will not need that additional VRAM as most Stable Diffusion models are optimized to generate either 512×512 or 768×7768 images.

Stable Diffusion is still a relatively new technology with rapid ongoing development, so it is worth noting that performance will only improve over the next few months and years. While we don’t expect that there will be a drastic change in relative performance, the results we found in this article are likely to shift over time.

If you are looking for a workstation for any of the applications we tested, you can visit our solutions page to view our recommended workstations for various software packages, our custom configuration page, or contact one of our technology consultants for help configuring a workstation that meets the specific needs of your unique workflow.

Tower Computer Icon in Puget Systems Colors

Looking for a content creation workstation?

We build computers tailor-made for your workflow. 

Configure a System
Talking Head Icon in Puget Systems Colors

Don’t know where to start?
We can help!

Get in touch with one of our technical consultants today.

Talk to an Expert

Related Content

  • NVIDIA RTX PRO 6000 Blackwell Workstation Content Creation Review
  • NVIDIA Blackwell GPU GenAI Software Support
  • Choosing the Right CPU for Unreal Engine: Ryzen X3D vs Threadripper
  • Do Video Editors Need GeForce RTX 50 Series GPUs?
View All Related Content

Latest Content

  • Unreal Fest 2025: Highlights from Orlando
  • NVIDIA RTX PRO 6000 Blackwell Workstation Content Creation Review
  • Puget Mobile 16″ Laptop (2025) Product Showcase
  • Why PugetBench for Premiere Pro is Dropping HEVC Encoding Tests
View All
Tags: AMD, GPU, NVIDIA, Radeon Pro VII, Radeon Pro W6800, Radeon Pro W7800, Radeon Pro W7900, Radeon Pro WX 9100, RTX 6000 Ada, RTX A5000, RTX A6000, stable diffusion

Who is Puget Systems?

Puget Systems builds custom workstations, servers and storage solutions tailored for your work.

We provide:

Extensive performance testing
making you more productive and giving better value for your money

Reliable computers
with fewer crashes means more time working & less time waiting

Support that understands
your complex workflows and can get you back up & running ASAP

A proven track record
as shown by our case studies and customer testimonials

Get Started

Browse Systems

Puget Systems Mobile Laptop Workstation Icon

Mobile

Puget Systems Tower Workstation Icon

Workstations

Puget Systems Rackmount Workstation Icon

Rackstations

Puget Systems Rackmount Server Icon

Servers

Puget Systems Rackmount Storage Icon

Storage

Latest Articles

  • Unreal Fest 2025: Highlights from Orlando
  • NVIDIA RTX PRO 6000 Blackwell Workstation Content Creation Review
  • Puget Mobile 16″ Laptop (2025) Product Showcase
  • Why PugetBench for Premiere Pro is Dropping HEVC Encoding Tests
  • NVIDIA Blackwell GPU GenAI Software Support
View All

Post navigation

 Stable Diffusion Performance – NVIDIA GeForce VS AMD RadeonAMD Radeon PRO W7600 and W7500 Content Creation Review 
Puget Systems Logo
Build Your Own PC Site Map FAQ
facebook instagram linkedin rss twitter youtube

Optimized Solutions

  • Adobe Premiere
  • Adobe Photoshop
  • Solidworks
  • Autodesk AutoCAD
  • Machine Learning

Workstations

  • Content Creation
  • Engineering
  • Scientific PCs
  • More

Support

  • Online Guides
  • Request Support
  • Remote Help

Publications

  • All News
  • Puget Blog
  • HPC Blog
  • Hardware Articles
  • Case Studies

Policies

  • Warranty & Return
  • Terms and Conditions
  • Privacy Policy
  • Delivery Times
  • Accessibility

About Us

  • Testimonials
  • Careers
  • About Us
  • Contact Us
  • Newsletter

© Copyright 2025 - Puget Systems, All Rights Reserved.