Table of Contents
TL;DR: NVIDIA GeForce RTX 4090 24GB Content Creation Performance
Overall, the new NVIDIA GeForce RTX 4090 24GB GPU represents a massive leap in GPU performance. The exact amount depends highly on the application, with the greater benefit of course being found when the GPU is the primary bottleneck to performance.
For video editing, the RTX 4090 can be as much as 40% faster than the previous generation RTX 3090 and 3090 Ti, or almost 2x faster than the older RTX 2080 Ti. The RTX 40 Series also brings about a small performance boost for those using the GPU for either hardware decoding or encoding of H.264 and HEVC media.
Unreal Engine sees an even greater performance gain, with the RTX 4090 giving us roughly an 85% increase in FPS over the RTX 3090 and 3090 Ti across all our tests. Depending on the exact use case (ArchViz, Virtual Production, etc.), that means either faster renders, smoother performance, or the capacity for increased detail.
Lastly, GPU rendering is really where you are going to get the most out of a more powerful GPU, and the RTX 4090 comes through in spades. GPU Rendering is often nearly twice as fast as the previous generation RTX 3090 or 3090 Ti, or four times faster than the older RTX 2080 Ti.
With these performance gains come two major concerns, however. First, the new RTX 40 series (and their Quadro counterparts) apparently are doing away with NVLink entirely. This won't affect the majority of users, but if you have a workflow that requires, or benefits from, NVLink, be aware that that won't be an option. Second, the RTX 4090 requires four PCIe power plugs and is a full three slots in width. This is going to limit you to two GPUs at most outside of very specific circumstances.
Introduction
NVIDIA GPUs have long been a staple in our content creation workstations, as they provide a terrific mix of top-tier performance and reliability. For certain workflows, they are simply a requirement due to their proprietary CUDA technology. And now, they are launching the latest of the GeForce RTX lineup: the NVIDIA GeForce RTX 4090 24GB.
While the RTX 4090 only has the same 24GB of VRAM as the previous generation RTX 3090 and 3090 Ti, NVIDIA is advertising a very large increase in overall performance. Most of the focus so far has been on gaming, however, so we are looking forward to seeing how much faster the RTX 4090 is for content creation tasks like video editing, game development, and GPU rendering.
If you want to see the full specs for the latest NVIDIA GPUs we recommend checking out the NVIDIA GeForce RTX 40 Series product page. But at a glance, here are what we consider to be the most important specs:
GPU | VRAM | Cores | Boost Clock | Power | MSRP |
---|---|---|---|---|---|
RTX 3060 | 12GB | 3,584 | 1.78 GHz | 170W | $329 |
RTX 3060 Ti | 8GB | 4,864 | 1.67 GHz | 200W | $399 |
RTX 3070 | 8GB | 5,888 | 1.70 GHz | 220W | $499 |
RTX 3070 Ti | 8GB | 6,144 | 1.77 GHz | 290W | $599 |
RTX 3080 | 10GB | 8,704 | 1.71 GHz | 320W | $699 |
RTX 3080 Ti | 12GB | 10,240 | 1.67 GHz | 350W | $1,199 |
RTX 3090 | 24GB | 10,496 | 1.73 GHz | 350W | $1,499 |
RTX 4090 | 24GB | 16,384 | 2.52 GHz | 450W | $1,599 |
RTX 3090 Ti | 24GB | 10,752 | 1.86 GHz | 450W | $1,999 |
Although the RTX 4090 is priced slightly higher than the previous generation RTX 3090, the specs on paper are extremely impressive., with a 56% increase in CUDA cores and a 46% increase in boost clock. We will have to see how this translates to real-world performance, but there is no doubt that it will be significantly faster than any other NVIDIA GPU currently on the market.
While not a part of the specs above, another change NVIDIA is making with the RTX 40 Series is to have dual NVIDIA Encoders (NVENC). This should allow these cards to be significantly faster when encoding supported versions of H.264 and HEVC media, and we are very curious to see how well it works.
Puget Systems offers a range of powerful and reliable systems that are tailor-made for your unique workflow.
Test Setup
Test Platform | |
CPU | AMD Threadripper PRO 3975WX 32-Core |
CPU Cooler | Noctua NH-U14S TR4-SP3 (AMD TR4) |
Motherboard | Asus Pro WS WRX80E-SAGE SE WIFI |
RAM | 8x Micron DDR4-3200 16GB ECC Reg. (128GB total) |
Video Card | 1-2x NVIDIA GeForce RTX 4090 24GB NVIDIA GeForce RTX 3090 Ti 24GB 1-2x NVIDIA GeForce RTX 3090 24GB NVIDIA GeForce RTX 3080 Ti 12GB NVIDIA GeForce RTX 3080 10GB AMD Radeon RX 6900XT 16GB |
Hard Drive | Samsung 980 Pro 2TB |
Software | Windows 11 Pro 64-bit (2009) |
Benchmarks |
PugetBench for After Effects 0.95.2 (After Effects 22.4) PugetBench for Premiere Pro 0.95.5 (Premiere Pro 22.6.1) PugetBench for DaVinci Resolve 0.93.1 (DaVinci Resolve Studio 18.0.2) Unreal Engine 4.26 OctaneBench 2020.1.5 Redshift 3.5.08 Blender Benchmark 3.3.0 V-Ray Benchmark 5.02.00 |
*Latest drivers, OS updates, BIOS, and firmware as of October 3rd, 2022
To see how the RTX 4090 performs, we will be comparing it to the full range of RTX 3000 series GPUs as well as the older NVIDIA GeForce RTX 2080 Ti, along with the AMD Radeon RX 6900XT to get a sense of performance versus NVIDIA’s main competitor. The test system we will be using is one of the fastest platforms currently available for most of the applications we are testing and is built around the AMD Threadripper Pro 5975WX in order to minimize any potential CPU bottlenecks.
For the tests themselves, we will be primarily using our PugetBench series of benchmarks using the latest versions of the host applications. Most of these benchmarks include the ability to upload the results to our online database, so if you want to know how your own system compares, you can download and run the benchmark yourself. Our testing is also supplemented with Blender to show off the GPU rendering performance of these cards.
Video Editing: DaVinci Resolve Studio
For our first look at the new RTX 4090, we are going to examine performance in DaVinci Resolve Studio. More so than any other NLE (Non-Linear Editor) currently on the market, Resolve can make terrific use of high-end GPUs, and even multi-GPU setups. The four main areas the GPU is used for are processing GPU effects, debayering (and optionally decoding) RAW media, H.264/HEVC decoding, and H.264/HEVC encoding
To start off, we want to look at the overall score, which is a combination of everything we test for Resolve. This is often a good indicator of what kind of relative performance a “typical” (if such a thing exists) Resolve user may see with different hardware. Here, the RTX 4090 scores about 9% higher than the RTX 3090 Ti, or 10% higher than the RTX 3090.
However, this is the overall score, which includes a number of tasks that are either purely CPU driven, or often bottlenecked by the CPU. If we look at the GPU Score (chart #2), we get to see how the GPUs perform for tasks like OpenFX and noise reduction where the performance of the GPU itself is typically the limiting factor. In this case, the RTX 4090 ended up being 34% faster than the RTX 3090 Ti, or 42% faster than the RTX 3090. The RTX 4090 isn’t able to quite keep up with a dual RTX 3090 setup, but dual RTX 4090 is a nice 40% faster than dual RTX 3090.
This is also a good place to mention the older RTX 2080 Ti. We included this card in our testing to give a reference for how fast the RTX 4090 is compared to a bit older of a GPU, and for GPU effects, the RTX 4090 came in at just over 2x faster than the RTX 2080 Ti.
Last, we wanted to specifically look at H.264 encoding (chart #3). There is no easy way to test this in a complete vacuum (since you always have to decode or process a source clip), so we are going to be looking at the results for exporting a ProRes project to H.264 where the main bottleneck should be the H.264 encoding portion. The RTX 40 series includes dual encoders, which is supposed to dramatically reduce export times when using NVDEC. We didn’t see a massive increase in performance, but the RTX 4090 was able to complete our “ProRes to H.264” almost exactly 25% faster. It is also worth pointing out that this, and H.264 decoding, is the one area that AMD is able to keep up with NVIDIA. Everywhere else (RED/BRAW processing, GPU Effects, etc.), NVIDIA has a strong lead.
The last primary area where the GPU is used is RED/BRAW debayering (chart #5), but performance for that was not significantly different than the RTX 30 series in our testing. Most likely, we are simply CPU bottlenecked, so until we get even more powerful processors, having a faster GPU like the RTX 4090 isn’t going to be able to show any benefits.
Video Editing: Adobe Premiere Pro
Adobe Premiere Pro may not utilize the GPU quite as much as DaVinci Resolve (and effectively doesn’t take advantage of multi-GPU setups at all), but having a strong GPU can still make an impact depending on your workflow.
Once again, we are going to start with the Overall Score from our Premiere Pro benchmark, which shows the new RTX 4090 coming in at just a small 7% faster than the RTX 3090 and RTX 3090 Ti. You will notice, however, that there really isn’t much in the way of performance difference between the RTX 3080 all the way up to the RTX 3090 Ti. There is a clear benefit to going with NVIDIA, but most of the performance differences between the various NVIDIA GPU models are right along the generational line. RTX 40 is faster than RTX 30, which is in turn faster than RTX 20 series. Most of this is due to the fact that most of the tasks we test in our benchmark are impacted more by the performance of the NVENC/NVDEC (NVIDIA encoder and decoder) on the cards, rather than the raw performance of the GPU itself.
The one area where a more powerful GPU from within a single generation can get you a boost is for GPU-accelerated effects like Lumetri Color, most blurs, etc. (chart #2). For these tasks, the RTX 4090 is about 14% faster than the RTX 3090 Ti, or about 17% faster than the RTX 3090. That still isn’t anywhere near the gains we saw in Resolve, but keep in mind that this is just with the base version of Premiere Pro. If you use plugins that take advantage of the GPU (such as many noise reduction plugins), the RTX 4090 could give you a much bigger boost than what we are showing here.
Lastly, on chart #3 we are looking at the geometric mean of all our H.264 and HEVC results. The current version of our Premiere Pro benchmark doesn’t pull out a single score based on the type of source codec, but this is something we are starting to do more and more often, and will likely be included natively in the next major update of our benchmark. As we mentioned earlier, performance for H.264/HEVC decoding/encoding is often more dependent on the generation of the GPU, rather than the raw performance since the NVENC and NVDEC portion of the GPU tends to be the same across all models. Still, there is definitely a small, but noticeable, performance gain from the RTX 40 series, with the new RTX 4090 scoring about 13% higher than the RTX 30 series cards.
Motion Graphics/VFX: Adobe After Effects
Adobe After Effects just barely makes the cut for our GPU testing in general, since the vast majority of projects in After Effects are going to be bottlenecked by the CPU. Still, the GPU does come into play at times, so for now, we are still including it.
In terms of overall performance, you are not going to notice too much of a difference between the RTX 30 series and the RTX 4090. The RTX 4090 is within 2% of the RTX 3090, with the 3090 actually scoring higher overall. This doesn’t mean that the 3090 is necessarily faster since the margin of error for this kind of real-world testing is about 5%. Rather, it means that any RTX 30 or RTX 40 series GPU should perform pretty much on par for most users.
Just like with Premiere Pro, however, we can drill down to the “GPU Score” portion of our benchmark. This is testing projects specifically designed to put as much load on the GPU, while minimizing the load on the rest of the system. This is a borderline synthetic set of tests since no one really uses After Effects in this manner but does provide a “best case” situation for what a GPU upgrade can get you in the base version of After Effects without factoring in plugins, GPU-based rendering engines, etc.
Even for the GPU score (chart #2), the RTX 4090 is still within the margin of error compared to the RTX 3090 Ti and RTX 3090. With the powerful AMD Threadripper PRO 5975WX processor, we are able to see a benefit going from the RTX 2080 Ti or AMD Radeon 6900XT to the RTX 3080 and above, but beyond that, we are once again CPU bottlenecked even in this extreme situation.
Game Dev/Virtual Production: Unreal Engine
Moving on to Unreal Engine, things get a bit more nuanced. While there are a few niche use cases that can take advantage of multiple GPUs (more on this later) this benchmark focuses on single GPU performance – specifically for real-time graphics. This score looks at three different Unreal Engine scenes, with and without ray tracing, and at a variety of resolutions. For this testing, we are not using DLSS. Currently, only the 4090 supports DLSS 3.0, while the 3000 series cards use DLSS 2.0, so it’s hard to get a good apples-to-apples comparison. Instead, we are looking at the raw horsepower. Furthermore, not all professionals will be using DLSS in their daily workflow as it does have an impact on the final pixels that each user will need to check to see if it is acceptable for their project.
Overall, the RTX 4090 sees a roughly 78% increase in framerates across all scenes compared to the 3090 Ti, and 93% when compared to the 3090. For those working at a fixed 30-60 FPS for virtual production, this means you can do a lot more in your scene at the same framerate. If you were riding that edge on a 3000 series GPU, you’ll have plenty of headroom with the RTX 4090, allowing you to push the realism even further. Many in the Virtual Production space will wait for the new RTX 6000 Ada in order to have access to Quadro Sync and 48GB of VRAM, but based on past generations, the 4090 should give us a really solid glimpse of what to expect.
One important note about this generation of video cards (both RTX 40 Series and the “Ada Lovelace” Quadro cards) is that Nvidia has removed NVLink. Currently, that is required if you want to use multiple GPUs for tasks like GPU lightmass, inner/outer frustum in nDisplay, and to use multiple GPUs for the path tracer in Unreal 5.1. Technically, nDisplay does not require NVLink, but the performance hit when not using it is so great that everyone I’ve talked to, including people at Epic, says it is not worth attempting.
There are some potential fixes, such as Epic recently adding SMPTE 2110 in 5.1, improved PCIe handling (though that may not come about until PCIe 5.0), or some other solution. There aren’t many people that would be using a GeForce card for this anyway as you lose the Quadro Sync option, so there is still time to find a workable solution. Worst case scenario, the RTX 4090 is about the same performance as two RTX 3090s, but without the hassle of getting dual GPUs working.
GPU Rendering: V-Ray
New generations of video cards are always an exciting time for GPU rendering, as it is where we typically see the most significant performance increase. Starting off our GPU rendering benchmarks is V-Ray from Chaos, which scales very well with both GPU processing power and adding multiple GPUs.
As you can see from the results, the new NVIDIA GeForce RTX 4090 is twice as fast as the RTX 3090. In fact, it is faster than dual RTX 3090s due to the imperfect nature of GPU scaling. And if you are using a bit older of a GPU, the new RTX 4090 is 4 times faster than RTX 2080 Ti, which is only 4 years old. If you tend to upgrade every other generation, this will be a monumental leap in performance.
We were fortunate enough to also test with multiple RTX 4090 GPUs. Currently, we can only do two RTX 4090s in a desktop system due partly to physical space, but more importantly because of power draw. The RTX 4090 requires four dedicated 8-pin PCIe power plugs from the power supply, and since 1600W power supplies typically have at most 9 total PCIe cables, two RTX 4090s will fill a power supply’s capacity for PCIe power.
The results show that the RTX 4090 scales well in V-Ray with two of the cards being roughly 83% faster than just one. To put this number further into perspective, two RTX 4090s would be roughly equivalent to 4x RTX 3090s, or around 7-8 RTX 2080 Ti’s.
One big point to note is that the RTX 4090 does not offer NVLink. If you use very large scenes and needed NVLink to pool VRAM, that is not going to work with the new RTX 40 series. You will have to wait for the RTX 6000 Ada to get a card with 48GB of VRAM.
GPU Rendering: OctaneRender
Moving on to Octane from Otoy, we see that the new Nvidia GeForce RTX 4090 again nearly doubles the speed of the previous generation’s RTX 3090 and RTX 3090 Ti, being 92% faster and 83% faster respectively. In fact, it is practically on par with dual RTX 3090s. And if you do opt for a dual RTX 4090 system, the RTX 4090 scales almost perfectly, allowing for really impressive render times.
Otoy has made some tremendous improvements recently, and these results are primarily for the standard rendering system. You should see similar performance improvements to their real-time preview, or when using Octane within and DCC, Unreal Engine, or plugins such as EmberGen. Also, Otoy is working on improving out-of-core performance, and other ways of getting large scenes to perform better on GPUs if you don’t have enough VRAM. This should help mitigate the lack of NVLink and not being able to pool VRAM.
GPU Rendering: Redshift
Up next in our GPU rendering suite is Redshift from Maxon. Here we don’t see as big of an improvement, with the RTX 4090 only being about 60% faster than the RTX 3090. To a certain extent, this may be a limitation of Redshift’s benchmark. Both Octane and V-Ray look at how much work can be accomplished in a given amount of time, whereas Redshift renders a single frame and reports how long it took. This was a fine approach for a long time, but modern GPUs are getting so fast that the overhead of initiating the render is going to start to impact the final time. We’ve gone from 251 seconds for the RTX 2080 Ti, to 145 seconds for the RTX 3090, to only 87 seconds for the RTX 4090. Amazingly dual RTX 4090s completed the render in 45 seconds. Again, there is no NVLink, so you will be limited to the VRAM of a single card.
We must also mention that while these are the results from Redshift, Maxon has begun adding GPU-based physics simulations to Cinema 4D as well. These new features were too new for us to be able to test before the RTX 4090’s launch, but we hope to have some specific data soon. We fully expect to see similar results, however, with the RTX 4090 performing anywhere from 60-90% faster than the RTX 3090.
GPU Rendering: Blender
Rounding out our GPU rendering tests, we turn to Blender. One thing to note is that the official Blender benchmark does not support multiple GPUs, even though Blender itself does. Because of this, we opted to stick to Blender’s official benchmark in order to allow anyone to be able directly to compare their system to the cards we tested. This is also the only renderer that supports AMD video cards, so we threw the AMD Radeon 6900XT in for reference.
Like many of the other rendering engines, the new RTX 4090 once again scores nearly double the RTX 3090 or 3090 Ti. Taken as a whole, the new RTX 4090 is a beast when it comes to GPU rendering. In most cases, it is nearly double the best 30-series video card.
How Well Does the NVIDIA GeForce RTX 4090 Perform for Content Creation?
NVIDIA tends to give us great performance gains whenever they launch a new series of GPUs, and the new RTX 4090 is no different. It may not have more VRAM than the previous generation (24GB), but for applications that can benefit from having a more powerful GPU, the RTX 4090 can give you a massive boost to performance.
For video editing, the big winner is DaVinci Resolve Studio, as it makes more use of the GPU (and multi GPU configurations) than any other NLE currently on the market. In the instances where the GPU is the primary bottleneck (OpenFX and noise reduction), the RTX 4090 is around 40% faster than the previous generation RTX 3090, or just over 2x faster than the older RTX 2080 Ti. This isn’t quite enough for a single RTX 4090 to match a dual RTX 3090 setup, but it does bring it to within a very respectable 10%.
Resolve is also one of the few video editing applications that can take advantage of multiple GPUs, and we found that going from one RTX 4090 to two gave us a 55% increase in performance. This is right in line with what we see from previous generation cards like the RTX 3090, so it is good to see that we are not yet hitting any sort of CPU bottleneck.
Unreal Engine, being a real-time engine, benefits even more from powerful GPUs. Testing across a variety of scenes, from basic game environments to high-end Virtual Production sets, with and without ray tracing, the RTX 4090 averaged around 85% higher FPS. For those in ArchViz, that translates to faster render times, or smoother VR experiences. Users in Virtual Production who are capped at 30 or so FPS will be able to do more on-screen, making their sets even more realistic.
GPU rendering in Octane, Redshift, V-Ray and Blender sees nearly a doubling in performance over the RTX 3090 and 3090 Ti. All of these renderers also benefit from using multiple GPUs, and it is typical to see an 80-90% speed up by adding a second video card to a renderer. We would see even more performance by using more than two GPUs, but you will almost always be limited to just 1-2 cards due to the massive size and power constraints of these new cards.
A big concern for this generation is the lack of NVLink. With it, GPU rendering would use this feature to pool the VRAM of two video cards to be able to hold larger, more complex scenes. Exceeding the VRAM of a GPU would fall back to the system RAM, dramatically decreasing performance, or crashing the system. Without NVLink, VRAM pooling isn’t even an option.
This missing NVLink is going to be an even bigger concern for many in Virtual Production, especially those filming on large LED volumes. Currently, they use multiple GPUs so one video card can be dedicated to the inner frustum, and then transfer that frame to the other GPU to display on the wall. Not everyone in this space opts for this workflow, as it can be tricky to get set up, but when it works, it allows for much better performance. There are other options on the horizon, but nothing concrete at the time of writing. Not many users of this workflow would use a GeForce card anyway due to its lack of Quadro Sync support, but it is worth being aware of ahead of the RTX 6000 refresh.
Beyond NVLink, another concern we have about the RTX 4090 is simply how much power it demands (and how much heat output that will translate to). The physical design of the card is going to make using more than two RTX 4090 cards impossible without liquid cooling, but even then, you will find yourself to be power limited very quickly. The Founders Edition cards we are using straight from NVIDIA required a total of four(!) 8-pin PCIe plugs, which combined with the one plug required for our WRX80 motherboard, meant that we were using every single available PCIe power cable from our 1600W power supply in order to test dual RTX 4090 cards.
NVLink and power concerns aside, however, there is no question that the GeForce RTX 4090 24GB is an extremely capable GPU. Anytime we see performance gains of 2x over the previous generation cards, even in just a few workflows, it is hard not to be impressed. In the end, whether the RTX 4090 is going to be worth the investment for you is going to come down to your individual workflow, and what kind of ROI (return on investment) you might expect given the time savings it would be able to give you. But, if you are often limited by the performance of the GPU in your system, the RTX 4090 is almost certain to be a solid investment.
If you are looking for a workstation with the NVIDIA GeForce RTX 4090 24GB, you can visit our solutions page to view our recommended workstations for various software packages, our custom configuration page, or contact one of our technology consultants for help configuring a workstation that meets the specific needs of your unique workflow.
Puget Systems offers a range of powerful and reliable systems that are tailor-made for your unique workflow.