Article Thumbnail

Adobe Premiere Pro CC 2015 Multi Core Performance (Update1)

Written on June 8, 2016 by Matt Bach
Share:
Table of Contents:
  1. Introduction
  2. Test Setup
  3. Exporting to 1080p
  4. Exporting to 4K
  5. Rendering Previews
  6. Conclusion
  7. Related Hardware Articles
  8. Recommended Systems for Premiere Pro

Introduction

When designing a computer there are literally thousands of different hardware components to choose from and each one will have an impact on the overall performance of your system in some shape or form. Depending on the software you will be using, however, some components will simply be more important than others. In the case of Adobe Premiere Pro CC 2015, one of the most critical hardware components that will help with the performance of your system is the CPU. The question is: how do you know which CPU will give you the best performance?

Before even attempting to answer this question, it is important to understand the two most basic CPU specifications:

  1. The frequency is essentially how many operations a single CPU core can complete in a second (how fast it is).
  2. The number of cores is how many physical cores there are within a CPU (how many operations it can run simultaneously).

This doesn't take into account the differences between CPU architectures, but in an ideal world a CPU that has the same frequency but twice the number of cores would be exactly twice as fast. Unfortunately, making software utilize multiple cores (and do so effectively) is difficult in most situations and almost impossible in others. Add in the fact that higher core count CPUs tend to have lower operating frequencies and it becomes even more difficult to ensure that you are choosing the right CPU for your software.

In this article, we want to find out how well Premiere Pro can utilize multiple cores - also known as multi-threading - to help determine what type of CPU (either one with a high frequency or a high core count) will give you the best possible performance. Since exporting videos and rendering previews are the two major tasks that users tend to wait on when using Premiere Pro, those are what we will be focusing on in this article. If you want to skip over our individual benchmark results and simply view our conclusions, feel free to jump ahead to the conclusion section.

Test Setup

For our test system, we used the following hardware:

Since we want to determine how many CPU cores Premiere Pro can effectively utilize, we used a pair of Xeon E5 2687W V4 CPUs to give us 24 physical CPU cores with which to test. To see if the efficiency changes depending on the amount of GPU power in the system, we also tested with both one and two GTX Titan X video cards. To help with consistency - and since the benchmarks we performed ran for several days - we programmed a custom script using AutoIt to start Premiere Pro, set the CPU affinity (which CPU cores it is allowed to use), load the relevant project, export the timeline with the appropriate settings or generate previews, close Premiere Pro to clear any data from the system RAM, then loop while making more and more cores available.

To analyze the data, we will be presenting our results in terms of how long it took each action to complete with X number of cores compared to how long it took to complete with just a single core. From these results, we will then use Amdahl's Law to estimate the parallel efficiency for the action. 100% is perfect efficiency where a high core count CPU is ideal, but as the efficiency drops lower and lower having a high frequency CPU becomes more and more important. For more information on Amdahl's Law and how it works we recommend reading our Estimating CPU Performance using Amdahl's Law article.

The files we will be testing with came from a variety of sources:

1080P H.264/CineForm
4K H.264/CineForm
Provided by: Jerry Berg
Barnacules Nerdgasm - YouTube
ProRes 4K Grant Petty
Blackmagic Design Forum (available for public download)
4K RED RAW Provided by: Mike Pecci
Director & Photographer
6K RED RAW Neumann Films
RED Dragon Test Shot (available for public download)

In order to make our testing as accurate as possible, we used relatively simply timelines for our testing in this article. In the past, we've loaded on the accelerated effects to show the maximum difference between cards, but we found that this was not representative of real-world performance gains. Our test timelines consisted of:

  • 4-5 clips arranged in series to make a 60 second timeline
  • A basic transition was applied to each clip
  • Lumetri color correction effect applied to each clip
  • Vector-based logo graphic added to the bottom corner of the footage

Exporting to 1080p

While more and more people are starting to shoot in 4K and higher resolutions, 1080p is still by far the most common resolution to export to. Because of this, we thought we would start our testing by looking at how Premiere is able to utilize multiple CPU cores when exporting from a variety of resolutions and codecs to H.264 1080p:

Single GPU Dual GPU

In the graphs above, the lines with dots are the actual speedup we recorded in our testing. The solid lines shows the calculated efficiency we arrived at by using Amdahl's Law on the results. The green line indicates the initial speedup we saw while the orange line is the efficiency after the system hit some sort of bottleneck (whether it was due to the GPU, RAM, cache, or simply an issue from Premiere itself).

What is surprising is simply how consistent the results were between the different codecs and resolutions. The amount of time it took in seconds to export each was vastly different, but the speedup per core was very similar. From our data, it appears that there is little to no benefit to having more than four or five cores with a single high-end GPU when exporting to 1080p. In fact, we even saw a drop in performance when we added a second physical CPU when using the RED 4K footage.

Adding a second GPU helped a bit and made it so that it was beneficial to have roughly six CPU cores instead of just four or five. With dual GPUs, however, we saw three instances where having two physical CPUs was worse than just having one (H.264 4K, CineForm 4K, and RED 4K).

What this basically means is that if you are exporting to 1080p, you ideally want the highest frequency six core CPU if possible. A higher core count shouldn't hurt anything (unless you have two physical CPUs), but it makes no sense to get a CPU with a higher core count if it means making any sort of a sacrifice in terms of operating frequency.

Exporting to 4K

Although 4K isn't as widespread as 1080p quite yet, more and more Premiere Pro users are either exporting to 4K or are considering exporting to 4K in the near future. Exporting to higher resolutions is more taxing on the system, so it will be interesting to see if the multi core efficiency is any better when exporting to 4K versus exporting to 1080p:

Single GPU Dual GPU

Compared to exporting to 1080p, exporting to 4K in Premiere Pro appears to be much more effective at utilizing higher CPU core counts. Where we didn't see a benefit to having more than six cores before, in some situations we are seeing advantages to having even dual CPUs.

In general, it appears that with a single high-end GPU, having around ten to twelve CPU cores is going to be ideal. With two video cards, it gets a little better to the point where having fourteen or even sixteen CPU cores may be beneficial. One thing to keep in mind, however, is that the average point where the efficiency drops to a lower value is only at about nine or ten CPU cores. This means that while more cores can be beneficial, even a small drop in operating frequency could potentially make a CPU with more cores slower than a CPU with fewer cores but a higher operating frequency.

Rendering Previews

While export times are extremely important, the time it takes to generate previews is also a key consideration. In fact, while it is not a perfect correlation, the faster a machine is able to generate previews, the more likely it will be able to perform live playback of a timeline without the need for previews in the first place.

Single GPU Dual GPU

For this test, we left the preview resolution at full as that is ideally what you would want your system to be capable of.

Overall, the results are strikingly similar to what we saw when exporting to 1080p although they are much more inconsistent between the different footage codes. The main difference is that with only a single high-end GPU Premiere was able to effectively use a few more cores - anywhere from five to seven or even eight cores. Interestingly, while two GPUs allowed for a bit better scaling, it was not as large of an improvement as what we saw when exporting.

Conclusion

From our initial multi-core testing in Premiere Pro, we've known that Premiere is only moderately effective at utilizing multiple CPU cores. This round of testing (with more realistic timelines and a wider range of source codes and resolutions), however, indicates that it is actually even a little bit worse than we thought. 

To summarize our results, lets first look at the individual efficiencies when exporting to H.264 1080p:

Exporting to H.264 1080p Single GPU
Parallel Efficiency
Dual GPU
Parallel Efficiency
H.264 1080p 87% (0% after 5 cores) 95% (50% after 5 cores)
CineForm 1080p 92% (50% after 3 cores) 95% (0% after 6 cores)
H.264 4K 90% (25% after 4 cores) 94% (50% after 6 cores)
CineForm 4K 90% (35% after 4 cores) 95% (35% after 6 cores)
ProRes 4K 90% (0% after 4 cores) 90% (0% after 4 cores)
RED 4K 92% (25% after 5 cores) 93% (0% after 5 cores)
RED 6K 95% (25% after 3 cores) 93% (40% after 5 cores)

With a single GTX Titan X video card, we saw an initial efficiency of about 91% on average. After about four or five cores, however, this dropped to pretty much nothing. Dual video cards was better with an initial average efficiency of about 93%, but this also dropped off to almost nothing after about 6 cores. What this means is that if you are exporting to H.264 1080p, you would ideally want a CPU with about four or six CPU cores. Eight cores can be beneficial in some instances, but if you have to sacrifice more than .1-.2GHz for those extra cores it will likely actually result in lower performance.
 

Exporting to H.264 4K Single GPU
Parallel Efficiency
Dual GPU
Parallel Efficiency
H.264 4K 98% (75% after 8 cores) 98% (78% after 8 cores)
CineForm 4K 96% (65% after 8 cores) 97% (65% after 8 cores)
ProRes 4K 92% (0% after 9 cores) 92% (0% after 9 cores)
RED 4K 97% (70% after 12 cores) 97% (50% after 12 cores)
RED 6K 98% (40% after 9 cores) 98.5% (70% after 11 cores)

When we upped the export resolution to 4K, we saw much better efficiency numbers. Not only was the initial efficiency higher (about 96.5% on average), Premiere Pro was also able to effectively use a much higher number of CPU cores. Where at 1080p anything more than six or maybe eight cores wouldn't be beneficial, in this case the initial multi core efficiency numbers lasted until around twelve cores. In other words, when exporting to 4K a ten core CPU should be great and in some instances a CPU (or multiple CPUs) with even more cores may give you a performance increase.
 

Render Full Res. 
Previews
Single GPU
Parallel Efficiency
Dual GPU
Parallel Efficiency
H.264 1080p 87% (0% after 5 cores) 95% (50% after 4 cores)
CineForm 1080p 82% (0% after 4 cores) 92% (0% after 4 cores)
H.264 4K 96% (50% after 4 cores) 94% (50% after 6 cores)
CineForm 4K 90% (35% after 3 cores) 88% (35% after 6 cores)
ProRes 4K 60% (0% after 4 cores) 70% (0% after 4 cores)
RED 4K 92% (60% after 6 cores) 92% (65% after 6 cores)
RED 6K 95% (45% after 7 cores) 95% (25% after 9 cores)

Unlike exporting, our results for rendering full resolution previews was really inconsistent. Some source footage (like RED 6K) was pretty good and should see a nice performance increase with even up to ten CPU cores. Others (such as ProRes 4K) was really bad - to the point that you almost want to ignore the number of cores and focus entirely on getting the highest possible CPU frequency you can. Overall, however, we would say that with the exception of RED 6K, you should ideally want a CPU with around 6-8 cores.

Unless you are very familiar with CPU scaling and Amdahl's Law, a lot of these numbers are probably not all that useful.  If you want to see how well different Core i7 and dual Xeon CPUs actually perform in After Effects, we recommend reading our Adobe Premiere Pro CC 2015.3 CPU Comparison article. In that article, we found that the three best CPU options for Premiere Pro today are the Intel Core i7 6850K, 6900K, or 6950X. However, we would advice against a dual Xeon option as in most cases these expensive configurations are slower than the Intel Core i7 6950X.

Related Hardware Articles

If you are configuring a workstation for Premiere Pro, we have a number of articles regarding the hardware requirements for Premiere Pro that you may be interested in. If you prefer a summary, we also have a list of Hardware Recommendations for Premiere Pro based on the findings of these articles.

Premiere Pro CC 2015.4
Storage Optimization

How many drives should you use and how should you configure them?

NAB Webinar: Choosing the right hardware for your Premiere Pro Workstation
Comprehensive guide for what hardware to use in your workstation

Premiere Pro CC 2015.3
CPU Comparison

What CPU should you choose for your Premiere Pro workstation?

Premiere Pro CC 2015
Multi Core Performance

Does having more CPU cores give you more performance?

Premiere Pro CC 2015.3
Pascal GPU Performance

Will a more expensive video card improve performance in Premiere?

View All Articles

Includes older articles that may not be relevant for the latest hardware and software revisions 

Recommended Systems for Premiere Pro

 

Compact
Workstation

Purchase

Utilizing an Intel Core i7 CPU with up to 10 cores, this workstation provides the best possible performance for Premiere Pro in a compact package.

Standard
Workstation

Purchase

While physically larger than the Compact Workstation, this system allows for up to 512GB of RAM and a wider range of storage options.

 

Tags: Adobe, Premiere Pro, Multi-threading
Peter White

What's not clear to me from this is what happens when Premiere is running with effects on the timeline - especially non GPU-accelerated effects. For example, Neat Video (noise reduction) is compute intensive. So are many third party effects like Tiffen DFX, Boris Continuum, etc.

These benchmarks could imply that a 6-8 core system is good enough for Premiere, but is that necessarily the case when your timeline is bogged down by such effects? It would be interesting to see a set of benchmarks measuring various effects, in addition to those here that seem to discriminate only on input and output formats.

Posted on 2016-10-26 02:42:00
Balubish

So I just went from 3770K to a 6800K reason is hoping on much faster render times in 4K. And now i see you guys recommend 64GB. How much difference would it be with my 32GB 3200MHz Vs 64GB?

Posted on 2016-11-02 12:49:46
Berth Ljunggren

Interesting read, i have just built a workstation with dual E5-2640 v4 10 core xeons, 128gig ram and a PNY P5000 gfx card, but it should be
doing good for games, PPro, AE, Lightwave 3D, Zbrush, development, and vm machines, going from my i7ee(990x) with maxed memory(24 gig)
that cpu has 6 cores but at higher clock than the 2640(2.4ghz turbo 3.4) might have gone a bit overboard but the machine should last me for a long time.

Posted on 2016-11-05 00:57:23
Joe

My question is how taxing on the CPU is it when playing a timeline of multi camera 4K XAVC files? Editing isn't just about rendering, it's also about smooth timeline playback.

As an example, when editing two multi cam 4K XAVC Sony files (30) minutes in length each with one layer of color grading applied my 5960x will stutter and reach 100% CPU load or hover in the 90%+ range. This is happening when playing back the multicam sequence in the timeline.

In that scenario would dual xeons diminish the load during timeline playback? Or would the second Xeon be unutilized in premiere? I haven't seen this answered anywhere I've looked. Does a dual Xeon setup offer smoother timeline playback and skipping through the timeline?

Posted on 2016-12-01 14:42:15