Skip to content

Main Navigation

Puget Systems Logo
  • Solutions
    • Recommended Systems For:
    • Content Creation
      • Photo Editing
        • Recommended Systems For:
        • Adobe Lightroom Classic
        • Adobe Photoshop
      • Video Editing
        • Recommended Systems For:
        • Adobe After Effects
        • Adobe Premiere Pro
        • DaVinci Resolve
        • Foundry Nuke
      • 3D Design and Animation
        • Recommended Systems For:
        • Autodesk 3ds Max
        • Autodesk Maya
        • Blender
        • Cinema 4D
        • Houdini
        • ZBrush
      • Real-Time Engines
        • Recommended Systems For:
        • Game Development
        • Unity
        • Unreal Engine
        • Virtual Production
      • Rendering
        • Recommended Systems For:
        • OctaneRender
        • Redshift
        • V-Ray
      • Digital Audio
        • Recommended Systems For:
        • Ableton Live
        • FL Studio
        • Pro Tools
    • Engineering
      • CAD
        • Recommended Systems For:
        • Autodesk AutoCAD
        • Autodesk Inventor
        • Autodesk Revit
        • SOLIDWORKS
      • Photogrammetry
        • Recommended Systems For:
        • ArcGIS Pro
        • Agisoft Metashape
        • Pix4D
        • RealityCapture
    • Scientific Computing
      • Recommended Systems For:
      • Data Science
      • Machine Learning / AI
      • Scientific Computing
    • More
      • Recommended Systems For:
      • Compact Size
      • Live Streaming
      • NVIDIA RTX Studio
      • Quiet Operation
      • Virtual Reality
  • Products
    • Intel Core i7 & i9
      Workstations with 13th Gen Intel Core i7 & i9 processors on Z690 and Z790 chipsets
    • AMD Ryzen 7 & 9
      Workstations with AMD Ryzen 7000 Series processors on B650 and X670 chipsets
    • AMD Threadripper PRO
      Workstations with AMD Threadripper PRO 5000 WX processors on the WRX80 chipset
    • Intel Xeon W
      Workstations with Intel Xeon W 3300 processors on the C621E chipset
    • Rackmount & Server
      Servers and workstations in rackmount chassis
    • Custom Computers
      Customize a desktop workstation from scratch
    • Custom Servers
      Customize a rackmount server from scratch
    • QNAP Network Attached Storage
      Check out our external storage options as an authorized reseller for QNAP
    • Recommended Third Party Peripherals
      View our list of recommended peripherals to use with your new PC
  • Publications
    • Articles
    • HPC Blog
    • Blog Posts
    • Case Studies
    • Podcasts
    • Press
  • Support
    • Contact Support
    • Support Articles
    • Warranty Details
  • About Us
    • About Us
    • Contact Us
    • Press Kit
    • Testimonials
    • Careers
  • Talk to an Expert
  • My Account
  1. Home
  2. /
  3. Hardware Articles
  4. /
  5. Mental Ray Multi Core Performance

Mental Ray Multi Core Performance

Posted on March 30, 2016 by Matt Bach
Always look at the date when you read an article. Some of the content in this article is most likely out of date, as it was written on March 30, 2016. For newer information, see our more recent articles.

Table of Contents

  • Introduction
    • Theoretical CPU Performance:
  • Test Setup
  • Single Frame Render Test Results
  • Animation Render Test Results
  • Conclusion

Introduction

Mental Ray is a CPU-based rendering engine that is currently owned and developed by NVIDIA. While NVIDIA is primarily a GPU-focused company, Mental Ray does not currently have many features that are GPU accelerated. Instead, the primary hardware component that is going to impact render times is the CPU. In this article we want to benchmark Mental Ray and analyze how effective it is at using multiple CPU cores. Based on this analysis, we can then determine exactly what CPU models should give you the best possible performance for your budget.

Before we get too far, it is important to understand the two most basic CPU specifications:

  1. The frequency is essentially how many operations a single CPU core can complete in a second (how fast it is).
  2. The number of cores is how many physical cores there are within a CPU (how many operations it can run simultaneously).

If life was easy, you would be able to compare the performance of different CPUs by simply multiplying the number of cores by the frequency – creating a sort of "combined" or "aggregate" frequency. Unfortunately, that is not how it works in the real world. In reality, the way to calculate the effective performance of a CPU is a bit more complicated and includes using a formula derived from Amdahl's Law: 

Theoretical CPU Performance:

S=F*\frac{1}{(1-P)+\frac{P}{n}}

  • S is the theoretical overall speed of the CPU
  • F is the frequency of the CPU
  • P is the fraction of the task that can be made parallel
    (parallel efficiency)
  • n is the number of CPU cores

This doesn't take into account the differences between CPU architectures – which can make a one generation newer CPU as much as 10-15% faster than an older CPU with identical specs – but what this formula is saying is that the theoretical overall speed of a CPU (S) depends on both the frequency it runs at (F) and an "adjusted" core count that takes into account how much of the task is actually able to be run across multiple cores (P). This P value is commonly called the "parallel efficiency" and while it is very often overlooked when comparing CPU models, it is actually extremely important. The higher the efficiency, the better performance you will see with a higher core count. The lower the efficiency, the less effective more CPU cores will be which makes it more important it is to have a CPU that runs at a higher frequency.

What we will be doing is performing a series of benchmarks to find out the speed of Mental Ray with various numbers of cores, then using another formula from Amdahl's Law to determine it's parallel efficiency (P). Once we have this efficiency number, we can then leverage the formula above to estimate the relative performance of a CPU based on the number of cores and the frequency it runs at. By doing so, we can ensure that you will not purchase a more expensive CPU that actually gives you less performance than a cheaper option.

If you want a more in-depth explanation of how we use Amdahl's Law, we have a guide available for Estimating CPU Performance using Amdahls Law.

Test Setup

To accurately benchmark Mental Ray, we will be using a pair of Xeon E5-2687W V3 3.1GHz Ten Core CPUs . This will allow us to test with up to 20 physical CPU cores across two CPUs to see exactly how well Mental Ray is able to utilize both a high number of cores as well as multiple physical CPUs. The full specifications for our test system are:

Testing Hardware
Motherboard: Asus Z10PE-D8 WS
CPU: 2x Intel Xeon E5-2687W V3 3.1GHz Ten Core
RAM: 8x Crucial DDR4-2133 16GB ECC Reg.​ (128GB total)
GPU: NVIDIA GeForce GTX 980 4GB
Hard Drive: Samsung 850 Pro 512GB SATA 6Gb/s SSD
OS: Windows 10 Pro 64-bit
PSU: Antec HCP Platinum 1000W
Software: 3ds Max 2016 SP3 V2 using Mental Ray

To make sure our results are as consistent as possible we used a custom script using AutoIt to start 3ds Max, load the test scene, adjust how many CPU cores are available by setting the affinity of the process, then time how long it took to render the scene. For our test scenes, we used three Mental Ray sample scenes from the 3ds Max 2016 samples files. We left these scenes on the pre-set render settings when possible although we did modify the Arch Interior scene slightly. For that scene, we increased the resolution to 1920×1080 to make the single frame testing a bit more intensive. When we tested that scene as an animation, however, we left the resolution at 640×480 and limited the animation to just 11 frames as otherwise it would have taken an absurdly long time to render with the lower core counts.

Night Caustics
(autodesk_mrwhite_handout_night_caustics.max)
Single frame – camera1 – 640×360 – 99 objects – 43 lights

Living Room
(living.room.max)
Single Frame -camera03 – 510×680 – 59 objects – 3 lights

 

Evermotion Arch Interior Vol1 Scene09
(evermotion.arch.interior.vol.01.scene.09.mentalray.max)
Single frame – camera10 – 1920×1080 (HDTV) – 219 objects – 3 lights
Animation – camera10 – 640×480 – 11 frames – 219 objects – 3 lights

 

To analyze the results of our testing, we will be presenting our results in terms of how long it took to render the scene with up to twenty cores compared to how long it took to complete with just a single core. From these results, we will then use Amdahl's Law to estimate the parallel efficiency for the render. 100% is perfect efficiency where a high core count CPU is ideal, but as the efficiency drops having a CPU with a higher frequency (even at the cost of a lower core count) becomes more and more important.

One thing we want to make very clear is that our testing is only 100% accurate for the files and settings we used. While this should be able to give us a fairly accurate measurement for how well Mental Ray can use multiple CPU cores, if you want more accurate results for the scenes you tend to work with we recommend following our Estimating CPU Performance using Amdahls Law guide. It can be a time consuming process (even with automation and rendering the scenes at only moderate settings the testing for this article took a significant amount of machine time) but it is really the only way to know for sure what the parallel efficiency is for exactly what you do.

About our testing: We rely on our customers and the community at large to point out anything we may have missed in our testing. If there is some critical part of Mental Ray you think we skipped in our testing, please let us know in the comments at the bottom of the page! Especially if you are able to provide a file that we can integrate into our testing, we really want to hear from you.

Single Frame Render Test Results


From these three renders, we saw an efficiency of 98%, 95.25%, and 97% respectively. If we combine all of these results, we get a overall multi core efficiency of ~97% when rendering a single frame using Mental Ray. 97% may sound very high (and it is actually quite decent), but what this basically means is that instead of seeing a speedup of 20x with 20 CPU cores – which is what a program with 100% efficiency would have – we actually only saw an average speedup of about 13x. Due to this, in many cases a CPU with a slightly lower core count but higher frequency may out-perform a CPU with a higher core count and lower frequency.

Animation Render Test Results

Animations have a bit more going on than just a single frame render, since the program needs to do a number of calculations between each frame. These in-between tasks are often single-threaded (only able to use one core) so the overall parallel efficiency for an animation tends to be lower compared to rendering a single frame. In fact, that is exactly what we saw in our testing.

While the single frame render for this scene had an efficiency of 97%, rendering an 11 frame animation dropped the efficiency down to 95.25% That isn't a huge drop, but it basically makes our 20 cores about 25% less effective than they were when rendering just a single frame. One thing to keep in mind is that the longer it takes to render each individual frame of an animation, the less impact there should be the overall multi core efficiency. This is due to the fact that doing things like changing the camera's location in space tends to take a fairly fixed amount of time – so the less relative time is spent on those actions, the less impact there should be on the overall efficiency.

Conclusion

Overall, Mental Ray has fairly good CPU scaling and is able to take advantage of a large number of CPU cores. It is not perfect, however, so in order to ensure that you are purchasing the exact right CPU we need to determine the relative performance of Mental Ray for various CPUs. To do this, we will once again Amdahl's Law – but this time in reverse to find the theoretical performance of a CPU based on it's core count, frequency, and the parallel efficiency we measured. Note that these results were calculated using the "all-core Turbo Boost" frequency (the speed at which the CPU actually runs when all cores are active) and not the base frequency that is in the product name. We've covered the difference between the advertised frequency, maximum Turbo Boost and that all-core Turbo Boost in a number of recent posts including Xeon E5v3 All Core Turbo Boost and Amdahl's Law and Actual CPU Speeds – What You See is Not Always What You Get. Using the all-core Turbo (which unfortunately is not often listed int he CPU's specifications) results in a much more accurate estimation of the real-world performance for each CPU.

[-] Hide estimated performance for all Xeon E5 V3 CPUs
Single Xeon E5 V3 / Core i7 Seconds to complete render (97% efficiency)
Xeon E5-2603 V3 1.6GHz Six Core 15MB 85W  3018
Xeon E5-2609 V3 1.9GHz Six Core 15MB 85W  2542
Xeon E5-2623 V3 3.0GHz Quad Core 10MB 105W  2081
Xeon E5-1620 V3 3.5GHz Quad Core 10MB 140W  1907
Xeon E5-2637 V3 3.5GHz Quad Core 15MB 135W  1907
Xeon E5-2620 V3 2.4GHz Six Core 15MB 85W  1857
Xeon E5-1630 V3 3.7GHz Quad Core 10MB 140W  1807
Xeon E5-2630 V3 2.4GHz Eight Core 20MB 85W  1466
Core i7 5820K 3.3GHz Six Core 15MB 140W 1420
Xeon E5-2640 V3 2.6GHz Eight Core 20MB 90W  1361
Xeon E5-1650 V3 3.5GHz Six Core 15MB 140W  1341
Core i7 5930K 3.5GHz Six Core 15MB 140W 1341
Xeon E5-2643 V3 3.4GHz Six Core 20MB 135W  1341
Xeon E5-2650 V3 2.3GHz Ten Core 25MB 105W  1231
Xeon E5-1660 V3 3.0GHz Eight Core 20MB 140W  1155
Core i7 5960X 3.0GHz Eight Core 20MB 140W  1155
Xeon E5-2667 V3 3.2GHz Eight Core 20MB 135W  1121
Xeon E5-2660 V3 2.6GHz Ten Core 25MB 105W  1103
Xeon E5-1680 V3 3.2GHz Eight Core 20MB 140W  1089
Xeon E5-2670 V3 2.3GHz Twelve Core 30MB 120W 1074
Xeon E5-2683 V3 2.0GHz Fourteen Core 35MB 120W  1001
Xeon E5-2687W V3 3.1GHz Ten Core 25MB 160W 1000
Xeon E5-2680 V3 2.5GHz Twelve Core 30MB 120W 963
Xeon E5-2690 V3 2.6GHz Twelve Core 30MB 135W 901
Xeon E5-2695 V3 2.3GHz Fourteen Core 35MB 120W  893
Xeon E5-2698 V3 2.3GHz Sixteen Core 40MB 135W 816
Xeon E5-2697 V3 2.6GHz Fourteen Core 35MB 145W  807
Xeon E5-2699 V3 2.3GHz Eighteen Core 45MB 145W  755
Dual Xeon E5 V3  
2x Xeon E5-2603 V3 1.6GHz Six Core 15MB 85W  1745
2x Xeon E5-2609 V3 1.9GHz Six Core 15MB 85W  1470
2x Xeon E5-2623 V3 3.0GHz Quad Core 10MB 105W  1155
2x Xeon E5-2620 V3 2.4GHz Six Core 15MB 85W  1074
2x Xeon E5-2637 V3 3.5GHz Quad Core 15MB 135W  1059
2x Xeon E5-2630 V3 2.4GHz Eight Core 20MB 85W  878
2x Xeon E5-2640 V3 2.6GHz Eight Core 20MB 90W  816
2x Xeon E5-2643 V3 3.4GHz Six Core 20MB 135W  776
2x Xeon E5-2650 V3 2.3GHz Ten Core 25MB 105W  761
2x Xeon E5-2660 V3 2.6GHz Ten Core 25MB 105W  682
2x Xeon E5-2670 V3 2.3GHz Twelve Core 30MB 120W 682
2x Xeon E5-2667 V3 3.2GHz Eight Core 20MB 135W  672
2x Xeon E5-2683 V3 2.0GHz Fourteen Core 35MB 120W  652
2x Xeon E5-2687W V3 3.1GHz Ten Core 25MB 160W 618
2x Xeon E5-2680 V3 2.5GHz Twelve Core 30MB 120W 612
2x Xeon E5-2695 V3 2.3GHz Fourteen Core 35MB 120W  582
2x Xeon E5-2690 V3 2.6GHz Twelve Core 30MB 135W 572
2x Xeon E5-2698 V3 2.3GHz Sixteen Core 40MB 135W 543
2x Xeon E5-2697 V3 2.6GHz Fourteen Core 35MB 145W  525
2x Xeon E5-2699 V3 2.3GHz Eighteen Core 45MB 145W  512

[+] Show estimated performance for all Xeon E5 V3 CPUs

If you wish to see the estimated performance for all single and dual E5 V3 CPUs (as well as the Core i7 equivalents), feel free to expand the option above. If you really dig into these numbers, you will find that it is very easy to end up spending more money on a CPU that will actually give you lower performance with Mental Ray. To help prevent that, we closely examined both the estimated performance numbers as well as the cost associated with each CPU choice and found seven CPU options (three single CPU and four dual CPU) that should give you the best performance for their price:

Single CPU

  • Intel Xeon E5-1650 V3 3.5GHz Six Core
    (or Intel Core i7 5930K 3.5GHz Six Core)
  • Intel Xeon E5-1660 V3 3.0GHz Eight Core
    (or Intel Core i7 5960X 3.0GHz Eight Core)
  • Intel Xeon E5-2680 V3 2.5GHz Twelve Core

Dual CPU

  • 2x Intel Xeon E5-2630 V3 2.4GHz Eight Core
  • 2x Intel Xeon E5-2650 V3 2.3GHz Ten Core
  • 2x Intel Xeon E5-2680 V3 2.5GHz Twelve Core
  • 2x Intel Xeon E5-2697 V3 2.6GHz Fourteen Core
 

To give you an idea about how each of these CPUs will perform relative to each other (without having to dig through the collapsed table above) we created a graph showing the theoretical time to render a scene in Mental Ray that has the same parallel efficiency as those we tested:

Mental Ray CPU Performance
As you can see, each increase in model (as well as price) gives a significant decrease in render times. The difference in render times between each of the CPUs above ranges from 8% to almost 20%, but on average the difference is about 14.5%. 

One last thing we will note is that while these CPUs should give you the best bang for your buck for Mental Ray, they may not be the best CPU for your system as a whole. Depending on what else you use the system for, it may be better to sacrifice a bit of performance in Mental Ray to substantially increase the performance of any other programs you regularly use. This can often be a tricky balance to find, so we highly recommend speaking with one of our consultants before purchasing a system that is going to be used for multiple tasks.

Phone: 425.458.0273
Email: [email protected]

Tags: CPU, Mental Ray, Processor, Rendering

Who is Puget Systems?

Puget Systems builds custom PCs tailored for your workflow

Extensive in-house testing
making you more productive and giving you more performance for your dollar

Reliable workstations
with fewer crashes and blue screens means more time working, less time waiting on your computer

Support that understands
your complex workflows and can get you back up and running ASAP

Proven track record
check out our customer testimonials and Reseller Ratings

Get Started

Browse Our Workstations

Fractal Design Define 7 Chassis with Puget Systems Logo

Select your workflow:

Content Creation
Engineering
Scientific Computing
More

Latest Articles

  • Puget Systems Hardware Trends of 2022
  • Guide To Removing Instapak Foam
  • Basic Guide to Identify and Remove Malware
  • Should I Upgrade My Gpu
  • Case Study with Lost Boys Interactive
View All

Post navigation

 Samsung SATA SSD’s Amazing ReliabilityNVIDIA Iray GPU Performance Comparison 
Puget Systems Logo
Build Your Own PC Site Map FAQ
facebook instagram linkedin rss twitter youtube

Optimized Solutions

  • Adobe Premiere
  • Adobe Photoshop
  • Solidworks
  • Autodesk AutoCAD
  • Machine Learning

Workstations

  • Content Creation
  • Engineering
  • Scientific PCs
  • More

Support

  • Online Guides
  • Request Support
  • Remote Help

Publications

  • All News
  • Puget Blog
  • HPC Blog
  • Hardware Articles
  • Case Studies

Policies

  • Warranty & Return
  • Terms and Conditions
  • Privacy Policy
  • Delivery Times
  • Accessibility

About Us

  • Testimonials
  • Careers
  • About Us
  • Contact Us

© Copyright 2023 - Puget Systems, All Rights Reserved.