At Puget Systems we record a huge amount of data from every system we sell including benchmarks, BIOS screenshots, thermal images, and system photos. In fact, much of this data is published on our website and can be accessed through our part information pages. Simply scroll to the bottom of a part information page and you can view this data from recent systems we built.
While this data is nice, one of the most important things we track is something we don't have publicly available at all times: the failure rates of individual components. Reliability is one of our primary goals, so this data invaluable for tracking both individual component, product line, and overall brand failure rates. With 2016 coming to a close, we thought we would run some reports and share what hardware we found to be the most reliable in the last year.
One thing we want to make clear is that we can only comment on products that we have sold significant quantities of. There may be an even more reliable product out there, but either we have not sold it or we have not sold it in large enough quantities for us to have enough data to call it reliable or not. With that said, let's get started!
A reliable motherboard is essential in a high quality computer. Not only is a motherboard very difficult to swap out, but the effects of a poor quality motherboard can be far reaching and difficult to troubleshoot. This is complicated by the fact that motherboards are one of the most complex components in a computer. There are SATA, USB, fan, and network controllers as well as the physical ports, audio chips, and everything else that is needed to interconnect every component in your system. This is a huge number of delicate parts that have to work perfectly together and any one of these could potentially have a problem. If there is a single dead USB port, slight static over the audio, or the voltage levels are measured outside of norm, it does not meet our standards and is considered to have failed.
Because of this, motherboards have one of the highest overall failure rate of any core component with about 1 out of every 18 motherboards (5.5%) failing for one reason or another. This may seem like a high failure rate, but the silver lining is that nearly all of these failures we catch in-house before the system is shipped to the customer. In fact, motherboards as a whole only have a 1% failure rate (or one out of every 100) when you only look at issues that occurred in the field.
We like to look at reliability in two ways: first, in terms of overall reliability rates which includes both DOA (dead on arrival) failures and failures that happen in the field. Second, we look at only failures that happened in the field or failures we were not able to catch in house before the system shipping. Sorting with these methods, there are a number of boards we found to have an above-average history of reliability in 2016:
Best overall reliability:
(Lowest combination of DOA and field failures)
Best long-term reliability:
(Lowest field failure rate)
2016 was a bit of a rocky year for CPUs in terms of failure rate. Last year, we only saw an overall failure rate of .33%, but this year it jumped up to .76%. This is actually still very good, just not as good as it was last year. We did not sell enough AMD CPU/APUs in 2016 for us to make a call on their reliability, but for Intel CPUs we can give numbers divided up between all Intel CPUs, Core i3/i5/i7, and Xeon E3/E5:
|Intel CPUs||Total Failure Rate||Field Failure Only|
|Intel Core i3/i5/i7||1.0%||.34%|
|Intel Xeon E3/E5||.32%||.16%|
One interesting thing to point out is that we saw less than half the failure rate with Intel Xeon CPUs versus the Intel Core i3/i5/i7 CPUs. Due to the jump in Core i3/i5/i7 failures compared to last year, we are going to make the call that if you want the most reliable CPU, we recommend using a Xeon CPU if possible.
The brands and models of memory we offer in our workstations change based on a number of factors including reliability and availability, but in 2016 we primarily offered Crucial and Samsung RAM. Separating standard RAM from ECC and Reg. ECC RAM lets first take a look at the one model of standard RAM with no failures in 2016:
|Desktop DDR4 RAM||Failure Rate|
|Crucial DDR4-2133 4GB (CT4G4DFS8213)||0%|
Interestingly, this exact model made our most reliable list last year as well which makes it extra good! We offer both 8GB and 16GB version of this stick as well, but unfortunately both of those models had a number of sticks that failed. They were still decent overall (and had very low field failure rates), but nothing like the Crucial 4GB stick.
|Server/Workstation DDR4 RAM||Failure Rate|
|Crucial DDR4-2133 8GB ECC Reg. (CT8G4RFS4213)||0%|
|Crucial DDR4-2133 16GB ECC (CT16G4WFD8213)||0%|
|Samsung DDR4-2133 8GB ECC Reg. (M393A1G40DB1-CRC)||0%|
|Samsung DDR4-2133 16GB ECC Reg. (M393A2G40DB1-CRC)||0%|
|Samsung DDR4-2133 32GB ECC Reg. (M393A4K40BB1-CRC)||0%|
ECC RAM (and REG ECC) is specifically designed to be reliable, and in 2016 it actually turned out to be about 4x more reliable than standard RAM (.2% vs .9%). Because of this, we had a number of different models that we sold a significant quantity of that had absolutely no failures.
Just like RAM, we typically only use a few brands of SSDs and hard drives that we historically know to be extremely reliable. In 2016 we sold primarily Samsung and Intel SSDs and the reliability for both brands was excellent. In fact, the failure rates were so low that instead of listing individual models we are simply going to list the failure rates for the two product lines we found to be exceedingly reliable in 2016:
If you compare the two drive lines above to our list from last year, you will notice that it is much shorter. The Samsung 850 Pro line is still exceptional with no failures at all in 2016, but the 850 EVO is no longer on the list. The EVO line is still terrific with only a .26% failure rate, but solid state drives in general are so reliable that that is not all that uncommon. Also no longer on the list is the Intel 750 NVMe drives - these had a number of failures in the field this year which knocked them off.
Traditional platter drives - with all their moving parts - did not fare as well in 2016. In fact, the failure rates were high enough that we are not comfortable putting any models onto this list. We primarily offer WD Red, Gold, and RE drives and while none of these were terribly bad in terms of reliability (we wouldn't carry them if they were), at the same time they are not anything special. Platter drives as a whole had about a 1% failure rate overall, with more than half of the failures occurring in the field. So if reliability it a key concern for you, we recommend using an SSD if possible.
Like motherboards, video cards tend to have a bit higher failure rate than other hardware components. In addition, with the wide range of video cards we offer (including a mix of different brands), naming the most reliable model is a bit tough as we don't tend to sell incredibly large quantities of any one card. However, of the cards we sold enough of to have a good feel for their reliability, there are a few that stand out with no failures in 2016. In no particular order, these cards are:
EVGA GeForce GTX 950 2GB SC
(0% failure rate)
EVGA GeForce GTX 970 4GB SC
(0% failure rate)
EVGA GeForce GTX Titan X 12GB
(0% failure rate)
EVGA GeForce GTX 1070 8GB
(0% failure rate)
PNY GeForce GTX 1080 8GB
(0% failure rate)
PNY Quadro M2000 PCI-E 4GB
(0% failure rate)
Video cards as a whole have a couple of interesting trends we also wanted to point out:
- As a whole, Quadro cards are about three times more reliable than GeForce cards. We saw a low 1% failure rate overall with across the Quadro product line and only had a single card fail in the field. GeForce, on the other hand, had a higher 3.4% overall failure rate - although only .6% of the GeForce cards we sold had issues in the field. This may seem odd since only a single Quadro card made it onto our list, but this is mostly because we don't sell huge quantities of Quadro cards compared to GeForce. In addition, for Quadro we only sell a single brand (PNY) and there are not variations of each model - such as overclocked versions - so there is also higher chance of their being a failure for each specific model. Taken as a whole, however, Quadro cards very noticeably have a lower failure rate than GeForce.
- Overclocked (or SC) cards from EVGA are actually very good in terms of reliability. Where standard EVGA cards has a total failure rate of 1.87% (.65% in the field), the SC cards were actually slightly better at 1.58% (or .32% in the field). It is possible that the cards will burn out faster over time, but going back to the start of 2015 EVGA SC cards did not have any higher of a failure rate than their standard counterparts.
Out of all the power supplies we sold an appreciable amount of in 2016, there were two models that stood out as being extra reliable:
EVGA SuperNOVA 550W G2
(.9% failure rate - 1 PSU)
Seasonic M12II EVO 520W
(.76% failure rate - 1 PSU)
Oddly, both of the power supplies are relatively low wattage (for our workstations at least). We use very high quality power supplies from the most reliable manufacturers, but it is interesting to see that none of the higher wattage and higher efficiency power supplies made our list this year. In fact, compared to the 2.6% overall failure rate from 2015, power supplies as a whole increased to a 3.11% failure rate in 2016 (with less than half failing in the field).
So there you have it: the most reliable hardware we sold in 2016. While there is too much data to make many broad generalizations, there are a few trends we want to point out:
- Samsung SSDs continue to be one of the most reliable components within our systems, with a total failure rate of just .16%
- In the past we haven't seen a large difference in reliability between Intel Core i3/i5/i7 CPUs and Intel Xeon CPUs, but this year Xeon had about 1/3 the DOA rate and just under half the failure rate in the field.
- ECC and Reg. ECC RAM is much more reliable than standard RAM. Where normal DDR4 RAM saw about a .9% failure rate (with a .14% failure rate in the field), ECC and Reg. ECC RAM had just a .2% failure rate (.04% in the field).
- Power supplies took a small hit this year, with a combined failure rate (across primarily Seasonic and EVGA units) increasing from 2.6% last year to 3.1% this year. This isn't anything hugely concerning quite yet, but it is something we are keeping an eye on.
Keep in mind that if you purchase a system from us that includes parts not on this list, that does not mean you are at a significantly higher risk of your machine failing. Our internal testing catches the majority of problems before we ship the system so if anything this list is more about the parts that make the production process smoother for us than parts that our customers should specifically use in their systems.
This wraps up the most reliable hardware of 2016. Overall, it was a great year for reliability at Puget Systems with just the normal shifts in reliability as new products and technologies come out. We saw some small increases in failure rates compared to 2015, but the majority of those failures were found in-house. As a whole, hardware appears to be pretty stable in terms of reliability without a significant shift in any direction.