HPC Posts Archive

Problems With RTX4090 MultiGPU and AMD vs Intel vs RTX6000Ada or RTX3090

Posted on February 15, 2023 by Dr. Donald Kinghorn

I was prompted to do some testing by a commenter on one of my recent posts. They had concerns about problems with dual NVIDIA RTX4090s on AMD Threadripper Pro platforms. I ran some applications to reproduce the problems reported above and tried to dig deeper into the issues with more extensive testing. The included table below tells all!

Intel Xeon E5 v4 Broadwell Buyers Guide (Parallel Performance)

Posted on July 1, 2016 by Dr. Donald Kinghorn

Intel’s Xeon E5 v4 processors are available and there are lots of them! The changes from the v3 Haswell are mostly small clock changes and increases in core count. You can now get a E5-2699v4 with 22 cores. In a dual socket system that’s 44 cores to work with. If the programs you want to run scale well with thread count then that could be a great processor for you. However, if your parallel scaling is not near linear then it may not be the best value. We have a dynamic chart of performance based on Amdahl’s Law that may help you decide which processor is best for your uses.

NAMD Molecular Dynamics Performance on NVIDIA GTX 1080 and 1070 GPU

Posted on June 23, 2016 by Dr. Donald Kinghorn

The new NVIDIA GeForce GTX 1080 and GTX 1070 GPU’s are out and I’ve received a lot of questions about NAMD performance. The short answer is — performance is great! I’ve got some numbers to back that up below. We’ve got new Broadwell Xeon and Core-i7 CPU’s thrown into the mix too. The new hardware refresh gives a nice step up in performance.

GTX 1080 CUDA performance on Linux (Ubuntu 16.04) preliminary results (nbody and NAMD)

Posted on May 27, 2016 by Dr. Donald Kinghorn

Just got a NVIDIA GTX 1080 for testing. I hacked up an install with Ubuntu 16.04 and CUDA 7.5 along with a beta display driver that works! First run after compiling the cuda samples nbody gave 5816 GFLOP/s! A GTX 980 on the same system does 2572 GFLOP/s. However, it’s not all good news …

Intel Broadwell Xeon E5 2600v4 performance test

Posted on May 18, 2016 by Dr. Donald Kinghorn

The Intel Xeon E5 2600 v4 Broadwell processors are finally available. My first Linpack testing with a E5-2687W v4 shows a greater than 35% performance increase over the v3 Haswell version! And, it’s the same price as the v3 version! It’s significantly better than expected.

NVIDIA CUDA with Ubuntu 16.04 beta on a laptop (if you just cannot wait)

Posted on March 8, 2016 by Dr. Donald Kinghorn

I was preparing a Puget Systems Traverse Skylake based laptop for GPU accelerated molecular dynamics demos at the upcoming ACS meeting and decided to see if I could get Ubuntu 16.04 beta working with NVIDIA CUDA 7.5. It worked!

Windows 10 with Xeon Phi

Posted on November 13, 2015 by Dr. Donald Kinghorn

Can you use an Intel Xeon Phi with Windows 10? Yes, you can. However, just because you can do something, doesn’t mean that you should do it! I did a set up and a little testing mainly just to see if it would work — it does!

Molecular Dynamics Performance on GPU Workstations — NAMD

Posted on October 27, 2015 by Dr. Donald Kinghorn

Molecular Dynamics programs can achieve very good performance on modern GPU accelerated workstations giving job performance that was only achievable using CPU compute clusters only a few years ago. The group at UIUC working on NAMD were early pioneers of using GPU’s for compute acceleration and NAMD has very good performance acceleration using NVIDIA CUDA. We show you how good that performance is on modern Nvidia GPU’s

OpenACC for free! — NVIDIA OpenACC Toolkit

Posted on July 14, 2015 by Dr. Donald Kinghorn

NVIDIA and PGI are offering “PGI Accelerator with OpenACC” free to academia (or 90 day trial for commercial users) under the banner “NVIDIA OpenACC Toolkit”. It’s about time!

Xeon Phi 5110p and Free Intel Parallel Studio Cluster Edition

Posted on June 22, 2015 by Dr. Donald Kinghorn

Another amazing deal on Xeon Phi from Intel! This time you can get a 90% discount on a Phi 5110p and get the Intel Parallel Studio Cluster edition with a 1 year license for free.

HPC Posts