HPC Posts Archive

Skylake-S i7 6700K and i5 6600K for compute? maybe?

Posted on August 5, 2015 by Dr. Donald Kinghorn

I have done a little informal testing with the new i7 and i5 processor running the Linpack benchmark and a NAMD MD simulation. Mixed results!

CentOS 7 kernel boot order bug

Posted on July 30, 2015 by Dr. Donald Kinghorn

I have been butting heads with a particularly annoying bug that I hit frequently on installs since I work with systems that need to have kernel modules recompiled for CUDA and the Xeon Phi. I have it mostly figured out and have a fix in this post.

OpenACC for free! — NVIDIA OpenACC Toolkit

Posted on July 14, 2015 by Dr. Donald Kinghorn

NVIDIA and PGI are offering “PGI Accelerator with OpenACC” free to academia (or 90 day trial for commercial users) under the banner “NVIDIA OpenACC Toolkit”. It’s about time!

Xeon Phi 5110p and Free Intel Parallel Studio Cluster Edition

Posted on June 22, 2015 by Dr. Donald Kinghorn

Another amazing deal on Xeon Phi from Intel! This time you can get a 90% discount on a Phi 5110p and get the Intel Parallel Studio Cluster edition with a 1 year license for free.

GTX 980 Ti Linux CUDA performance vs Titan X and GTX 980

Posted on June 12, 2015 by Dr. Donald Kinghorn

NVIDIA has just launched the GTX 980 Ti and I got to run some benchmarks on one. How is the Linux CUDA performance? Almost as good as the TitanX! This is another great card from NVIDIA for single precision compute loads. We’ve got some number to show it.

Install NVIDIA CUDA on Fedora 22 with gcc 5.1

Posted on May 19, 2015 by Dr. Donald Kinghorn

Fedora 22 is full of new goodness like kernel 4.0 and gcc 5.1 and yes, you can install and run CUDA on it! It’s not officially supported but I did manage to get it working!

5 Ways of Parallel Programming

Posted on May 12, 2015 by Dr. Donald Kinghorn

Modern computing hardware is all about parallelism. This is because we essentially hit the wall several years ago on increasing core clock frequency to speedup serial code execution. The transistor count has continued to follow Moore’s Law (doubling every 1.5-2 years) but these transistors have mostly gone into multiple cores, vector units, memory controllers, etc. on a single die. To utilize this hardware, software needs to be written to take advantage of it, i.e. you have to go parallel.

GTC 2015 Deep Learning and OpenPOWER

Posted on April 6, 2015 by Dr. Donald Kinghorn

Another great GTC meeting. NVIDIA does this right! The most interesting aspects for me this year were the talks on “Deep Learning” (Artificial Neural Networks) and OpenPOWER. I have some observations and links to recordings of the keynotes and talks. Enjoy!

NVIDIA CUDA GPU computing on a (modern) laptop

Posted on March 13, 2015 by Dr. Donald Kinghorn

Modern high-end laptops can be treated as desktop system replacements so it’s expected that people will want to try to do some serious computing on them. Doing GPU accelerated computing on a laptop is possible and performance can be surprisingly good with a high-end NVIDIA GPU. [I’m looking at GTX 980m and 970m ]. However, first you have to get it to work! Optimus technology can present serious problems to someone who wants to run a Linux based CUDA laptop computing platform. Read on to see what worked.

Intel vs NVIDIA, IBM, Mellanox, AMD and everybody!

Posted on March 2, 2015 by Dr. Donald Kinghorn

The next 18 months are going to see more shakeup and factioning in the computing world than we have seen in over a decade. Intel is pulling more and more of the compute architecture onto a single piece of silicon and tightly integrating the whole hardware stack. That’s good and bad. It may let them achieve better performance. However, this is going to leave users with a choice of “all Intel” or something else entirely. And, the “something else” is starting to seriously take shape.