Hyper-Threading may be Killing your Parallel Performance

Hyper-Threading, hyperthreading, or just HT for short, has been around on Intel processors for over a decade and it still confuses people. I’m not going to do much to help with the confusion. I just want to point out an example from some testing I was doing recently with the ray-tracing application POV-ray that surprised me. Hyper-threading dramatically lowered the performance on a multi-core test system running Windows when running POV-ray in parallel.

Install CUDA and PGI Accelerator with OpenACC

I’m going to walk you through a basic install and configuration for a development system to do CUDA and OpenACC GPU programming. This is not a detailed howto but if you have some linux admin skills it will be a reasonable guide to get you started. We’ll do a basic NVIDIA GPU programming setup including CentOS 6.5, CUDA development environment and a PGI compiler setup with OpenACC. The most interesting part may be the OpenACC setup. OpenACC is a relatively new option for GPU programming and allows for a directive (pragma) based coding model.

Quad Xeon vs Opteron, Zemax OpticStudio

We take a look at Quad Xeon and Quad Opteron performance and parallel scaling with Zemax OpticStudio including an analysis using Amdahl’s Law. Based on this analysis we then make performance predictions for other processors.

NVIDIA GPU Starter DevKit with OpenACC

NVIDIA Tesla K20 plus PGI Accelerator compilers with OpenACC in a package deal with a system. Yes, it’s official. If you’ve wanted to do some development work with OpenACC on Tesla, this is a nice way to get started with a heavily discounted K20 and PGI compiler package pre loaded on a Peak Mini.

LinuxFest Northwest 2014

The annual Northwest pilgrimage for the Linux faithful to the Bellingham Technical College in Bellingham, WA is nearly upon us! Puget Systems is donating a great machine to the raffle, a Serenity mini with a commemorative case etching!

NVIDIA HPC future directions

Where is NVIDIA heading with High Performance Computing hardware? Ever since Intel announced Xeon Phi Knights Landing as a stand-alone processor integrated at the board level as a full compute unit, I’ve been wondering what NVIDIA would do along these lines. It just makes sense that they would do something similar since getting the GPU off of the PCIe bus and tightly integrated with plentiful system memory would be a huge step forward for usability and performance. Here’s my guess about where NVIDIA is heading.

GTC 2014 Brief Recap — content links

I had the pleasure of attending the NVIDIA Graphics Technology Conference ( GTC ) last week. Wonderful conference! If you have any doubts about the quality of the conference you are in luck. They have most of the content on-line, you can check it out yourself …