Intel E5 v3 processors will run at “All Core Turbo” under load if properly cooled. This “clock” measurement is a better predictor of theoretical performance than base clock. We present a table of CPU performance at “all-core-turbo” using different parallel scaling factors from Amdhal’s Law. We have a dynamic graph that will show how much performance you lose when your parallel scaling is less than perfect. Just because your dual socket 16-core system shows all 32 cores at 100% doesn’t mean your problem is running 32 times faster!
What is Machine Learning
Machine Learning is getting a lot of attention these days and with good reason. There are mountains of data to work with and computing resources to handle the problems are easily attainable. Even a single GPU accelerated workstation is capable of serious work.
Intel Python Preview
Intel is working on an optimized Python! I did a quick test with the preview version and it looks good.
Windows 10 with Xeon Phi
Can you use an Intel Xeon Phi with Windows 10? Yes, you can. However, just because you can do something, doesn’t mean that you should do it! I did a set up and a little testing mainly just to see if it would work — it does!
Molecular Dynamics Performance on GPU Workstations — NAMD
Molecular Dynamics programs can achieve very good performance on modern GPU accelerated workstations giving job performance that was only achievable using CPU compute clusters only a few years ago. The group at UIUC working on NAMD were early pioneers of using GPU’s for compute acceleration and NAMD has very good performance acceleration using NVIDIA CUDA. We show you how good that performance is on modern Nvidia GPU’s
Intel Skylake 6700K with Parallel Studio XE 2016 vs 2015 on Fedora 23 Much Better!
Intel Skylake Core-i7 CPU — 256 GFLOP/s Linpack result with Intel Parallel Studio XE 2016 and MKL 11.3 vs 200 GFLOP/s using Intel Parallel Studio XE 2015 and MKL 11.2!
Skylake-S i7 6700K and i5 6600K for compute? maybe?
I have done a little informal testing with the new i7 and i5 processor running the Linpack benchmark and a NAMD MD simulation. Mixed results!
CentOS 7 kernel boot order bug
I have been butting heads with a particularly annoying bug that I hit frequently on installs since I work with systems that need to have kernel modules recompiled for CUDA and the Xeon Phi. I have it mostly figured out and have a fix in this post.
OpenACC for free! — NVIDIA OpenACC Toolkit
NVIDIA and PGI are offering “PGI Accelerator with OpenACC” free to academia (or 90 day trial for commercial users) under the banner “NVIDIA OpenACC Toolkit”. It’s about time!
Xeon Phi 5110p and Free Intel Parallel Studio Cluster Edition
Another amazing deal on Xeon Phi from Intel! This time you can get a 90% discount on a Phi 5110p and get the Intel Parallel Studio Cluster edition with a 1 year license for free.




