Tel: +44(0)1865 300 579
Fax: +44(0)1865 300 232

Programs for Programmers

64 bit Fortran Execution Time Benchmarks - Linux64 on AMD X2 5600

 Register now for free Polyhedron / Intel Seminar at University College London on April 2nd!

  Absoft
10.2
g95
0.91
GFortran
4.3.0
Intel
11.0.074
Lahey
8.1
Nag
5.1
Pathscale
3.2
PGI
8.0-2
Sun
8.3
AC 9.08 17.62 13.28 11.58 13.93 21.51 8.89 12.13 17.57
AERMOD 23.50 39.45 32.40 20.11 24.26 37.86 36.10 24.67 26.44
AIR 13.34 19.27 12.98 11.47 49.96 12.70 12.38 13.53 11.71
CAPACITA 55.29 94.69 83.18 75.56 99.11 83.84 56.93 54.17 82.11
CHANNEL 16.50 23.70 11.51 14.45 14.82 13.40 17.33 12.45 11.06
DODUC 33.82 44.78 40.81 30.45 39.16 43.88 35.40 31.70 39.29
FATIGUE 5.33 36.82 9.30 7.20 10.40 17.54 5.29 6.99 7.16
GAS_DYN 5.19 18.88 10.24 6.40 7.72 12.65 7.81 8.01 10.71
INDUCT 28.29 40.67 41.65 34.99 42.86 34.45 26.07 31.71 36.55
LINPK 25.42 25.05 23.43 23.23 23.32 21.52 25.49 24.09 23.93
MDBX 17.13 19.09 18.74 17.04 17.06 18.65 16.97 18.31 15.91
NF 26.04 42.72 28.49 22.77 33.74 24.21 24.90 25.58 25.91
PROTEIN 42.73 62.26 50.62 41.31 70.47 48.29 45.64 50.28 57.54
RNFLOW 27.49 45.20 31.09 33.06 35.10 36.93 32.35 37.94 32.03
TEST_FPU 17.26 30.77 19.50 16.51 18.65 18.45 17.33 17.74 14.90
TFFT 7.19 7.41 7.24 7.13 7.10 7.20 7.36 7.82 7.06
 
Geometric Mean 17.93 30.28 21.69 18.61 24.29 23.55 19.06 19.56 20.59

Compiler Switches
Absoft af95 -m64 -Ofast -speed_math=10 -march=opteron -msse3 -xINTEGER
g95 g95  -march=opteron -ffast-math -funroll-loops -O3
gfortran gfortran -march=native -ffast-math -funroll-loops  -O3
Intel ifort -O3 -fast -ipo -no-prec-div
Lahey lf95 --fast --static -x -
NAG nagf95 -O4 -mismatch_all -ieee=full -Bstatic
Pathscale

pathf90 -Ofast -WOPT:if_conv=0 -OPT:ro=1 -LNO:fu=9:full_unroll_size=7000

PGI pgf90 -Bstatic -V -fastsse -Munroll=n:4 -Mipa=fast,inline -tp k-8-64
Sun sunf95 -fast -xtarget=opteron
 

Notes

All figures are Execution Times in Seconds - measured on a Dell Dimension E521 with an AMD X2  processor 5600 (2.8 GHz), with 4 x 1024MB 533MHz DDR2 Memory, running openSuSE 10.2. Each figure is the average over at least 10 runs (many more for some). Measurement error is typically <1%.  Green cells highlight figures within 10% of the fastest.  Red cells indicate figures which are more than 150% of the fastest.

So far as possible, we have used the compiler switches which give the best overall results.  We have not attempted to tune individual benchmarks, and, in particular cases, different switch settings may give better results. 

Thanks are due to Jos Bergervoet for permission to use his CAPACITA benchmark, to Quetzal Associates for permission to use their CHANNEL, FATIGUE, GAS_DYN, INDUCT, PROTEIN and RNFLOW benchmarks, to David Frank for his TEST_FPU benchmark, and to Ted Addison of McVehil-Monnett Associates for permission to use AERMOD, an air quality model used by the US Environmental Protection Agency.

All the benchmarks have been modified slightly to fit into our benchmarking harness. 

The NF benchmark uses  "nested factorization", a little known but very effective iterative linear solver for huge finite difference matrices.  A paper describing nested factorization, and comparing it to other methods is available here.

These benchmarks were also used to compare Win32 and Linux compilers on a Pentium based machine.

Download Polyhedron Benchmarks