Professional Documents
Culture Documents
Outline
What is a cluster?
Simulation results
1. Test problems
2. Criteria
3. Evaluations
Whats coming
Introduction
2007 3.4
2009 3.5a
2012 - 4.2
History of HPC
1954
1958
1962
1967
1969
1964
1976
1976
1996
2002
What is this?
What is this?
Test Problems
Benchmark criteria
Test Criteria
Benchmark criteria
Memory
CPU time
Software
Number of
elements
Memory cost
(MB)
Compared values
case
Case 1:
Used
COMSOL 4.2
COMSOL 3.5
3,341
3,372
468
264
164
240
Totdis-max:27.62m
Totdis-max:21.97m
Circulator
COMSOL 4.2
COMSOL 3.5
5,051
9,635
20.101
5,032
10,779
16,893
14,031
14,089
299
434
489
170
360
480
338
280
4
5
8
3
8
22
23
103
Xdis-max=3.065m
Xdis-max=3.069m
Xdis-max=3.067m
Xdis-max=3.065m
Xdis-max=3.069m
Xdis-max=3.066m
reflection, isolation and
insertion loss
Case 4:
COMSOL 4.2
32,850
441
13
Bmax=1.267T
COMSOL 3.5
32,395
190
17
Bmax=1.257T
FSI
Case 2:
COMSOL 4.2
Actuator
COMSOL 3.5
Case 3:
Generator
What is a cluster?
PC Clusters
Beowulf
SIMD vs MIMD
Test Measures
Speedup
Scalability
When you compute with multiple processors in
a parallel environment, you will also want to
know how your code scales.
The scalability of a parallel code is defined as
its ability to achieve performance proportional
to the number of processors used.
Linear Speedup
If it takes one processor an amount of
time t to do a task and if p processors can
do the task in time t/p, then you have
perfect or linear speedup (Sp= p).
Slowdown
Efficiency
Amdahl's Law
This is the fraction of code that will have to be run with just one
processor, even in a parallel run.
Amdahl's Law defines speedup with p processors as follows:
Amdahls Law
Speedup Limitations
Wrong algorithm
Load imbalance
Problem Size
SGI - USAFA
COMSOL Tests
The following codes were tested on the
SGI Altix System at the USAFA
1. sar_in_human_head (RF)
2. Induction motor
3. Turbulence backstep
4. Free convection
5. Tuning fork
(plus other models now being run)
DOF
node time
(s)
Sp
Ep
RF Module
780308
108
5.62
19.21
0.800
Induction
Motor
95733
51
2.6
19.61
0.817
86786
67
3.48
19.25
0.802
Turbulent
Backstep
70405
99
5.15
19.22
0.800
46557
56
2.91
19.24
0.801
Free
Convection
10360
11
0.57
19.29
0.803
Tuning
Fork
96822
204
10.62
19.20
0.800
Whats coming?
Computer performance
Name
FLOPS
yottaFLOPS
1024
zettaFLOPS
1021
exaFLOPS
1018
petaFLOPS
1015
teraFLOPS
1012
gigaFLOPS
109
megaFLOPS
106
kiloFLOPS
103
TNB
Contacts
Darrell W. Pepper
USAFA, DFEM, Academy, CO 80840
darrell.pepper@usafa.edu
Dept. of ME, UNLV, NCACM
University of Nevada Las Vegas
Las Vegas, NV 89154
darrell.pepper@unlv.edu
David Kan
COMSOL
Los Angeles, CA
david.kan@comsol.com