Professional Documents
Culture Documents
Horst Simon
Lawrence Berkeley National Laboratory
and UC Berkeley
2
A decadal DOE plan for providing exascale
applications and technologies for DOE mission
needs
3
Process for identifying exascale applications and technology
for DOE missions ensures broad community input
First-Principles Calculations of
Materials Genome (K. Persson)
7
Advanced Computation in Energy
Science
First-Principles Calculations of
Materials Genome (K. Persson)
8
First-principles Methods Predictive Power
O
covered!
9
Leveraging the Information Age
Scalable
Infrastructure
Web
NERSC
Invert the Design Problem Automation
(MongoDB)
Engine
API
Computing ~200 compounds/day
Completely new
New phosphate class of materials
discovered @MIT synthesized based
through on computational
computations predictions
LBNL NERSC
Kathy Yelick LBNL CRD
David Skinner
Juan Meza
Eli Dart David Bailey
Shreyas Cholia
Alex Kaiser
Daniel Gunter Maciej Haranczyk
Annette Greiner
Zhenji Zhao
12
Overview
13
Extreme Scale Computing Today
Computational
Science ASC and ASCR provide much more than machines:
• Applications (Computational Science)
Computer Applied • Algorithms (Applied Mathematics)
Science Mathematics
• Systems (Computer Science)
• Integration (SciDAC, Campaigns)
14
Jaguar: Most Powerful Computer in the U.S. in 2011
Most Powerful Computer in the World in 2009
!"#$%&"'()'*#+,"% -.//-%!0%
!"#$%&'&%&()"' *++',-'
./#0'#123%' 4+'5-'
5)(3%##()#' 6678'
12%3)4.%-556% 5(9%)' :;<='>?'
1-%3)4.%-525%
15
ASCI Red: World’s Most Powerful
Computer in 1999
12%3)4.%2666%
!"#$%&"'()'*#+,"% /.278%90%
!"#$%&'&%&()"' 4;646',-'
./#0'#123%' 46;=',-'
5)(3%##()#' <6<@'
5(9%)' @=+'0?'
16
Comparison:
Jaguar (2009) vs. ASCI Red (1999)
• 24x processors/cores
• 8.2x power
Significant increase
in operations cost
Essentially the same architecture and software
environment
17
Traditional Sources of Performance
Improvement are Flat-Lining (2004)
• New Constraints
– 15 years of exponential
clock rate growth has
ended
18
5%)A()&2B3%'.%C%D(1&%B$'
8/.@@%!0<)&=>%
100 Pflop/s
10 Pflop/s -.7:%!0<)&=>%
1 Pflop/s
10 Tflop/s
1 Tflop/s
3C2%
2.2:%90<)&=>%
100 Gflop/s
76.:%;0<)&=>%
10 Gflop/s 3C755%
1 Gflop/s
100 Pflop/s
10 Pflop/s
AB?%
1 Pflop/s
100 Tflop/s
3C2%
10 Tflop/s
1 Tflop/s
3C755%
100 Gflop/s
10 Gflop/s
1 Gflop/s
100 Mflop/s
G(B3H))%B3"'I%C%D#'
Jack‘s Notebook
Moore’s Law reinterpreted
22
,($2D'5(9%)'I%C%D#'J0?K'A()',L5=++'#"#$%&#'
5(9%)'MN3/%B3"'J>O(1#P?2QK'A()'F/R%)%B$''
5)(3%##()'S%B%)2T(B#'
Koomey’s Law
• Computations per
kWh have improved
by a factor about 1.5
per year
• “Assessing Trends
in Electrical
Efficiency over
Time”, see IEEE
Spectrum, March
2010
25
Overview
26
Trend Analysis
27
We won’t reach Exaflops with the current approach
From Peter
Kogge, DARPA
Exascale Study
28
… and the power costs will still be staggering
1000
System Power (MW)
100
10
1
2005 2010 2015 2020
29
Potential System Architecture Targets
30
Comparison:
“2018” vs. Jaguar (2009)
31
DOE Exascale Technology Roadmap
32
Where do we get 1000x performance
improvement for 10x power?
1. Processors
2. On-chip data movement
3. System-wide data movement
4. Memory Technology
5. Resilience Mechanisms
33
Summary
34