: Michael Resch, Sabine Roller, Katharina Benkert, Martin Galle, Wolfgang Bez, Hiroaki Kobayashi
: Sabine Roller, Katharina Benkert, Martin Galle, Wolfgang Bez, Hiroaki Kobayashi
: High Performance Computing on Vector Systems 2009
: Springer-Verlag
: 9783642039133
: 1
: CHF 114.00
:
: Sonstiges
: English
: 250
: Wasserzeichen
: PC/MAC/eReader/Tablet
: PDF
This book covers the results of the Tera op Workbench, other projects related to High Performance Computing, and the usage of HPC installations at HLRS. The Tera op Workbench project is a collaboration between the High Performance C- puting Center Stuttgart (HLRS) and NEC Deutschland GmbH (NEC-HPCE) to s- port users in achieving their research goals using High Performance Computing. The rst stage of the Tera op Workbench project (2004-2008) concentrated on user's applications and their optimization for the former ag ship of HLRS, a - node NEC SX-8 installation. During this stage, numerous individual codes, dev- oped and maintained by researchers or commercial organizations, have been a- lyzed and optimized. Within the project, several of the codes have shown the ability to outreach the TFlop/s threshold of sustained performance. This created the pos- bility for new science and a deeper understanding of the underlying physics. The second stage of the Tera op Workbench project (2008-2012) focuses on c- rent and future trends of hardware and software developments. We observe a strong tendency to heterogeneous environments on the hardware level, while at the same time, applications become increasingly heterogeneous by including multi-physics or multi-scale effects. The goal of the current studies of the Tera op Workbench is to gain insight in the developments of both components. The overall target is to help scientists to run their application in the most ef cient and most convenient way on the hardware best suited for their purposes.
Preface5
Contents7
I Petaflop/s Computing14
Lessons Learned from 1-Year Experience with SX-9 and Toward the Next Generation Vector Computing15
Introduction15
SX-9 System Overview16
HPC Challenge Benchmark Results18
Case Study Analysis of Memory-Conscious Tuning for SX-925
Multi-Vector Cores Processor Design30
Summary33
References34
BSC-CNS Research and Supercomputing Resources35
Overview35
Supercomputing Resources at BSC36
MareNostrum36
MareNostrum Performance 200838
Shared Memory System38
Backup and HSM Service39
Spanish Supercomputing Network39
PRACE Prototype40
Research at BSC42
Challenges and Opportunities of Hybrid Computing Systems43
Introduction43
European Context45
Validation Scenario46
Initial Results47
Operational Requirements49
Conclusions and Future Work51
References51
Going Forward with GPU Computing52
Computing needs at CEA52
Starting the Process54
Available Hardware54
Choosing a Programming Language57
CUDA57
OpenCL58
RapidMind58
HMPP58
A Remark on Languages60
Training Sessions60
The System Administration Side60
The Grand Challenges Strategy61
Foreseen Problems61
First Results62
Conclusion63
Optical Interconnection Technology for the Next Generation Supercomputers64
Introduction64
Components and Structure66
Performance67
Conclusions69
References69
HPC Architecture from Application Perspectives70
Introduction70
Trend of CPU Performance72
Architectural Challenges74
SIMD-based Approaches75
Conclusions77
References78
II Strategies79
A Language for Fortran Source to Source Transformation80
Compiler80
Self Defined Transformations81
The Transformation Language81
Transformation Variables82
Transformation Constructs82
Self Defined Procedures in the Transformation Code83
Intrinsic Procedures83
Parsing Primitives in Parsing Mode84
Examples85
Concluding Remarks87
The SX-Linux Project: A Progress Report88
Introduction89
Project Paths89
Progress and Status91
The GNU Toolchain91
Binutils91
GCC92
Current Toolchain Status93
Future Work94
User Space and I/O Forwarding94
Newlib94
Future of Newlib95
Virtualization Layer96
I/O Forwarding97
I/O Forwarding-Current Implementation98
I/O Forwarding Library Status99
Kernel99
Kitten LWK100
Implementation and Status100
Bootstrapping100
Early Introspection100
Stack and Memory Layout101
Interrupts102
System Calls102
Context Switch102
User Space103
Status103
Outlook104
References105
Development of APIs for Desktop Supercomputing106
Introduction106
Client APIs for GDS108
Client APIs108
Script Generator API108
Implementation of Script Generator API in AEGIS110
Development of GDS Application of Three-dimensional Virtual Plant Vibration Simulator112
Three-dimensional Virtual Plant Vibration Simulator112
Development of GDS Application of Three-dimensional Virtual Plant Vibration Simulator113
Summary114
References115
The Grid Middleware on SX and Its Operation for Nation-Wide Service117
Introduction117
Structu