<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<meta http-equiv="content-type" content="text/html;
charset=ISO-8859-1">
</head>
<body text="#000000" bgcolor="#ffffff">
<style>@font-face {
font-family: "Courier New";
}@font-face {
font-family: "Wingdings";
}@font-face {
font-family: "MS 明朝";
}@font-face {
font-family: "Cambria";
}@font-face {
font-family: "Lucida Grande";
}p.MsoNormal, li.MsoNormal, div.MsoNormal { margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman"; }a:link, span.MsoHyperlink { color: blue; text-decoration: underline; }a:visited, span.MsoHyperlinkFollowed { color: purple; text-decoration: underline; }pre { margin: 0cm 0cm 0.0001pt; font-size: 10pt; font-family: Courier; }p.MsoAcetate, li.MsoAcetate, div.MsoAcetate { margin: 0cm 0cm 0.0001pt; font-size: 9pt; font-family: "Lucida Grande"; }p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph { margin: 0cm 0cm 0.0001pt 36pt; font-size: 12pt; font-family: "Times New Roman"; }p.MsoListParagraphCxSpFirst, li.MsoListParagraphCxSpFirst, div.MsoListParagraphCxSpFirst { margin: 0cm 0cm 0.0001pt 36pt; font-size: 12pt; font-family: "Times New Roman"; }p.MsoListParagraphCxSpMiddle, li.MsoListParagraphCxSpMiddle, div.MsoListParagraphCxSpMiddle { margin: 0cm 0cm 0.0001pt 36pt; font-size: 12pt; font-family: "Times New Roman"; }p.MsoListParagraphCxSpLast, li.Mso
ListParagraphCxSpLast, div.MsoListParagraphCxSpLast { margin: 0cm 0cm 0.0001pt 36pt; font-size: 12pt; font-family: "Times New Roman"; }span.BalloonTextChar { font-family: "Lucida Grande"; }span.HTMLPreformattedChar { font-family: Courier; }span.BalloonTextChar1 { font-family: "Lucida Grande"; }div.Section1 { page: Section1; }ol { margin-bottom: 0cm; }ul { margin-bottom: 0cm; }</style>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">Dear QE users & developers, <span
style=""> </span></span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;"> </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">We are happy to announce that the first
beta GPU-enabled release of Quantum ESPRESSO (QE) has been
committed today in the official repository. </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;"> </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">You can download the new version of the
code using the following command:<span style=""> </span></span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;"> </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">$ svn checkout
svn://scm.qe-forge.org/scmrepos/svn/q-e/branches/espresso-PRACE
<span style=""> </span></span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;"> </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">The Irish Centre for High-End Computing
(ICHEC, </span><a href="http://www.ichec.ie"><span
style="font-size: 10pt; font-family: Courier;">www.ichec.ie</span></a><span
style="font-size: 10pt; font-family: Courier;">) has been mainly
responsible for extending the QE suite to enhance calculations
on NVIDIA GPUs. The porting activity has been supported within
the PRACE 1st Implementation Phase project. It is currently
carried out through the Sub-task "Accelerator”, led by ICHEC,
within the Work-Package "Programming Techniques for
High-Performance Applications” in collaboration with CINECA.<span
style=""></span></span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;"> </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">The porting activity is concerned mainly
with the PWscf package. But ICHEC and the Irish QE user
community are interested in exploring any other initiatives
which come forward from QE users or developers interested in
porting on GPGPU architecture any of the QE suite related codes.<span
style=""> </span></span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;"> </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">We have successfully accelerated the
linear algebra part of the QE suite using a library called
phiGEMM, some explicit computational kernels (newd, addusdense,
vloc_psi) and the 3D FFT for the single CPU/GPU version. Both
linear algebra (matrix multiplication) and the FFT accelerated
version make use of CUDA libraries. The porting is mainly based
on wrappers that permit the use of libraries for accelerators.
The distributed 3D FFT version is still in progress, since this
porting requires important changes of the current structure of
the code and data distribution. While running the parallel and
distributed multi-GPUs version it still uses the original 3D FFT
implementations.</span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;"> </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">The phiGEMM library is distributed as an
independent open-source external package together with the
Quantum ESPRESSO suite. It aims to perform matrix multiplication
([SDZ]GEMM) taking advantage of the underlying BLAS kernel
functions on both CPU and NVIDIA CUDA-based GPU, mixing and
overlapping computation between the host (CPU) and the
accelerator (GPU). Whatever code makes intensive use of GEMM it
can derive a significant advantage linking this library when
running on a CPU/GPU hybrid system. </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;"> </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">Even if the 3D FFT is accelerated only
for a single CPU process (not when using MPI), other parts are
already enabled to take advantage of a distributed parallel
hybrid system. All of this allows PWscf to potentially use
distributed message-passing parallelization (MPI) plus
multi-threading (OpenMP) plus accelerators (NVIDIA GPUs) all
together and produce good performance enhancement using the
latest version of NVIDIA GPUs (high performance double precision
is needed). This porting activity is still in progress,
especially the parallel 3D FFT component that represents a
bottleneck for large calculations. We have been testing this
beta release using some small/medium benchmarks used in the
DEISA official bench-suite and several GPU hardware (Tesla and
Fermi architectures). Special thanks goes to both E4 Computer
Engineering and CEA for providing access to hybrid GPU systems
with differing configurations to those available at ICHEC.</span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;"> </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">We look forward with interest to
receiving any suggestions for improvement, feedback or request
for collaboration by anyone who is interested to try and
validate PWscf CUDA version on different platforms using
different scientific cases.<span style=""> </span>For
additional information please contact <a
class="moz-txt-link-abbreviated" href="mailto:qe-gpu@ichec.ie">qe-gpu@ichec.ie</a>
or replay at this mail. We'll be shortly available a dedicated
forum <a href="http://qe-forge.org/mail/?group_id=10">q-e-gpgpu@qe-forge.org</a>.
Please use this list for bug report and any other issue related
to the use of the PWscf GPU version.<br>
<br>
</span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">Although compilation of the GPU
implementation is fairly straight-forward, we kindly suggest
that users carefully read the README.GPU that is included. The
intrinsic characteristics of hybrid multi- and many-core systems
require careful consideration to best exploit the available
computing power.<span style=""><br>
</span></span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;"> </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">Any and all suggestions are welcome and
will be very much appreciated.<span style="">
</span></span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;"> </span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;">Ivan Girotto & Filippo Spiga</span></p>
<p class="MsoNormal" style=""><span style="font-size: 10pt;
font-family: Courier;"> </span></p>
<span style="font-size: 10pt; font-family: Courier;">--- <br>
<br>
ICHEC GPU developer team <br>
<br>
</span>
<pre class="moz-signature" cols="72">The Tower - 7th floor
Trinity Technology & Enterprise Campus
Grand Canal Quay - Dublin 2 - Ireland
+353-1-5241608 (ph) / +353-1-7645845 (fax)
</pre>
</body>
</html>