[QE-developers] QE GPU test results

Pietro Davide Delugas pdelugas at sissa.it
Wed Jun 21 12:04:09 CEST 2023


It should be possible,
If the test is supersmall, it can be a little bit difficult though.
For small test with not a big number of plane waves and bands, one needs to use as much pools as possible.
But  program need to run with 1 MPI rank per GPU, it is possible to superscribe the GPUs, but you need to have nvidia MPS on the node for this to be efficient.


Sent from Mail<https://go.microsoft.com/fwlink/?LinkId=550986> for Windows

From: Małgorzata Wierzbowska<mailto:malwi45 at gmail.com>
Sent: Wednesday, June 21, 2023 10:00 AM
To: developers at lists.quantum-espresso.org<mailto:developers at lists.quantum-espresso.org>
Subject: [QE-developers] QE GPU test results


 Dear QE Team,

we tested v.7.2 on Athena computer with GPU
https://www.cyfronet.pl/en/19073,artykul,athena.html

The simple case for nscf with large kmesh that runs 37 min at 96 cpu on Ares
https://www.cyfronet.pl/en/computers/18827,artykul,ares_supercomputer.html

gives the following results on Athena:

1-1-1-1: PWSCF : 1h 6m CPU 1h13m WALL

1-2-1-2: PWSCF : 1h 3m CPU 1h 8m WALL

1-4-1-4: PWSCF : 1h 1m CPU 1h 6m WALL

1-8-1-8: PWSCF : 1h12m CPU 1h17m WALL

Where the configurations  1-2-3-4 mean:
1 node 2 tasks (processes MPI), 3 cpus-per-task, 4 cards GPGPU

Is it possible to get it better?

With best regards,
Malgorzata

dr hab. Malgorzata Wierzbowska, prof. IHPP PAS

Institute of High Pressure Physics (Unipress)

Polish Academy of Sciences

Sokolowska 29/37, 01-142 Warsaw, Poland

email: malwi45 at gmail.com<mailto:malwi at gmail.com>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.quantum-espresso.org/pipermail/developers/attachments/20230621/73bfdbbb/attachment.html>


More information about the developers mailing list