[QE-users] [QE-GPU] computing force and stress - time cost

Iurii TIMROV iurii.timrov at epfl.ch
Mon Jan 4 12:06:50 CET 2021


Dear Mohammad,


> 2. You are using DFT+U, more precisely
>     U_projection_type = 'ortho-atomic'
> The GPU acceleration of this functionality is limited and some portions
> of the algorithm will still run on the CPU. In addition, I fear that the
> evaluation of forces with that projection method scales pretty bad with
> the number of atoms, but I let the experts (and developers) of this new
> functionality further comment this last point.


Yes, indeed, in DFT+U with the 'ortho-atomic' Hubbard manifold the calculations of Hubbard forces and stress take much more time than with the 'atomic' Hubbard manifold. See Phys. Rev. B 102, 235159 (2020), in particular see Appendix C.


As Pietro said, the ortho-atomic Hubbard forces and stress were ported to the GPU version of QE but this is still not optimal and can be improved in the future.


Greetings,

Iurii


--
Dr. Iurii TIMROV
Postdoctoral Researcher
STI - IMX - THEOS and NCCR - MARVEL
Swiss Federal Institute of Technology Lausanne (EPFL)
CH-1015 Lausanne, Switzerland
+41 21 69 34 881
http://people.epfl.ch/265334
________________________________
From: users <users-bounces at lists.quantum-espresso.org> on behalf of Pietro Bonfa' <pietro.bonfa at unipr.it>
Sent: Monday, January 4, 2021 11:52:31 AM
To: users at lists.quantum-espresso.org
Subject: Re: [QE-users] [QE-GPU] computing force and stress - time cost

Dear Mohammad,

Paolo is right, but a couple of comments can already be made:

1. you are not using OpenMP parallelism, and I believe you have more
than 2 cores in your system. In order to achieve a decent speedup (at
least in the SCF) it's mandatory to enable openmp and exploit the whole
CPU power.

2. You are using DFT+U, more precisely

     U_projection_type = 'ortho-atomic'

The GPU acceleration of this functionality is limited and some portions
of the algorithm will still run on the CPU. In addition, I fear that the
evaluation of forces with that projection method scales pretty bad with
the number of atoms, but I let the experts (and developers) of this new
functionality further comment this last point.

Best regards and happy new year,
Pietro




On 1/4/21 10:07 AM, Paolo Giannozzi wrote:
> The most important piece of information (the final time report) is not
> contained in your 45Mb output.
>
> Paolo
>
> On Mon, Jan 4, 2021 at 6:33 AM Mohammad Moaddeli
> <mohammad.moaddeli at gmail.com <mailto:mohammad.moaddeli at gmail.com>> wrote:
>
>     Dear Pietro,
>
>     It takes about 22 hours to perform an scf (about 2 hours to perform
>     diagonalization until convergence is achieved, and about 20 hours to
>     compute force and stress).
>     Here is the google drive link containing input and output files:
>     https://drive.google.com/file/d/1DFtLqFvrc8CFo1_q_jjnMFErvXpWEAHB/view?usp=sharing
>     <https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fdrive.google.com%2Ffile%2Fd%2F1DFtLqFvrc8CFo1_q_jjnMFErvXpWEAHB%2Fview%3Fusp%3Dsharing&data=04%7C01%7Cpietro.bonfa%40unipr.it%7Ce34d20b789e54091c08008d8b0906452%7Cbb064bc5b7a841ecbabed7beb3faeb1c%7C0%7C0%7C637453483257890985%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=s5PSyKLXmuHXgzPSj9xcsCqPXt%2BmKPw9mj%2BZACc9sSk%3D&reserved=0>
>
>     hp.x will be performed after vc-relax is done.
>
>     Thanks in advance,
>     Mohammad
>
>     On Mon, Jan 4, 2021 at 2:38 AM Pietro Bonfa' <pietro.bonfa at unipr.it
>     <mailto:pietro.bonfa at unipr.it>> wrote:
>
>         Dear Mohammad,
>
>         the performance of the GPU code depends dramatically on the
>         portions of
>         computation that are still performed on the CPU. Only a portion
>         of all
>         contributions to forces have been accelerated, and what is left
>         out may
>         be optimized for MPI parallelism rather than openmp.
>
>         That being said, the behavior that you report is definitively
>         unusual.
>         Would you mind sharing input and output files?
>
>         Best regards,
>         Pietro
>
>
>
>         On 1/3/21 8:52 AM, Mohammad Moaddeli wrote:
>          > Dear all,
>          >
>          > GPU enabled QE v.6.7 is compiled on a VOLTA card. I am trying
>         to run a
>          > vc-relax for a bulk containing 48 atoms. Although
>         diagonalization
>          > (davidson) is about 3x faster than CPU, it takes a lot of
>         time (a couple
>          > of hours) to compute force and stress. Is this something
>         related to the
>          > code itself?
>          >
>          > Best,
>          >
>          > Mohammad Moaddeli
>          > ShirazU
>          >
>          > _______________________________________________
>          > Quantum ESPRESSO is supported by MaX (<http://>www.max-centre.eu
>         <https://eur01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.max-centre.eu%2F&data=04%7C01%7Cpietro.bonfa%40unipr.it%7Ce34d20b789e54091c08008d8b0906452%7Cbb064bc5b7a841ecbabed7beb3faeb1c%7C0%7C0%7C637453483257900986%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=ayiwKOVdIHx6hQ4VpfrjS1rTXOEiHF3ZWgGTRVbawCo%3D&reserved=0>)
>          > users mailing list users at lists.quantum-espresso.org
>         <mailto:users at lists.quantum-espresso.org>
>          > https://lists.quantum-espresso.org/mailman/listinfo/users
>         <https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.quantum-espresso.org%2Fmailman%2Flistinfo%2Fusers&data=04%7C01%7Cpietro.bonfa%40unipr.it%7Ce34d20b789e54091c08008d8b0906452%7Cbb064bc5b7a841ecbabed7beb3faeb1c%7C0%7C0%7C637453483257900986%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=nnTzluNl%2FpWGpQ3HPy0Zm%2Bjvt4qUlNIW5mNyncKQWoU%3D&reserved=0>
>          >
>         _______________________________________________
>         Quantum ESPRESSO is supported by MaX (<http://>www.max-centre.eu
>         <https://eur01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.max-centre.eu%2F&data=04%7C01%7Cpietro.bonfa%40unipr.it%7Ce34d20b789e54091c08008d8b0906452%7Cbb064bc5b7a841ecbabed7beb3faeb1c%7C0%7C0%7C637453483257910979%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=VDmrELj9IT2pHITCuC6xEAj%2Bc9PJuhhij7noi0qPqsM%3D&reserved=0>)
>         users mailing list users at lists.quantum-espresso.org
>         <mailto:users at lists.quantum-espresso.org>
>         https://lists.quantum-espresso.org/mailman/listinfo/users
>         <https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.quantum-espresso.org%2Fmailman%2Flistinfo%2Fusers&data=04%7C01%7Cpietro.bonfa%40unipr.it%7Ce34d20b789e54091c08008d8b0906452%7Cbb064bc5b7a841ecbabed7beb3faeb1c%7C0%7C0%7C637453483257920970%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=IBKLu4F2OC%2BLtjc0eeKxBt1m1dMEzLizqXZ1EbH3%2FSI%3D&reserved=0>
>
>     _______________________________________________
>     Quantum ESPRESSO is supported by MaX (<http://>www.max-centre.eu
>     <https://eur01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.max-centre.eu%2F&data=04%7C01%7Cpietro.bonfa%40unipr.it%7Ce34d20b789e54091c08008d8b0906452%7Cbb064bc5b7a841ecbabed7beb3faeb1c%7C0%7C0%7C637453483257920970%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=OYY1he5ImqA5ypHVCP0oxQLCMVoqVa3hXbngEJRmZhw%3D&reserved=0>)
>     users mailing list users at lists.quantum-espresso.org
>     <mailto:users at lists.quantum-espresso.org>
>     https://lists.quantum-espresso.org/mailman/listinfo/users
>     <https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.quantum-espresso.org%2Fmailman%2Flistinfo%2Fusers&data=04%7C01%7Cpietro.bonfa%40unipr.it%7Ce34d20b789e54091c08008d8b0906452%7Cbb064bc5b7a841ecbabed7beb3faeb1c%7C0%7C0%7C637453483257930965%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=TliW5iLzHDjHe1J2j51oBqQAi2iQ8VIb12QQesVXmZA%3D&reserved=0>
>
>
>
> --
> Paolo Giannozzi, Dip. Scienze Matematiche Informatiche e Fisiche,
> Univ. Udine, via delle Scienze 208, 33100 Udine, Italy
> Phone +39-0432-558216, fax +39-0432-558222
>
>
> _______________________________________________
> Quantum ESPRESSO is supported by MaX (www.max-centre.eu<http://www.max-centre.eu>)
> users mailing list users at lists.quantum-espresso.org
> https://lists.quantum-espresso.org/mailman/listinfo/users
>
_______________________________________________
Quantum ESPRESSO is supported by MaX (www.max-centre.eu<http://www.max-centre.eu>)
users mailing list users at lists.quantum-espresso.org
https://lists.quantum-espresso.org/mailman/listinfo/users
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.quantum-espresso.org/pipermail/users/attachments/20210104/499c6838/attachment.html>


More information about the users mailing list