[QE-users] MPI in qe-6.5

Vinson, John T. (Fed) john.vinson at nist.gov
Fri Dec 20 17:47:48 CET 2019


Hi Ian,

I was able to get this crash when I compiled w/o scalapack (nor elpa, etc.). Assuming you’re also running w/o scalapack, try editing LAXlib/ptoolkit.f90. Starting at line 4212 you should have:

    ! split communicator is present and must be freed on all processors
    CALL mpi_comm_free( col_comm, ierr )
    IF( ierr /= 0 ) &

Change it to:

    ! split communicator is present and must be freed on all processors
    IF( col_comm /= MPI_COMM_NULL ) THEN
      CALL mpi_comm_free( col_comm, ierr )
      IF( ierr /= 0 ) &
            CALL lax_error__( " pdtrtri ", " in mpi_comm_free 25 ", ABS( ierr ) )
    END IF

And see if that fixes it for you.

John

From: users <users-bounces at lists.quantum-espresso.org> on behalf of Ian Shuttleworth <shuttleworth.ian at gmail.com>
Reply-To: Quantum ESPRESSO users Forum <users at lists.quantum-espresso.org>
Date: Friday, December 20, 2019 at 10:59 AM
To: Quantum ESPRESSO users Forum <users at lists.quantum-espresso.org>
Subject: Re: [QE-users] MPI in qe-6.5

I've attached "test.in<https://gcc01.safelinks.protection.outlook.com/?url=http%3A%2F%2Ftest.in&data=02%7C01%7Cjohn.vinson%40nist.gov%7Cc07d9fa036e5429e021908d785659d51%7C2ab5d82fd8fa4797a93e054655c61dec%7C1%7C1%7C637124543805045449&sdata=6MhdnfIg9tQ2YmKTC6bye%2BaAaGnYStkcSCZ3flJgRrc%3D&reserved=0>" the input file - the pseudos just come directly from the QE web-site

With thanks

Ian

On Fri, Dec 20, 2019 at 3:47 PM Pietro Delugas <pdelugas at sissa.it<mailto:pdelugas at sissa.it>> wrote:

Dear Ian

I just compiled pw with gcc-4.8.5 and openmpi 1.10.7 and the test-suite tests are passing, so either it is a problem in someway related to your input, to some issue related to the way you compiled the program or to some feature of you system other than the compiler and the mpi library.

kind regards - Pietro

Could you send you input
On 20/12/19 16:23, Ian Shuttleworth wrote:
Dear all

I encounter MPI communication errors when running 6.5 compiled both with gcc-4.8.5 and openMPI 1.10.7, and also - on another HPC - compiled with gcc 6.4.0 and open  OpenMPI 2.1.2.


The code starts correctly with statements:

     This program is part of the open-source Quantum ESPRESSO suite
     for quantum simulation of materials; please cite
         "P. Giannozzi et al., J. Phys.:Condens. Matter 21 395502 (2009);
         "P. Giannozzi et al., J. Phys.:Condens. Matter 29 465901 (2017);
          URL http://www.quantum-espresso.org<https://gcc01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.quantum-espresso.org&data=02%7C01%7Cjohn.vinson%40nist.gov%7Cc07d9fa036e5429e021908d785659d51%7C2ab5d82fd8fa4797a93e054655c61dec%7C1%7C1%7C637124543805055443&sdata=0gEwitsOMxujynLItOAJe4Q8n3AUD97ZuXiEzjrMvd0%3D&reserved=0>",
     in publications or presentations arising from this work. More details at
     http://www.quantum-espresso.org/quote<https://gcc01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.quantum-espresso.org%2Fquote&data=02%7C01%7Cjohn.vinson%40nist.gov%7Cc07d9fa036e5429e021908d785659d51%7C2ab5d82fd8fa4797a93e054655c61dec%7C1%7C1%7C637124543805055443&sdata=Uxu67hdAKo%2Fh2TGa%2FXA9EWLXx7v0QDFTfV07marSZX8%3D&reserved=0>
     Parallel version (MPI), running on    16 processors



The execution stops with the statement:

     Starting wfcs are  192 randomized atomic wfcs




and the following error messages then appear:

[node44:20988] *** An error occurred in MPI_Comm_free
[node44:20988] *** reported by process [140653000785921,140724603453442]
[node44:20988] *** on communicator MPI_COMM_WORLD
[node44:20988] *** MPI_ERR_COMM: invalid communicator
[node44:20988] *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
[node44:20988] ***    and potentially your MPI job)



Compiling 6.4.1 using exactly the same gcc/openmpi's doesn't produce the same problem, and the execution in fact completes without error. So my question is: what differences are there in the MPI between 6.5 and 6.4.1, and are there any 'tweaks' that could be applied to the compile script to remove the problems I'm seeing with 6.5?

With thanks

Ian Shuttleworth
(Nottingham Trent University)




_______________________________________________

Quantum ESPRESSO is supported by MaX (www.max-centre.eu/quantum-espresso<https://gcc01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.max-centre.eu%2Fquantum-espresso&data=02%7C01%7Cjohn.vinson%40nist.gov%7Cc07d9fa036e5429e021908d785659d51%7C2ab5d82fd8fa4797a93e054655c61dec%7C1%7C1%7C637124543805065437&sdata=1RmTscxQZflZOy0rA2mt1B%2FuDIIYt79fUX0ZN5AnOxs%3D&reserved=0>)

users mailing list users at lists.quantum-espresso.org<mailto:users at lists.quantum-espresso.org>

https://lists.quantum-espresso.org/mailman/listinfo/users<https://gcc01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.quantum-espresso.org%2Fmailman%2Flistinfo%2Fusers&data=02%7C01%7Cjohn.vinson%40nist.gov%7Cc07d9fa036e5429e021908d785659d51%7C2ab5d82fd8fa4797a93e054655c61dec%7C1%7C1%7C637124543805065437&sdata=Byr38ph91oRMhfIUZkMGB0vosBDrKuqtZb%2Fc5CN%2F%2FXY%3D&reserved=0>
_______________________________________________
Quantum ESPRESSO is supported by MaX (www.max-centre.eu/quantum-espresso<https://gcc01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.max-centre.eu%2Fquantum-espresso&data=02%7C01%7Cjohn.vinson%40nist.gov%7Cc07d9fa036e5429e021908d785659d51%7C2ab5d82fd8fa4797a93e054655c61dec%7C1%7C1%7C637124543805075434&sdata=%2BhVcfRsTvDVMzSRQwthUjUk7bTu1UGhA3WmxnnSJ%2Bh4%3D&reserved=0>)
users mailing list users at lists.quantum-espresso.org<mailto:users at lists.quantum-espresso.org>
https://lists.quantum-espresso.org/mailman/listinfo/users<https://gcc01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.quantum-espresso.org%2Fmailman%2Flistinfo%2Fusers&data=02%7C01%7Cjohn.vinson%40nist.gov%7Cc07d9fa036e5429e021908d785659d51%7C2ab5d82fd8fa4797a93e054655c61dec%7C1%7C1%7C637124543805075434&sdata=L5bIozKD3Aqs8Z1Y%2B7y2hBlZxLBMDIqCxANwcJZaONI%3D&reserved=0>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.quantum-espresso.org/pipermail/users/attachments/20191220/f6bcc1b3/attachment.html>


More information about the users mailing list