<div dir="ltr"><div dir="ltr"><div>Dear all,</div><div> Today i tried to run a quantum espresso code in parallel execution via the command line as,</div><div> " mpirun -np 32 '/home/mainak/Desktop/qe-6.5/bin/pw.x' -npool 4 - bgrp 4 -ndiag 36 <<a href="http://pha_nscf.in" target="_blank">pha_nscf.in</a>| tee pha_nscf.out "</div><div> the run was completed and the results are correct but in terminal it shows,</div><div><br></div><div><br></div><div>" PWSCF : 41m 7.66s CPU 7h22m WALL<br><br><br> This run was terminated on: 20:59:32 25Dec2020 <br><br>=------------------------------------------------------------------------------=<br> JOB DONE.<br>=------------------------------------------------------------------------------=<br>[mainak-linpc:26225] [[38684,1],0]-[[38684,0],0] mca_oob_tcp_msg_recv: readv failed: Connection timed out (110)<br>[mainak-linpc:26227] [[38684,1],2]-[[38684,0],0] mca_oob_tcp_msg_recv: readv failed: Connection timed out (110)<br>[mainak-linpc:26229] [[38684,1],4]-[[38684,0],0] mca_oob_tcp_msg_recv: readv failed: Connection timed out (110)<br>--------------------------------------------------------------------------<br>mpirun has exited due to process rank 2 with PID 26227 on<br>node mainak-linpc exiting improperly. There are two reasons this could occur:<br><br>1. this process did not call "init" before exiting, but others in<br>the job did. This can cause a job to hang indefinitely while it waits<br>for all processes to call "init". By rule, if one process calls "init",<br>then ALL processes must call "init" prior to termination.<br><br>2. this process called "init", but exited without calling "finalize".<br>By rule, all processes that call "init" MUST call "finalize" prior to<br>exiting or it will be considered an "abnormal termination"<br><br>This may have caused other processes in the application to be<br>terminated by signals sent by mpirun (as reported here).<br>-------------------------------------------------------------------------- "</div><div> <br></div><div><br></div><div> As you can see in the last line,</div><div><br></div><div>" PWSCF : 41m 7.66s CPU 7h22m WALL "</div><div>
I, personally unable to find out what went wrong ! Advices will be
very much helpful to me. For concern, i would like to tell that
my cpu has 4 physical cores. Here in the following i am also attaching
the '.out' file.</div><div> <br></div><div> Thanks in advance.</div><div> Mainak Ghosh</div><div> University of Calcutta, India<br></div></div></div>