<div dir="ltr"><div>Dear all,</div><div><br></div><div>I wish this is the right place to ask this question.</div><div><br></div><div>Since, I don't know exactly when, but definitely this year, when one try to do pw calculation with many cores, the code will stop and leave this error message:</div>
<div><br></div><div><div> %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%</div><div> Error in routine blk2cyc_redist (1):</div><div> nb less than the number of proc</div><div> %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%</div>
<div><br></div><div>The error usually happens when the number of cores is larger than the number of bands. While USING OLD VERSION OF THE CODE WON"T HAVE THIS PROBLEM.</div><div><br></div><div>If one looks into the code the meaning of nb seems misleading. Because in the code, the trigger of this error is due to this command:</div>
<div><br></div><div><div style="font-family:arial,sans-serif;font-size:13.333333969116211px"> IF( desc%n < nproc ) &</div><div style="font-family:arial,sans-serif;font-size:13.333333969116211px"> CALL errore( ' cyc2blk_redist ', ' nb less than the number of proc ', 1 )</div>
<div style="font-family:arial,sans-serif;font-size:13.333333969116211px"><br></div><div style="font-family:arial,sans-serif;font-size:13.333333969116211px">While the definition of nb at this part was defined as:</div><div style="font-family:arial,sans-serif;font-size:13.333333969116211px">
nb = desc%nrcx ! leading dimension of the local matrix block<br></div><div style="font-family:arial,sans-serif;font-size:13.333333969116211px"><br></div><div style="font-family:arial,sans-serif;font-size:13.333333969116211px">
On the other hand this error happens when the number of processor is larger than the Number of Bands. </div><div style="font-family:arial,sans-serif;font-size:13.333333969116211px"><br></div><div style="font-family:arial,sans-serif;font-size:13.333333969116211px">
So what NB actually means here? After all, I really don't understand why the code should stop when I am using 32 cores to calculate a system with 6 bands. The plane wave parallelization should not care about how many bands there are in total. </div>
<div style="font-family:arial,sans-serif;font-size:13.333333969116211px"><br></div><div style="font-family:arial,sans-serif;font-size:13.333333969116211px">Could anyone give any comments, thank you very much. I don't attach an input here, because one could easily reproduce this error by calculating any small molecule with many cores on a parallel machine. </div>
</div></div><div><br></div><div>===================<br>Ge Xiaochuan(Giovanni)<br>4th year PHD Student<br>Condensed Matter<br>SISSA,Italy<br>===================</div>
</div>