[Pw_forum] Problem using SGE
Mahmoud Payami
mpayami at aeoi.org.ir
Tue Feb 21 09:55:53 CET 2012
Dear QE users,
I am using sge for running a parallel job.
My "file.qsub" contains the following lines:
--
#!/bin/bash
#
#$ -cwd
#$ -j y
#$ -S /bin/bash
/opt/openmpi/bin/mpirun /opt/qe/bin/pw.x -npool 2 -ndiag 16 < /home/mahmoud/file.in
--
Then I use the orte parallel env and use the command:
qsub -V -pe orte 32 file.qsub
Everything is ok until the first david diagonalization during which the load on some nodes increases the number of processors (that is, the node has totally 8 cores but the load shows at the crash time to be more than 16) , and then those nodes hangup.
Any comment is highly appreciated.
Best regards,
Mahmoud Payami
--------------------------------
Mahmoud Payami
Physics Group, AEOI,
Tehran-Iran
Email: mpayami at aeoi.org.ir
Phone: +98 (0) 21 82064393
----------------------------------------------
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.quantum-espresso.org/pipermail/users/attachments/20120221/60732863/attachment.html>
More information about the users
mailing list