[Pw_forum] parallel version error in quad core Xeon
Paolo Giannozzi
giannozz at democritos.it
Thu Apr 29 12:55:07 CEST 2010
TAE BUM LEE wrote:
> Does anybody know the reason of following error?
> rank 3 in job 37 Lynx_60167 caused collective abort of all ranks
> exit status of rank 3: killed by signal 9
http://www.quantum-espresso.org/user_guide/node48.html
or, updated version:
\subsection{pw.x crashes in parallel execution with an obscure message
related to MPI errors}
Random crashes due to MPI errors have often been reported, typically
in Linux PC clusters. We cannot rule out the possibility that bugs in
\qe\ cause such behavior, but we are quite confident that
the most likely explanation is a hardware problem (defective RAM
for instance) or a software bug (in MPI libraries, compiler, operating
system)
P.
--
Paolo Giannozzi, Democritos and University of Udine, Italy
More information about the users
mailing list