[Pw_forum] parallel version error in quad core Xeon

Paolo Giannozzi giannozz at democritos.it
Thu Apr 29 12:55:07 CEST 2010


TAE BUM LEE wrote:

> Does anybody know the reason of following error?

> rank 3 in job 37  Lynx_60167   caused collective abort of all ranks
>   exit status of rank 3: killed by signal 9

http://www.quantum-espresso.org/user_guide/node48.html

or, updated version:

\subsection{pw.x crashes in parallel execution with an obscure message
   related to MPI errors}
Random crashes due to MPI errors have often been reported, typically
in Linux PC clusters. We cannot rule out the possibility that bugs in
\qe\ cause such behavior, but we are quite confident that
the most likely explanation is a hardware problem (defective RAM
for instance) or a software bug (in MPI libraries, compiler, operating
system)

P.
-- 
Paolo Giannozzi, Democritos and University of Udine, Italy



More information about the users mailing list