<div dir="ltr">I don't have an answer, but we have seen the same error message which is associated with an intermittent segmentation fault which we think may be associated with the interaction of QE and openmpi 3.1.0 and openmpi 3.1.1 compiled with the intel 2018 compiler on our Red Hat
RHEL6u9 cluster. The error happens less frequently when we use openmpi 2.1.0. In our case the error channel prints the following:<div><br></div><div><div>forrtl: severe (174): SIGSEGV, segmentation fault occurred</div><div>Image PC Routine Line Source </div><div>ph.x 0000000000D99A1D for__signal_handl Unknown Unknown</div><div>libpthread-2.12.s 0000003271E0F7E0 Unknown Unknown Unknown</div><div>mca_btl_vader.so 00002AB74BBB99A7 Unknown Unknown Unknown</div><div>libopen-pal.so.40 00002AB738AD3A54 opal_progress Unknown Unknown</div><div>libmpi.so.40.10.1 00002AB7384DBC04 ompi_request_defa Unknown Unknown</div><div>libmpi.so.40.10.1 00002AB7385384C5 ompi_coll_base_ba Unknown Unknown</div><div>libmpi.so.40.10.1 00002AB7384F26F1 MPI_Barrier Unknown Unknown</div><div>libmpi_mpifh.so.4 00002AB73826D013 MPI_Barrier_f08 Unknown Unknown</div><div>ph.x 0000000000BA9E0E Unknown Unknown Unknown</div><div>ph.x 0000000000B9835B Unknown Unknown Unknown</div><div>ph.x 000000000057FE26 Unknown Unknown Unknown</div><div>ph.x 00000000004BE229 Unknown Unknown Unknown</div><div>ph.x 00000000004A0F10 Unknown Unknown Unknown</div><div>ph.x 0000000000415A65 Unknown Unknown Unknown</div><div>ph.x 000000000040EE73 Unknown Unknown Unknown</div><div>ph.x 000000000040EDDE Unknown Unknown Unknown</div><div><a href="http://libc-2.12.so">libc-2.12.so</a> 000000327161ED1D __libc_start_main Unknown Unknown</div><div>ph.x 000000000040ECE9 Unknown Unknown Unknown</div><div>--------------------------------------------------------------------------</div><div>mpirun detected that one or more processes exited with non-zero status, thus causing</div><div>the job to be terminated. The first process to do so was:</div><div><br></div><div> Process name: [[24484,1],12]</div><div> Exit code: 174</div><div>--------------------------------------------------------------------------</div><div>It seems to be a big mystery. Sincerely, Natalie Holzwarth</div><div><br><div> <br clear="all"><div><div dir="ltr" class="gmail-m_-8527648235566977528m_2461529398462829279gmail_signature">N. A. W. Holzwarth email: <a href="mailto:natalie@wfu.edu" target="_blank">natalie@wfu.edu</a><div>Department of Physics web: <a href="http://www.wfu.edu/~natalie" target="_blank">http://www.wfu.edu/~natalie</a></div><div>Wake Forest University phone: 1-336-758-5510 </div><div>Winston-Salem, NC 27109 USA office: Rm. 300 Olin Physical Lab</div></div></div><br></div></div></div></div><br><div class="gmail_quote"><div dir="ltr">On Sun, Aug 12, 2018 at 3:06 PM Sina Malakpour <<a href="mailto:sina.malakpour@gmail.com" target="_blank">sina.malakpour@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">Dear all,<div><br></div><div>Recently, I am working on linear response method implemented in QE to do the phonon calculations for a structure. I apply different isotropic strain to the optimized structure and then I run the phonon computations through these steps:</div><div><br></div><div>1. scf run</div><div>2. ph run</div><div>3. q2r run</div><div><br></div><div>For all strains, the scf run is ok, but for some strains, after ph run I get this message at the end of the output file:</div><div><br></div><div><div>-------------------------------------------------------</div><div>Primary job terminated normally, but 1 process returned</div><div>a non-zero exit code.. Per user-direction, the job has been aborted.</div><div>-------------------------------------------------------</div><div>--------------------------------------------------------------------------</div><div>mpirun detected that one or more processes exited with non-zero status, thus causing</div><div>the job to be terminated. The first process to do so was:</div><div><br></div><div> Process name: [[47696,1],6]</div><div> Exit code: 28</div><div>--------------------------------------------------------------------------</div><div><br></div><div>and So, the file of force constants would not be generated. I really appreciate it if you guide me through this and let me know what to do to fix the problme?</div><div><br></div><div>Thanks,</div><div>Sina </div><div><div class="m_-8527648235566977528m_2461529398462829279m_509883546116463105gmail_signature"><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div><div><div><div><div><br>Sina Malakpour Estalaki<br></div></div>PhD student <br></div></div>Department of Aerospace and Mechanical Engineering<br></div>University of Notre Dame</div><div>Notre Dame, IN, US</div><br></div></div></div></div></div></div></div></div></div>
</div></div>
_______________________________________________<br>
users mailing list<br>
<a href="mailto:users@lists.quantum-espresso.org" target="_blank">users@lists.quantum-espresso.org</a><br>
<a href="https://lists.quantum-espresso.org/mailman/listinfo/users" rel="noreferrer" target="_blank">https://lists.quantum-espresso.org/mailman/listinfo/users</a></blockquote></div>