Thanks for your suggestion. I will try one of the examples as soon as 

Current status: ifort-compiled pw.x and ph.x can complete the job
normally. However, the NEC executables pass a larger "nrxx" value, 22200
in NEC vs 20736 in Intel, given that nr1=24,nr2=24,nr3=36. So in NEC, some
zero "zeta" were passed to dmxc_spin subroutine which led to "divide by
zero" error at line 1192 in Modules/functionals.f90. Interestingly, pw.x
by sxcross compiler and ifort gave the same scf results, while ph.x in NEC
didn't work...

Any suggestion is welcome, e.g. compiler options, preprocessor flags...

