[Pw_forum] PHONON errors varies when i use 6 or 2 cpu?
Xunlei Ding
ding at sissa.it
Fri Jun 8 08:27:03 CEST 2007
Dear Xu,
I think,
error for 6 cpu calculation is just because one of the six nodes is down,
and error for 4 cpu calculation is because you change 6 cpu to 4 cpu.
So my suggestion is, doing the ph calculation with 6 cpu again.
Hope it will works.
Yours,
ding
xu yuehua wrote:
> hi everyone?
> today i met a problem when i compute phonon :first i do scf using 6
> cpu ,then i also use 6 cpu to do phono at G,BUT a problem came out in
> out.file :
>
>
>
> Proc/ planes cols G planes cols G columns G
> Pool (dense grid) (smooth grid) (wavefct grid)
> 1 5 3284 53988 4 2408 34052 719 5577
> 2 4 3283 53987 4 2407 34051 719 5577
> 3 4 3283 53987 4 2407 34049 719 5577
> 4 4 3283 53987 4 2407 34051 719 5577
> 5 4 3283 53987 4 2407 34049 719 5577
> 6 4 3283 53987 4 2407 34051 720 5576
> 0 25 19699 323923 24 14443 204303 4315 33461
>
>
> nbndx = 20 nbnd = 20 natomwfc = 30 npwx = 4282
> nelec = 40.00 nkb = 50 ngl = 10269
> p0_9381: p4_error: net_recv read: probable EOF on socket: 1
> Killed by signal 2.^M
> forrtl: error (69): process interrupted (SIGINT)
> Killed by signal 2.^M
> Killed by signal 2.^M
> Killed by signal 2.^M
> Killed by signal 2.^M
> p0_9381: (12.363281) net_send: could not write to fd=4, errno = 32
> Fri Jun 8 09:41:35 CST 2007
>
> because i do not know the reason .and then i try to use 4 cpu to
> compute phono ,this time the error is like this :
>
>
>
>
> Representation 44 1 modes - To be done
>
> Representation 45 1 modes - To be done
> IOS = 36
>
> %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
> from davcio : error # 20
> i/o error in davcio
> %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
>
>
> stopping ...
>
> %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
> from davcio : error # 20
> i/o error in davcio
> %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
>
>
> stopping ...
> [0] MPI Abort by user Aborting program !
> [0] Aborting program!
> p0_11006: p4_error: : 0
> Killed by signal 2.^M
> forrtl: error (69): process interrupted (SIGINT)
> p0_11006: (18.296875 ) net_send: could not write to fd=4, errno = 32
> Fri Jun 8 09:57:22 CST 2007
>
> above two case ,the same input:
> phonons of fiveringwater at Gamma
> &inputph
> tr2_ph=1.0d-14,
> prefix='fxx_specify_ibra_500_12+force',
> epsil=.true.,
> amass(1)=1.0,
> amass(2)=15.999,
> outdir='/raid/xx/pwscf/tmp/',
> fildyn='fxx.dynG',
> /
> 0.0 0.0 0.0
>
>
>
>
>
> so my question is why different number of cpu can change the error ?
> befor a few days ago ,i use 2 cpu to do relax ,scf and phonon about
> another case ,there was well ,but now .....?
> i need your help .thanks
>
> --
> Xu Yuehua
> physics Department of Nanjing university
> China
More information about the users
mailing list