[Pw_forum] Energy variations in noncolin-constrain_total.in with OpenMP & MKL

stefano de gironcoli degironc at sissa.it
Tue Jan 26 19:29:56 CET 2016


what was the conv_thr in this example ?
the default is 1.d-6 which is the range of variations you are reporting.
If so, could you try with a tighter threshold ?
stefano

On 26/01/2016 17:55, Paolo Giannozzi wrote:
> Very interesting, and excellent questions, for which unfortunately I 
> have no clear answer (nor has anybody else, I am afraid).
>
> One should obtain the same numbers - within the errors due to 
> roundoff, though -  in serial, OpenMP, MPI execution, and on different 
> machines, and with different compilers and mathematical libraries. In 
> practice, there are invariably small differences, that sometimes do 
> not completely disappear even if one pushes convergence thresholds to 
> very strict limits.  In addition to noncolin-constrain_total.in 
> <http://noncolin-constrain_total.in>, another notable offender is 
> vdw-ts.in <http://vdw-ts.in>.
>
> This could signal a small bug, but in my experience, most of those 
> cases can be linked to specific optimized mathematical libraries or 
> compiler versions. As long as we can blame somebody else, not  a big 
> problem :-)
>
> Paolo
>
> On Tue, Jan 26, 2016 at 12:23 PM, Nick Wilson 
> <nw.qeforge.5211 at family-wilson.me.uk 
> <mailto:nw.qeforge.5211 at family-wilson.me.uk>> wrote:
>
>     Hi,
>
>     I’ve been testing the OpenMP build of Quantum Espresso 5.3.0  on
>     our system using the Intel compiler and MKL and have a question
>     about variation of energy with the number of OpenMP threads used.
>
>     I ran all the plane wave tests in the test-suite directory using
>     between 1 and 16 OpenMP threads and they all gave consistent
>     results apart from pw_noncolin/noncolin-constrain_total.in
>     <http://noncolin-constrain_total.in> which showed variation in 
>     between -55.54478325 Ry and -55.54478414 Ry.
>
>     I ran the test through the Intel Inspector tool but that didn’t
>     show up any threading deadlocks or data races.
>
>     I dropped the compiler optimisation to -O0 and added the
>     “-fp-model strict” and “-fp-model source” compiler flags but that
>     had no effect.
>
>     I tried using some of the relevant environment variables
>     (KMP_DETERMINISTIC_REDUCTION=1 and MKL_CBWR=COMPATIBLE) which also
>     had no effect.
>
>     Changing to use the internal BLAS library resolved the issue so it
>     looks to be MKL-related. It’s present with both the GNU and Intel
>     compilers.
>
>     I dropped back to an earlier version of MKL but the effect was
>     still present.
>
>     As it was thread-related I tried linking against the sequential
>     version of MKL but that didn’t help.
>
>     So, I guess my questions are:
>     Should the results always be invariant of the number of OpenMP
>     threads?
>     Is there anything unique about the noncolin-constrain_total.in
>     <http://noncolin-constrain_total.in> test case which would cause
>     it to behave differently to the rest of the tests?
>
>     Best regards,
>     Nick Wilson
>
>
>     System details:
>       Intel Sandy Bridge E5-2650 CPU
>       CentOS Linux release 7.2.1511
>       MKL from Intel compiler 16.0.0
>      GNU compiler version 4.8.5
>     _______________________________________________
>     Pw_forum mailing list
>     Pw_forum at pwscf.org <mailto:Pw_forum at pwscf.org>
>     http://pwscf.org/mailman/listinfo/pw_forum
>
>
>
>
> -- 
> Paolo Giannozzi, Dip. Scienze Matematiche Informatiche e Fisiche,
> Univ. Udine, via delle Scienze 208, 33100 Udine, Italy
> Phone +39-0432-558216, fax +39-0432-558222
>
>
>
> _______________________________________________
> Pw_forum mailing list
> Pw_forum at pwscf.org
> http://pwscf.org/mailman/listinfo/pw_forum

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.quantum-espresso.org/pipermail/users/attachments/20160126/3d8ce4d1/attachment.html>


More information about the users mailing list