<div dir="ltr"><div>The performances of a large and complex code are affected by a number of factors: hardware[1], compilation [2], mathematical libraries[3], parallelism[4], obvious[5] and less obvious[6] conflicts for resources, phase of the moon[7] and alignment of planets[8]...</div><div><br></div><div>Paolo</div><div><br></div><div>[1] Not only CPU speed but also speed of memory access, size of cache,...As a rule: the cheaper the hardware, the slower the speed<br></div><div>[2] Can make a sizable difference, but, as Lorenzo says, not a factor 10<br></div><div>[3] These, especially FFTs and linear algebra, can also make a sizable difference<br></div><div>[4] There are many ways to use a given number of processors; just increasing their number is not always the right thing to do</div><div>[5] Some other process running on the same processor</div><div>[6] Memory access conflicts, unwanted OpenMP threads, ...<br></div><div>[7-8] By far the most important factors:-) Many years ago, I watched in disbelief the same run taking 27 s in one case, 30 in another, reproducibly. The difference? a single print statement, executed once.<br></div><div><br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Sat, Jan 23, 2021 at 10:22 PM Robert Stanton <<a href="mailto:stantor@clarkson.edu">stantor@clarkson.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">Dear all,<div><br></div><div> I am able to successfully install parallel QE and get everything to run smoothly, however after testing against an install on an externally maintained cluster I'm finding my speed to be slower by a factor of 5-10x when keeping the system, core number, QE version, etc. the same. </div><div><br></div><div> I'm not sure the degree to which this could be a hardware discrepancy (Xeon Silvers vs new EPYCs), v.s. how much could be a sub-optimal compilation of QE. The slower QE install is being done with blas/lapack 3.8.0 and openmpi 1.10.7. I'm just curious if there is anything very clearly not ideal about this environment, or if it is likely something I'll just need to troubleshoot with smaller settings? Thanks so much in advance for any insights you can provide!</div><div><br></div><div>Regards, </div><div>Robert Stanton</div><div>Graduate Student</div><div>Clarkson University</div><div><br></div><div><div dir="ltr"><div dir="ltr"><div dir="ltr"></div></div></div></div></div>
_______________________________________________<br>
Quantum ESPRESSO is supported by MaX (<a href="http://www.max-centre.eu" rel="noreferrer" target="_blank">www.max-centre.eu</a>)<br>
users mailing list <a href="mailto:users@lists.quantum-espresso.org" target="_blank">users@lists.quantum-espresso.org</a><br>
<a href="https://lists.quantum-espresso.org/mailman/listinfo/users" rel="noreferrer" target="_blank">https://lists.quantum-espresso.org/mailman/listinfo/users</a></blockquote></div><br clear="all"><br>-- <br><div dir="ltr" class="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div>Paolo Giannozzi, Dip. Scienze Matematiche Informatiche e Fisiche,<br>Univ. Udine, via delle Scienze 206, 33100 Udine, Italy<br>Phone +39-0432-558216, fax +39-0432-558222<br><br></div></div></div></div></div>