[Pw_forum] Efficient parallelization
Axel Kohlmeyer
axel.kohlmeyer at theochem.ruhr-uni-bochum.de
Wed Apr 6 14:02:09 CEST 2005
On Wed, 6 Apr 2005, Sergey Lisenkov wrote:
SL> Dear Axel,
dear sergey,
SL> >looks perfectly. 6 times 4 is 24. the parallelization is first
SL> >across the k-points, since those calculations are almost independent
SL> >AND THEN for EACH k-point across g/r-space. the latter parallelization
SL> >is less efficient, so the choice of the -npool parameters is very
SL> >important for efficient use of your cpu resources (if you have
SL> >multiple k-points). this is explained in the manual in more detail.
SL>
SL> If I run the code on SMP machine, does k-parallelization efficient?
it should be. there may be a problem with memory requirements.
if you run several k-points in parallel, you need to be able to store
those wavefunctions in memory simulaneously, whereas with g-/r-space
parallelization the total amount of memory should not increase that
much when increasing the number of processors. i was referring to
the efficiency of the algrorithm itself. there are always technical
issues to be considered. under some cicumstances i found, that
it is occasionally better to not use all cpus to have more memory
bandwidth per processor (e.g. on dual Xeon machines).
best regards,
axel
SL>
SL> Best wishes,
SL> Sergey
SL>
SL>
--
=======================================================================
Dr. Axel Kohlmeyer e-mail: axel.kohlmeyer at theochem.ruhr-uni-bochum.de
Lehrstuhl fuer Theoretische Chemie Phone: ++49 (0)234/32-26673
Ruhr-Universitaet Bochum - NC 03/53 Fax: ++49 (0)234/32-14045
D-44780 Bochum http://www.theochem.ruhr-uni-bochum.de/~axel.kohlmeyer/
=======================================================================
If you make something idiot-proof, the universe creates a better idiot.
More information about the users
mailing list